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Percentile Performance Criteria for Limiting Average 
Markov Decision Processes, Filar, Krass , and Ross . 

This paper addresses the question of whether a policy can be 
found that achieves a specified value (target) of the long-run limiting 
average reward at a specified probability level (percentile) for a 
Markov Decision Process (MDP). This is motivated from ‘the fact 
that an optimal policy for an infinite horizon MDP with a limiting 
average reward criterion is insensitive to the probability distribution 
function of the long-run average reward. Therefore, it is possible that 
an optimal policy might carry an unacceptably high probability of low 
values of the long-run average reward. For a problem with a single 
objective, a complete classification of both the maximal achievable 
target levels and of their corresponding percentiles is presented. Also, 
in the case of a communicating MDP, it is shown that every target 
level can be achieved with only two possible values: zero or one. 
The approach is constructive and thus furnishes an algorithm for 
computing a deterministic policy for any feasible target level and 
percentile pair. Problems with multiple objectives and/or constraints 
are also studied. These are considered in detail for communicating 
MDP’s with constructive solutions obtained for most cases. 

Design and Analysis of Fuzzy Identifiers of Nonlinear 
Dynamic Systems, Wang . 

This paper develops identification algorithms based on “fuzzy” 
IF-THEN rules for nonlinear dynamic systems. These rules can be 
either collected form human experts or generated from sensory mea¬ 
surements. Two fuzzy identifies are developed based upon different 
rules structures. Both algorithms are shown to be globally stable in 
the sense that all variables are uniformly bounded and, under mild 
conditions, the associated prediction errors converge to zero. The 
algorithms are applied to identify the chaotic glycolytic oscillator. 

Stochastic Approximation with Averaging and Feedback: 
Rapidly Convergent “On Line” Algorithms, Kushner and 
Yang. 

This paper is concerned with the problem of stochastic approxi¬ 
mation with averaging of the type considered by Poyak and Juditsky, 
The authors point out that while the averaging approach has many 
advantages, choosing the appropriate slowly time-varying gain can 

’This section is written by the Transactions Editorial Board. 


be a nuisance. The authors propose a new stochastic approximation 
model with constant gain and a feedback term. The rate ofc<*nvgo*, 
gence of the approximation sequence arid the averages m analyzed. 
This result can be applied to feted parameter estimation pmWepxi 
The feedback approach is further extended to time-varying parameter 
cases. The authors conclude the paper with a tuimikrtton suitfyof the 
proposed algorithms. 

Exponential Stabilization of Pfouhckmmk Chained 
Systems, Sfrdalen and Egeland. 

This paper analyzes the stabilization of two-input drlftlesa nom 
holonomic systems in chained farm. It is shown how to achieve 
global asymptotic stability wife exponential convergence about any 
reference configuration by using a nonsmootb time-varying feedback 
control law. The notion of K-exponential stability is used. Simulation 
results are given in addition to theoretical analysts. 

Global Total Least Squares Modeling of Multivariable 
Time Series, Roorda and Heij. 

This paper concerns the problem of modeling discrete-time multi¬ 
variate umc series by a finite dimensional linear system. The paper 
differs from other papers on identification in its viewpoint: Model 
error is defined as the minimal adjustment of the data required to 
make the time series obey the system equations so that no distinction 
need be made between system inputs and outputs. The goal of finding 
an ideal realization can thus be formulated as a total least squares 
problem. An iterative algorism is devised to obtain an optimal modal, 
and several numerical simulations are presented as illustrations. 

Control of Chained Systems. Application to Path 
Following and Time-Varying Point Stabilization of Mobile 
Robots, Samson. 

This paper studies the control of a class of nonholonotnic kinematic 
systems. Such systems are nonholonomic in that they are subject to 
nomntegrable constraints, and they are kinematic in that the controls 
are velocity controls. The class of systems studied here—systems in 
chained form—has proven to be particularly amenable to analysis. 
Here it is shown that systems in chained form may be put into 
a convenient skew-symmetric chained form, and some open-loop 
and closed-loop control problems are solved, including global point 
stabilization by time-varying feedback. 
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Percentile Performance Criteria For Limiting 
Average Markov Decision Processes 

Jetty A. Filar, Dmitry Krass, and Keith W. Ross, Senior Member , IEEE 


cf—In tills paper we address the tallowing basic fea- 
flMf&j problem for Infinite-horizon Markov decision processes 
fM?S#Y>s can a policy be found that achieves a specified value 
(target} of the long-run limiting average reward at a specified 
pfWbaMHty level (percentile)? Related optimization problems of 
hinidhivtalftg the target for a specified percentile and vice versa 
ate also considered. We present a complete (and discrete) clas¬ 
sification of both the maximal achievable target levels and of 
their corresponding percentiles. We also provide an algorithm for 
competing a deterministic policy corresponding to any feasible 
Uu'&t-percenttte pair. 

Next we consider similar problems for an MDP with multiple 
rewards and/or constraints. This case presents some difficulties 
and foods to several open problems. An LP-based formulation 
provides constructive solutions for most cases. 

I. Introduction and Definitions 

I NFINITE horizon Markov decision processes (MDP’s, for 
short) have been extensively studied since the 1950’s. One 
Of the most commonly considered versions is the so-called 
limiting average reward"* model. In this model the decision¬ 
maker aims to maximize the expected value of the limit- 
average (“long-run average”) of an infinite stream of single- 
stage rewards. There are now a number of good algorithms 
for computing optimal deterministic policies in the limiting 
average MDP’s (e.g M see [4], 16], [11]). 

U should be noted, however, that an optimal policy in 
the above “classical” sense is insensitive to the probability 
distribution function of 1 the long-run average reward. That 
is, it is possible that an optimal policy, while yielding an 
acceptably high expected long-run average reward, carries 
with it unacceptably high probability of low values of that 
same random variable. This “risk insensitivity” is inherent in 
the formulation of the classical objective criterion as that of 
maximizing the expected value of a random variable, and it 
is not necessarily undesirable. Nonetheless, in this paper we 
adopt the point of view that there are many natural situations 
where the decision-maker is interested in finding a policy that 
will achieve a sufficiently high long-run average reward, that 
is, a target level with a sufficiently high probability, that is, a 
percentile. The key conceptual difference between this paper 

Manuscript received January 22, 1992, revised Apnl 12, 1993 This work 
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and the classical problem is that our controller is not searching 
for an optimal policy but rather for a policy that is “good 
enough,” knowing that such a policy will typically fail to 
exist if the target level and the percentile are set too high. 
Conceptually, our approach is somewhat analogous to that 
often adopted by statisticians in testing of hypotheses where 
it is desirable (but usually not possible!) to simultaneously 
minimize both the “type 1” and the “type 2” errors. See 
Bouakiz [5] for a review of similar approaches in economics 
and operations research literature and White [16] for a review 
of various approaches to nsk-sensitivity in MDP’s. 

We start out by considering a problem with a single objec¬ 
tive. It will be seen (Section IV below) that for our target level- 
percentile problem it is possible to present a complete (and 
discrete) classification of both the maximal achievable target 
levels and of their corresponding percentiles (see Theorem 
4.3 and its corollaries). The case of a communicating MDP 
is particularly interesting as here every target level can be 
achieved with only two possible values: zero or one (see 
Theorem 4.1 and its corollary). In all cases our appioach is 
constructive in the sense that we can supply an algorithm for 
computing a deterministic policy for any feasible target level 
and percentile pair. 

In Section V we turn our attention to problems with multiple 
objectives and/or constraints. The connection of this to the 
problem with sample path constraint of [13] and [14] is dis¬ 
cussed. In Section VI, we show how the techniques developed 
in Section IV extend directly to these problems, provided that 
the problems can be solved for a communicating MDP (see 
Theorem 6.3). The multiobjective/constrained problems are 
considered in detail for communicating MDP’s. with construc¬ 
tive solutions obtained for most cases (see Theorems 6.1 and 
6.2). Conclusions and some open problems are presented in 
Section VII, 

Our analysis is made possible by the recently developed de¬ 
composition and sample path theory due to Ross and Varadara- 
jan [14]. The logical development of the results is along the 
lines of Filar [8], The latter paper, to the best of our knowledge, 
introduced the percentile objective criterion in the context of 
a limiting average Markov control problem, but substituted 
the long-run expected frequencies in place of actual percentile 
probabilities since the decomposition ajid sample path theory 
of [14] was not known at that time (the unusual way of 
evaluating risk in [8] was also pointed out in [16]). Some 
earlier related work appeared in Mitten [12], Sobel [15], and 
Henig [10]. In the remainder of this section we shall introduce 
the notation of the limiting average Markov decision process. 


0018-9286/95104.00 © 1995 IEEE 



filar ntffc umrm average mark&v decision processes 

A fluke MOP, I\ is observed at discrete-time points n = 
li 2, • - . The state space is denoted by S « {1, 2, ■ - *, |S|}. 
With each stale ie 3 we associate a finite action set A(t). At 
any time point n, the system is in one of the stales, and an 
action has to be chosen by the decision-maker. If the system is 
in state i and the action a E A(i) is chosen, then an immediate 
reward r(i, a) is earned and the process moves to a state 
j 6 S with transition probability p tQJ , where p taJ > 0 and 
Pt s 

A decision rule u n at time n is a function which assigns 
a probability to the event that action a is taken at time u. 
In general u n may depend on all realized states up to and 
including time n. A policy (or a control) it is a sequence 
of decision rules: u = (u 1 , « 2 . ■ • ■ , u”, • ■ ■). A policy is 
stationary if each u 71 depends only on the current state at 
time w, and it 1 = li 1 = ■ ■ ■ = u n = • ■ ■ . A pure (or 
deterministic) policy is a stationary policy with nonrandomized 
decision rules. A stationary policy / induces a Markov chain 
P(f) with the transitions P(f) tJ = £ ofej4(l x for 

i, j E 5, where f m is the probability (under /) that action 
a is chosen whenever state t is visited. If P(f) consists of a 
single recurrent class of states, then / is called an irreducible 
policy. If P(f) contains some transient states in addition to a 
single recurrent class, then / is called a unichain policy. MDP 
F is called unichain if every stationary policy in V is unichain. 

Let X n and be the random variables that denote the 
state at time ti and the action chosen at time n, and define the 
limiting average reward as the random variable 

1 N 

J?:=r Jim inf -Yr(X n , A fl ). 

n=sl 

It should now be clear that once a policy u and an initial 
state X x = are fixed, the expectation </;(u, .s x ): — E u [ft \ 
A"i = »i] of P is well defined and will, from now on, be 
referred to as the expected average reward due to a policy u. 
The classical limiting average reward problem is to find an 
opLimal policy u* such that for all policies u 

<t>{u*, ,si) > </j(w, for all s\ £ S, (1.1) 

It is well known (e.g., see [4]) that there always exists a pure 
optimal policy u*. 

II. Problems Relating to Percentile Objective Criteria 

We shall say that any pair (r, a) such that r e R and 
a E [0. 1] constitutes a target level-percentile pair. We shall 
address the following problems. 

Problem 1: Fix u x E S. Given (r, ot) E R x [0, 1] does 
there exist a policy u such that 

Pu{R>r\ X x =*i)>rt? (2.1) 

If (2.1) holds for some policy u, then we shall say that u 
achieves the target level t at percentile c*, and r will be called 
^achievable. 

Problem 2: Given a E [0, 1] find 

r rt : sup {t | r is a - achievable}. (2.2) 


Problem 3: Given r E R find 

a r : = sup {a € (0,1] 13 a policy ti s.t. (2,1) holds}. (2.3} 

Remark 2.1: It should be dear that in many situations the 
natural goal of maximizing the target level will be in direct 
conflict with the goal of maximizing the percentile value, this 
is because r a is a nonincreasing function of a, while a r ■* * 
nonincreasing function of r. 

III. Preliminaries 

We shall develop our results within the framework of 
the decomposition and sample path theory due to Ross and 
Varadarajan [14] (for a related decomposition, see (21). this 
decomposition approach has also proved instrumental in solv¬ 
ing other limiting average MDP problems with nonstandard 
criteria (see 114] and (3|). In this section we collect some 
results from 114] that will be needed for the proofs in the 
subsequent sections. 

In [14j it is shown that the state space 5 has a unique par¬ 
tition Ci, (7 2 , • • •,C/ v -, T, whose properties are summarized 
below. 

Theorem 3.1 (Proposition 2 of 114]): For any policy ti, we 
have 

K 

5>«(*t I -Y, . m) = 1 

*=t 

where 

= {A'„ e C\ almost always) 

(where “almost always” means that 0 C* finitely often). 

The sets Ci,- ■, C\ and T are referred to as strongly com¬ 
municating classes and the set of transient states, respectively. 
For a given strongly communicating class (7*. denote by f(fc) 
the MDP restricted to C*.. Thus, the state space of T(fc) is C* 
and the action space ,4*-(i), i e Ct, is given by 

'MO = fa € A{i): p lttJ =0 Vj $ ( 7 *}. 

From [14| we know that 4 *,(*') is nonempty for all t 6 
Ck and that T{k) is a communicating MDP. Recall that 
a communicating MDP is such that for any pair of states 
i, j £ S, there is a pure policy under which j is accessible 
from i. Now consider the following linear program LP(k) 

max ^2 53 a ) x ><* 

S.t. ^ ^ ~~ Ptaj)^w ~ A j E 

53 53 x *» = 1 

a£ (*) 

•£*a 5: 0, i E Ck , o E Ak{i)- 

Lei Vk denote the optimal objective function value of LP(k). 
We can now state die following result. 
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theorem 32 iLemma 4 [14]): For all policies u, all initial 
states «i e S, and all k = 1, ■ ■, K, we have 

P U {R £ V* | Xi *«i)pl 
whenever Xi » aj) > 0, 

iv! Basic Results for the Single Reward Case 

We shall first solve Problems 1-3 for die case of T being 
a communicating MDP. In this case, there is one strongly 
communicating class Cl, and T is empty; thus, S - C\. 

Consider then LP{ 1) and to simplify notation denote v. = 
v*. Also let {x* 0 } be an optimal basic feasible solution of 
LP{ 1) and g* be a stationary optimal policy constructed from 
{*!„} (e.g„ see [11], [9], or [13]). Then g* satisfies 

</>($*, «!) = v, Si € S (4.1) 

where v is the maximal objective function value in LP( 1). 
Moreover, the Maikov chain P{g*) associated with the policy 
g* has at most one recurrent class plus (a perhaps empty) set 
of transient states. 

Theorem 4.1 : In a communicating MDP T there exists a 
policy that achieves the target level r with percentile « if and 
only if r < v, If r < v, then the pure policy g* achieves the 
target level t with percentile a, for any a e [0, 1] 

Note: This result (at least for the unichain case) seems to 
be well known in the “folklore” of MDP's. A proof for the 
communicating case can be found in Asriev and Rotar’ [1] 
(who establish this result for a much more general stochastic 
dynamic control model). Since our proof is quite simple, we 
present it here for completeness. 

Proof: Since g * gives rise to a Markov chain with one 
recurrent class, we have 


(e,g., see [13, Proposition 1 (iii)|). Combining this with (4.1) 
gives 


P r {R - v | Xi - sj) = 1. (4.2) 

From Theorem 3.2, we have 

Pu{P<v |Xj=»,) = l. (4.3) 

The result then follows from (4.2) and (4.3). 

As a direct consequence of Theorem 4.1 we have the 
following corollary. 

Corollary 4.1: In a communicating MDP T, r a = v for all 
a 6(0, 1], moreover 

_ (1 if < v 
f * r — \0 if r > v. 

Problems 1-3 have now been solved for the communicating 
MDP’s. We return to the general case, where we have strongly 
communicating classes , • • ■, C*- and the set T of transient 
states. Denote by gl the pure policy of Theorem 4.1 associated 
with r (k), the MDP restricted to CV 


Corollary 42: For a fixed k g {lr • •, K}, let g be a pure 
policy that coincides with g k on C k and is defined arbitrarily 
elsewhere. Then 


Pg(R 5= Vlt | X\ — «l) = 1 


if P g (* k , X l = Sr) > 0. 

Proof: Note that the definition of implies that C k 
must be reached in finite time (P g — a.s.). Thus, the result 
follows easily from the proof of Theorem 4.1. 

NexL we shall consider a fixed target level r and associate 
with it an index set J T =* {fc: 1 < k < K, v k > t}, and an 
auxiliary “0-1 MDP” r r , whose states, actions, and transition 
probabilities are the same as I\ but with rewards defined by 



if t e C k and k 6 I T 
otherwise. 


It is easy to see that for an arbitrary policy u, the expected 
average reward in T t is given by 


<A T («. «i) 


= E U 


N 


lim jv— ou inf “^ ^ l{X n € C\) | X\ = s\ 

n=lk£l r 

= |*t =*t) (4.4) 


fc€/r 


where 1( ) is the indicator function. Note that the last equality 
above follows from Theorem 3.1 and the definition of 
since P u - a.s. after some finite time / we must have X n £ Ck 
for all 7i > t and some A- € {1, ■ ■, A'} 

Theorem 4.2: Let g* be an optimal pure policy in T r which 
coincides with gl on Ck for k £ I T [ There exists a policy a 
satisfying 


P U {B >t \X\ = .si) > (\ (4.5) 

where n is the percentile, if and only if <t> T (g*, t*\) > (v. 
Further, if the target r can be achieved at percentile r*, then 
it can be achieved by the pure policy g*. 

Proof ■ From Theorem 3.1 we have that for any policy u 


K 


Pv(P >r I Xl = > T I **., Xl = *0 


fc = 1 


Pu(*fc I =-i). (4.6) 


From Theorem 3.2 we have 

Pu{R > T I $*, Xl = flj) = 0 k#l T . (4.7) 

From Corollary 4.2 

1 = Pq*{R > r | X\ = 

> P U (P >T\$ k ,Xi = s x ) k<tl T (4.8) 

1 Note thai there is no loss of generality here, because gl ensures that once 
the process enters Q it remains there forever. Thus it yields the maximal 
reward of one for every slate i € C*, k € U . 
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when the inequality follows from die optimality of & far 
r(fc). Combining (4.6H4.8) gives 

< Pu(R >r|^ *4) <P t .(R> r | Xi = * x ) 

= j> 9 .(* t |X]=ei) 

k€l r 

from which the result follows. 

It is important to note that Theorem 4.2 provides a con¬ 
structive answer to Problem 1 of Section II concerning a- 
aehievability of the target level r. We shall now address 
the problem of determining r a —the maximal achievable per¬ 
centile for the fixed level r. Towards this goal we assume 
without loss of generality that the strongly communicating 
classes C \, • ■ ■, Ck are ordered so that 

Vl > V2 > ' " > VjK ■ (4.9) 

Recall the definition of the MDT (here u* is the target 
level). To simplify the notation, we will refer to F Vk as T*. and 
to the corresponding expected average reward as </>* instead 
of <p Vk . 

Theorem 4.3 Let gl be an optimal pure policy for MDP 
Tk chosen as in Theorem 4.2. Wc have for a e(0, 1] that 

r a = t*: = max {| <//(<;£, «,) > a, k = 1, • • ■, A'}. 

(4.10) 

Proof: Let l be the largest index that achieves the maxi¬ 
mum in (4.10), l is well defined since <f> K {g * h , * 1 ) = 1. Since 
t* = vi , we have / r . = {1, 2,- ■,/}. Thus, from Theorem 
4.2, we know that 

/’ 9 ;(i?>r* \Xi =«])>«. (4.11) 

Hence, t* is ^-achievable, implying that r n > r*. If strict 
inequality were possible in the preceding statement, then there 
would exist a r' > t* and a policy n such that 

Pu(R>r'\X l =*!)>«. (4.12) 

Now let 

m: = max {k\ ok > r'} 

noting that if Vk < r' for all k = 1, ■ - •, K, then the left side 
of (4.12) equals zero contradicting the hypothesis a > 0. By 
the definition of m we have 

v m >r‘ >t*. (4.13) 

Applying Theorem 3.1, Theorem 4.1 and optimality g* n to 
(4.12) yields 

K 

ft < > t' | * fc , X x = S])P u (4> fc | X x = «,) 

m 

= £>„(* > r' | x x = *i)P u ($ fc I Xi = •,) 

fc= 1 
m 

< I Xi = *i) 

= (4.14) 


* 

But. by the definition of t\ (4.14) implies r* > which 
contradicts (4.13). < 

Corollary 43: The maximal a-achicvabie target level, r 0 . 
is a monotone nonincreasing step-function of or, defined on 
the interval (0,1). 

Proof: Choose gl as in Theorem 4.3. Let 
<f> k (gl . «i) for A; = 1, • • ■, K, so that 0 < «i < «2 £ >• • < 
ax = 1. If we define to: = «i, then by Theorem 4.3, r a =s m 
for all a 6(0, ft]]. Similarly, r„ = r*, a constant for all 
a 6(ft*., ft/k+i], where r* > t*+i for each fc * 1. • * •, K - 1. 

Corollary 4.4: Choose gl as in Theorem 4.3. The maxi¬ 
mum percentile n T for a given tatget level r, is a mono¬ 
tone nonincreasing step function of r defined in the interval 
[tt/i, vi]. In particular for r €(«]+], ?>*] we have 

ftr = <t> k (.Ql, si) 

for each k — 1, • • •, K - 1. 

Proof: This follows easily from the monotonicity of 

J, «i) in the index k. 

Remark 4.1: Corollaries 4.3 and 4.4 demonstrate the 
strength of the percentile objective criteria. Namely, the 
decomposition of states into C’i, ■ • ■, Ck and T, and 
the subsequent computation of policies gl together with 
“breakpoints” t*, and v* for r„ and n T , respectively, allows 
for a flexible and practical evaluation of gain-risk trade-offs 
in an average reward MDP. 

In view of Corollaries 4.3 and 4.4 the only “reasonable” 
choices of tv and r are of the special form (r, tv) with r = r a 
and tv = ft T ; these correspond to Pareto-optimal solutions. 

The preceding results are summarized in the following 
algorithm, which (for a fixed initial state jm) finds all target- 
level-percentile pairs of the indicated "special form" 

Step I: Apply the algorithm of [14] to find the decompo¬ 
sition C], • • ■. Ck, T. 

Step 2: For each k € {1, • ■ •, K }, find the value m of T(A). 
Order the strongly communicating classes so that 

> <’K- 

Step J. For each k € {!.■••, K }, form the MDP T* and 
find the optimal policy r/J'. Let ft] = <j> k {yl, #i). 

Step 4: Let .7 = {(«».«*) | k = 1, If 

(ti, ft), (t 2 , ft) 6 .7 and t x > rj, then elimi¬ 
nate (r 2 , a) from J. Continue until no further 
eliminations are possible. 

Step 5: We have constructed the set .7 = {(r, a) =s 
(t„, u T ) | r 6 R. ft G (0, 1]]. 

Note that Step 2 is not hard computationally, since very 
efficient algorithms are known for communicating MDP's. 
In Step 3 one should use the aggregated MDP method of 
114], where each strongly communicating class is replaced 
by one state. In addition to computational efficiency, this 
method will automatically yield a deterministic optimal policy 
satisfying the conditions of Theorem 4.2. Also, since when k 
is incremented by one, only one immediate reward changes in 
the aggregated problem (the rest of the data stay the same), 
perhaps a parametric solution method can be used (e.g. by 
using LP algorithms for solving the aggregated problem). 
Further computational efficiency can be gained by quitting 





IEEE TRANSACTIONS WAUTOMATIC CONTROL, VOL. 40, NO. I, JANUARY 1999 


Step 3 as soon as a* # i for some k (since all subsequent 
a*'* are automatically equal to one). Note also that steps l 
and 2 need not be repeated for different storting states. Finally, 
m note fhet before starting die algorithm, one should check 
whether MDP r is communicating (easily verifiable conditions 
cm be found in (9]). If so, the complete characterization is 
immediately available from Corollary 4.1. 

' V. Problems with Multiple Rewards 
and Constraints—Formulations 

A natural extension of the percentile objective criteria is to 
the case Where each action carries multiple immediate rewards, 
i.e., each immediate reward is actually a vector of some fixed 
length L 


r(i, a) € R L for i 6 S, u G A(i). 


policy u to be u = (u 1 , g) (i.e., u uses die decision rule it 1 at 
time 1 and follows g thereafter). It is easy to see that 


Pu 




04 


and thus u achieves both the specified target-levels at the 
corresponding percentiles. In fact the same conclusions apply 
for any n, r 2 € ( 0 , 1 ). 

Under the second approach to a problem with multiple 
rewards, a separate target level is specified for each compo¬ 
nent of A, and all target levels are required to be achieved 
simultaneously at a single percentile level a. 

Problem 5: Fix $i € S. Given a vector r € R L and 
a 6 [0, 1], does there exist a policy v such that 


The definition of the limiting average reward in Section I leads 
to a vector A of random variables. 

At this point two approaches can be taken. Under the 
firat approach, a separate target level-pcrcentile pair would be 
specified for each of the L components of A = ( R\ , ■ ■ ■, Ri). 
This leads to the following multi-objective version of Problem 
1 (of Section II). 

Problem 4: Fix aj € S. Given a pair of vectors (r, a) € 
R^ x [ 0 , 1 ] L , does there exist a policy v such that 

Pu(Ri > ri | Xi as *i) > m for / = 1 ,■ ■,LI 


This problem appears to present serious difficulties and is 
presently unsolved. Below we present an example showing that 
stationary policies do not suffice for this case, thus indicating 
that it is unlikely that a simple extension of the results and 
methods developed in Section IV will work in this case. 

Example 5.1: Consider the following MDP with L = 2 


r(l, 1 ) = ( 1 , 0)1 ;>ui = 1 
r( 2 , !)*((), 1 ), P 2 i 2 = l. 


/ = 2 

r(l, 2) = (0, 0 ), pi 22 = 1 


Thus action 1 is absorbing in both states, and action 2 in state 

1 results in immediate rewards of ( 0 . 0 ) and transition to slate 

2 with probability one. 

Suppose the initial state is one and that the target level- 
percentile pairs are ( 5 , 5 ) for each component of the reward 
vector. Note that for any stationary policy / 


PfUh » 1 I JTi = 1) = 1, 

Pf(Ri = 0 I X l « 1 ) = 1 , if fn = 1 


and 

P f {Ih = 0 | Xi = 1) = 1, 

P f {R 2 = 11^ = 1)^1, if fn < 1. 


Therefore, no stationary policy suffices. 

Let g be the deterministic policy that takes action 1 in each 
state, and let u 1 be the decision rule that takes actions 1 and 2 
with probabilities 1/2 each in state 1 . Define the nonstationary 


Pu{Pl > Tin l = U •■■»£ | Xl = .Si) > a? (5.1) 

Intuitively, the difference between the two approaches is that 
in Problem 4, the requirements are placed on the marginal 
distributions of the components of A, while in Problem 5 
the requirement Is placed on the joint distribution of A. 
Henceforth, we will refer to these two approaches as the 
“marginal-probability’' and the “joint-probability” formula¬ 
tions, respectively. While the marginal-probability formulation 
provides for more modeling flexibility, the advantage of the 
joint-probability formulation is that it leads to more tractable 
problems: most of the results obtained for Problem 1 will be 
extended to Problem 5. 

We note that an extension to multiple rewards is especially 
important, since some of the components of r(/, a ) can be re¬ 
garded as negatives of the costs. In this case, the corresponding 
components of R and r can be regarded as constraints which 
must be satisfied at the specified percentile level (either singly 
in Problem 4 or jointly in Problem 5). This can be seen as 
a generalization of the “sample path constraint” of Ross and 
Varadarajan [13) and [14| (which in our notation corresponds 
to a =■ 1 and L = 1 ). 


VI. Basic Results for the Multiple Rewards Case 

In this section we consider Problem 5 for the case of com¬ 
municating MDP’s. We employ the linear-programming-based 
techniques developed in [ 6 ), [11], and [13] and introduced 
in Section III above. It should be noted that the problem 
considered in this section is related to the problem of finding 
optimal policies in average-reward communicating MDP’s 
with state-action frequency constraints. It is well known (see 
e.g., [7, example 9.31) that stationary policies may not be 
optimal in this case. For a problem with one inequality 
constraint, [13] shows that an e-optimal stationary policy can 
always be found (the policy is generally not deterministic). 
This result is related to Theorem 6.2. 

In the sequel, all vector inequalities and equalities are 
defined componentwise, that is A > r means Ri > tj, 
l = L. 
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We first state some preliminary definitions and results. 
Consider the following polyhedral set 

{ SoeAW Jr,a ~ Sj€S Soe.4(j) P)at T J<f » £ S 

WfS IdaM(i) 1 

®»o 2? 0, j € S. o € A(i) 

(6.1) 

(this is simply the feasible region of LP(k) of Section III). 
For x = (a‘, 0 ) € A, define a stationary policy 

f V Jw - if i £ S’, £ a € A(i).r„ > 0 , 

I LutM 'I 

u € A(<) 


f(«). a = 


■RTfT 


if / € A, 53 m€ 4(,) fio — 0. 


( 6 . 2 ) 


Define the following random variable (whenever it exists), 
representing long-term average stale-action frequencies 

1 T 

Z la = ^liiii^ —5^1 [X t - i, A t = ft]. (6.3) 


i=i 


We will need the following well-known results, the proofs 
of which can be found in [ 11 J and 113J. 

Lemma 6.1: Let T be an arbitrary MDP. 

0 If f is a stationary policy then Pj -almost surely, Z is 
well-defined, Z e A, and 


Proof: 

i) The proof is analogous to the proof of Proposition 2 in 
U4]. 

ii) If b* > 0 then /* is irreducible by Lemma 6.1-iv) and 
thus, by Lemma 6.1-iii), x* — Z Pf- -a,s. It now follows 
by Lemma 6.1-i) and the feasibility of m* that 

R > TPf - a.s. 

This result is a counterpart of Theorem 4.1 for the single¬ 
objective case. When conditions in part ii) uf Theorem 6.1 
hold, it provides a solution to Problem 5 for any n € [0, 1 ], and 
if the condition in part i) holds then Problem 5 has no solutions 
for any r». Unlike Theorem 4.1, however, the current result 
does not provide a complete characterization of solutions. If 
the optimal value b* of LP is equal to zero and fix*) is not 
unichain, then it is not currently known whether Problem 5 
has any solutions or any solutions in stationary policies. The 
following example shows that it is possible to have a situation 
where no feasible stationary policies exist when 6 * = 0 . 

Example 6.1 • Consider the following MDP T with L = 2 

i = l i = 2 

r(l. 1) = (1,0), /Mu = 1 r(2, 1) = (0, 1). p» 12 * 1 

r(l, 2) = (0, 0), = 1 r(2. 2) = (0, 0), p 221 = 1 


R = r ('- 0 ^*"- 

te-Stit 1(0 


where = 1, r = (1/2, J/2) and a > 0. Then LP j$ 
feasible with If = 0 and 


n) If / is a stationary policy and / is unichain, then 

Z ,n = 7T(/),/, a Pf ~ a.S. 

where ir (/) is the unique stationary probability vector 

of P(f). 

lii) If x e A and f(x) is unichain, then 
Z = xPf^x) ” a 

iv) If r is a communicating MDP and x e A, x > 0, then 
f(x) is an irreducible stationary policy. 

Let t be a given target vector and or a given percentile level. 
We now define the following linear program LP 

max b 
Subject to 

]£h€<a( 0 ,r,fl = Har-4(j) Pj<"- r j« 1 ^ ^ 

SaG4(/)‘ r '^ = 1 

Sae^CO “ T/ * = 1, * • •, L 

Tin >b, i € 5, 0 e A(t) 

b > 0 . 

Note that the feasible region of LP is contained in A x R. 
Theorem 6.1 : Let V be a communicating MDP. 

i) If LP is infeasible, then for any policy u 

P u (R>r\ A'i = .si) = 0. 

ii) Suppose LP is feasible. Let (x\ If) be an optimal 
solution and /* = /(**). If b* > 0 , or f* is unichain, 
then 


x* = (-in, r 12 . .raj. .r «) 7 * 0/2, 0, J/2, 0) r . 

Thus f(x *) is not unichain, and m fact it is easy to see that 
for any stationary policy /, I)(R > t) = 0 . It is not known 
whether there exists some nonstationary policy u such that 

P U (R >r)> 0. 

Il might be reasonable to suppose that when If = 0 , the 
presence of feasible stationary policies might be detected by 
checking whether f{x*) is umchain for any optimal basic 
feasible solution x* of LP (this would cover Example 6.1 
where there is only one optimal basic feasible solution). The 
following example, however, shows that it is possible that 
If = 0 , f(x *) is not unichain for any optimal basic feasible 
solution, and yet, feasible stationary policies exist for any 
rv > 0. 

Example 6.2 * Consider the following MDP T with L = 3 

( / == 1) i*2 

r(l,l) = (l,0.0), Pm = l r(2,1)*(0.1,0), pa U s 1 

r(l,2) = (0.0,0), /M22 = l r(2,2) = (0.1.0), p 22; , = 1 

r (l,3) = ( —1.0,0). /Mji = 1 
i = 3 

r(3,l) = (0,0,1), 7.313 =1 
r(3,2) = (0,0,0), 7.322 = l 

and let .<n = 1. Take some n > 0 and r = (1/4,1/4. 1/4). 
It is not hard to verify that the resulting LP has the optimal 
value b* = 0 with only two optimal basic feasible solutions 

** = ( x ll! X12, J'13, J2I, -f22) *31, *32)^ 

= (1/4, 1/4, 0, 1/4, 0, 1/4, 0) T 


P r (R>T\X l = »i) = l. 
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** = (1/4, 0, 0, 0,1/4, 1/4,1/4) T . 

Neither /(**) nor /( x 2 ) is unichain and, in fact, the proba¬ 
bility of achieving r is zero for both of these policies. If we 
take the feasible point 

*' ** 5* 1 + 5* 2 - ( l / 4 ’ V 8 ' °1 VS, 1/8, 1/4, 1/8)* 

then /(**) is unichain and Pf(x*)(R > t | Xi = # 1 ) = 1 
We will call a target vector r for which LP is feasible and 
has optimal value b* => 0 an indeterminate target vector. 

We note that this case does not arise in the case of unichain 
MDP’s (since one of the two conditions of Theorem 6.1 must 
be met), and we have the following corollary. 

Corollary 6.1: Suppose T is a unichain MDP. Then either 
LP is infeasible, in which case Problem S has no solutions for 
any a > 0, or LP is feasible with an optimal solution [b*, x *), 
in which case for / = /(**), Pj{R > v \ X\ = si) - 1. 

In the remainder of this section we show that the difficulties 
associated with indeterminate target vectors can always be 
avoided by a slight relaxation of the target vector and that 
indeterminate target vectors are sufficiently “rare” in the set 
of all possible target vectors. We will need the following result. 

Lemma 62: Suppose T is a communicating MDP. Take 
X € A. Then for any f > 0 and e 5, there exists an 
irreducible stationary policy / such that 

Pj(\\Z-x\\< f I X, = «!) = 1 

(where || • || is an arbitrary vector norm). 

Proof: If x > 0, then / = f(x) is irreducible by Lemma 
6.1-iv), and it follows by Lemma 6.1-iii) that Z - x Pf-a.s. 

Now suppose that x,„ = 0 for some i € S, a 6 A(t). 
Let g be a completely randomized (stationary) policy (i.e., 
9w = for any i 6 5, a € j 4(*)). Since T is a 

communicating MDP, g must be irreducible. By Lemma 6.1 - 
ii), Z = x(g) P,-almost surely, where x{g), a = x {g),g, a for 
i 6 S, a G i4(i). By Lemma 6.1-i), x(g) > 0 and x(g) € A. 
Let 

*(A) at Ax + (1 - X)x(g) for A € (0. 1). 

Clearly, *(A) € A and x(A) > 0 for any A < 1. It follows 
by continuity of *(A) with respect to A that we can choose 
A* 6 (0,1) so that 

ll*(A*) — x|| < f. (6.4) 

Let / - /(x(A*)). By Lemma 6.1-iv), / must be irreducible. 
It follows by Lemma 6,1-iii) that Z = x(A*) P/-a.s. The result 
now follows immediately from (6.4). 

We are now ready to prove the following result. 

Theorem 62: Suppose T is a communicating MDP and r 
is an indeterminate target vector. Let 6 be an arbitrary vector 
with positive components and choose c > 0. Then there exists 


an irreducible stationary policy / such that 

Pf(R > T - | Jfi = Si) = 1, 

Proof: Let (6*, x*) be an optimal solution of LP with 
target vector t. By assumption, b* = 0. By Lemma 6.2, for 
any 7 > 0 , there exists an irreducible stationary policy / such 
that ||ar / - x*|| < 7, where x' = Z is a constant P/-almost 
surely. Choose 7 small enough so that 

5Z a) > T - e6 Pf - a.s. (6.5) 

t€Sa€A(t) 

(this can always be done since the left-hand side of (6.S) is a 
continuous function of x' and Yhes £o€A (0 x *a r (h < l ) > & 

Since / is irreducible, by Lemma 6.1-ii), P/-almost surely 
x' > 0, i.e., x ' > V > 0 for some V 6 R. It follows from (6.5) 
that (6', x') is feasible in LP with the target vector r-e6 and 
consequently, the optimal value for this LP is positive. The 
result now follows from Theorem 6.1-ii). 

Thus, any strict relaxation of an indeterminate target vector 
produces a target vector for which Problem 5 can be solved by 
an irreducible policy for any percentile level nr. Geometrically, 
the situation is as follows: let T = {r | LP is feasible}. T 
must be a closed set. Let 

TS = {t | Problem 5 has a solution in unichain policies 
for any percentile rv e [0. 1]}, 

■ TF = {t | LP has optimal value b * > 0}. 

. 77 = {t | r is an indeterminate target vector}. 

Then TSuTl = X, TF C TS, and TI C boundary^ 1 }. Note 
also that the intersection of TS and TI need not be empty, as 
shown by Example 6.2. We illustrate the relationships between 
these sets in the context of Example 6.1, where 

T = {ti < 1, t 2 < 0} U {n < 0, r 2 < 1} 

U {r x + r 2 < 1, T[ > (1, r 2 > 0} 

(represented by the shaded region on Fig. 1) 

TS = X\{ r i + r 2 = L > 0, r 2 > ()}, 

TI = {ti = 1, t 2 < 0} U {ti < 0, r 2 = 1} 

U {ti + r 2 = 1, r A > 0. r 2 > 0} 

(the boundary of X), and 

TF = X\X/. 

Remark 6.1: In summary, our results for the communicat¬ 
ing case with multiple rewards closely parallel the correspond¬ 
ing results for the single reward: for “most” target vectors, 
either Problem 5 has no solution or else it is solvable by 
irreducible policies for any percentile level. The differences 
from the single reward case lie in the fact that the feasible 
stationary policies might have to be randomized (for single 
reward deterministic policies sufficed), and in our inability to 
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! Fig. 1. The set of passible target vectors for Example 6.1. 


; handle (completely) the indeterminate target vector case, If the 
modeler has some flexibility in setting the target vector, the 
I indeterminate case can always be avoided. In some cases, how¬ 
ever, such flexibility might not exist. Therefore, we consider 
the further study of the indeterminate case to be important. 
Specifically, the following questions should be addressed: 

1) Must nonstationary feasible policies exist when r is 
indeterminate? In particular, do such policies exist in 
Example 6.1? 

2) Can the indeterminate target vectors for which Problem 
5 has a solution in unichain policies (i.e., r £ TInTS) 
be characterized? It would be particularly interesting to 
find a computationally simple characterization or show 
lhal one does not exist. 

Wc now turn our attention to the general (multichain) 
MDPs. Surprisingly, the extension of our communicating MDP 
results to this case can be done quite easily by employing the 
decomposition and sample path theory. 

As in Section IV above, let C\ s * • , , T be the strongly 

communicating classes and the set of transient states of 1', and 
let V(k) be the MDP restricted to (\. Define the index set 

J T — {k | 1 < k < K, The optimal value of LP for V(k) 
is positive}. 


: For k € J r * let /£ be the stationary policy of Theorem 6,1-ii) 
associated with the MDP T(k). Following Section IV, define 
MDP 1\ whose slates, actions, and the transition law are the 
same as F, but with the rewards defined by 


r r (/\ a 



if i € C/.. a £ A(i) and k £ J T 
otherwise. 


( 6 . 6 ) 


Theorem 63: Assume the target vector r is not an indeter¬ 
minate target vector for any F(A:), k = (1, ■ ■, K}. Let /* be 
an optimal stationary policy in F T which coincides with on 
Cy for k £ J r ? There exists a policy u satisfying 


P U {R > r | Xi = j»i) > n (6.7) 


if and only if t*i) > «. Further, if the target vector r 

can be achieved at percentile a, then it can be achieved by 
the stationary policy /*. 

Proof: Exactly the same as in Theorem 4.2. 

This result provides a constructive answer to Problem 5, 
provided r satisfies the assumption of Theorem 6.3. 

2 See the footnote in Theorem 4.2, 


Remark 62: Similarly iso Problem 3 of Section It define 

ol t « sup {<* 6 (0, 1] | 3 a policy u $,l (5,1) holds}. 

It is clear from the proof of Theorem 4.2 that at • $ T (/*, aj) 
and the policy /* constructed above achieves r at percentile 

(*T 


VII. Problems for Further Research 

In the preceding sections we have outlined several prob¬ 
lems for further research. For the multiple reward case, a 
satisfactory treatment of the marginal-probability formulation 
described in Section V would be very useful. Open problems 
connected with the indeterminate target vector case were 
discussed in Remark 6.1. 

Another important open problem associated with the per¬ 
centile objective criterion is the satisfactory treatment of the 
discounted case. Analysis of a similar problem for this case 
can be found in Bouakiz (51; however, no algorithmic results 
were obtained. Problems 1-3 of Section II appear to be 
much harder in this case, mainly because no equivalent of 
the decomposition theory is known (or perhaps, possible) for 
discounted MDP’s. 
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Design and Analysis of Fuzzy Identifiers 
of Nonlinear Dynamic Systems 

Li-Xin Wang, Member, IEEE 


Abstract —In this paper, we use fuzzy systems as identifiers for 
nonlinear dynamic systems. We provide a theoretical Justification 
for the ftizzy identifiers by proving that they are capable of 
following the output of a general nonlinear dynamic system to 
arbitrary accuracy in any finite time interval. The fuzzy iden¬ 
tifiers are constructed from a set of adaptable ftizzy IF-THEN 
rules and can combine both numerical information (in the form of 
Input-output pairs obtained by exciting the system with an input 
signal and measuring the corresponding outputs) and linguistic 
Information (in the form of IF-THEN rules about the behavior 
of the system in terms of vague and ftizzy words) into their 
designs in a uniform fashion. We develop two fuzzy identifiers. 
The first one is designed through the following four steps; 1) 
define some ftizzy sets in the state space U C Rr of the system; 
these fuzzy sets do not change; 2) construct fuzzy rule bases of the 
fuzzy identifier which comprise rules whose IF parts constitute 
all the possible combinations of the fuzzy sets defined in 1); 
3) design the fuzzy systems in the fuzzy identifier based on 
the fuzzy rule bases of 2); and 4) develop an adaptive law for 
the free parameters in the ftizzy identifier. The second fuzzy 
identifier is designed in a similar way as the first one except 
that: a) the parameters characterizing the ftizzy sets in the state 
space change during the adaptation procedure; and b) the fuzzy 
systems and the adaptive law are different. We prove that: I) both 
fuzzy identifiers are globally stable in the sense that all variables 
in the ftizzy identifiers are uniformly bounded, and 2) under 
some conditions the identification errors of both ftizzy identifiers 
converge to zero asymptotically. Finally, we simulate the ftizzy 
identifiers for identifying the chaotic glycolytic oscillator, and 
the results show that: 1) the ftizzy identifiers can approximate 
the chaotic system at a reasonable speed and accuracy without 
using any linguistic information, and 2) by incorporating some 
fuzzy linguistic IF-THEN rules about the behavior of the system 
into the fuzzy identifiers, the speed and accuracy of the ftizzy 
identifiers are greatly improved. 


I. Introduction 

E XISTING identification schemes determine a model for 
a system based on the input-output pairs which are 
collected by exciting the system with an input signal and mea¬ 
suring the corresponding outputs [8], [20], For many complex 
systems, however, an important portion of information comes 
from another source: human experts, who are very familiar 
with the systems and can provide linguistic descriptions about 
the behavior of the system in terms of vague and fuzzy 
words. For example, although we may not know the exact 
mathematical model of a car, we can describe the behavior of 

Manuscript received February 6, 1992; revised January 11. 1994 and April 
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the car as fdllow: “IF apply ‘more* force to the accelerator 
of the car, THEN the speed of the car will ‘increase’/* 
where the ‘more* and ‘increase 1 arc characterized by fuzzy 
membership functions. Although these linguistic descriptions 
are not precise, they provide important information about the 
system. In fact, for many industrial process control problems, a 
human operator can determine a set of successful control rules 
based only on the linguistic descriptions about the processes 
[5], [11J, 113], [17], |38]. Unfortunately, existing identification 
schemes ignore this important source of information and 
cannot incorporate the linguistic descriptions directly into |he 
identifiers. The goal of this paper is to develop identifiers of 
nonlinear systems which combine both linguistic descriptions 
and input-output pairs in a uniform fashion into their designs. 

Fuzzy systems are powerful tools for achieving this goal 
because, on one hand, fuzzy systems are constructed from 
fuzzy IF-THEN rules so that linguistic descriptions can be 
naturally incorporated into the fuzzy identifiers; on the other 
hand, adaptive laws can be developed (as shown in this paper) 
to adjust the free parameters of the fuzzy identifiers to make 
them match the input-output pairs. Specifically, the initial 
identifiers arc constructed from linguistic information, and the 
parameters are then adjusted on-line based on input-output 
pairs. In this way, the fuzzy identifiers combine both linguistic 
and numerical information into their designs. 

The idea of fuzzy systems was introduced by Zadeh [41] 
very early in the literature of fuzzy sets. Research on fuzzy 
systems has been developed in two main directions. The first 
studies fuzzy systems in the same conceptual framework as 
classical systems and has given birth to a body of abstract 
results concerning the stability |4], [14], reachability [6], [25], 
observability [6], [25], etc., of fuzzy systems. The second 
direction is the linguistic approach to fuzzy systems [22], [30k 
143], [44], in which a fuzzy system is constructed from a 
collection of fuzzy rules (propositions) and a fuzzy inference 
engine which uses techniques in approximate reasoning [1], 
[45]. This approach was used to synthesize fuzzy logic con* 
trailers [18], [19], [22] which has been successfully applied 
to a wide variety of practical problems during the past decade 
[5], [11], [13], [17], [30], [40]. We adapt this second approach 
to fuzzy systems in this paper. 

The basic configuration of a fuzzy system is shown in Fig. 1. 
There are four components in a fuzzy system: 

1) Fuzzy rule base which comprises fuzzy rules describing 
how the fuzzy system performs; it is the heart of the whole 
system in the sense that other three components are used to 
interpret and combine these rules to form the final system. 


00I8-92S6/95$04.00 © 1995 IEEE 
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H*.I. Basic configuration of fuzzy systems. 

2) Fuzzy inference engine which uses techniques in approx¬ 
imate masoning to determine a mapping from the fuzzy sets 
in the input space U C R n to the fuzzy sets in the output 
space V c R m . 

3) Fuzzifier which maps crisp points in the input space into 
fuzzy sets in die input space. 

4) Defuzzifier which maps fuzzy sets in the output space 
into crisp points in the output space. Depending upon whether 
there are fuzzifier and defuzzifier, we have two classes of 
fuzzy systems. The first class of fuzzy systems comprise only 
the fuzzy rule base and fuzzy inference engine and therefore 
operate in a pure fuzzy environment, i.e., inputs and outputs to 
these fuzzy systems are fuzzy variables. If there is a feedback 
(as shown in Fig. I by the dashed arrow line), we have the so- 
called fuzzy dynamic systems [4], [14], [32], [33]. The second 
class of fuzzy systems comprise all the four components and 
perform mappings from crisp U c /?” to ensp V c R m . 
In the literature, these second-class fuzzy systems are called 
fuzzy logic controllers [ 18], 119], because their most successful 
applications have been to control problems. In this paper, 
we use these second-class fuzzy systems as identifiers for 
nonlinear dynamic systems. Also, we consider only static 
fuzzy systems. 

In Section II, we present a detailed description for each 
component of the fuzzy system and show the universal ap¬ 
proximation properties of the fuzzy systems to nonlinear static 
and dynamic systems. In Sections III and IV, we develop two 
identifiers of nonlinear systems based on the fuzzy system 
models and study their stability and convergence properties. In 
Section V, we simulate the two fuzzy identifiers for a chaotic 
system and show how identification performance is improved 
by incorporating linguistic descriptions about the system into 
the fuzzy identifiers. Section VI concludes the paper. 


II. Description and Analysis of Fuzzy Systems 

Fig. 1 shows the basic configuration of the fuzzy systems 
considered in this paper. The fuzzy system performs a mapping 
from crisp U C R n to crisp V c R m . We assume that 
V = t/] x• • xU n and V = V\ x• ■ • x V m , where U„ V 3 c R, 
i = 1, 2, • ■ • ,n, and ] — 1, 2, • ■ •, m. We first briefly review 
some basic concepts in fuzzy sets and systems theory which 
are useful in describing the fuzzy systems and then present a 
detailed description for each component of the fuzzy system. 


A fuzzy set [ 42 ] F of a universe of discourse U is 
characterized by a membership function hf : U -*■ [0,1] 
Which associates with each element u of U a number hf( u) in 
die interval [0, 2] which represents the grade of membership 
of u in F. The label F of a ftizzy set is often some linguistic 
term like “small,” “large,” etc.. A linguistic variable [ 44 ] is 
a variable whose values are words in a natural or artificial 
"language, and these words are labels of fuzzy sets. A fuzzy 
IF-THEN rule [ 44 ] is an expression of the form “IF A THEN 
B," where A and B are statements about what values the 
linguistic variables take on. A triangular norm ‘★’is defined as 
a mapping from [0, 1] x [0, 1] to [0, 1], and the most commonly 
used triangular norms are [ 18 ] 

min[x, y\ intersection 

x ★ y = xy algebraic product (1) 

max [0, x + y - 1] bounded product 

where x, y € [0, 1], A triangular conorm “+’ is a mapping 
from [0, 1] x [0, l] to [0, 1], and the most commonly used 
triangular conorms are [18J 

max [x, y] union 

x+y = ,r + y - xy algebraic sum (2) 
min [1, x + .y] bounded sum. 

Let Fj, • • ,F„ be fuzzy sets in X u ■ • ,X n , respectively; the 
cartesian product of Ft,- -,F„, denoted by Fi x ■ x F n , 
is a fuzzy set in the product space X = A', x - x X„ with 
membership function 

/JF.x xn(A,"',J - n ) = /Jr,(j'i)*"-*W n (j n )- (3) 

Let R and S be fuzzy sets in X x W and WxY, respectively; 
the supstar composition of R and 5, denoted by R o 5, is a 
fuzzy set mXxy with membership function 

HR O s(-c, y) = sup u , feW [hr{x, w)+ys(w, y)]. (4) 

Let A and B be fuzzy sets in X and V, respectively; the fuzzy 
implication A —► B, or ‘IF A THEN B\ is defined as a fuzzy 
set in X x Y with membership function 

Ha->b{x, y) = HA{x)+HB(y)- ( 5 ) 

Operations other than ★ may be used in (5) to define the 
fuzzy implication (see 118]). 

A fuzzy rule base R is a collection of fuzzy IF-THEN rules 
R = [R\R 2 ,--,R M ] (6) 

with 

R l : IF (xj is F{ and • • • and x p is F p )', 

THEN { Vl is G[,- ‘ ,y q is G l q ) ( 7 ) 

where £ - (xi,• ■ • ,x„) T and y = (yir•• ,y m ) T are the 
input and output vectors to the fuzzy system, respectively, F, 1 
and G l } are labels of fuzzy sets in (/, and V 3 , respectively, 
1 < p < n, 1 < <7 < m, and l = 1, 2, • • •, Af. It is clear that 
R! of ( 7 ) can be decomposed into a collection of q rules 


(8) 
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where 

Fj: IF {xi is and and x p is F^); 

THEN (*/, is G}) (9) 

j = 1, 2,"-,f. 

A fuzzy inference engine uses the rules in the fuzzy rule 
base to determine a mapping from fuzzy sets in V to fuzzy sets 
in V based on fuzzy logic operations. In the fuzzy inference 
engine, the IF part of Fj defines a cartesian product of 
F/, *, Fp, and the Fj itself is viewed as a fuzzy implication 

F* x * • • x Fp —> G l j. Let A be an arbitrary fuzzy set in U % 
then each Fj of (9) determines a fuzzy set A o Fj in Vj 
based on the supstar composition 

o /?' (j/j) = su Pap€l/[/^A(£)^Mf , 1 / x xFUGj fcl/j)] 

= s up £ et/hU)*/i F ;(xi)* ■ • ■ */if< (i P )t/i G ; (jj)] (10) 

where yj € V 3 . The final fuzzy set A o R 3 (R 3 = 
[Fj, • ■ ■, It™]) in Vj determined by the fuzzy inference engine 
is obtained by combining (10) for / = 1, 2, ■ ■ -, M using the 
triangular conorm + 

Ha o A^Vj) O R){Vj)+ o (H) 

In summary, the fuzzy inference engine determines a mapping 
from fuzzy set A in U to fuzzy set A o F in V by 1 

Ha o n(y) — (ha o H\ (y\)) ■ ■ * * ma o /t™ (i/m)) (12) 

A fuzzifier maps a crisp point r = (j*i, ■ • ■. x n ) T € U 
into a fuzzy set A x = [A Xl , ■ • ■, A Xtt ] in U = U\ x • ■ • x U n , 
where A Vi is a fuzzy set in U,, If A a> is a fuzzy singleton with 
support .r,, i.e., //^ 4 (xJ) = 1 for J'[ = .r 7 and Ha T i (*\) = 0 
for all other .r' € [/,, then we call this a singleton fuzzifier. 
There are other types of fuzzifiers; see [34]. 

A defuzzifier maps a fuzzy set in V to a crisp point in V. The 
defuzzifier is needed because for most practical applications; 
a fuzzy system is required to give a crisp output, no matter 
whether it is used as a controller, a decision maker, etc.. 
Because the output of the fuzzy inference engine is a fuzzy set 
A o R in V, the defuzzifier maps A o R into a crisp point 
y E V. In the literature, the most commonly used defuzzifier 
is the following center average defuzzifier 

EiliVj(^Aon‘(y l j)) 

Vj = - 7-iyT (13) 

where j/* is the point in V } at which (i G i (Vj) achieves its 
maximum value, n A 0 R < {yj) is given by (10), and j = 

1, 2,'■ • , TTl. 

We propose a modified center average defuzzifier, defined as 

_ El»,(#)/<) 

S ‘ E£.(/<A.w,ttS)/<>?) 

where <r' 2 is a parameter characterizing the shape of hg 1 (Vj) 
such that the narrower the shape of Hg 1 } (s/j), the smaller is cr l2 \ 

1 We assume that each for 1 < j < m appears at least once in the R l 
of (7), therefore ft a o r(v) i* an n?-dimensional vector. 


tte «et of ttitezy system 



for example, if y. G , (j/,) = exp[-((y, -pJ)/oJ) a ], then o' 2 is 
such a parameter. The modified center average defuzzifier is 
justified as follow. Common sense indicates that the sharper 
the shape of fi G i(yj) is, the stronger is our belief that the 

output j/j should be nearer to y\ — argsup v / €V ^ (/t G i 
[according to the rule 7?' of (9)1. The standard center average 
defuzzifier, (13), is a weighted average of the 17*’s, and die 
weight /t 4 0 R i (y‘j) determined by (10) do not take the shape 
of i L a' (tfj) into consideration. This is clearly not satisfactory 
based on our common sense. An obvious improvement is the 
modified center average defuzzifier (14). 

Note that if we use the center average or modified center av¬ 
erage defuzzifiers, we do not need to calculate the (i A o n, (Vj) 
of (11); we only need to calculate the n A 0 R i (y.) of (10) in 
the fuzzy inference engine. 

We see that the fuzzy systems of Fig. I comprise a very rich 
class of static systems mapping from U C R n to V c R m , 
because within each block there are many different choices, 
and many combinations of these choices can result in useful 
subclasses of fuzzy systems. If we view all the fuzzy systems 
as a set of functions, then each combination of these choices 
corresponds to a subset of the fuzzy systems, as shown for 
some examples in Fig. 2. We now show the specific functional 
forms of two subclasses of fuzzy systems which will be used 
later as basic building blocks of nonlinear system identifiers. 

Definition i: The set of fuzzy systems with singleton fuzzi¬ 
fier, center average defuzzifier, and product inference is all 
functions / = (/i, • • •, J m ) T from 17 to V of the following 
form 


tf,) 


where £ = x n ) T € U, 1 < p < n, and yj is the 

point at which n G i (y 3 ) achieves its maximum value. Usually, 
/i G i (y‘j ) = 1. (15) is obtained by substituting (10) into (13), 



t4 
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{tejpladng Hr with the algebraic product in (1), and using the 
fact that if 4 h * fuzzy singleton with support g (i.e., 04 (g) = 
l and ma(c') * 0 for all &' t z). *en n Ao #\y,) = 

• • • *n*j(®*)*l*G‘ (jfi). 

Definition 2: The set of fuzzy systems with singleton fuzzi¬ 
fier, modified center average defuzzifier, product inference, 
and Gaussian membership function is all functions / = 
(/j, • • f m ) T from U to V of the following form 


£2W 

nr=i a ! cx p 


)}/w 


nf=i a !«p(-i( 

;^) ! )] / <*!) ! 


which is obtained by talcing p F i(x t ) - a\ exp(-(l/ 2 )((x, - 
SiM'))*). = ex P("U/2)((t/j - p‘)/«!) 2 ). using 

the modifier center average defuzzifier (14) with 0 a 0 r?< (g 3 ) 
given by (10) and dr = algebraic product, and noticing that 
0 a(£) =* 1 and ha(s!) = 0 for all x' ^ x, where 0 < a[ < 1 , 
<r{ > 0, b l > 0 , Tj 6 U„ and yj e V } are parameters. 

The following theorem was proven in [34]. 

Theorem 1. Let Y be the set of fuzzy systems in Definition 
2, and assume that U is compact. Then, for any real continuous 
function ^(g) => ( 51 (g), • • ,ff m (g)) T from U to V and 
arbitrary e > 0, there exist / = (/i(g), •■•,/mU)) T G Y 
such that 

8U P*€t/l/j(£)-flj(£)l < c (17) 

for all j = 1 . 2 , 

Next, we study the capability of using the fuzzy systems 
in Definition 2 to approximate dynamic systems over a finite 
time interval. Consider the dynamic system 


g = Ax + g(x, a) ( 18 ) 

l = h(x, u) ( 19 ) 

Where u. € u is the input, y e V is the output, x e W c R r is 
the state, A is a known Hurwitz matrix, and g, h are unknown 
continuous functions. We make the following assumption. 

Assumption 2 1 For any u in compact U and any finite 
initial condition g( 0 ) = g°, the solution x(t) of (18) satisfies 
|g(/) -g°| < 6 for some positive constant b and all t € [0, T], 
where T is an arbitrary positive constant. 

Define it to be the set 

K - ((g, y,) € JF +n : |g - g°| < b + c, u 6 U] (20) 

where e > 0 is an arbitrary constant, and g° is any finite initial 
condition. Because U is compact and |x| < jg°| + fc + f < oo, 
K is compact. Assumption 2.1 assures that for u € U 
and finite g°, (18) generates (g(f), y.(t)) € K for t € 
|0,T]. Let 5(1, Hie.) = (3,(g ) li|© a ),---,5r(g,u|e 9 )) T 
and /i(g, aJG h ) « (fri(g, a|G fc ), • ■ •, fi m (g, 3 i|©s)) T be the 
fuzzy systems in Definition 2, where © 3 , ©/, are collections 
of the parameters o{, i?J, oj, and gj. Define ©J and ©J be 
such that 


su Pi.a€K St e «) “* a)| < €9 (2D 


StiPg,u€K' |^($* — ^(iEi M)| ^ (22) 

for some arbitrary t g > 0 and ej, > 0. Theorem 1 assures the 
existence of the ©J and ©J. We now show that by replacing 5 
and h in (18) and (19) with die $ and h, we obtain a dynamic 
system whose output can approximate the output of (18) and 
(19) to arbitrary accuracy over any finite interval of time. 
Theorem 2: Consider the dynamic system 

i = 4g + 5(i.al©;) (23) 

^=fi(g,u|©h) (24) 

where g( 0 ) = g( 0 ) = g°, Ji G U, 5 and h are the fuzzy 
systems in Definition 2, and the parameters 9* and 0,* are 
defined in (21) and (22). Then, for any f > 0 , finite T > 0 , 
and properly chosen e g and eft, we have that 

su Pre[o,r] |£(0 “P(0| < f - (25) 

Proof of this theorem is given in Appendix A. 


III. The First Fuzzy Identifier 
Consider the identification of the nonlinear system 

i = f(x) + g{x)u (26) 

where / and g are unknown functions from U = U\ x ■ ■ x U n 
to V = V X x ■ x V n , U t1 V t C R, 1 = 1, 2, ■ •, n, and the 
input it € R and the state x € R n are assumed to be bounded 
and available for measurement. Our purpose is to develop 
an identification model where the / and g are replaced by 
fuzzy systems /(^| 0 /) and g{T\Q g ) and an adaptive law for 
updating the parameter matrices 2 0/ and S g such that the 
identification model converges to the real system (26). First, 
we rewrite (26) as 

Z = /(®|B/) + 5UI©J)u + w (27) 


where 


ul s [/(g) - /(g|©/)] + (gU) - s(d©;)k (28) 


0 } = argmin e/en/ [sup £€l/ 
©J s argmin e>ent [sup £e{/ 


Hi.) ~ /Ul©/) ], (29) 
</U) ~ ff(zl© 9 )|] (30) 


and Slf and tl q are bounded feasible sets of 0 / and @ g , 
respectively. We assume that ilf = [0: tr(00^) < Mf] 
and Slg s [0: tr(00 :r ) < M g ], where Mf and M g are 
given constants. Because of Theorem 1 and the boundedness 
of u> |u/| can be arbitrarily small; therefore, without loss of 
generality we assume that w 0 = sup t > 0 \yi{t)\ is finite. We use 
the following serial-parallel identification model [24] 

f = -a£ + a£ + /(.£|@/) + g{£\& g )u (31) 


where nc is a given positive scalar. The whole identification 
scheme is shown in Fig. 3. The goals of identification are the 
following. 

2 In the first fuzzy identifier we collect the parameters of the fuzzy systems 
into matrices, m the second fuzzy identifier we collect the parameters of the 
fuzzy systems Into vectors 
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00<O) are determined ldenuAcguon Model 

based on the linguistic 
descriptions about f(x) 
and g(x) 

Fig. 3 Identification model using fuziy systems. 

Identification Goals: Specify the fuzzy systems /(^|0/) 
and g(x\Q g ), and develop an adaptive law for the parameters 
6 / and 0 9 such that: 

a) all signals involved in the identification model must be 
uniformly bounded, i.e., it must be guaranteed that i E L^ 
tr(0/0j) < M/, and tr(0 5 0j) < M u (the input u and the 
system state x are uniformly bounded by assumption), and 

b) the error e = x - x should be as small as possible. 

To emphasize the point that our fuzzy identifiers can make 
use of linguistic descriptions about the unknown system (26), 
we make the following assumption. 

Assumption 3.1: There are the following linguistic descrip¬ 
tions about the unknown functions f(x) and g(x) 

R'j: IF j'i is A\ and ■ • ■ and x n is A r n , 

THEN f\{x) is C \, ■ ■ •, / t ,U) is C' n (32) 

and 

Eg'. IF x i is D[ and - and x n is 

THEN tn U) is Dl , • - •, g„ (r) is D* (33) 

respectively, where A\ and B* are fuzzy sets in V„ C'f and 
D* are fuzzy sets in V - , which achieve membership value one 
at some point, r = 1 , 2, • ■ ■, Nj, and s = 1 , 2 , • • ■ , N g . We 
allow Nj = N g = 0 which means that there are no linguistic 
descriptions about / and g. 

Design of the First Fuzzy Identifier 

Step I: Define m, fuzzy sets Ff l in U, such that for any 
x t e U,, there exists at least one p F }>(x,) ^ 0 , where 
i = 1 , 2 , • • •, n, and ji - 1 , 2 , -, m,. We require that these 

F /”s include the Aj"s and B ’’s in (32) and (33). These 
fuzzy sets are fixed and will not change during the adaptation 
procedure of Step 4. 

_ Step 2: Construct the fuzzy rules base of fuzzy system 
/(sl©/) which consists of the following H " =1 m « m * es 

R 1 : IF xi is F / 1 and • ■ • and x n is F^ n , 

THEN h (*) is G \, • ■ •, /„(£) is G' n (34) 


is 


where F/' are defined in Step 1, G\ are fuzzy gets in 
Vi with p G ‘{Q\j) = 1 for some parameter 6\ } € V ti 

©/ = &/ = 

jt = 1 , 2, • • ’, m„ * = 1 , 2 , -, n, and I * 1 , 2 , ", JXm-i m ' 

with each l corresponds to a combination of (jl,- ■ •, jn). 
Because the fuzzy rule base consists of all the possible rules 
concerning the fuzzy sets F/' of Step 1 which include the 
A*’$, it includes the linguistic descriptions in (32). 

The initial parameters are chosen as follow: if 

the IF part of Gj in (34) agrees with an IF part in (32). 
choose 0 ^( 0 ) = argsup v , eV ', (»«)]: otherwise, choose 
(0) arbitrarily in V,. Therefore, we incorporate the linguistic 
descriptions into the fuzzy identifier by constructing die initial 
fuzzy identifier based on these descriptions. 

The fuzzy rule base and initial parameters of y($\Q g ) ate 
determined in exactly the same way as /(£| 0 /); we omit the 
details. 

Step 3: Choose /(r|0/) = (/iUI£i/)> • ■ - J*(x\Lf)) T 
and fl(x| 0 9 ) = (si(a:| 0 1(| ), • • ■ ,.o„(a:| 0 nfl )) T to be fuzzy 
systems with singleton fuzzifier, center average defuzzifier, 
and product inference (Definition 1) based on the fuzzy rule 
bases constructed in Step 2 


/.Ul If) = 


E nii 

jl = l 


’ IZjri’Ll (^1 ) * * ‘ l l Fi" (*»»)] 


E mt 
j 1=1 




(35) 


5Zj 1 =J ‘ * ’ ]Cjn"=l 0[ g [lt F o{Xl) * • ■ 


* / I/. \ _ j r H » t *tt ' ' 


(36) 


where ? = 1 , 2 , • ■ *, n, and f = 1 , 2 , • ■ ■, nT=i m » with each l 
corresponds to a combination of (jl, ■ ■ •, jw). Because of the 
way we define the fuzzy sets F / 1 in Step 1, the denominators 
of (35) and (36) are nonzero for all x E U. Defining the fuzzy 
basis function [39| 


V U) ” - Um ^ - r- /~ \ — r W < 37) 

zZjl = l ' ‘ ’ Ylyn=zl[l l F{ x ( x l) * * ’ Vf^ 

(/ is defined as in (35) and (36)], collecting them into an 
n,~i m % * 1 vector and collecting the 0 l lf and 8[ g into 
nr=i m t xl vectors 8 lf and d, g in the same order as p(x), 
we can rewrite (35) and (36) as 


f'Ulhf) = (LlfPU) ( 38 ) 

and 

S.(£|£. 9 ) = CgE(&) (39) 

respectively. Recall that 0/ s (&/,••■,£„ f) 1 and 0 9 = 
- ■ ,i„ g ) T and substitute (38) and (39) into (31); the 
fuzzy identifier becomes 

i - -ax + Q!£ + 0 f p(z) + 9gP(z)u. (40) 
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, Step 4: Update die parameter matrices 0/ and 0„ using 
idle following adaptive lew 


liefis) tf («*(©/©/) < Mf) 
a or (tr (0/0j) » Mf and £ T 0/g|j£) < 0) 
W/SS P[ 7 i«£ T (£)}if{tr{e/eJ) 

= M/ and & T Q/p(k) > 0 ), 


C4D 




^Sf( £ )uif(tr ( 0 t 9 f)<M a ) 

or (tr (0 fl 6j) = M g and e r 0 s g(£)u < 0 ) 
P{7a£P T (a)«] if M© 9 ©J) 

* M B and e r 09 P(£)u > 0 ) 


where 71 and 72 are positive constants, P[*] is the projection 
operator [7] defined as 


P[l\S^{$)] = 7iS£ T ($) “ 7i 1*$] *,, (43) 

P[72Sg T (i)u] = 72 £P T (s )u - 72©s (44) 


£ = £ - i, and the initial 0 /( 0 ) and 0 g (O) are determined 
in Step 2. 

Properties of the above fuzzy identifier are summarized in 
the following theorem. 

Theorem 3: The fuzzy identifier (40) with the adaptive law 
(41) and (42) guarantees the following properties: 

a) tr(e/0p < M f , tr(0 9 0j) < M g . and x 6 

b) there exist constants k\ and le? such that 

/ |e(r)| 2 dr < k\ + k 2 [ | u >( t )| 2 dr (45) 
Jo Jo 

for any t > 0 , where w{t) is defined by (28); and 

c) if 1 £ € 1/2. then limt_.ee |e(t)| = 0 . 

Proof of this theorem is given in Appendix A. 

Remark 3.1; From'c) of Theorem 3 we see that to have 
limt_oo |£(f)| — 0 , we require that w is squared integrable. 
From the definition of w (28M30), we see that w is the sum 
of die minimum approximation errors of / and g to f and g 
over the entire state space U. From Theorem 1, this minimum 
approximation error should be small if we properly choose 
the fuzzy systems / and g. 

Remark. 3.2: From Step 2 we see that the linguistic descrip¬ 
tions (32) and (33) about the unknown functions f(x) and 
g[g) are used to construct the initial fuzzy identifier. If these 
descriptions provide good pictures of f(x) and 17 ( 1 ), we can 
expect that the adaptation procedure will converge fast, be¬ 
cause the initial identifier constructed from good descriptions 
should be close to the true system. If no linguistic descriptions 
are available, the fuzzy identifier becomes a regular nonlinear 
identifier, similar to the radial basis function [3], [27] and 
neural network [28] identifiers. 

Remark 3.3: In this fuzzy identifier, we fix the fuzzy sets 
in U and consider fuzzy systems whose IF parts are concerned 
only with these fuzzy sets. An advantage of doing so is that the 
fuzzy systems in the fuzzy identifier are linear in the parameter, 
therefore: I) we were able to use a relatively simpler adaptive 
law to update the parameters, and 2) convergence of the 


adaptation procedure is expected to be faster because we are 
not concerned with complicated nonlinear search problems. 
A disadvantage is that we have to consider all the possible 
combinations of the fuzzy sets in U, because there fuzzy sets 
cannot change so that we should have rules to cover every 
region of U, where by ‘cover’ we mean that for each £ 6 U 
there should exist at least one rule in the fuzzy rule bases 
of / and g whose ‘strength’ (p FJ i(xi)■ ■ -p F ^n(x n )) is not 
very small. Therefore, if the dimension n of a problem is 
large, then we have to choose the m,’s relatively small so 
that the computational requirements of the fuzzy identifier 
do not exceed the capability of computing sources available. 
Clearly, the bigger the m,’s, the more the rules, and therefore 
the smaller the minimum approximation error t£, which in 
turn means the smaller the identification error £ [see (45)]. In 
practical applications of this fuzzy identifier, we have to find 
a compromise between complexity and accuracy. 


IV. The Second Fuzzy Identifier 

As discussed in Remark 3.3, the fuzzy identifier in Section 
III may require a large number of rules for higher dimensional 
systems. An obvious way to overcome this rule explosion 
problem is to allow the fuzzy sets in U also to change during 
the adaptation procedure so that in principle any rule can cover 
any region of U\ therefore, we only need a small number 
of rules. This is the basic idea of the fuzzy identifier in this 
section. The price paid for this additional freedom is that the 
fuzzy identifier becomes nonlinear in the parameter, so that 
we have to use a more complicated adaptive law. 

We consider the identification of the same system (26) 
and use the same serial-parallel model (31) (Fig. 3). The 
identification goals remain the same as in Section III, and we 
still consider Assumption 3.1. The design of the second fuzzy 
identifier, however, is quite different from the first one. 


Design of the Second Fuzzy Identifier 

Step 1. Define 2 N fuzzy sets Fj and G[ in each (/, of 
U and 2 N fuzzy sets E\ and Jfj in each V, of V with the 
following membership functions 


p F i{x t ) = exp 

= e *P 

dE[(Vi) = ex P 
VH[{y*) = exp 


1 ~ • r !/) z 
. 2 (*[,)* + r 

1 (a:, - x[ g ) 2 

. _ 2«,) 2 + e_ 

1 (i/» -i/!r) 2 l 
2 ( 0 2 + *. 
1 (If.-gilt) 8 ] 

r2<oij*+‘j 


(46) 

(47) 

(48) 

(49) 


where c > 0 is a small constant, l = 1 , 2 , • ■ ■, N (in general, 
N 4, 'ti € ll„ y% € V„ t — 1 , 2 ,'**, n, and x % j, 

o‘ f , x‘ s , trig, y[ p , of P i y\ h , and o[ h are free parameters which 
will be updated in the adaptation procedure of Step 4. The 
purpose of adding the small e > 0 to the fuzzy membership 
functions (46M49) is that even if the <r’s= 0, the fuzzy 
membership functions are still well defined. This modification 
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will make the adaptive law simpler because we do not require 
tile tr’s ^ 0. For this fuzzy identifier, we assume that the 
membership functions of A|\ B% C', and D\ in (32) and (33) 
are in the form of (46H49), respectively, and that N > Nr 
and N > N g . 

Step 2: Construct die fuzzy rule bases of / and g as 
R\‘- IF xi is F( and • ■ ■ and x„ is Ff,, 

THEN hU) is El---. /„(*) is El (50) 
and 

R l g ' IF xi is G[ and • • ■ and x„ is G l n , 

THEN fli(x) is H l v ■ ■ ■ ,g„(r) is H‘ n (51) 

respectively, where Fj, E[, G{, and H[ are defined 
in Step 1, and l = 1,2, • -,N. Choose f{s\0f) - 
(hU\df)r--J n U\i f )) T and = (<hUI£ fl )-• • ■ . 

ffn(£|£„)) to be fuzzy systems with singleton fuzzifier, 
modified center average defuzzifier, product inference, and 
Gaussian membership function (Definition 2) based on the 
fuzzy rule bases of (50) and (51), respectively 


f,U\i f ) 


eZi t, [rr;„«p (-1%^)] ) 2 +<i 

/ ko j +- 


6 UIL) 


Ef.A[m.i«pjj / IW,) , + < 


1 


2l±_ _ gig - /« 
d< b t ((<rj e )* + f) 2 ’ 

SL _ t 

&x\f ~ K («r' e )* + e‘ + (' 

df, Vl^l 1 i 2 («j-Ji/) a gjj 

d<t\ } ~ ft, «) 8 + f ((ffj/) a + c) 2 


! "S-TST^, 


&. = /((<x!,.) a +o 


l ss 1 . 2 , •••,#, and /, j ~ 1 , 2 ,•••,«; and (dgj&g J A ), 
(dg,/d(T l lh ), (dgJXE 1 ^), and (dgJOa 1 ^) are computed in 
exactly the same way as (54M57), respectively, with the 
subscripts “e” replaced by “ft” and “/” replaced by "g." 

Step 4; Update the parameter vectors Of and 0g using the 
following adaptive law 

if (\0 f \ 2 < Mf) 


(|g / | 2 = M / and£ I (^-) r g / <o) 


0 /(£l«#) 


if (|£/| 2 = Mf and L ,r (g^) if > 0 ^, 




where Of and 0 q are 4 nN x 1 vectors which are 
collections of the free parameters. Specifically, Of = 

{if It ■ * * ’ » Unt ’ * * ’ i 2/1 1 ’ ‘ ’ i Vne i ^lf > ’ ’ » ’ ’ * » ^*1 r ' ’ ‘ ’ ’ 

_ ... -71 ... "F* ... -rW_. zrl ... rrl ... 


^nc ’ lT l/’ ’ ' ‘ 1 J n/i ' * ' » r l/’ ‘ ’ ‘ G \/»■“’ ^n/i ’ " ’^1/’ 

an ^ * s defined exactly the same way as 
Of with the subscript l V’ replaced by “h” and replaced 
by “p”. Different from the first fuzzy identifier where we 
collected the free parameters into matrices 0 / and 0 g , 
we collect the free parameters into vectors 0 f and 0 g in 
this second fuzzy identifier. We still use the serial-parallel 
identification model (31), with /Gr| 0 /) and g(x |0 fl ) replaced 
by the f{z\0f) and of (52) and (53), respectively. 

Step 3: Compute the gradient matrices {df{sc\O f )/dO f ) = 
((df\ (x\O f )m f ), • ■ •, (df n (x\0 f )/d0 f )) and 

(dg(3L\i g )/ dig) = mMi s )/dl g ), • • ■, (d9n{SL\l g )/dlg)) 
using the following back-propagation algorithm which is 
derived in Appendix B 


M( 0 2 + £ )’ 


12-^ni \n\lg\ 2 <M q ) 

( l^l 2 = M o and <?($-) i„v < °) 

- p[„ 2 ^! £ .] " <6,) 

if (l«,l a = M,a,dr I -(|L) 7 »,»>o) 

where 71 and 72 are positive constants, P[*] is the projection 
operator [7] 

Of _ df -t a 

71 W.- = 7l - ~ 71 i T. 


p I dg 1 dg £ 

^172—eu| =72^-f«-72- 


(^) 


(63) 


£ = £“£. and the initial parameters fi/( 0 ) and £«( 0 ) are 
determined in the following way: for rules R l f and Iv q in (50) 
and (51) that agree with the linguistic rules (32) and (33), set 
their initial parameters equal to the corresponding parameters 
in the linguistic rules; otherwise, set the initial parameters 
arbitrarily. Therefore, same as the first fuzzy identifier, we 
incorporate linguistic descriptions about / and g into the fuzzy 
identifier by constructing the initial fuzzy identifier based on 
these linguistic descriptions. 

The following theorem summarizes the properties of the 
second fuzzy identifier. 
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Theorem 4: The fuzzy identifier (31) with / and y given by 
(52) and (53) and the adaptive law (60) and (61) guarantees 
the following properties: 

a) |fi ,| 2 < M f , \i g \ 2 < M g , and £ € I*,; 

b) there exist constants ki and & 2 such that 


/1 


r(r)| 2 dr < k\ + 


I'ht)? 

Jo 


dr 


where 


(64) 


v = 


ofuiifo) ajulif) 


Mj 


00 j 


0</U|0,o) VyUlftn) 


00 .. 


oo a 


(0) - Of) 


{O'o -iJu + 'UL 


(65) 


0/o = A x i f + (1 - A,)£. = A 2 g, + (1 - A 2 )0; 

for some Ai. A 2 G [0, 1], and w , 0}, (9j are defined in 
(28M3Q) with 0} and 0J replaced by 0} and 0*; 

c) if v 6 1 / 2 * then limine, k(OI = 0. 

Proof of this theorem is given in Appendix A. 

Remark 4 / ■ Because we have much more freedom in 
choosing the parameters of this fuzzy identifier than the 
first fuzzy identifier, the w for this fuzzy identifier should 
be smaller than the first one. Therefore, the key factor that 
influence the value of v is the closeness of the 0 f and 0 f/ 
to their optimal values 0j and 0*. From (65) we see that if 
|0/ — 0f \ and |0J - 0J arc small, then v will be small. Because 
the initial parameters 0^(0) and 0^(0) are determined based 
on the linguistic descriptions about / and </, good descriptions 
should make 0 f {0 ) and close to 0) and 0* and therefore 
make v small. 

Remark 42: By using the adaptive law (60) and (61), it 
is possible that some a parameters are zero at certain points. 
To deal with this problem, we added a small r > 0 to the 
fuzzy systems (52) and (53) so that the fuzzy identifier is well 
defined for zero a parameters. 


V. Simulations 

in this section, we simulate the two fuzzy identifiers for the 
following chaotic glycolytic oscillator [ 10 ] 

•ri(0 = -,r 1 (/)j-^(0 +0.999+ 0.42cos(1.75f) ( 66 ) 
j-i(l) = ■ri(1)4{l) ~ M0- (67) 

Fig. 4 shows the trajectory of this chaotic system in the 
phase plane from f = 0 to t = 50 with initial condition 
.ri(0) “ .ri»(0) = 1.5. For this system, wc have = 

+ 0.999, /‘’(jl) = t\j *2 ~ Jr 2 , c/i (x) = 0.42, and 
V 2 (iL) -- 0 . For simplicity, we assume that y\ and y 2 are 
known, i.e., wc onty consider the identification of f\ and 
/■z* For each fuzzy identifier, we simulate two cases: 1 ) 
without linguistic descriptions about /1 and / 2 , and 2 ) with 
the following linguistic descriptions 

77}: IF j'\ is “near 1 ” and j* 2 is “near T\ 

THEN fi is “near 0". / 2 is “near 0“, ( 68 ) 



Fig 4 Trajectory of the chaotic glycolytic oscillator in the phase plane from 
1 = 0 lo t = a0 with initial condition ii(0) = 12 ( 0 ) = 1 r j 

77}: IF j-i is “near 1“ and .r 2 is “near 2”, 

THEN /1 is “near - 3, / 2 is “near T\ (69) 

77}: IF .r\ is “near 2" and ,r 2 is “near 1”, 

THEN /1 is “near - 1, / 2 is “near 1 (70) 

where “near v'\ r =1, 0, 2, -3, or -1, is a fuzzy set vr with 
membership function = exp[-(l/2)((s - r)/0.25) 2 ]. 

77} - 77} are obtained by evaluating /] and / 2 at three points 
£ = (1, l) T , (1, 2) t , and (2, 1) T , and then fuzzifying the 
£- f(j :) pairs. For practical systems, this kind of information is 
provided by human experts who are familiar with the behavior 
of the systems. In all the simulations in the sequel, we chose 
rv = 1, 7 ]_ - 1, Mj = 10 c , and < - 10 _(j , and solved the 
differential equations using the MATLAB command “odc23” 
which uses the second/third order Rungc-Kutta method. 

To simulate the first fuzzy identifier, we first need to 
define some fuzzy sets to cover the state space. From Fig. 4 
we see that most values of .rj and s 2 are in the interval 
[0.5, 3.5], therefore we define the following seven fuzzy sets 
F J (j — 1 , 2 , * ■ •, 7) for both :rj and .r 2 (i.e., m 1 = rn 2 = 7): 
///.',(.r,) = exp [-(l/2)((.r,-7')/0.25) 2 ], where T* =0.5x j, 
j — 1, 2, • ■, 7, and / = ], 2. Clearly, these fuzzy sets include 
the fuzzy sets in the IF parts of 77} - 77}. For this choice of 
F J , we have 49 fuzzy rules to construct each f, (v — 1 , 2 ), 
i.e., each /, has 49 free parameters. We simulated two cases: 
I) without incorporating the 77} - 7?}, i.e., all elements of 
0 /( 0 ) were chosen randomly in the interval [- 2 , 2 ]; and 2 ) 
incorporating the 77} - 7?}, i.e., the elements of 0 /( 0 ) that 
correspond to the conditions in the IF parts of 77} - Rj were 
chosen according the corresponding THEN parts of 77} - 77}, 
and other elements were still chosen randomly in the interval 
|- 2 , 2]. Figs. 5 and 6 show the states and x 2 ( t ) with 
the corresponding ii (0 and .t 2 (7). respectively, for the first 
case, and Figs. 7 and 8 show the same results for the second 
case. Comparing Figs. 5 and 7 and Figs. 6 and 8 , we see that 
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Fig ^ u(0 (solid line) and r (/) (dashed line) using Ihe first fuzzy 

identifier without incorporating the linguistic descriptions (68H70) 



Tip f> iJO (solid Imw and i>(f) (dashed lint) using the first lu/zy 
identifier without mcoiporaung Uil linguistic descriptions (68H70) 



Fig 7 »i(0 (solid line) and ij/) dashed line) using the first fuz/y 

identiher after incorporating the linguistic descriptions (68H70) 

the adaptation speed and accuracy were greatly improved by 
incorporating the linguistic rules /?} — Rj 

We simulated the second fuzzy identiher tor the same two 
cases 1) without incorporating the R l j - R], l e , all elements 
of 0,(0) were chosen randomly in the interval [-2, 2], and 
2) incorporating the /?} - Rj, le, some elements of 0^(0) 
were chosen accoiding to the fuzzy sets in R l f - Rj, and 
the remaining elements were chosen randomly in the interval 
[-2, 2] We chose N ■= 10, so that on average each f, 
U = 1,2) contains 40 tree parameters [see (52)]. Figures 9, 
and 10 show the ri(t) f i 2 (/), and a 2 (f). i 2 (0 curves for the 
first case, respectively, and Figs 11 and 12 show the same 



Fig # f 2 (f) (solid line) and i 2 (f) (dashed line) using the first fuz/y 

identiher after incorporating the linguistic descriptions (68M70) 



! 0 5 10 Ts 20 23" 10 v“ 40 43 SO 

Fig 9 rjf) (solid line) and ijf) (dashed line) using the second fuz/y 
identiher without mLorpoiaiing the linguistic descriptions (68H70) 



'o 3 10 13 2(T 25 10 ~ 1*5 40 43 30 

Fig 10 i J/) (solid line) and j >(/) (dashed line) using the second fuz/y 
idcntifiei without incorporating the linguistic descnplions (68)~(70) 

results for the second case Comparing Figs 9 and 11 and 
Figs 10 and 12, we see that the adaptation speed and accuracy 
were greatly improved by incorporating the linguistic rules 



Comparing Figs. 5-8 with the corresponding Figs 9-12, we 
see that although the second fuz/y identifier used less free 
parameters than the first fuzzy identiher, Us performance is 
better This may be caused by two reasons. 1) although each 
fuz/y system in the first fuzzy identifier has 49 free parameters, 
some of them were never used because from Fig 4 we see 
that the states of the system never enter some regions in the 
state space so that the parameters responsible for these regions 
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Fig. 11. J'i(t) (solid line) and ii (t) (dashed line) using the second fuzzy 
identifier after incorporating the linguistic descriptions (68H70) 



Fig. 12. ,r/(t) (solid line) and i^f) (dashed line) using the second fuzzy 
identifier after incorporating the linguistic descriptions (68M70). 


have no influence on the performance of the identifier, i.e., the 
number of actually useful free parameters of the second fuzzy 
identifier is more than that of the first fuzzy identifier, and 
2 ) the first fuzzy identifier is linear in the parameter so that 
in principle it should converge to a single solution, whereas 
the second fuzzy identifier is nonlinear in the parameter so 
that it has a larger functional space to search. In principle, 
the second fuzzy identifier has the danger to be trapped at a 
local minimum; but this seems not happen in our simulations, 
especially when we incorporated the linguistic rules. 

VI. Conclusions 

In this paper two identifiers of nonlinear dynamic systems 
were developed based on the fuzzy system models. We proved 
that all signals in the fuzzy identifiers are uniformly bounded 
and provided conditions under which the identification errors 
converge to zero. We also proved that the fuzzy identifiers are 
capable of following the output of a very general nonlinear 
dynamic system to arbitrary accuracy in any finite time in¬ 
terval. The most important advantage of the fuzzy identifiers 
is that linguistic descriptions about the systems (in terms of 
fuzzy IF-THEN rules) can be directly incorporated into the 
fuzzy identifiers. We simulated the two fuzzy identifiers for 
the chaotic glycolytic oscillator, and the results show that: 1 ) 
they could identify the chaotic system at a reasonable speed 
and accuracy without using any linguistic descriptions, and 2 ) 


by incorporating some linguistic descriptions, the speed and 
accuracy of the fuzzy identifiers were greatly improved. 


Appendix A 

Proof of Theorem 2: From (16) we see that g and h 
are continuous functions and therefore satisfy the Lipschitz 
condition in the compact set K of (20), i.e., there exist 
constants b g and such that for all u), (x^ 2 \ u) G K 

|£(£ (1 \ u\®g) - mI©$) < M^ (1) - £ (2) l> ( A -0 

\h(x^\ u|@/ t ) - h( r A 2 \ ujOh) < 6 /, \x^ - x^|. (A.2) 
Define e, = i - x, then from (18) and (23) we have 

t x = Ae x + g(i , u\&* g ) - g{x, u) (A.3) 

whose solution can be expressed as 

Cr(0 = / u(t)|0*) - g{x(r), u(r))]dT . 

Jo 

(A.4) 

Since A is a Hurwitz matrix, there exist positive constants 
v and n such that ||e Af || < ct~ at for all t > 0. Let 
e g = then from (A.4), (A.l), and (21) we have 

that 


M 0 i 

- , 9 (x(r), u(t) I 0*) I dr 

+ f ||r A(t_T) |||. 9 (a:(r), w(r)| 0 *) - g{r{r), m(t))|</t 

J 0 

< f (■(•- n{t ~ T) b ll \c J .{T)\dT 

Jo 

+ f l ce- n ^- T) ^f- rbJn dr 

Jo 

< cb v f\-"^\^(T)\dT + ^~ CbJn - 

Using the Bellman-Gronwall Lemma [ 9 |, wc obtain 


(A.5) 


r( 0 l < ^ ‘ r ’ 


dr 


2bh 


a(t - T) </n(t-r)] < J__ 

“ 2 b h 


(A.6) 


Therefore, \x - j: 0 ! < |x - :r| + |t - x°| < (e/2 bh) + b from 
Assumption 2.1. Without loss of generality, we assume that 
bh > 1; therefore, (i, «) G K. Hence, letting tu = </2, using 
(22), (A.2) and (A. 6 ), and considering the fact that (x, u) G K 
and (£, u) G K, we obtain 


IjKO - y (.01 < |M£, - h(x, M |0;,)| 

Ms, - Ms, m)| 

.£,(*)| +t /2 < f 


+ 
S bh 


for all t 6 [0, 7’]. 


(A.7) 
Q.E.D. 
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Proof of Theorem 3: a) From (41) and ( 43 ) we see that 
if tr(@/ 6 j’) = M/, then: if e T & f p(x) < 0 , we have 

[*( 0 /©/)] = + 0 /©J) 

=■ tr(7i ep r (£)©/ + 7i0/P(£k r ) 

= 27ie r 0/p(x) < 0 (A.8) 

where we use the properties of trace in the last equality: and 
if t T e f p(x) > 0 . we have 


we can prove that :Wtr(e p e£))tr(e fl $p < 0 . 

Therefore, from (A, 12 ) we have 

V < —a|e | 2 + e T w < ~a |£| 2 + io 0 |e| (A.14) 

(recall that w o s sup (>0 |w(/)j which is assumed to be finite). 
Hence, if |r| > (wo/nr), we have V < 0, which means that 
? £ Loc- Since e = £ <- x and y is bounded by assumption, 
we have i £ Loo- 
b) From (A, 14) we have 


^[tr( 0 / 0 /)] = 2 tr ^i£P T (i)BT - 7ij ~ - ^ B/B^ 
= 27i£ 7 0/p(ji) - 27ic r 0/p(r) = 0. (A.9) 

Hence, we always have tr(ByBj) < Mf. Similarly, we can 
prove that tr(B^Bj) < M (f . To prove s £ L&C* define the 
Lyapunov function candidate 

V = ±|d a + + ^tr(*,*J’) (A. 10) 

where 4>/ = BJ - B j and 4>_ g H B* - B f/ . Subtracting (40) 
from (27) and using (38) and (39), we obtain the error dynamic 
equation 

t - -or -f $jp(£) + 4\,p(,r)f/ + w. (A.l 1) 


The lime derivative of \ T along the solution of (A.l 1) is 
V = -fv|r| J + L 7 ^JP{£) 4- r 1 4%p(dl)n + r 7 tv 
4 ±tr(i f *}) + -ir(* 9 *l) 


= -f*ld 2 + L 7 w + — tr [(7iO' J U) - © / )*/] 
7i 

+ — tr[(720' r (i)i/ - 0,)$J] 

72 

, •> r C 7 B fp('l') ... 

= -«ld 2 + C r JE + 7;~-%{ tr(0y*J) 


tr(0/0y ) 


r 0,y>(jl)i/ 




(A.12) 



Integrating both sides of (A. 15), wc have 


fvXr )| 2 dr < ~(|V( 0 )| 4 |V(/)|) + -i /W)| 2 dr. 

(A.16) 

Define k\ = £(|V(U)| + sup t>0 |V 7 (/)|) and k 2 ~ 1/o 2 ,(A.16) 
becomes (45). k] is finite because r. tr(4>j4 > 7 ) and tr(4> fl 4 > 7 ) 
arc bounded from a), therefore V £ |sce (A.I0)|. 

c) If w £ L‘ 2 , then from (45) we have that i £ Li. Because 
r, 4>/, 4\ ; , p{s), u, and m are all bounded (;>(£) is bounded 
because from (37) wc see that each element of p(r) is not 
greater than one), from (A. 11 ) we have that r £ L^. Using 
the Barbalat’s Lemma |29| (if r £ L^nL^ and r e/> v , then 
lim r _^ \r(t)\ - 0 ), we have lim,^ \r(t)\ = 0. Q.E.D. 

Proof of Theorem 4: a) From (60) and (62) we sec that 
if \0f\ 2 = A//, then: if L r {{Of/()0j)) r flj < 0 , we have that 

(d/dl)(\O f \ 2 ) = 20j0j - 2 7l r 7 ((<l/ /i)f)j))' 1 Of < 0: 
and, if vj ((Of/OS/)) 1 Of > 0 , we have 

that (r//</O(| 0 ,| 2 ) = 2('nc r (0j/m r )) T £j - 

= 0. Hence, |^| 2 < M s . 
Using the same procedure we can prove that | 0 V | 2 < M, r To 
prove .r £ define the Lyapunov function candidate 




(A.17) 


where /* = 1 (/.] — 1) if the second line of (41) |(42)( 
is true, 1\ = 0 (ij = 0) if the first line of (41) 1(42)] is 
true, and we use the fact that 4>y = —By. We now prove 
that / 1 *(c T 0f£(r)/tr(0y0j))tr(0y4> 7 ') < 0. If/* = 0 , 
the conclusion is trivial. For l{ — 1 , which means that 
tr(B^Bj) = Mf and r 1 £)fp(r) > 0 , we have 

tr («/*}) 

- tr 0 y$y - ^$y4»y - ^/4>y 

= itr ( 0 ; 0 f) - itr( 0 y 0 7 ) - < 0 

(A. 13) 

because tr(0)BJ T ) < Mf = tr (B/B 7 ) and tr($/^J) > 0. 
Hence, I* (eU-)/p(£)/tr (B/B 7 )) tr (B/<f>^) < 0 . Similarly, 


where <f f = fP f - and 0^ = - 0 q . Using (27) and 

(31) (replace B^ and B* by 0} and 6 (J), we obtain the error 
dynamic equation 

<1 = “ f *fl + [/(sdff/) “ fUlftj)} 

+ LvUI^) - + (A. 18) 

Using the Taylor scries expansions of /(j|£}) and y(.r\(£ g ) 
around Of and 0 rr we have 

im)) - i f + /o. (A.i9) 

«UI^) - flU|g 9 ) = t, + 9° (A - 20) 
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where fa and go are the higher order terms. Using the Mean 
Value Theorem, we have 




f df(s.\£f)\ T , 

l^rj *-< 


9 fjx\ifo) g/tejg/) ' 


dO f 


dOj 


J ~ ! ' 


„ _ ./ m.x \ (dg(x\d g )'\ 

go = g(x\O g ) - g{.c\t,) - J i v 


dg(x\i g u) ®g(£ |g 9 ) 




do,. 


<t> 

—9 


(A.21). 


(A.22) 


where 0 /o = \x0 f + (1 - Ai)g) and 6 g0 = X 2 0 g + (1 - A 2 )0* 
for some X 2 € [0, 1 ]. Substituting (A.19) and (A.20) into 
(A. 18), we have 



• 0^ u. + /o + fjuu 4* w- (A.23) 


The time derivative of Vi along the solution of (A.23) is 

+ £ T + IL 1 HI 

+ -£+, +-fy,- (A.24) 


Using (60M63) and 0 ^ = -if, ^ = -i g , we have that 
Vi = -«|d 2 + £ T /<> + t 7 00M -f £ 7 jW! + /*-|^j2- 


r £ 7 Gt;) 

. i ; - — 


l««l 2 


(A-25) 


where f* = 0 ( 1 ) if the condition of the first (second) line of 
(60) is true, and J 2 = 0 ( 1 ) if the condition of the first (second) 
line of (61) is true. Using the same arguments as in the proof 
of Theorem 3 and noticing that |0 | 2 = tr (00 r ), we can prove 
that the last two terms of (A.25) are nonpositive. Hence 

Vt < -o|f;| 2 + L r /q + p T gou + r T w 

< — rjf|r| 2 + |e| • |/ 0 | + |e| • |<j 0 | ■ |u| + |e| • |w|. (A.26) 


From (54H59) we see that all the elements of {Of/dO) 
and (iig/dB) are bounded: therefore, from (A.21) and (A.22) 
and the boundedness of </>^ and 0 ^ (we have proven that 
\l s \ 2 < M f and |£/ < M fl ; \6}f*< M, and \6* g \ 2 < M g 
by definition), we have that |/ 0 | and |r; () | are bounded. Hence, 
ft = |/o| + \<h\ ■ |«| + |l£| is finite. From (A.26) we have that 
if |r| > tf/tx. then Vi < 0; therefore, |t;| is bounded. Since 
i — £ - l and £ is bounded by assumption, we have £ € L-*,. 





b) From (A.21), (A.22), (65), and the first line of (A.26), 
wc have that 



2 T 1 . ., 

- t T ll+ —:: « 2 - 

<» n z 



S-tf + i-UI 1 - 


(A.27) 


Integrating both sides of (A.27) and defining k] = 
(l/a)(|Vi(())| + sup , >0 |V(/)|) and k 2 = l/o 2 , we obtain (64) 
(sup f>0 |V(/)| is finite because r, </^ and (j) arc all bounded). 

c) If r € h 2 » then from (64) e E L 2 . &om a), v 6 L XJ . 
Because in the proof of b) of this theorem we showed that all 
the variables in the right-hand side of (A.23) are bounded, 
we have r £ L^. Using the Barbalafs Lemma, wc have 
lim,-,^ |r(0| = 0. Q.E.D. 


APPENDIX B 


The fuzzy system / t (x|0y) of (52) can be represented as 
the feedforward network of Fig. 13. Using the chain rule on 
the f, of Fig. 13, we have that 


Vfi . 

c\ 





Vytc 

" 6,’ 





Of, 


Dr[ 

(g‘„ - 

l) 

I 

to 

X! 

da\ c 

dc[ 

a* 

b. 


((O 2 + 0 2 

Vf, 

. 

dz l 

tL - 

l 

1 

{ * l jf ' 

dz l 

rtjf 

b, 

( 

ff .'e ) 2 + f 


■z 1 - 

■ r J - 

r l 

X 3f 




1 


’■ + <’ 



Of, . 

_dj 1 

dz l 

y'„ - 

/. 

1 

do\ s ' 

0z l 

a °\j 

b, 

( 

rrj, y + ( 


,| 2(^-5W, 
((*J/) a + 0 a 


(B.l) 

(B.2) 


(B.3) 


(B.4) 
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where j = 1, 2,• ■,n, l - 1, 2,,TV, and a,, 6,, and 
are defined in Fig. 13. The (Oyi/d& g ) can be computed in 
exactly the same way as (B.1HB.4), with the subscripts V* 
replaced by "/T and ik J" replaced by “g 
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Stochastic Approximation with 
Averaging and Feedback: Rapidly 
Convergent “On-Line” Algorithms 

Harold J. Kushner, Fellow, IEEE , and Jichuan Yang 


Abstract —Consider the stochastic approximation A"„+i = 
-Vf.+«ii/?(-V a ^ M ),whereO < n„ —*^0,= x and {£,.} is the 
“noise” sequence. Suppose that n n —► 0 slowly enough such that 
fi„/ri„ +1 - 1 + <>(<'„ )■ Then various authors have shown that the 
rate of convergence of the average ± A", is optimal in 

the sense that y/n( X„ - 9) converged in distribution to a normal 
random variable with mean zero and covariance I \ where \ was 
the smallest possible in an appropriate sense. \' did not depend 
on „}. The analogs of the advantages of averaging extend to the 
constant parameter systems .Y», M = A’, f + f</(A\,,£„ ) for small 
f >0. The averaging method is essentially “off line” in the sense 
that the actual SA iterate ,V fl is not influenced by the averaging. 
In many applications, A"„ itself is of greatest interest* since that 
Is the “operating parameter.” This paper deals with the problem 
of stochastic approximation with averaging and with appropriate 
feedback of the averages into the original algorithm. It is shown 
both mathematically and via simulation that it works very well 
and has numerous advantages. It is a clear improvement over the 
system .Y„ by itself. It is fairly robust, and quite often it is much 
preferable to the use of the above averages without feedback. 
We will deal, in particular, with “linear” algorithms of the type 
appearing in parameter estimators, adaptive noise cancellers, 
channel equalizers, adaptive control, and similar applications. 
The main development will be for the constant parameter case 
because of its importance in applications. But analogous results 
hold for the case where n>, ->0. 


I. Introduction 

i 

IONSIDER the stochastic approximation 

41 ” • £/i) 

where 0 < a„ 0, £ n rj„ - x and {(„} is the “noise” 
sequence. There is a large literature and many sophisticated 
analyses available for such processes. Under various sets 
of conditions, there is 9 such that X„ —► 9 cither with 
probability one or weakly. Also, under appropriate conditions 
(A\, - 9)/yJTi~ t converges in distribution to a normal random 
variable with mean zero and some covariance matrix Vo. The 
matrix V’o and the scale factors {«„} arc a measure of the 
“rate of convergence.” 

Suppose that a 1t —* 0 slowly relative to I/ 7 /. More partic¬ 
ularly. let 

«ri/«ij + i = 1 + o(n ?t ). (1.2) 
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Under some additional conditions, Polyak and Juditsky [11] 
showed that the rate of convergence of the average 

= X, (1.3) 

n " 

is optimal in an important sense. In particular, \fn(X n - 9 ) 
converged in distribution to a normal random variable with 
mean zero and covariance V, where V was the smallest pos¬ 
sible in an appropriate sense. V did not depend on {a T ,}. This 
development promises to be quite important in applications. 
By now, extensive simulations have supported the theoretical 
conclusions. This superiority would not be true if u 71 decreased 
as (){l/ti). Choosing the a n sequence was a major nuisance 
in the past. The importance of this problem is now much 
reduced. Larger values of u n (larger than 0(1/^)) cause the 
process to “jump around” more, but the averaging in (1.3) 
compensates for this and greatly improves the values. Ruppert 
fl2| developed similar results for a one dimensional problem. 

Additional work on the averaging problem was done by Yin 
f 14], [15] who generalized the earlier results. In [9], Kushner 
and Yang showed that the averaging results were generic to 
stochastic approximation (SA), and held under conditions of 
considerable generality. That work shed much light on the 
reasons why the averaging gave a better result when (1.2) 
held. In general, that reference used sums of the type (1.3), 
but with various moving windows and where the lower index 
of summation goes to infinity as 7 / — x. As pointed out in 
|9|, the basic reasons for the success of averaging stem from 
the fact that the “time scales” of the original sequence A’ T1 and 
the averaged sequence A'„ can be separated, with the time 
scale of the former sequence being “faster.” This time scale 
separation also accounts for the difficulties in the analysis of 
the averaged sequence. Time scale separations have been used 
as a standard tool in the analysis of stochastic approximations. 
But, in the past it was the separation of the scales of the X tl 
and the noise £„ sequences which was used. Now there is a 
third lime scale to be considered. Similar results have been 
obtained for the constant parameter case 

An+i = -Y„ + £„). (1.4) 

The averaging method is essentially “off line” in the sense 
that the iterations (1.1) and (1.4) are not influenced by the 
averaging. In many applications, A',, itself is of greatest 
interest since that is the “operating parameter.” Indeed, if 
A\, were substituted for X r , on the right sides of (1.1) or 
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(1-4)* or in the j(-) function alone, then the advantages of 
averaging would be totally lost. One can, of course, run a 
"‘training’* sequence and then use the end value of the averaged 
sequence as the future operating value. This would often be a 
useful idea. Nevertheless, the question remains whether it is 
possible to exploit the averaging to improve the actual primary 
stochastic approximation {A„}. This paper deals with the 
problem of stochastic approximation with simultaneous use of 
averaging and feedback. It is shown both mathematically and 
via simulation that it works well and has numerous advantages. 
It is a clear improvement over the systems (l.l), (1,4) by 
themselves. 

We will deal only with “linear'* algorithms of the type 
appearing in parameter estimators, adaptive noise cancellers, 
adaptive antenna arrays, channel equalizers, and similar appli¬ 
cations. Stability problems make the general nonlinear problem 
much harder at present. But, as noted above, the special case 
has numerous important applications and is the subject of an 
enormous literature itself. These are adaptive problems of great 
current interest, particularly in the communications, parameter 
estimation and signal processing area. For such problems, 
it is common practice to use (1.4) for small e. even if the 
parameters of concern arc not time varying. So, we concentrate 
on this case. More detailed discussion and motivation for the 
constant parameter case is given at the end of Section III to 
which the reader is referred. 

The multiple time scale point of view and the clear (to be 
seen below) advantages of feedback with which this paper is 
concerned imply that the use of various types of averaging 
methods in stochastic approximation is just beginning. In 
Section 11 wc describe a simple but important averaging 
result (with no feedback), taken from |9J. While this result 
will not be used explicitly in the sequel, it illustrates a 
basic property and value of averaging in a simple context 
and it is useful for motivation. In Section Ill the actual 
problem of interest will be described. We work with (1.4) 
and use various approximations to (1.3) which are equivalent 
to different windows of averaging or “forgetting rates." All 
the results have analogs for the case where a n —► 0 and ( 1 . 2 ) 
holds. The basic estimates of the asymptotic orders of the 
errors for the problem with feedback is given in Section IV. 
This section makes use of the so-called perturbed Lyapunov 
function methods. The order estimates show that the averaged 
iterate (in the feedback case) is still better than what one 
would get without averaging. Once the estimates of order 
are available, an asymptotic weak convergence analysis can 
be done. This appears in Section V, where the limits of the 
distributions of normalized iterates are obtained: In particular, 
it will be seen that the limiting distribution is that of the 
stationary random variable of a Gauss-Markov diffusion, and 
that the limit covariance of the process with feedback is much 
better than that for the original sequence without feedback and 
frequently better than that for the vsequence of averages (1.3) 
of the original sequence (1.1). Numerous simulations have 
strongly supported the theoretical conclusions, and typical 
simulation results for canonical systems fitting the framework 
of the various applications cited above appear in Section VI. 
The time varying parameter case is discussed in Section V-B. 
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II. The Advantages of Averaging, a Simple Example 

A very useful and relatively simple result which demon- ' 
strates the advantages of averaging without feedback can be 
readily obtained via a weak convergence method, under quite 
broad conditions. We will use a “minimal window** of iterates, 
smaller than in (1.3), yet of considerable practical interest. Our 
interest in this section is simply in showing that various sets of 
conditions in stochastic approximation give the desired result. 
We wish to show only that the averaging works if there is 
a basic rate of convergence result for X n , thus implying that 
the advantages of averaging naturally accompany any classical 
rate of convergence analysis. The material in this section is 
taken from [9] and is for motivation only. The result will not 
be used explicitly in the sequel, but is important for motivation 
and for putting the subsequent results into perspective. 

Lei us set up the required terminology. Define the “interpo¬ 
lated time" />*, = with 1$ = 0, In this section, we 

will set 9 = 0 [see the definition below ( 1 . 1 )] for notational 
simplicity. In the asymptotic analysis, it is usual to examine 
the tails of the SA sequences. To do this we define the “tail’* 
processes as follows. For each n > 0 , define the interpolated 
processes V n (-)< X n ( ■) by 

u n (t) = X n +tly/a n + k for 

f E [^n-fi — f n y ffl + i+3 f n )i f ^ Dj 

X"(t) = X n+I for t € [/ fJ+ , - f„,f w+ Hi - f„)./ > o. 

The basic result below is stated in terms of weak conver¬ 
gence. It is not necessary to be familiar with the details of 
the weak convergence method to deal with the rest of the 
paper. Loosely speaking, weak convergence of a sequence 
of processes l r ”(-) to a process (/(■) means that if /(•) is 
a continuous functional of the process paths, then /(I/ f, (-)) 
converges in distribution to /(?/(•)). It is a considerable 
extension of the usual convergence in distribution for vector 
valued random variables. Of course, to speak of convergence 
in a function or path space implies a particular path space 
and topology on it. The most frequently used path space is 
the space of functions which are continuous on the right and 
with left hand limits. The most commonly used topology on 
this space (and the one used here) is the so-called Skorohod 
topology Z) r [ 0 ,oo) [ 21 , [3], | 6 ], where r is the dimension of 
X , All that we need to know here is that the topology is weaker 
than that determined by uniform convergence on bounded time 
intervals. Thus, if y n (-) is a sequence of paths in I) T [0, oo) and 
jVr?(') —* j/(') m the sup norm on each bounded lime interval, 
then the convergence is also in the Skorohod topology. Let => 
denote weak convergence. 

In Theorem 2.1, we will use the following assumption. 

A 2.1: There is a matrix G whose eigenvalues lie in the 
open left-half plane and a positive semidefinile symmetric 
matrix Ro such that A' n ( ) => zero process and U n (') =» [/(.), 
where !/(■) is the stationary solution to 

dU = GUdt + R^dw (2.1) 

where «>(■) is a standard R 1 —valued Wiener process. 
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Comment on (A2.1): We state condition {A2. 1 ) in the given 
form since it is the most convenient way to illustrate the main 
point. Many different sets of conditions imply (A2.1) and 
getting such a result has been one of the main goals of work in 
stochastic approximation. Our primary goal in this section is 
to show that such a commonly available weak convergence 
result can be easily extended to show the possibilities in 
improvement in the rate of convergence due to averaging. The 
references |1J, [ 6 J, [7], flO] contain various sets of conditions 
which guarantee (A2.1). 

For T > 0, define Z n ( ) by 

n+T/ a„ 

z " m= -m= £ 

In sums such as (2.2), the limits of summation are always 
taken to be the integer parts of the indexes. 


A. A Convergence Theorem 

Theorem 2.1: Assume (1.2) and (A2.1) and define V — 
G~ l Ro(G')-*. For each T, Z"(T) converges in distribution 
to a random variable with mean zero and covariance Vj = 
V + (?(1/T). 

Discussion: The proof of the theorem is in (9]. By the weak 
Convergence in (A2.1), as n —* no we have (loosely speaking 
and in the sense of distributions) X n ~ (V((), «„Vn), where 
N(0,M) denotes the normal distribution with covariance 
M and mean zero, and V r 0 is the stationary covariance of 

(2.1) . The theorem implies that for large T and large n. 
X„ ~ N(0 t a v V/7'). Thus for large T, averaging is a dislincl 
advantage. The sum in ( 2 . 2 ) can be replaced by Yl'Un-r/n,, ■ 
Other relevant comments arc in f9], [11], [14], with various 
windows of averaging. 

Minimality ofV: It is noted in [9], [II] that V is minimal 
in an important sense, which w e now describe. Suppose that 

(1.1) is used and U n = X n / converged in distribution 

to a normally distributed random variable V with mean zero. 
Then the fastest asymptotic rate of convergence is obtained 
with «„ = K/n, for K a positive definite matrix. Then, under 
appropriate conditions (see, e.g., [ 1 ], [ 6 ] (/’'(•) !/(•), where 

U(’) is the stationary solution to 

dll = ^ + KG^j Udt + dw (2.3) 

where C and /?'« arc as in (A2.1), and it is supposed that 
+ KG) is a stable matrix. If we optimize the trace of the 
covariance matrix of (2.3) over K , we get the best value of 
K as 

K = -cr 1 . 


With this value of A\ the covariance of U( 0 ) is just the V 
used in Theorem 2.1. In this sense, the result in [9], [II] and 
of Theorem 2.1 is optimal 

Constant Gains: Suppose that a ri - £\ a small gain param¬ 
eter. Define 


1 


»+T/f 

E 




Then if n and T are large enough, we have X n ~ N( 0 , eVo), 
and X„ ~ N( 0 , eV/T ), again an advantage over the nonaver- 
aged variate. 


III. Averaging With Feedback Formulation 

In this section, we describe the algorithm of concern. The 
analysis will be done in the following section. We would like to 
be concerned with algorithms of the general type, for positive 
real valued A 

X TI+1 = X n + tg(X,M + iA(X n - X n ) 

where the feedback is linear and proportional to the gain <. 
This form does allow the use of the averaged iterates m an 
intuitively reasonable way in the primary algorithm. Indeed, 
the results of the sequel suggest that a larger feedback will not 
work well. It is hard to analyze the fully nonlinear case, and 
we will work with a simpler form which is very important in 
applications. 

The Model: Consider a classical algorithm for parameter 
identification. Let 9 denote the system parameter, <j> 7 , a se¬ 
quence of random “input vectors” with = G > 0 

(positive definite), a sequence of real valued random 
variables (with mean zero, without loss of generality) and 
(j) f is the transpose of </>. The observed output is the real 
valued //„ = 9 f (f> n + (/>„. A common lorm ot the identification 
(alternatively, adaptive noise canceller, etc.) algorithm is 

0 n + ] - 9 n + e<pn[yn - On, (3.1) 

where 9 V is the Nth estimate ot 9. Define 



One also might use the finite window ol averaging, wheie 9„ 
is defined by (for T > 0 ) 


1 

w 


E i. 

'- 1 / 


Then (see comment after the proof of Theorem 2.1) the theory 
of the last section (equivalently, of |9|) can be used to show 
that for large T, the average 9 U has a much smaller asymptotic 
covariance than does the original 9 n . 

A Feedback Form of (3.1), (3 2). Consider the following 
form ol (3.1) and (3.2), where the average is fed back. For 
A > 0, redefine 9„ and 9 U by 


0„ + l = K + E<J>n[yn ~ 0'„<t>n ] + - 0„] 



(3.3) 


Defining A'„ = 0„ - 0, X„ = 0 n - 0, rewrite (3.3) a& 
X„ +l = [/- £C7].¥„ + + eAfin - -Y n ]. (3.4a) 



Z"('T) a 


(3.4b) 
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where 

6 * = [G ~ 4>n<t>' n ]X a + <j> n 1p u = $„X n + 0 „ 

where the definitions of and p n are obvious. 

The sequence { X n } can be arbitrarily well approximated 
for large n by the solution to the equation 

X 0 = X\ = A'o and, for n > 1, 

^».+i = ~ + —■ (3.5) 

In fact, we will work with the more general forms defined by 
either (3.6) or (3.7) 

A 71 + 1 (1 ^71 )A T , + Af) (3.6) 

where 

bn ‘ 0) ^ ^ bfi = cXj 

or 

i = (1 - CXC)X U 4 (\lX u (3.7) 

where rv > 0 is small. The form 

71 -f 1/nr 

A\, = w £ X, 

seems to be better in simulations although it is much harder 
to analyze. 

Algorithms of the torm (3.1) are commonly used m adaptive 
applications throughout communication theory and in signal 
processing [13], [5|. The simulations in Section VI are for 
a canonical problem in these areas. In such applications, one 
generally lets e be a constant, selected a priori The decreasing 
gain case <i n —► 0 is more rarely used, even when the basic 
parameters to be adjusted to are thought to be constant. This 
is due partly to the way that the systems are automated. Also, 
in applications one often seeks the largest value of t for which 
the system will be stable and still have reasonably small noise 
effects. It is important to keep in mind that the errors are 
relative to a bias, since the system order is rarely known. (In 
this biased case the results here hold, as they should, for the 
errors centered about the bias.) 

A possibly ideal algorithm, in lieu ol (3.1), would be 
linearized least squares. This has excellent asymptotic prop¬ 
erties, but due to the heavy computational overhead, one 
generally prefers to avoid it in the applications cited. A 
possible alternative to linearized least squares replaces r by 
(i„ —► 0. This is often unsatisfactory. The best rate is a„ = 
0(1/7/), which has serious robustness problems. Furthermore, 
unless a n = K/n for an appropriate matrix A, the data 
for the different coordinates can be greatly skewed. In many 
applications one prefers not to use such a matrix due to the 
more complex implementation and the difficulty in getting a 
good value for it. 

As seen from Section II, letting a n go to zero slowly 
enough and using averaging accomplishes the same thing 
asymptotically as does the use of linearized least squares. 
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without much additional computation, since (loosely speaking) 
X n ~ N(0, V/n), where V is minimal, The best algorithms 
of Section II use a window of averaging which is of the order 
of n, the current iterate number. One might wish to avoid this 
due to robustness considerations. In addition, the benefits of 
averaging do not extend to the iterate X n itself. If On —► 0 
slowly enough, then over practical operating ranges there is 
not much difference between using a fixed t or a n 0. We 
work with a constant window of averaging and the constant 
gain case. The scheme will be seen to be robust. In Section IV 
it is shown that the use of averaging and feedback and a finite 
window does not hurt the iterate and yields an average that is 
0(f 3 ) in typical cases. This is a vast improvement over the 
widely used constant gain case. In Section V, it is shown that 
the result is actually much better, and the result is quantified 
in terms of the feedback gain and the window of averaging. 
The considerable improvement that it can provide over the 
standard fixed gain case for both the direct iterate and the 
average are clearly seen from the estimates in Sections IV and 
V, and from the simulation results, 

IV. Asymptotic Orders 

Stability results and estimates of the orders of the asymptotic 
errors generally form the basis of asymptotic analysis of 
stochastic approximations. Such estimates are the subject of 
this section. Informal calculations suggest that the estimates 
(4. lb) and (4 .1 c) might be conservative. The stability estimates 
will be exploited in the next section, where we will get 
more precise asymptotics and a more explicit estimate of the 
errors as a function of the feedback gains and the window 
of averaging. As written, Theorem 4.1 shows that the use 
of feedback does not hurt the direct iterate and that the 
averaged iterate is much better than what one would gel 
without feedback and averaging. In Sections V and VI, it will 
be seen that the direct iterate can be much better as well. 

Assumptions: Define E rl as the conditional expectation 
given {-Vo,< 56 ,, 0 ,,? < «}. For real A'o, we require, uniformly 
in ri 


(Xj 


771=71 

OO 

(A4.1) 

^ Ejiptn — Kq. 

(A4.2) 

j < 00 } is bounded. 

(A4.3) 


Note: Conditions such as (A4.1)-(A4.3) are quite conve¬ 
nient and not loo restrictive in practice. They imply a certain 
rate of decrease of the correlation; i.e., a rate of decrease of 
the expectation of future “noise values” given the remote past. 
They are implied by various types of mixing conditions. See 
[3], [ 6 ] for examples. They do exclude the Gaussian case, 
but would include any reasonable truncation of a Gaussian 
sequence. 

Theorem4,l: Assume (A4.IHA4.3). Then, for (3.4a), 
(3.6) 

liinsup£|-V n | 2 = 0(e). (4.1a) 
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limsupi?|X n | 2 = 0(e 2 ). (4.1b) 

n 

If Enflt = 0, a > n, then 

Umsup£|X| 2 ~0(c 3 ). (4.1c) 


If (3.7) is used, then the right-hand sides of (4.1b), (4.1c) arc 
replaced by 0(f). 

Comment: We have EnPi = 0,/ > n , if the observation 
noises {Va» } are mutually independent, have mean zero, and 
are independent of the {<//„}. 

Proof: The proof will be divided into several parts. The 
basic technique is the perturbed Lyapunov function method 
[6], which has been found to be very useful for stability 
problems with non-Markovian systems. In Part 1, wc try 
to apply a standard Lyapunov function method to the A M . 
X„ sequences and show that there is a problem due to 
certain “bad” terms. We then proceed to define the perturbed 
Lyapunov function, which is designed to deal with these bad 
terms. The procedure is straightforward, but a little tedious. 
The technique requires that a certain “fixed parameter” system 
be defined, and this is done in Part 2. Certain auxiliary 
functions (the perturbations) are defined in Part 3. In Part 4, 
the perturbed Lyapunov function is defined and appropriate 
estimates of the perturbation given. 

Part l: First, we use b„ 0 and attempt to use standard 
stochastic Lyapunov functions. It will be seen that these will 
not yield inequalities which can be used to prove the estimaies 
(4,1). The problem concerns certain “bad” terms which appear 
and the inability of a classical Lyapunov function approach to 
exploit time scale differences. We will be able to cancel the 
effects of these bad terms via the use of suitable perturbations 
to the Lyapunov functions, which allow us to cxploii time 
scale differences. 

Define C = [A1 + 0] M and Q = Al 4 G. It is 
convenient to work first with a “centered” process defined by 
X„ = X n - ( \Y n . We can write 

*,. + 1 = (/ - cQ)Xn + ctn + b n C(X n - X n ) (4.2) 
where the noise term £„ is defined by 

(n — QjiX,, + p n -= -|- C\Y,|] + p„. (4.3) 


We can write 

En IJLhMA'mI 2 

~ 2A t / i E,,[A„ f i - A„] -f E n |A t1+ i - A\,| 2 , (4.4a) 
2X'„E„ [a„ + , - A„] 

= -2eX'„QX„ + 2eX'„E n Cn + 2b n X’„C 
x [A'„ - A,,]. (4.4b) 


We now put (4.4b) into a form which will allow a better 
estimate to be obtained. For small positive A*i, wc will use 
the inequality 


& «||X M I ^ bn 




k„ 


(4.5) 


The h] above and the positive real numbers A 2 , A\ q which 
will be introduced below can be as small as needed. But they 


do not depend on e, n . The purpose of inequalities such as 
(4,5) is to facilitate use of the time scale differences between 
X n and X n , as will be seen. We use (4.5) in several of the 
following inequalities. If the term linear in A’i is dominated 
by other terms for small A,'i, then we might omit it and adjust 
the orders appropriately. 

Using the inequality (4.5) and writing X n = A\, + GX n , 
yields 

2 A'£,’„[A „ +1 - A„] < -2eX' n QX n + 2eA 

+ fciO(«) |A „| 2 + (;(^)1|X„| 2 . 

Since we are concerned with asymptotic analysis, where 
7? -4 rx. and e is small, we always suppose (without loss of 
generality) that c and the ratio b n /c are small, the latter going 
to zero with n. Then in all of the following estimates, we omit 
the terms which arc dominated by others which appear; e.g., if 
for any “quantity” there is a term 0(f) x quantity + 0(fc fl ) x 
quantity, we will omit 0{K) x quantity. From (4.4a), we can 
now write 

K\Xn+i\' 2 - \X n \ 2 < -cX;,QX„ + 2cX’M* 

+ o(-J)^|A „| 2 + 0(c 2 )En\(„\ 2 . (4.6) 

Using X tl = X n -f OA tm we can also write 

|X.+ , I 2 - IX ,| 2 - 2 fc„IX ,| 2 + 2b n X^ n (X tl + rx.) 

+ 0(6 2 )(|X,| 2 + |.Y fl | J ), (4.7) 

We will bound the right side of (4.7) by a more manageable 
function by using a form of (4.5), as follows. For small /, i > 0, 
use 

bn | X n 11 <Y,, | b b n 

Again, when (4.8) is used, if ihc term linear in is dominated 
by other terms when A* 2 is small, ihen we might omit it and 
adjust the orders appropriately. Now rewrite (4.7) as 

|A, l+ 1 | 2 -|A fl | 2 <-2ft„A'(/-r)A„ 

+ o(bMx l ^ + ()(i, l Mn\ 2 r- 

K‘2 

(4.9) 

Part 2-Auxihary Processes: Inequalities (4.6) and (4.9) 
cannot be used directly to get the moment estimates due 
to the appearence of certain “bad” terms. To deal with the 
“bad” term 2tX' n E„G, in (4.6), we need to exploit the time 
scale differences between the X n and sequences. This 
will be done by modifying the Lyapunov functions slightly. 
But first, wc need to introduce various “fixed-./” processes. 
These arc needed since the noise is not only correlated, 
but depends on the state. The approach is analogous to the 
one used successfully in ( 6 ]. First, wc define some auxiliary 
sequences. In the next part of the proof, perturbations to 
the Lyapunov functions |A r fl | 2 and |A \,| 2 which use these 
auxiliary sequences will be defined and estimaies of their 
values obtained. The proof will be completed in the Parts 4-6. 




\ 2 + h\X, 


(4.8) 
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The noise in (4.3) depends on the state X „. Exploitation 
of the time scale differences is aided by the introduction of 
“fixed-tf ” noise process. This useful idea has wide applicability 
for getting limit results, whenever the noise is state dependent; 
e.g., sec [6]. For each j, define the “hxed-j” noise process 
+ Ph For each //, define the noise sequence 
{C*™ > "} by 

C =* *mXn+l> m - (4 10) 

Analogously, for each n and rn > n define the '1ixed-.r 
process’* by A "(r.j) = .r, and tor m > ti 

-V:+iU^) = + (41D 

Note that ,i and ./ are parameters here Define {A’”, 7 n > n} 
by x;; = X n , and A'", = A’ t ”(A" n , A'„) tor r// > // i.e., for 
m > 

Am ti = (/ - t</)An, f v;; - X, (4 12) 

Thus, the parameter (j.j) in (4 11) is set to (A' M A'„). i.e., 
m | A ",. m > n\, the noise evolves (altei lime //) as though 
the state never changes, thus the term "fixed r” process. 

Pait 3 We will define vanous auxiliary sequences, which 
will be used for ihe Lyapunov turn lion perturbations and 
obtain some estimates ol their values For each n, define the 
sequence 

/] = Y b„,E„ (4.i3) 

m~~n 

In the Appendix, it will be shown lhal 

|/i| = n(^)(|\, , | 4 \\ u \) + (Hb„) (4 14a) 

11 K u p, — 0, f r n, then wc will get the estimate 

«(^)(|A’ (1 |+|T„|) (4.14b) 

To umly tcnnmology, and write estimates such as (4.14a) and 
(4.14b) as one compact expiession. wc will use the symbol 
/: If wc assume E n p t = 0, / ^ n, then J = U, otherwise it 
takes the value 1. 

In the Appendix it is shown that the conditional difference 
satisfies 

B»fnn-j«~- h » x »* () (-' + *»*) 

X (|IY„ | + |,Y„|) f l()(b„f) (4.15) 

We have 

X 

= -i>nX„+ Y *>,»£.[A 1 ;;, 41 -x::,) <4.i6) 
We now define one last pcturbation. Define 

ft = 2eX' n 'E ii *&- (4l7a > 

Then 

ft = 0(e)(|A„| J + |A„f) + lO(c) (4.17b) 


where the estimate follows from (A4.1HA4.3), This pertur¬ 
bation will be used to cancel the 2cX' n E„£„ term in (4.6), 
Now. evaluate the conditional difference (recall that Q *= £„) 

£../» 2 +1 - ft = -2eX'M, + T,\ + T* (4.18) 

where wc define 

/;! = 2 tEjx n+1 - xj Y A «+iC +1 

m -- ti 41 

ft -= 2cX'„ Y ^IC +, 'C]- 

/r*-=>i4 l 

We have 

|7;!| 4 |T,f| “ 0(u" + //„:) fO(* 2 -F h u e) 

*(|A'„| 2 4 |X„| 2 ) (4.19) 

Part 4 We are now prepared to define and use the per- 
tuibed siochashc Lyapunov functions. Define the perturbed 
Lyapunov function l r „ ~ |.Y ri | J -4 f*. Then, via (4.6), (4.18), 
and cancelling terms whete possible or where they arc dom¬ 
inated by others 

E t Xn -ti " ^ H j ~y A,, Q A n 

bo(^ + ^)|X II | J j i +()( *) (4.20) 

Note, in particular, that the - 2 r X' n E tl i n in (4.18) cancels a 
lenn in (4 6) This is the essential reason for the form selected 
foi /-. Now, using the bound on the right side ol (4.17), we 
see that there is some h \ ■*> 0 such that 

F„V„^ V„ < 4 +-)|A,^ + 0(r 2 ). 

(4.21) 

Next, for h> > 0 define the Lyapunov function V 7I ~ 
A A An/t + |A„| 2 Foi large enough Aj, there is A\t > 0 
such that (4.21), (4 9), and the fact that I - (' is positive 
definite, imply that 

1 A. h\\' ri 4 0(ch n ) (4.22) 

Thus, taking expectations, iterating, and using the condition 

yields 

limsup EV„-()U) (4.23) 

Since (h'ili,,/. )\j/\ = 0(/i m )(|A'„| j + |X„| 2 + I). for small 
e and b n we have 

|X„|’ < 2V„+()(!,„) 

Thus (4.23) implies that 

Inn sup =• O(c). (4.24) 

Now, taking expectations in (4.21), and substituting (4.24) into 
(4.21) yields 

Ihiisup £(|A'„| 2 + j, 2 ,) - (J(e) 

Now, (4.1a) follows from this and the bound in (4.17). 



30 


IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL, <H>, NO. I, JANUARY 1995 


Part $: To date, we have mainly exploited the time scale 
differences between the {X,,, X „} and {$„./»«} sequences. 
To improve the estimate (4.1a), we need to do a Oner analysis. 
This will require the use of an additional perturbation,_to 
exploit the time scale differences between the X„ and X n 
sequences. We get the estimates in this part and do the Anal 
Lyapunov function analysis in Part 6. 

Define 7n = ‘^X' n fn- By (4.14) and the inequality IW) 
< O(ft) + 0(b) |x| 2 , we get 

1711 = o( j)[|X| 2 + \Y n \ 2 } + iO(b n ). (4.25) 

We next evaluate the conditional difference, +1 - 
We can write 

Enfl+l ~T n = ~2bnX' n Xn 

+ 2E n (X n +l “ hY En + lfn + l 

+ 2x' n f; b m E„ix:'+ i - xm 

£ -2b n X' n Xn + + rl (4.26) 

where the T*, are defined in the obvious way. From (4.14b), 

|Til = 0(6„)(|X| + \X„\)()(- ^)[|X| + IX, I + 0(e)] 

= [\X n \ 2 + |X,| 2 ] + 0(b 2 ). (4.27) 

For k > 0 and q > 0 (which will be small numbers, but not 
going to zero with v or r)), we can write 

|X||X|< jlX| 2 +'~|X| 2 , 

0((> n£ )/|X| < 0(h n e)I -|X„| 2 + - . 

l € tfj 

Using these bounds and the estimates of the sum in (4.16) 
contained in the two right-hand terms of (4.15), we gel 

|T*| = o(^+t n e)(l+J)|X| 2 

+ °(j + !>n<f)^|X| 2 + iO(b„e) ^|X| 2 + ~ . 

(4.28) 

Part 6: We now improve the estimate (4.1a) and get (4. lb, 
c). Define the perturbed Lyapunov function V n = \X fl \' 2 4- f H . 
Then, from (4.7), (4.26H4.28), and cancelling or dominating 
terms where possible, wc get (where k,k\ and q are as small 
as desired, but do not go to zero with n and e) 

E n v n+l -V n < -2 b n x’ n (I - C)X„ 

+ o(f)(|X.| 2 + |X„| 2 ) 

+ o(b 2 n ) + o(%- + b n£ s )( k - + i)|xi 2 

+0^+/; n ^|X| 2 + /0((>, 1 e)^|X' , + ^ ■ 


Note, in particular, that the term -2b„X' n X„ in (4.26) can¬ 
cels a similar term in (4.7). Taking expectations, dominating 
terms where possible, and using lim supf?|X| 2 = 0(e) yields 
for small S > 0 (which goes to zero as n -+ oo and £->0) 

tiV n+l -v„<- h n Ex' n (i - a - bi)x n 

+ 0(b„ + 6„e 3 ) + I0(b n e 2 ). 

For large enough u and small enough e, there is b\ > 0 such 
that 

EV„ +1 -V„ < -b,b n EV„ +0(b 2 +b„£ 3 ) +IO(b n e 2 ). 

Thus 

lim sup EV n = 0(e A ) + IO(e 2 ). 

n 

Now, for small b„ and e 

|X n | 2 <2F„+o(M|XI 2 + ^„). 

Equations (4.1 b)-(4. Ic) follow from (4.1a) and the last two 
equations. 

The last assertion of the theorem (where b n -= ac) also 
follows from the above estimates. 

V. Rates of Convergence 
Limits of Normalized Iterates 

A. Stochastic Differential Equations jor the 
Asymptotic Normalized Errors 

In Theorem 4.1, it was shown that the use of feedback yields 
an asymptotic error of ()(e) for X n , and of (7(e 2 ) or (){e J ) 
for This does not tell the full story. Using these results, 
we will get a finer estimate of the asymptotic variances in this 
subsection. In the following subsection, similar results will 
be obtained for the time varying parameter case. We obtain 
weak convergence or rate of convergence results analogous 
to that assumed in (A2.1). The improvement in behavior 
due to feedback will become even clearer. Define U n = 
Xfi / s/e ~U n = X n /y/e. Define the continuous parameter 
interpolations U € (’) by: U £ (i) = U n for / C [7ie \he + e), 
and similarly define U (■ ). By (3.4a) 

f^i+i = (/ — eQ)Uh + 4 eAU n . (5.1) 

The methods in either (6| or in [8|, [1] can be used to get 
the asymptotic distributions of both U £ {t) and U c (t ) for large 
t. To simplify our work, we adapt the method of (6, Chapter 8, 
Section 3], This requires the addition of two conditions which 
are quite unrestrictive. Recall that if X n = 0 (the limit value), 
then („ — s p„. Let us assume the following. 

(A5,l): The sequence of processes Pi> (pn = 

converges weakly to a Wiener process with covariance 
Ro = E-no E PnPa* as e —► oc. 
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{A5.2): There are n f —♦ 3c as e —* 0 such that en, —> 
and such that (in the sense'of probability) 

1 *>+»>, 

- E - G. 

?=rt 

| n + n, 

— 52 ~* °- 

n e 

Comment on the Assumption: The second expression in 
(A5.2) is implied by (A5.I), bul we keep it to help matching of 
the conditions with those in the reference. In the development 
in [6], the limit of the second sum in (A5.2) can be any 
constant vector. This simply creates a bias and can be used 
here too. (A5.2) is just a law of large numbers. Condition 
(A5.I) is also not very restrictive. It seems preferable to word 
the condition as it is since there is a very large literature on 
convergence of such sums to a Wiener process [6], [2|, |3], 
and the convergence occurs under many different sets of more 
explicit and practical assumptions. 

Theorem 5.1: Assume (A4.l)-(A4.3), (A5.1), (A5.2), and 
let b u —■» 0. Then there are N c —> yj such that IP {cN L + •) 
converges weakly to the stationary solution of 

-QVdt + nl/ 2 dW (5.2) 

where VT( ) is a standard Wiener process. 

If b„ = <\c for small c, then Theorem 4.1 says that 
limsup,, E\X ti \ 2 = 0(e). But the actual situation is much 
better, and an analysis of this case gives additional insight 
into the case where h n —► 0. The constant b„ case is of 
practical importance, since it is the case where there is slow 
“exponential forgetting/ 1 and is similar to the case where we 
average with a window of size ()(\/etx). 

Theorem 5.2: Assume (A4.1 )-(A4.3), (A5.1), (A3.2), and 
let b„ — ae- for small n\ Then there are TV.- rxj such that 
((/ ; (eTV + ■), If 1 (eN r + -)) converges weakly to the stationary 
solution of 

di J = -QUdl + AUdt + ltl/ 2 dW 

dU = alfdl - (vUdi (5.3) 

where W r (-) is a standard vector valued Wiener process. 

Proof of Theorem 5.1 : In view of the stability estimates 
of Theorem 4.1, the proof is essentially that in [6, Chapter 8, 
Section 3], and wc only discuss the minor alterations. Note 
that our notation is different from that in the reference. Let 
b 7l -v 0. The proof in [6] concerns the asymptotics without the 
feedback. But the only difference between that case and ours 
is the eAlJ n term in (5.1). Since limsup^ E\U n \ 2 = ()(e) by 
Theorem 4.1, the added term docs not contribute to the limit. 
The cited proof shows that there is a sequence of real numbers 
t c = cN t —p rx such that lF(t f + ■) => U(-) where U() is 
the stationary solution to (5.1), and /?« is defined as in (A5.1). 

Q.E.D. 

Comment on Theorem 5.1: Loosely speaking, (5.2) implies 
that X n ~ TV((), eTo(Q)). where V () (Q) is the stationary 
covariance of the solution to (5.2) and is 

vow - r 

Ji) 


This means that the stationary covariance oi Li ( < tll , . 

as A increases, being 0(1/A) for large .1. i,t\. lot M -uli ii/„ 
c, as .4 increases the asymptotic covariance oi \, ,| (lU 
This is an immediate benefit of the feedback. Of mui-,. <, )K . 
cannot let A grow without bound if f is th.<! s „ Kl . iiu- 
algorithm will become unstable. We find in priuiiu* I.," > K 
asymptotic results improve until near the point when- 
does become a problem. 

Proof of Theorem 5.2: Again, the proof is a mild 
sion of that in (6 Chapter 8, Section 3). and only a leu 
comments will be made. We can write 

U „+1 = (/ - eQ)V„ 4 4 eAU„. (5.4) 

t^n + i -= (1 -ate)U„ 4 F<yV u , (5.5) 

Using the stability results in Theorem 4.1, the analysis in 
[6] can be readily extended to get that there is a sequence of 
real numbers t t —> exj such that 

(v r (t, + -)M r (i. + -))=>(in-).Tf(-)) 

where the limit process is the stationary solution to (5.3). 

The Order of the Errors in Theorem 5.2 for Small a and 
Large: rv A. We will show that (for the stationary solution) 

E\V(I)\ 2 = ()(\/A), (5.6a) 

E\U(I)\ 2 = ()(o/A). (5.6b) 

Thus, the asymptotic variances of l T (t) is inversely propor¬ 
tional to the effective window of averaging, as for the case of 
Section II without feedback. 

Proof of Claim: Since (7 is positive definite and symmet¬ 
ric, we can diagonalize (5.3). Write G - P~ { DP, P 1 - P\ 
where P is orthonormal and J) is diagonal with positive 
entries. Let <l w denote the smallest diagonal entry in D . Define 
V = P(KV = PU* with coordinate variables l',. etc. Then 

(IV = -(/) 4 Al)Vdl 4 AVdl 4 Pll\f l dW 

dV = tt(V - V)iII. (5.8) 


By It6\s Lemma and using |/?o| = trace Ho 


(It 

<IE\V\ 2 
d1 

dtrrV 

dl 


‘MEVy, - EY*), (5.9a) 

-2AE\V\ 2 - 2EVDV 4 2AEVV 4 


aE\V\ 2 - nV'V - ev'dv 

(5.9b) 

4 A(E\V\ 2 - EV'V). 

(5.9c) 


The deterministic part of the system (5.8) is stable. For the 
stationary solution, we have 

EV,V, = EV*. (5.10a) 

2 AE\V\ 2 4 2 EVDV = 2 AE\V\ 2 4 |W«|. (5.10b) 
aiE\V\ 2 + EV'nV = (xE\V\ 2 . (5.10c) 




In (5.10c), we used (5.10a) to cancel the coefficient of A in 
the right side of (5.9c). Using (5.10a) again and taking lower 
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bounds in (5.10b) and (5.10c) yields 

(d m + o)JE|F| 2 < ai?|V| 2 , (5.11) 

2(A + d m )ElVl 2 <2AEIVI 2 + lfiol (5.12) 

from which we get (5.6). 

B . Tracking Time Varying Parameters 

Feedback and averaging can also be used when the pa¬ 
rameters to be tracked vary with time. Extensive analyses of 
tracking algorithms are in [1), [4]. We view the main practical 
applications of averaging with feedback to be for constant 
parameter systems, and the applications to communications, 
parameter estimation, adaptive noise cancellation and else¬ 
where in signal processing cited earlier. But, for robustness 
considerations it is important to know that the basic behavior 
is continuous under small time variations (small ft below). The 
dependence on ft is seen in (5.16). If ft = o(cr), then (5.16) 
can be improved. 

Let b n = ae, and let 0„ denote the value of the parameter 
(replacing 6) at time n. For ft a real number, suppose that the 
parameter evolves according to 


4 1 ““ “1“ ftc W n . 

Here, {w,,} is a driving sequence of bounded random variables 
with mean zero, independent of {0 n , 0 7J } and satisfying 

jr E„w m < K 0 (A5.3) 

for some Ko < oo and all n, a condition analogous to (A4.1). 
Note that we do not require {w n } to be mutually independent. 
We will also use the conditions [the analogs of (A5.1), (A5.2)| 
(A5.4): The sequence of processes w « con ‘ 

verges weakly to a Wiener process with covariance matrix 
i?i - E-lx ^ti; n iu ( V 

(A55): There is a sequence of real numbers n £ —* oo as 
£ 0 such that —•> 0 and (in the sense of probability) 





The Evolution Equation: Redefine A\, = 0„ - 0„. A' n = 
0 n - 0 n , where 

0„ — (1 - (ye)0n + w0 n . 

Then (the analog of (3.4a)) 

^n+i = {l ~ ^0)X n -f 4 . eA(X n — \ n ) + (0 n — 0 n +i) 

where is defined as in Section III, and 

A ,i + i = (1 — fVf )3T„ + i*eX„ + (0 n — 0 n+ 1). 

The entire analysis for the non time-varying case can be 
carried over to the present case with only trivial changes. By 
a proof almost identical to that of Theorem 4.1, wc have the 
following result. 


Theorem 5.3: Assume (A4.1HA4.3) and (A5.1HA5.3). 
Then for small f) and a 


limstip E|A' n | 2 = 0(e). 

n 

limsupiJ|A’ fl | 2 = 0(e). 

n 

With Theorem 5.3 in hand, precise analogs of Theorem 
5.2 can be obtained by exactly the same analysis, and we 
will only_write the appropriate formulation. Redefine U„ = 

X n /y/e,U n = X„ye. Then 

U„+i = (/ - eQ)U„ + \/e£,„ + eAU n - y/e(iw„, (5.13) 
Tf n+i = (1 - ne)U n + enU„ - \feftw n . (5.14) 

Then the analysis which led to Theorem 5.2 and the order 
estimates which followed yield the following. 

Theorem 5.4: Assume (A4.1MA4.3) and (A5.1HA5.5). 
Then, for small fi and rv. there arc A', -* no such that 
(U e (eN F + (eN F + ■)) converges weakly to the stationary 
solution of 

dU = -QUdt + AU(l)di + Iil ,2 dW + fiR\ / ' 1 d\V\ 
dU = aUdt - vtTJdt + (Ul\ /2 dWi (5.15) 

where W(■) and Wj (■) are mutually independent vector valued 
Wiener processes. If fi - ()(<v). then (for the stationary 
solution) 


E\U(t){ 2 - O 
E\U(t)\ 2 = () 


A/i’ 2 

+ 0(\/A). 

(5.16a) 

/ Afi 1 > 

V J 

| +O(o/A). 

(5.16b) 


VI. Simulation Daia 

Numerous simulations have supported the theoretical con¬ 
clusions. and the following tables are typical of the runs 
taken. The simulations were made with a five-dimensional 
system, where the five-dimensional vector 0„ was computed 
as </>„ = (0n, 0n-ij 07 i -21 0n-3? 4>n- 4 ), where the sequence of 
random variables 0„ is obtained from the dynamical equation 
0 7 ,+i - 0 7i /2 + Cn where C„ is a sequence of mutually 
independent Gaussian random variables with mean zero and 
variance 1.0. The parameter 0 is fixed at 0 = (4.0. -4.2, 3.0, 
2.7, -3.0). The standard deviation of the observation noise V\i 
was 6.0. The noise level was chosen high enough to allow us 
to distinguish the general differences among the algorithms, 
since for small noise levels all the algorithms worked well. 
If one wishes to distinguish between the small errors at low 
observation noise levels, the nonfeedback algorithm is often 
preferable. Asymptotically, the best values of e are very 
small and decrease as the run size increases. But for run 
lengths of fixed finite size, the performance will degenerate 
if e becomes too small. Generally, we have tried to compare 
the performance of the algorithms with and without feedback 
using the parameters at which each works well. The actual 
averaged value that was used in the (both feedback and 
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TABLE I 

■ = .05, A 3 = 5, Window = 150. 


case_parameter# 


II 

1 

2 

3 

4 

5 

no FB 

E9 

1.133 

1.212 

d m 

1.251 

1.014 

no FB 

a 

.230 

.297 

.257 

.370 

.207 


E9 

.275 

.238 

.216 

.278 

.255 

l_FB_ 

X. 

.065 

.037 

.019 

.086 

.060 


TABLE n 

= .05, A = 22, Window — 150 

case_parameter# 



1 

2 

3 

1 4 

5 

no FB 

El 

1.133 

1.212 

1.173 

1.251 

1.014 

no FB 

x. 

.230 

.297 

.257 

.370 

.207 

FB 

KM 

.206 

.191 

.217 

.299 

.325 

FB 

li 

.045 

.014 

.020 

.083 

.095 


TABLE 111 

‘ = A, A = 2.o. Winijow - 1 S(> 

case _parameter# 


L 

1 

2 

3 

4 


no FB 

E9 


7.633 

8.070 

5.780 

~<i .(‘If 

no FB 

H I 

1.126 

.968 

.949 

.757 

ft: i 

FB 

E9 

2.266 

2.820 

3.334 

2.974 

"4409 

FB 

X n 

.327 

.146 

.202 

.214 

.142 ! 


TABLE IV 

= 02, 1 = 95, Winixw - 100 
case_parameter# 


i _ 

1 

2 

3 

4 

5 

no FB 

E9 

0.350 

0.441 

0.385 

0.489 

0.350 

no FB 

Xn 

.086 

.125 

.085 

.194 

.088 

FB 

E9 

.061 

.105 

.131 

.067 

.096 

FB 

X n 

.013 

.064 

.090 

.027 

.059 


nonfeedback) algorithm was 


A 


_ J 
window 


t 

n -window 


The averaging slarted al ileration 50, using a growing win¬ 
dow ot averaging until the desired window size was reached, 
alter which we averaged only over the desired window si/e. 
'The computation of the mean square errors commenced al the 
300th iteration so as to minimize the effect of the transient 
period. The tables list the sample mean square errors at 5000 
iterations; namely 


1 

5000 - 300 


5000 

E w. -*f 


where 0' is the /lh component of 0. The sum is quite stable 
and the values were essentially the same after 1 000 iterations. 
Hence, they can be taken for good approximations to the actual 
mean square errors. Generally, the algorithm with feedback 
is not as sensitive to the value of e and can work well at 
somewhat larger values of e, although c cannot be made too 
large without causing stability problems. This ability lo work 
well with somewhat larger e is a considerable advantage, in 
that it yields a shorter transient period. 

In all cases, both with and without feedback, it can be 
seen that averaging yields substantial improvements. This 
was true even when the original iterates A\, were poor, as 
in Table 111 (and also Tables I, II, V, VI, to some extent) 
for the algorithm without feedback. Clearly, as expected, the 
performance improves as the window of averaging increases. 
Refer to Table III, where e and As arc too large for good 
performance. But even there, with the poor values given by 
the A rl , averaging was a distinct advantage and more so for 
the feedback case. 

If the observation noise level is not too small, then the 
averaged iterate ~X n for the feedback case is generally better 
than that for the nonfeedback case and is often much better as 
seen from the tables. For such a case, the errors X u are also 
generally smaller in the feedback case. This is important if 


TABLE v 

- = .1)5, .4 = 5, Window = 300 


case parameter# 


1 

1 

2 

3 

4 

5 

no FB 

E9 

1.133 

1.212 

1.173 

1.251 

1.014 

no FB 


.124 

.131 

.096 

.225 

.110 

\msm 

E9 

.274 

.229 

mmm 

.239 

.242 

IE1 


.049 

WiTTl 



■a 


TABLE VI 

= 05. \ = 22. Window — 100 


case _parameter# 


ii . _ 

1 

2 

3 

4 

5 

no FB 

BM 

1.133 

1.212 

1.173 

1,251 

1.014 

no FB 


.124 

.131 

.096 

.225 

.110 

FB 

E9 

.246 

.197 

.329 

.243 

.320 

FB 


.064 

.007 

.114 

.010 

.070 


one wishes a good “on-line’ 1 algorithm. Indeed, in some cases 
the errors X n with feedback arc as good as the errors A\, 
without feedback. While the last word has not been said on 
the general family of algorithms considered and more testing 
and comparison needs to be done (as is usual for a new idea), 
it is clear that the use of feedback is quite promising and 
has many advantages, particularly for an algorithm where one 
wants a good estimate “on line. 11 

Wc also note that the use of feedback makes the behavior of 
the X fl less sensitive to the value of G\ as seen from the forms 
of the stochastic differential equations derived in Section V. 


Appendix 

Details of Estimates in Theorem 4.1 
Proof of (4.J4) By definition, for m > ri 


tn-l 

X” - (/ - cQr-X„ + £>(/- W" '~'C- (A.la) 
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We will also use the representation [(A la), but starting at 
time a + 1] 


*£ = (/-eQ ) m - ,, - 1 * n " +1 + £ e(J - eQr—'G 

tasn-f 1 

(Alb) 


By (A la) 


OO 

ti='Eb m (i-?Qr-"x n 

m—n 

oo w-1 

+ EJW/-eQr , ‘ 1 ^ 

m =7i i=s7i 

(A 2) 


Using the fact that £^_ n (7 - cQ)" 1 = 0(l/e), the first 
term on the right side of (A 2) is 0(b n /e) |X„| Using 
(A4.1MA4 3) and the definition (4 10) to evaluate the second 
term on the right side of (A 2) yields 0(/)„)(|An| + |X n |) + 
IO(b„) Thus (4 14) holds 

Pioofof(4 15) Using (A4 1)—(A4 3) and the representa¬ 
tion for A'£, from (A 1b) and for A"", +1 from (A la) (with ?i +1 
replacing n there), the component of the sum in (4 16) which 
is due to the "propagation of the initial conditions” (rather 
than due to the noise sequences m > ii + 1}) can 

be seen to be 

U, f; b rn (I-eQr - l A, i+1 

m — u f 1 
nc 

- Y, b m (i-sQr " ’ a ; v , 

m JS7i-f 1 

Since 

A„+i = (/ - sQ)X„ + et„ + b„C( Y„ - Y„) 

= a;; m + 6 n r(X„ - x n ) 

the above difference equals 

52 b m (I - cQ) m - n - l b„C(X„ - X H ) (A 3) 
which equals 

o(^)(|X«| + |T"|) (A 4) 

The component of the sum in (A 4) due to the “noise” 

(l !, +1 m > n + 1} is 

<x m -1 

52 12 b ^ r - eQ) m -'~ l En (C +1 - Cl 

m — n +1 i ss?i -f 1 

Noting that £" +l - £" equals 4»,(A M +1 - A n ), and using 
(A4 1MA4 3), we get the order [recall the definition of I 
below (4 14b)] 

()(b n e + (>?,)(|A,,| + |A„|) + T()(b n e) (A 5) 

(A 4) and (A 5) yield the right side of (4 15) 


Rgferencfs 

[Ij A Benvemste M Metivier and P Pnouret Adapti\e Algorithms and 
Stochastic Approximation New York Spnnger Verlag 1990 

[2] P Billingsley Convergence of Piohabiltty Measures New York Wi 
ley 1968 

|3] S N Ethier and T G Kurtz Maiko\ Processes Chat attenuation and 
Conva genet New York Wiley 1986 

[4] L Guo L Ljung and P Pnouret Tracking performance analyses of 
the forgetting factor RLS algonthm ‘ in Proc 1/itConf Deus Conti 
Tucson AZ 1992 pp 688-693 

|3] S Haykin Adaptne Film Ihtoiy Englewood Cliffs NJ Prtnhce 
Hall 1990 

|61 H J Kushner Approximation and Weak Comeigence Method? jot Ran 
dom Piotesxes with Applications to Stochastic System 7 heoi\ Cam 
bridge MA MIT Press 1984 

17) H J Kushner and D S Clark Stochastic Appioximation/oi Const/aincd 
and Unconstiained Systems New York Spnnger Verlag 1978 

[81 H J Kushner and A Shwartz Weak convergence and asymptotic 
properties of adaptive filters with constant gains Ihl F Turns Infoim 
Theoiy vol IT 30 pp 177-182 1984 

[9] H J Kushner and J Yang, Stochastic approximation with averaging 
Optimal asymptotic rates ot convergence lor general processes SIAM 
J Conti Optim vol 31 pp 1043-1062 1993 
|I0| H J Kushnei and Ci Yin Asymptotic properties ol distributed and 
communicating stochastic approximation algonthms SIAM / Conti 
Optim vol 23 pp 1266-1290 1987 
11II B T Polyak and A B Juditsky Acceleration of stochastic approx 
imation by averaging SIAM / Conti Optim vol 30 pp 838-833 
1992 

1121 D Ruppcrt Fffitienl estimators Horn i slowly convergent Robbins 
Munro process School of Operations Research and Induslinl 1 ngi 
neering Cornell Umv lech Rep 787 1988 
113] B Windrow and S D Steams Adaptiu Signal Pun i\\m\ Englewood 
Cliffs NJ Prentice Hall 198S 

II4J G Yin On extensions ol Polyak s aveiaging ipproach to stochastic 
approximation Stochastics vol 36 pp 243-264 1992 
113] G Yin Stochastic Appioximation Via A\a aging Polsak s Appiotuh 
Reusiud (Lecture Notes in Fcononucs ind Mathematics! Systems) 
Berlin Spnnger Verlag vol 374 1992 pp 119-134 


Harold Kushner (S 34-A 36-M 30 SM 73 1 74) 
received the BSEF degree hom Cily College 
ol New York NY in 1933 and the PhD degree 
in eleclncd engineering 1mm the University ol 
Wisconsin Madison in 1938 
He is Director of the Lcfschetz Center toi Dynam 
leal Systems Brown University Providence RI His 
current research interests include large deviations 
wide bandwidth systems wcik convergence nieth 
ods systems approximations and limit theorems 
singularly perturbed systems and he ivy traffic ap 
proximattons 

Dr Kushner has won the IEEE Field Award in Control Systems and the 
Franklin Institute Louis F Levy medal He has wnttcn seven books and over 
130 papers on virtually all aspects of stochastic systems 




Jichuan Yang was bom in Sicbutn China in 
1963 He received the B S degree in mathemat 
ics from Qinghua University PR China in 1983 
and the Ph D degiec in applied mathtmatics from 
Brown University Providence RI m 1991 
Dr Yang is currently working in a postdoctoral 
position in the Division of Applied Mathematics 
Brown University His cui rent research interests 
include numerical methods in stochastic control 
optimization stochastic approximation algorithms 
and financial denvative securities pricing models 



I EEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 40, NO. I. JANUARY l»5 


Exponential Stabilization of 
Nonholonomic Chained Systems 

O. J. S0rdalen, Member , IEEE , and O. Egeland, Member, IEEE 


Abstract —This paper presents a feedback control scheme for 
the stabilization of two-input, driftless, chained nonholonomic 
systems, also called chained form. These systems are control¬ 
lable but not asymptotically stabilizable by a smooth static-state 
feedback control law. In addition, exponential stability cannot be 
obtained with a smooth, time-varying feedback control law. Here, 
global, asymptotical stability with exponential convergence is 
achieved about any desired configuration by using a nonsmooth, 
time-varying feedback control law. The control law depends, in 
addition to the state and time, on a function which is constant 
except at predefined instants of time where the function is 
recomputed as a nonsmooth function of the state. The inputs 
are differentiable with respect to time and tend exponentially 
toward zero. For use in the analysis, a lemma on the exponential 
convergence of a stable time-varying nonlinear system perturbed 
by an exponentially decaying signal is presented. Simulation 
results are also shown. 

I. Introduction 

N ONHOLONOMIC chained systems can be used to rep¬ 
resent a large class of mechanical systems. Important 
and well-known examples are unicycles, four-wheeled cars 
and ^-trailer systems. Accordingly, control schemes for non¬ 
holonomic chained systems has a potential of being applied 
to a large number of mechanical systems. The problem of 
designing stabilizing feedback controllers for nonholonomic 
chained systems is a challenging one since the system is 
not stabilizable by a smooth static-state feedback law [4]. 
Moreover, the problem of exponential stabilization, which 
is the topic of this paper, cannot be solved by any smooth 
feedback law [10]. 

The nonholonomic chained system considered in this paper 
is the chained form [26] 

£ j = ^1 
j'2 ~ U 2 

Xi = x,- jU], i6 {3, 

A constructive procedure to transform a nonholonomic system 
with two inputs into this chained form was presented by [25] 
under certain conditions on the input vectors. Necessary and 
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sufficient conditions for convening a nonholonomic system 
into chained form were derived in [23] based on the theory 
of exterior differential systems using the Goursat normal form 
theorem. One of the results from this work is that all two-input, 
regular nonholonomic systems in Lhree and four dimensions 
arc locally feedback equivalent to a nonholonomic chained 
form. In [33], the kinematic model of a car with n trailers was 
converted into chained form. This conversion was global in the 
position of the system and local in the orientations. Another 
conversion into chained form was proposed in the framework 
of exterior differential systems in [35] based on [33], [23], and 
[30]. This conversion of the kinematics of the n-trailer system 
allowed any orientation of the last trailer. Control strategies for 
nonholonomic chained systems can, therefore, be used for the 
control of a broad class of nonholonomic, mechanical systems. 

The control of chained form and of more general non¬ 
holonomic systems is a very active field of research. The 
problem of nonholonomic motion planning was introduced 
by [17] who proved that a car-likc robot with one non¬ 
holonomic constraint is controllable. Open-loop planners for 
low-dimensional mobile robots have been proposed in [18], 
[ 1 ], [ 19], Because of the invertible transformation in [33], these 
planners can also be used to plan a path for low-dimensional 
chained systems. Other open-loop strategies have explored 
control theoretic approaches using differential geometry tools 
lo control nonholonomic systems. Sinusoids were proposed by 
[24] to steer in open-loop nonholonomic systems on a special 
canonical form including chained form. A generalization of 
the use of sinusoidal inputs to generate motion at a given level 
of Lie brackets of the input vectors was given in [16], [15J, 
[12], [22] for nilpotent and nilpoteniizable systems using an 
extended system wilh additional input vectors corresponding 
to higher order Lie brackets of the original system. Since the 
nonholonomic chained system is nilpotent, these open-loop 
strategies can also be used to steer such a system in open loop. 

To make the control more robust with respect to distur¬ 
bances and errors in the initial condition, some stabilizing 
closed-loop approaches have also been proposed. Character¬ 
istic for the nonholonomic systems encountered in robotics 
is that they cannot be stabilized by a smooth static-state 
feedback control law which has been shown with Brockett's 
theorem [4]. This has been further discussed by [2] and [8] 
for nonholonomic mechanical systems. Therefore, a discon¬ 
tinuous feedback control law was proposed by [5] to make 
the kinematic model of a three-dimensional robot globally, 
exponentially converge to a given configuration. This system 
is equivalent to a three-dimensional chained form. Another 
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discontinuous feedback approach was proposed by (3] for 
the same system with acceleration inputs instead of velocity 
inputs. This approach makes the system reach the origin in 
finite time in the case of no disturbances, 

The use of time-varying feedback control laws is another 
approach to stabilize nonholonomic systems about a constant 
configuration. This approach was first studied by 130] for the 
stabilization of a cart. This approach was further developed in 
£291 for a car-like mobile robot with a steering wheel. This 
system is locally equivalent to a four-dimensional chained 
forth. Constructive approaches were presented in [31], [28]. 
The existence of stabilizing time-varying feedback control 
laws for more general nonholonomic systems was studied in 
{6J. The design methods in [28] were extended in [7] to the 
more general situation given in 16]. An algorithm for comput¬ 
ing time-periodic feedback solutions for nonholonomic motion 
planning was presented in [10], This approach considered the 
extended system using Lie bracket completion vectors such 
as in the open-loop strategy of [16]. This feedback algorithm 
was based on multi-scaling averaging techniques and highly 
oscillatory inputs. These time-dependent approaches also work 
for chained form. Another time-varying smooth feedback 
control law to stabilize chained form was derived in [34] for 
power forms based on the work on the use of sinusoids. 

These smooth time-varying feedback control laws yield 
asymptotic stability but not exponential since time-periodic 
smooth feedback cannot be exponentially stabilizing [9], [10]. 
To improve the rate of convergence, a feedback control law 
was proposed by 127] to obtain local exponential convergence 
to a neighborhood of the origin for a three-dimensional chained 
form. The asymptotic behavior was obtained by letting the 
control law be time varying. The exponential convergence to 
the neighborhood was obtained by letting the control law be 
nonsmooth at the origin. No feedback control law, however, 
has been presented in previous work that exponentially stabi¬ 
lizes a nonholonomic chained form of an arbitrary dimension 
about any constant configuration. 

In this paper, the problem of exponential stabilization is 
addressed and a new feedback approach is proposed for 
chained form. The stabilization is achieved by letting the state 
feedback control law depend on time and on a function which 
is nonsmooth with respect to the state at discrete instants of 
time. The proposed feedback control law globally stabilizes the 
system about the origin with exponential convergence. Asymp¬ 
totical stability about any desired configuration is obtained 
by using a coordinate transformation. The resulting closed- 
loop system is not exponentially stable as defined by [14, p, 
168], but it is shown to have a property which will be termed 
^-exponential stability. 

The paper is organized as follows: Hie concept of K- 
cxponential stability is proposed in Section II. A lemma on the 
exponential convergence of a nonlinear time-varying system 
which is perturbed by an exponentially decaying signal is 
given in Section III, The nonholonomic chained system is 
presented in Section IV, The control law is presented in 
Section V. The convergence of a part of the system is analyzed 
in Section VI. The stability of the total system is analyzed 
in Section VII, In Section VIII, a coordinate transformation 


is presented such that a control law to control the chained 
form to the zero configuration can be used for the control to 
any desired configuration. The stabilizing feedback strategy 
is illustrated by a simulation example in Section IX, The 
conclusions are given in Section X. 

II. ^-Exponential Stability 

In this section, the concept of /C-exponential stability is 
introduced. It will be shown in later sections that the con¬ 
troller proposed in this paper makes the chained system 
/C-exponential stable. First, we need the following notion [11, 
Definition 2.5J, 

Definition 1: A continuous function a: /?+ —> i? + is said to 
be of class /C (or belong to class K) if it is strictly increasing 
and r*(0) - 0. 

One property of class JC functions is that, [14] and [111 
{■* 1 , 0:2 € class/C =* fVion^S: class K. (1) 

We then define the following. 

Definition 2: Consider the nonlinear, time-varying system 

r = /(j, t) .r € D C R'\ t > t 0 . (2) 

System (2) is /C-exponentially stable about x p iff there exist 
a neighborhood c D about s In a positive constant A, and 
a function /*,(■) of class /C such that all solutions jc(t) of (2) 
satisfy 

V.r(/„) € H r V/ > t 0 . IWO-rJ < /,(||.r(/„) - .r,,||), 

(3) 

where the constant A and the neighborhood are independent 
of to. and || * || denotes a norm in R n . 

If (3) is satisfied for il p = D, then system (2) is globally, K- 
exponentially stable about According to this definition, if 
system (2) is /C-exponentially stable at J' p , then it is uniformly 
asymptotically stable as defined by e.g., [14, 4.3], and in 
addition it has an exponential rate of convergence. 

The concept of /C-exponential stability is called exponential 
stability in [21]. The term exponential stability is however 
often used for the special case where the function /i( -) is 
linear, [11] and [14|, i.e., 

/i(||t(/ 0 ) - j>||) = - j>II- 

Here, r is a positive constant independent of to and x(t 0 ). This 
means that /C-exponential stability corresponds to a weaker 
form of stability than the usual concept of exponential stability, 
^-exponential stability and exponential stability are equal, 
however, with respect to the rate of convergence. Therefore, 
the notion “exponential stabilization 1 ’ is used in the title of this 
paper; although, only /C-exponential stability will be proved. 

Locally, a similar definition of exponential stability was in¬ 
troduced in [13] in terms of homogeneous norms. This concept 
has been used in [20] to investigate the convergence rates 
for controllers for low dimensional nonholonomic systems in 
so-called power form which is equivalent to chained form, 
[34]. 
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111. A Lemma on Exponential Convergence 

The following lemma is useful for establishing exponential 
onvergence for a class of time-varying systems. It will be 
ised in the convergence analysis of the control law. 

Lemma I: Consider the nonlinear, one-dimensional, lime- 
varying system 

j =-a(x,t).r + d(.r,t). t > / 0 , € R (4) 


under the following assumptions: 

• There exists a solution x(i) of (4) for any ,r(/ 0 ) and any 
l > / 0 ; see Remark 2 below. 

• a(.r.t ) has the property that for all x(i) 



- A )(It 


< P, 


Vf > t 0 


(5) 


where A, and P are positive constants. 

• The signal d(.r,t) is bounded for any t > 1 0 and any 

■r (0 by 


|r/(.r(f), /)| < D( ( 6 ) 


In the case that A = 7 we find 

k(#)| < « P k^o)k' A, ‘" 10 ’ + J>t ir (l - /„)< ' 

In the case that A / 7 we find 

k( 0 | < f p |j-(f«)|< + 

A - 7 

Since 

1 - (< ft, V/i > 0 

then 

k(OI<'*'’k('o)|r- X(, - , " , + /A 2|, (/-/ 0 ) < “C ( 10 ) 

for all A > 0 and 7 > 0 where o - min {A. 7 }. From (10) 
we have that 

MO| < (/.(f-/o) + rr), 

= (/>(/ — f„) H- rr)» ~ (n) 

where 


for some positive constants I) and 7 . 

Then 

Vt > 0, |j(0|<r(|j(/o)| + D)r- t '* )( '-' u) 


where 

(\ “ min{A, 7 [ > 0 , c — max 
Proof We denote 

F(l) = I a(i (t) s t)(It 




where r(1) is a solution of (4). Multiplying (4) with r 1 (t) 
gives 


j t W) f r(f) ) = rf(i-(/)./v r(, \ 


This implies 

■>(0< F(t) 


-•I (*o) + / f F(r, rf(.i(r) i T)f/T. 
■//o 


Dividing by then gives the following (implicit) expres¬ 
sion for .r(/) 

.r(/) = r f (r) ^.r(/o) + f 7 (r) fi(.r(r) t T)f/T^. (7) 

Property (5) implies that 

!*\0-A(f-/o)| <P (8) 

which is equivalent to 

-P + A(/ - /(,) < F(t) < F + A(/ - / 0 ). (9) 

By using (7), (9), and ( 6 ) we get 

|x(f)| < ,u) ^k(io)| + D jf 


P = Dt lV . rr = f /> |r(k()| (12) 

since < '») < i | or i > / () By comparison we find 

that 

(l>1+ ct)(-' < i V/>() (13) 


if 

I — < >' , ft > ecr 
s = S ff 

( rr. /? < 

Since 

<i 


when f) > evr, (13) will also be satisfied with the following 
choice of t, 


C = 



ft > rn- 
p < rcr 


(14) 


Then, from (11), (12), (13) and (14) we get that 

|r(f)| < ^ 

< («-)(!-*«) 

< Kk(/o)| H 

where n = min{ A. 7 } and r = max^ r . □ 

This lemma implies that a solution ,r(t) of (4) converges 
exponentially to zero if u(sj) and r/(./, /) have the properties 
(5) and ( 6 ). 

Remark I. By choosing c = a we see that 

f v lP 

max |.r(/)| < (|.r(£o)| + D) max < v p ,- 

f>fn ( r> 

where ** - min{A, 7 } > 0 . 

Remark 2; If a(xJ) and d(r, t) arc continuous in .r and /, 
then there exists at least one solution of (4), [21, Theorem 2.3J. 
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IV. The System 

The nonholonomic chained system considered in this paper 
is the so-called chained form introduced in [ 26 ] 

ii = ui 
i?2 — W 2 
£3 = X2U\ 

X n = Xn— 1 ^ 1 * ( 15 ) 

The input vector fields are 

gi(x) = [1,0,12,3:3, ••■.Zn-i] 7 ’, 

4/2(2) = [0,1,0,0, • ■ •, 0] r 

where x = [xi,X 2 , ■ •• , x n ] T . Repeated Lie brackets can be 
defined recursively by 

ri gl g 2 = 92, ri' gl g 2 = [ffi.ad*; 1 ^]. 

It can easily be shown that for system (15) 

rig~ 2 92 = (—l) J ej, j € {2,3, ••■,«} 

where e } = [6i J ,02 j. -, 'i0 n,/] T where b tJ is the Kronecker 
delta. Since 

Vx e R n , span{ji,ad"^, • • •, ri^ 2 g 2 \{x) = R n 

the chained form (15) is completely controllable and the degree 
of nonholonomy is n - 1. In spite of the controllability, (15) 
cannot be stabilized by a smooth static-state feedback control 
law as stated by Brockets Theorem [4]. From f 10] we know 
that to exponentially stabilize (15), the controller cannot be 
smooth, even if it is time dependent. 

V. The Control Law 

A. Outline of the Control Law and Analysis 

In this section, we will propose a control law to globally, 
/C-exponentially stabilize the nonholonomic chained system 
(15) about the origin. Since there is no smooth static-state 
feedback control law which can stabilize (15), we let the 
feedback control law be time dependent, as first proposed by 
[32] for a nonholonomic cart. The main idea of the control 
scheme to be presented is based on the observation that if u\ 
is a function of time only, then the state variables 

z = [x 2 ,---,x n ] 3 (16) 

satisfy a linear time-varying state space model with input U 2 . 
This linear time-varying system with input U 2 and state z can 
be made /C-exponentially stable using feedback from z, which 
will be shown in the following. The problem which remains 
to be solved is how to achieve /C-exponentially stability when 
Xi is included into the state vector, that is, when the complete 
system with state vector x and inputs ttj and U 2 is considered. 
The solution proposed in this paper is as follows: Define 


a sequence hr ■ ■) where the equidistant terms are 
defined by 

t, = iT (17) 

where T is a strictly positive constant. Then, the control u\ 
is given by 7/1 = ui(t } x{t t )),t £ [tn^+i)i that is, u\ is a 
function of time and not of the state whenever t, < t < i.+i. 
By appropriately selecting ui(t,x(t t )), it is then possible to 
achieve exponential convergence in x\(t) and, in addition, 
by patching together the solutions for z(t ), /C-exponentially 
stability for the complete system can be established. To obtain 
exponential convergence, the time-varying feedback control 
law is nonsmooth in the origin with respect to the state x(t t ). 

In this section the control laws are presented without further 
explanation. Then, in the following sections the stability of 
the closed-loop system is analyzed, and it is shown that the 
proposed control laws have the desired properties. The analysis 
is done in two steps: First the stability of the linear system with 
state vector z and input t/ 2 is analyzed. To this end a vector 
z d (x,f): R n -1- R+ —> R n ~ x is defined, and it is shown that 
for the proposed control law the vector £(j-,/) = z - z d (x,t) 
converges exponentially to zero. Then, this result is used to 
prove exponential convergence of z(t). At this point the state 
xi is included, and /C-exponentially stability of the complete 
system with state vector x is established. 

Concerning the notation, throughout the paper the one- 
norm will be used, i.e., the norm of an n dimensional vector 
X = [x l ,---,x„] T is given by 

lkll = £k,|. ( 18 ) 

J = 1 

Furthermore, the index 1 will refer to terms in the time 
sequence (/. 0 , f 1 , ■ ■ •)* 

B The Control Law for 

Let fc( ) be a bounded function 

k: R n —► R (19) 

which satisfies for a strictly positive constant K 

Vx e fl r, |fc(:r)| < K y Vx e R n /{i)}k(x) ± 0, k{ 0) = 0. 

( 20 ) 

The idea is now to let ui be given by a time-varying function 
multiplied with the function k(x(L)) such that the part of the 
system which is represented by the state variables z with input 
is linear and time-varying in the time interval (/,,/f+i). 
The input u 2 is then used to make [ar 2 (f)< ■ ■ * ,x n {t)] T expo¬ 
nentially converge to zero. The sign and magnitude of fc(j?(/,)) 
will be chosen such that x\(t) also converges exponentially to 
zero, and the inputs remain bounded. 

We introduce a time-varying function /: i?+ —► R which 
has the following properties; 

PI) /(t)€C°°[*o,+oo) 

P2) 0 < f(t) < l,Vt> $o 
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P3) /(*,) = 0,i, € 

P4) for any j € (3, ■ ■ ■; n}, there are positive constants 
rjj and Pj such that 


\ft p e {to,ft.■■■}, vt > t p 


jy 


-3 


(r) - rjj](h <Pj. 


A function satisfying conditions Pl)-P4) is 

/(0 = (1 - cosw/)/2, u >=y (21) 

where T = /,+1 - f, is a constant, (17). Incidentally, the 
function /(/) is not restricted to be time-periodic to satisfy 
Pl)-P4). 

The control law for iii is defined by 

«i =fc(x(t,))/(f), t €[/„/ l+ i) (22) 

where 

A-(.r) - sat(— [.r! + sKu(.r I )f;(||r||)]/), K). (23) 


Here 


and 


sat(r/, k) = 


J « 

l A sgn(ry), 


l</l < K \ 
l‘/l > K / 


(K 


= M 


/* = 


Ji f ; fl /(r),/r 


(24) 

(23) 


where /r is a positive conslanl and sgn(.i*i) is defined as 


sgn(./-L 



./■i > 0 

■r-i < 0 ' 


All the norms are one-norms as defined in (18), The saturation 
function is used to guarantee global stability. The idea of using 
saturation functions was also used in 134]. 

Note that due to PI )-P3) and the bound on A*(*), ihe control 
ii] is bounded and continuous with respect to time. 


C. The Control Law for 112 

With the input iij given by (22), we get form (15) and (16) 
that z is given by 


where X v j e {2, are any positive coim.ui i, 

dentally, we see from this definition of anJ !'! i (lVl 
9jm(t) is smooth 

The definition of <fjm (0 from (27)—(30) can Ik* ji, 

the following diagram where a ~+ b means that l> d.*(K m) „■«., 

92,3 ,92, n—2 . 92 , 11-1 

\ \ t \ T \ 1 

\ i \ i \ i 

f/n-.5,n-2 i — 

\ 1 \ 1 

,V?i —2,r» —J ( /ri- 2 ,m 

\ I 

On -1,« • 

The control law for is defined for t £ [/„/,+. i) by 
_ fr(fc(3-(t,))./) 7 2. 

“ 2 - \o, m, o m) 

where 

s = [j- 2 ,- -.J-r.] T € It'‘ \ 

Y(k.l) = [V 2 (k,t),---Sn(kJ )] 1 €ir~ l 

and for j £ {3, ■ ■ •. n} 
r2(^^ 0 = -^2 + f*g 2,a 
ry (A-, /) = /( A 2 /c/2j + 2/*/2j + /.^ f + 

The smooth functions </ 2/ (/) are given by (27M30). The 
function A;(.t(/,)) is given by (23). 

The feedback control law (22) and (31) depends on A(-) 
which is a nonsmooth function of the state .r(/,). One might, 
therefore, raise the question about the existence and uniqueness 
of the solution j;(/) of (15). Since in each time interval 
[/,,/,+x) the closed-loop system (15) with the control law 
(22) and (31) becomes a linear, time-varying system, global 
existence and uniqueness of r(t) can be shown in any time 
interval [/,, f,+i) for any by applying e.g., Theorem 2,3 
in [14]. The time intervals can then be patched together to 
show existence and uniqueness of r(l) for all / £ /?+ and for 
all initial conditions s(i o). 


.r 2 = 1/2 

h = A'(.r(/|))/(/).f 2 

i T , = fc(.r(/,))/(*)./•„- i- (26) 

In this subsection, we derive a feedback control law for <i 2 
to make z(0 = [j' 2 (0>■ ■ *<^n(0] T globally, exponentially 


converge to zero. 

Define g )m : /?+ —> U for j,m £ {2, ■ ■ ,r?} by 

g n ~ i, n = — A ri (27) 

9j-i,m(0 = -9, m [A,/ 2j - 2 (() + 2(j - 1 )/(*)] 

+ /(0[ff,m(0 + ff|,m+l(0/(0J (28) 

— ~Xj + / 2 (0.9j,7+i(0 (29) 

9 Jf) = 0 if p < j or p = n + 1 (30) 


VI. Convergence Analysis oh z(() 

In this section, it is shown that the control law (22) and (31) 
makes z(t ) (16) globally exponentially converge lo zero. 

To this end, define the functions z' 1 = [rfj, • ■ ■ ,J' d ] T : R" x 
Il + —» ft" -3 as follows for t € [(,,t 1+ j) 

J d n = 0 (32) 

■rfU’O 

A / 2 (0 5vm=j + l fljm(0 Jt"'- j(x(t,))‘ rm ‘ ^ 0 

\o, 3(/,) = 0 

(33) 

Note from this definition of j '2 that the control law for u 2 
(31) satisfies 


u 2 = -X 2 (t 2 - x 2 ) + ± d 


(34) 
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whenever x(t x ) / 0. This lelation will be used in the 
convergence analysis. 

By assuming that the control laws yield a continuous 
solution x(t) of (15), we see from the definition (33) of 
xf(x,t), PI), and P3) that x d (x(t),t) is continuous for all 
t > 0 and x d (x(t t ), t t ) = 0. In addition, z d has the properties 
shown in the following lemma. 

Lemma 2: Let z d = [x 2 , ■ • •, .r3] T be defined by (32) and 
(33). Then, for j € {3, - ■ -, n}, t G [f,, f,+i) 

x d n = 0 (35) 

®j_ 1 fc(x(t,))/(t) = -*jf 2 , - 3 (t)[xj - x d (x,t)] + i d (s,t). 

(36) 

Proof: Equation (35) is trivially satisfied from (32). 
Equation (36) will be proved by calculating x d (x,1). The 
arguments of the functions f(t), k(x(t,)), and x d (x,t) will 
be omitted in the proof for simplicity. From the definition of 
T d , (33), we find for t G (f,,f,+i) in the case r(t,) ^ 0 by 
using x m = a- m _i kf from (15) 


x d - ( 2 j 2 )/ 2j 3 f g jm -p m _ } x r , 


m=/ + l 


+ f 23 2 5 Z tm-/™ + 9jm Um-i-l Xm “ 1 ■ 


m—j+l 


We see that by assuming a continuous solution x(t) of (15), 
we have due to PI) and P3) 

lim x d (x(t),t) = lim x d [x(t),t) = 0 . 

i-.t^ t—t, 

Therefore, (37) is valid for all t > 0. Now, we will express 
x d _ x by using x d from (37). From (33), (28H30) we find 






=/”- 4 W 4 + /’^) 


+ E 


m=ej + l 


fcm-j +1 


[(V 2j - 2 + 20-1)/). 9 „ 


+ /(fljm + 0j,m+l/)]} 

m=; +1 

+ / 2, ” 4 {(2j-2)/ £ 

' m=j+l 


n ^ 1 

+ / 5 Z tm-j+l 1 " 


m=j + l 


— t '*' 4 + r ?- 


Equation (36) follows readily. 


Define die functions a = [£ 2 , ■ • -, x n ] T : R n x R+ -* R" 


Xj(x,t) = x, -x d (x,t), je (2,••■,«}. (39) 

From the model (15), Lemma 2, and the control law (22) and 
(31) with the expression (34) for the input U 2 , we find that 
z satisfies 

£2 = -A 2 J 2 (40) 

£3 = A 3 / 3 (t) £3 + kf(t)x 2 


Xn = -A„/ 2(n 2)+1 (/)x„ + fc/(t)£„_i. (41) 

The following lemma shows that Jj(jr (/),/) tends exponen¬ 
tially toward zero. 

We denote 

S.J - [^ 2 ,^ 3 , • • ■ ,-P T ] r , j€ {2, •••,«} (42) 

and use the one-norm || ■ || as defined by (18). 

Lemma 3’ Consider system (15) with the control law (22) 
and (31). Let z(x(i ), t) be defined by (39) where ,r(f) is the 
solution of (15). Then, for any j € {2, -,n}, there is a 

b > 0 such that 

Ve 2 , ,rj 6(0,6), 

> 0, 3t j > 0, 

Vfp €{/„,*!. •■},Vz J (fp)Gi? J - 1 
|x(a-(£),/)| < £- J |k J (f P )||e _ ' )j(t_t ' ,) , W > t p (43) 

where the constants 7 3 can be defined as 

7 j = aij - £j > 0 (44) 

a q = mm{\ q r) q ,at q ^i - e 9 -i), qe{ 3,- ,j} (45) 

012 = ^ 2 . (46) 

The constants T) q are found from Property P4) of /(f). 

Proof The proof will be given by induction. Assume 
that (43) is satisfied for j = m - 1 E {2, ■ • • ,77 - 1} for a 
t p € {fo, fi, ■ ■ }. Then, from (40H41) we have that 

£ = -A m f 2m -\i)x m + fc(.r(U)/(t)£m-i 

= -a(f)£ m + d{t) 


a(t) = 

d(f) = A-(3-(t,))/(f)£ m _j(x(f),f). 

From Property P4) of f(t) we have that for all t p E 
{to,t\r m } and for all t > t p 

f (a(r) - \ m rim)dT < A m P m . 

Jtv 

Since (43) is assumed to be satisfied for 3 = m - 1 and since 
|/(f)| < 1 from PI), and |fc(:r)| < K from (20) and (23) we 
have 
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here D m - tfc m _i||£ m _ 1 (* p )||. From the definition of 
x,t) (39) and since x^(x(t p ),t p ) = 0 for all t p e 
o,*i. • • ■}• Section V-C, we have im(x{t p ),t p ) = x m (t p ) = 
, n (t p ). From Lemma 1 we can then conclude that 


Theorem I: Consider system (15) with the conn 
(22) and (31) and z defined in (16). There ,ue a in 
h z : R + —► of class A." and a constant 7 - > () such tl\ 
any initial condition z(t 0 ) € It "' 1 


Ve m > 03 Om 3 7r , | 

|x m (f)| < c m (|U m _i(f P )|| + |im(<,»)l)e -Ym(t “ ,p) 

= c m||l m _l(fp)||e"' 7m ^ _1 '' ) 

where the constants j m and c m can be defined as 

Km ~ “ E-m 

Ck r n ~ niin{A m ^7m ? 7m- 1 } 

r m = max je AmPni , e 2Xrn Pm - C ™~ 1 1. 

The constant 7 m is strictly positive if 

0 < Ej < olj, j E {2, ■ ■ ■, m}. 

Consequently, there exists a constant /» such that e 2 < - • , e n E 
(0,A) implies that 7 m > 0. 

We have, therefore, proved that if (43) is satisfied for 
j = m - 1 for a t p E ■ ■ •}, then (43) is satisfied for 

j = m, for all 7 n E {3, ■ • ■ , 77 }. The induction is completed 
by showing that (43) is satisfied for j = 2. From (40) we find 
that for all x{t P ),t P € •■} 

M-KO.OI = \i 2 (x(tp)J p )\e~ Xi(t ~ tr) 

= IU 2 ( / p)l|f" Vi(f_ ' ,,) 


since x ( j(x(t p ) 1 t p ) = 0 for t p E {to,tu“} and hence x 3 
(.r(f p ),/p) = Tj (t p ). Equation (43) is then satisfied for j = 2 
by choosing c 2 = 1. (In this case, (43) is also satisfied for 
e 2 = 0.) Since (43) is satisfied for all t p E when 

3 — 2 then (43) is proved by induction to be satisfied for all 
tp € {toJi, ■ ■ •}. □ 

Remark: Note from (45) that arbitrarily large values for 
c*2, ■ • ■, «n can be obtained by choosing A 2 , • ■ •, A TI appropri¬ 
ately, which means that the exponential rate of convergence 
7 j = oij - £j can be chosen arbitrarily large since e 3 can be 
chosen arbitrarily close to zero. 

The definition? of x d (32)—(33) and x (39) can be used to 
show that the state x 3 can be expressed as a weighted sum 
of the functions r E {j, ■ ■ •, 71 } as stated in the following 
lemma. 

Lemma 4: Let x d (x, l) be defined by (32H33) and x 3 (r , t) 
by (39) for 3 E {2, ■ ■ ■, n} and / E [* t , f l+ i). If x(t t ) / 0 then 
x 3 satisfies for all j E {2, ■■■ , n - 1} 

Xj = Xj(x,t) + f 2l ~ 2 (t) Y, BjrWj 

(47) 

The functions g jr : R+ —*■ R are bounded and smooth. 

Proof: See Appendix A. □ 

From Lemmas 3 and 4 we can now show global, exponential 
convergence of z{t) to zero. 


MOM < MII*(MI)O^- t(0 , v# 

The constant 7 . can be giv^n.ny 


where 7 „ is given from ^ 

Proof: Sec Appendix A.\^ * 


□ 


VII. Stability Analysis ot ,r(t) 

In this section, we prove global ^-exponential stability of 
the state x(1) of the system (15) about zero. 

By showing that 11 1 = k(.r(l,))f(t ), (22), makes /](f) 
converge to zero as z(f) converge to zero. Theorem I can 
be used to show the following lemma. 

Lemma 5' Let the control law be given by (22) and (31). 
Then, system (15) is /C-exponentially stable about the origin, 
i.e. p in a neighborhood about x = 0 there exist a function 
//(■; T) of class K and a constant 7 > 0 such that 

Vx(to) € n||.r(0H < H\\x(t 0 )iT\\)e-^\ V/ > (0 

(49) 

where the function h(-;T) depends on time parameter T ~ 
t.+i -U, (17). 

Proof: See Appendix A. □ 

The neighborhood il is given by 

«={- r| kil<~, <?(MII*II))<^} (50) 


where h z (-) is a function of class K from Theorem 1 and 
2 = [ r 2 , ■ ■ ■ ,.r 71 ] T (16). The function 6 ’(-) is defined in (24), 
and fi is defined in (25). The constant 7 can be defined as 


7: 

27/ -4 



> 0 


where 7 „ is given from Lemma 3. 

Remark 1: The exponential convergence rate 7 in (49) can 
be selected arbitrarily large by choosing A 2 , ■ ■ ■, A n appropri¬ 
ately, that is large enough. See also the remark to Lemma 
3 in Section VI. For all A 2 , ■ ■, A 7l > 0 the system is K~ 
exponentially stable for all positive T where T — t t +i - t,. 
The class AJ function /i(-; T) increases exponentially, however, 
with the parameter T. This is due to the fact that it takes at 
least the time T for the input u\ to drive xi to an arbitrarily 
small neighborhood about zero. 

Remark 2. The neighborhood fl can be chosen arbitrarily 
large by choosing the parameters K , K, and ft (or T — 
“*i) appropriately. Therefore, the stability is semi-global. 



42 


IfifcE TRANSACTIONS ON AUTOMATIC CONTROL, VOL 40, NO. I, JANUARY 1993 


Remark 3: The bounds on J|;r(f)|| given by the function 
k(’;T ), ( 88 ), and 7 # can be very conservative and do not in 
general provide quantitative information. 

Lemma 5 does not prove global, ^-exponential stability 
because of the saturation function in the definition of k(.r(l ,)), 
(23). A function //,(■) of class AJ, however, can be constructed 
to yield also global, A.'-exponential stability as shown in ihe 
following theorem. 

Theorem 2: Let the control law be given by (22) and (31). 
Then system (15) is globally. A-exponentially stable about 
the origin, i.e., there exist a function //,(■:']') of class K and 
a constant 7 > 0 such that V/ > / () 

Vj-(l v ) t /f"||j(/)|| < M|k((o)||;7V »“ *»>. (51) 


Proof; Sec Appendix A. □ 

As for Lemma 5, the constant 7 can be defined as follows 


y - 


7 . 


2n - d 


> 0 


7r» 

lz = Y 


where % is given from Lemma 3. 

The remarks after Lemma 5 on the rate of convergence 7 
and the dependence of T for the class K function //,( :Y’) 
also apply here. 


VIII, Stabilization About Abritkaky Conmgi-ration 

We have shown that the controller (22) and (31), and (23) 
makes system (15) globally, A,-cxponentially stable about the 
origin, i.e., about [x |.* • • ,x„] f — 0. In this section we show 
that the same control law can be used to A-exponentially sla- 
bilize the chained from (15) about any configuration. Indeed, 
here we show that any strategy to control the chained form to 
the origin can be used to control 11 to any desired configuration. 

Let the desired constant configuration be given by 

/'*« K.xg , 1 t It". .r‘‘ = U. 


Now we introduce the following variables 


in — 1 


A r'* 




i 




(m - ))' 


(■'■1 


.ryn ■ t 


m 


<= {!•• •»} 


(52) 

Here, x t is a state variable of the chained form (15) 
satisfying x\ - u\, The vector x } — [. 1 ^,* ,,/J,]^ is thus 

given from (52) as a smooth function .r' - */>(.rj: x 1 *). 
Lemma 6• Let x r — [.r* * ■ , be given by (52). Then 


x[ =0 (53) 

>2 = » (54) 

K, = i«i4 m € {3, ■•.!#}. (55) 


Proof: Fmm (52) we have that x\ ~ .r 1 / and x\ = ./ 2 . 
Since and x% are constants, (53)-(54) follow readily. 
Equation (55) can be prgved by induction. Assume that there 
is an index m £ {3. • ■ *, n - 1} such that 


f r = r r 

’* Hi 11 1 


I’M- 


Since £{'„ = 0 . rn € {1,••>,«}, differentiating x ^ t+1 from 
(52) gives 


^=S<- ( - ,+ T~ ) ! (m + 1 - 7) 

J Jr 2 x 7 

xO'i-xiT’-'tii -j-i 1 ) 

m . 




J-2 


"1 




Consequently, if (55) is satisfied for index rn then (55) is 
satisfied for rn F 1, loo. The proof is then completed by 
showing that r\ = From (52) we get 


=t5«, = 


Wc now define 


n 


A 

X m — X’m 


mt {!.•••,«}. 


(56) 


From system (15) and Ixmma 6 it follows that 

,7-| - i/j (57) 

■r 2 = iiy (58) 

K, o, & (59) 


This system has the same structure as the chained form (15). A 
control law lor (15) controlling / - [.n, * ■ ■ , .r,,] 7 to zero can 
he used to control > — [.r \. ■ - ■, 7„] 7 to zero. The coordinate 
transformation between r and .7 is then given from (52) and 
(56) by 

r' =J‘-0(.r j; .rn-T(.r:^) 

x = 7 + ,r' - 7 F 0(.r 1 ;.r ,) ) 

-74 F .r , 1 , ;.r / ‘) = r(7;.r / ') 


where r( ;.W') and r(•;./■'') are smooth functions. 

Wc can then conclude with the following theorem. 

Theorem 3 Given the system (57)-(59) where 7, r , and x] n 
are given by (56) and (52). Then, a control law for (57)—(59) 
making 7 = [7]. - - , J,,} 1 converge to zero makes ./• converge 
to the desired configuration .r , \ The convergence of x to x v 
is exponential if 7 converges exponentially to zero. 

Proof. From (56) and (52) we have that 


IN — i 


x,„ - x tu -F x m - x* n F ^2 ,r7 / I^ m _ jy*i 
F x 1u , m € {L ■ ■ ■ ,w}. 


Therefore, the control of 7 to zero implies the control of x 
to .r f \ We also see that exponential convergence of x to zero 
implies exponential convergence of x to x l \ □ 
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IX. Simulations 

A simulation with n = 4 was done in MATLAB at a SPARC 
nation 1. The control law for ui was chosen as follows 

ui = k(x(U))f(t), f(t) = (1 - cos t)/2 

where 

* = + sgn(*i(f > ))G(||*(*t)||)J/S, K), K = 2 

as defined in (23). By studying the time integral of / 3 (t) and 
/ 5 (/), we see that Property P4) of f(t) is satisfied by choosing 

5 63 _ i _ i 

m - 16 , v a - 256 > Pi ~ 2 ' P *~ 2' 

The controller parameter «in (?(•), (24), was taken to k = 3. 
The constant (3 is given by (25) 

P = 1 / / f(r)dr = - 
Jo * 

where T is the time-period of the function f(t), i.e., T = 2ir. 
The instants of time t, where k(x(t,)) may switch are given 
by the set { 0 , 27 r, 47 r, 67 r, ■ ■}. 

We find the control law for u 2 from (31) and (27M30) 

«2 = - (A(l + / 3 + / J ))x2 

- (/ \(/A + f 3 A + f b X + 2/ + 8 f 2 f)/k)x 3 
~ (/A(/ 5 A 2 + 4/A/ + 6 / 4 A/ + 8/ 2 + 4//)/fc 2 )a: 4 . 

Here, we have chosen 

A = A 2 = A 3 = A 4 . 

In the simulation, A was taken to A = 1 . 

The initial state was chosen as 

(xi(0),x 2 (0) ) x,(0),X4(0)) = (0,-0.1,0.1,1). 

Euler’s method was applied for the numerical integration 
where the time-step was taken to 0.05. In Fig. 1, z(t) = 
j, 4 ( 0 F is plotted versus time showing the con¬ 
vergence to zero. The state variable xi(t) is plotted in Fig. 2. 
We see that xi(t) converges to zero, too. Note, however, 
from the time-axes that the rate of convergence of x\(t) is 
slower than the one of z(t). This coincides with the relation 
between 7 and 7 * in Lemma 5 where 7 = 7 */ 4 . To show 
that the convergence is exponential, log||z(t)|| is plotted in 
Fig. 3 where the one-norm is chosen. We see from this figure 
that Iog||r(/,)|| decreases toward -00 implying that ||x(0|| 
exponentially converges to zero. 

By using the coordinate transformation from [33], we can 
interpret the variables and Z 4 as the x - and y-position 
of the midpoint of the rear axle of a four-wheeled car. The 
path (x(f), y(t)) is presented in Fig. 4. We see that the motion 
seems natural when interpreted as a parking maneuver. 

In Figs. 5 and 6 the inputs ui(t) and u 2 {t) are shown 
as functions of time. We see that they are continuous and 
converge exponentially to zero. 



Fig. 1. Exponential convergence of ;(f) = [ 1 - 2 , r 4 | 7 to zero 



Fig 2 Exponential convergence of to zero 



f 

Fig. 3. Convergence of log||r(f)|| to -oq. 

X. Summary and Conclusion 

A feedback control law has been proposed to globally 
stabilize a chained nonholonomic system of any dimension. 
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u, 



4).2 l . . . — > > --1 —- i-1- l. ■ j-1 

- 4 - 3 - 2-101234 

J- 

Fig. 4. The resulting path in the r y —plane when applying the exponential 
eonvergent control law. The variables r = .ri and y = r% are interpreted as 
the planar position of a four-wheeled car 



Fig. 5. The input u t (t) from the exponentially convergent control law 

The resulting rate of convergence is exponential and global, 
^-exponential stability was achieved. K -exponential stability 
is a weaker form of stability than exponential stability in the 
usual sense, but it possesses the same rate of convergence. 

The feedback control law depends on a time-varying func¬ 
tion, not necessarily periodic, and on a function which is 
nonsmooth with respect to the state at predefined discrete 
instants of time. The exponential rate of convergence can be 
arbitrarily fast by choosing the controller parameters appro¬ 
priately, The controller cannot, however, drive the system to 
an arbitrary small neighborhood of the desired configuration in 
shorter time than T i.e., the length of the time intervals used in 
the control law. The class K, function which defines a initial- 
state-dependent upper bound of the time-evolution of the 
system, has T as a parameter. Simulation of a four dimensional 
chained form indicated the exponential convergence. By using 
a coordinate transformation similar to the transformation from 
chained form to power form, a strategy to control the state 
to zero can be used to control the state of any desired 
configuration. 



Fig. 6 The input «_>(/) from the exponentially convergent control law 


The idea of introducing a function which varies with the 
state at discrete instants of time seems, therefore, to be a useful 
approach to obtain good stability properties for nonholonomic 
systems. The control law has a simple structure, though the 
stability analysis is quite involved because of few existing 
mathematical tools for such systems. 

A drawback of this approach is the possibility for numer¬ 
ical problems at digital computers as the parameter k(v{l t )) 
approaches zero, since there are divisions by /.(/(/,)) in the 
control law for u> 2 . (These divisions do noi cause unbounded 
quantities, since A~(.r(/,)) always dominates the numerator.) 
Such numerical problems were, however, not observed during 
the simulations. Since the parameter k(.r(t,)) only changes at 
U € {to . / 1 , -■}, the control of .ri may be more sensitive to 
disturbances and modeling imperfections than if k = A (./(/)) 
i.e., function of the state at all t > to. Depending on the 
physical application, feedback from the state only at t, e 
{/o,/ 1 ,* ■ ■} may be sufficient to make ,/*i exponentially con¬ 
verge to zero. Note that the state variables 2 — [,r 2 , ■ • ■ ,./„] 7 
are controlled by the control law a 2 - u 2 {z(t),k) which 
depends on 2 (f) for all t > to- 

In further work, an extension from k = k(j'(1 t )) to A = 
k(s(t)) by redefining z tl , (32)-(33), can be studied. The 
structure of k(x(t,)) in the present control law indicates 
that the use of A: = k{s(1)) results in a nonsmooth, time- 
varying feedback control law. Also, more mathematical tools 
are needed to analyze such systems. The need for the definition 
of K -exponential stability for the stabilization with exponential 
convergence of chained systems and other nonholonomic 
systems should also be analyzed. 

Appendix 

proofs 

Proof of Lemma 4: The arguments of fr(x(t t )),.r(r, f), 
x d (x, t ), /(/), and g Jr (t) will mostly be omitted for simplicity. 
From the definition (39) of we have that 

*fi-X = *rt-l ( 60 ) 
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om (32) we have that x d n = 0 which means that x„ = x„. It 
trivially seen from Lemma 2 assuming that x(l t ) ^ 0 that 


/2n —4 / _ \ \ ^ 


t*-i =i tt -i+/^- 4 (-A„)-x fl (61) 

vhieh implies that 

fin — l,7i ~ f}n-\,n = “A yi . (62) 

We can then calculate from the definition (33) of and 
<60)-(62) 

■Til- 2 ~ -Gi- 2 4 ■ / ’n~2 

_ , r'2n-b(„ J 'n-i - r n\ 

— Jn — 2 4 J - 2,>i — 1 ^ i flu— 2,n ^ J 


— Tn—2“h / 2,7i —l ^ *T|i-1 

4 (fin-2 ,n 4 .9n-2,n - 1 /Ai-2 ) ii/ 2/< ~ 4 ) ry 


— -i' 7 |_2 4 f“ fin - 2 ii— 1 £ ?i—l 4 2,n jJ2^' n 


fin—2,n -1 = r/n — 2,n- 1 

fin 2,n = . < 7n-2,» 4 //n — 2,» ]</»» — l,r»/ • (65) 

The rest of the proof will be given by induction. Assume that 
(47) is satisfied for ,y 4 1..J + 2, - • •. - 1, i.t\. 


• t in + 1 — - 1 /r*4-l 


+/ 2, “ £ 


</ ”'+ 1 * .» i) 


i', (66) 


where r« € {./■ J 4- — 2}. Wc then find from the 

definition (33) of r d assuming that .;•(/,) j- 0 

,r i =*,+ J j 

~ J J 4 2 (0 (7/1 . , _ ! J ' r 


= +/ 2, - 2 (0 E »»* 

r=j + l 

x :r r + f 2 '- 2 E 


fir m , m , -' l 


Equation (67) shows that if (47) is satisfied |„i , 

2, then (47) is satisfied for j as well i.n ,, 

(61H62) and (6?) imply that (47) is satisfied («i .,11 
{2, • • •, n -1}. We see that ;y, P is given by .md ,Uk! 
are known. Because of the smoothness and hound, j i - o< 
f(t)<<ljr{i) and g,„ r (l) for rn € {j + 1.-- ,/ ■!}</,, ud, 
be smooth and bounded as well. 

Proof of Theorem I: The point ot time t, denotes dn 
largest term m the sequence (/o,( 2 . ■) such that I ; t, 

If .r(1,) = 0 then [i»i(0- U 2(0] T = 0 from the control law 
(22) and (31) for t € [t,,/,+i). From the model (15) wc 
then see that ,r(() = 0, for I € implying that 

[«i(0>«2(0] t = <* for I e [/ l+ i,/ l+2 ), etc. and x(l) = 0 
for all 1 > t,. In that case, relation (48) is satisfied for all 
I > I, for any function h.(-) of class 1C. In the following, we 
analyze the case where r(i,) f 0. The norm || • || denotes the 
one-norm (18). 

From Lemma 4 we have that tor j e {21} 

J-j(') = Jr,(.r(/).0 + 

f-) +1 V V }! 

where the functions g, r {i) are bounded, say by the constant 
(7, such that |</ J7 (/)| < G for all t > /(>. Since |/(/)| < I, 
Property P2) of /(/), we gel 

k,(0l < \.ijlr(t)J)\+ E 6, | A ik/l))]'- ~'' ?l(j(/) ' f )l' 

j 6 {2, • ■ •, n — 1}. (69) 

Now, we will show that there are constants h, such that 

= A' »r |*-C-r(/.))| > h, k, (.r(/)-01'^ (70) 

for all j f (3, ■ ■ ■, n). From (23) wc find that if |A(.r(f,))| < 
K then 

l^(^(/,))| = [|./i(/,))| 4 C;(||r.(/ i )||)]/7. 

Combining this with the definition of G(-), (24), implies 

k(.r(/ 1 ))|>G'(lk(/,)ll)/^ = /^ll--(L)||^ 1 
>/MM,)ll^- 
./ e {2.-(71) 

Lemma 3 implies 


= ■' / 4 f 2 ' 2 {t) E Ujr 
r=/ +1 


*(*('.))! >^^k,W0,0l) " 


1 + f 2> 2 E V"" /.r-j 


= .i-, +/ 2 ' 2 (/.) 5^ 


,9jr — fljr 14-/' jmr 

m=j +1 


= k.WO.OI 17 ^ 7 , J e {.3.• • •.»}. 

(72) 

__1 

Therefore, (70) is satisfied by defining n, = fine 1 ” s . 

From (70) and (69) we then get 


M0I < k,(x(/),/)| + c 53 ^-livWO-OI 

r =/+1 *- 

+ -2 7 kr(.rk)./)| 1 -^l ( 

«r J 
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for all j € {2, • • •, n-1}. From Lemma 3 we get, with t p =s i 0 

+ G £ f^7^li^(<o)l|e-^ (, - <o) 

■T™j + 1 ' 

/ c . \ 1- TS^n 

+ (-pl|&(<o)l|e- 7 ' (, - to) J 

< Mk(*o)||)e" Cj( ‘ -to) 

for all /€ {2,• ■ • ,n - 1}, since ||z(fo)|| > (|z r (to)||, where 
M«) * c ><7 + G £ +i [^=7 Cr9 + (^7 < ?) ^ 

0 = tmnj^r^l - }, {ji,•••,«}. (74) 

From (44H46) we see that 

o < 7* < 7n-i < • ■■ < 72 < A 2 . 

Equation (74) then implies that 



Note that the function hj(q) is of class 1C. Since x* =0 we 
have from Lemma 3 

j*n(f)| = |5n(®(0»*)l < C n ||2(t 0 )||fi' _ ' l ' n(t_to) . 

We define /i n (||z(*o)||) = c„||z(to)|| and C n = 7 „ such that 
from Lemma 3 

The proof is completed by noting that 
||z(t)|| = |X 2 (t)| + ---|x n (t)| 

< E h i(ll*( < o)ll)«‘ C,(t "* o) = fc.(ll*(*o)ll)e"' y - (t_lo) 

. 7=2 

where 

n 

■h ■«(«)* (75) 

3-2 

7* = min{C 2 , -,Cn} = <2 = y. (76) 

□ 

Proof of Lemma 5: The outline of the proof is to show 
that we can find a small enough neighborhood ft of the origin 
such that the controller u\ does not get saturated due to the 
convergence of z(t) from Theorem 1. It will then be shown 
that the control law uj = k(x(ti))f(t) is chosen such that 
X\(t) converges to zero as z(t) converges. A function /*,(•; T) 
will be constructed. The norm || ■ || denotes the one-norm (18). 
Let the neighborhood be given by 

«={®ll®il<§,G(^(NI))<^} 

where h z (-) is a function of class K from Theorem 1 and 
z = [* 2 , ■ • ■ , *n] T (16). The function G( ) is defined in (24), 


and P is defined in (25). By induction, we will show that if 
*(to) € fl, then 

|/c(x(tj))| < K , Vtj E {to,fi, ■ ■ ■} (77) 

which implies from the definition of k(-), (23) 

k{x{ti)) = -[*i(ti) + sgn(*i(ti))G(||z(ti)||)]/9, 

Vti G {to, *!,•■■}. (78) 

Note from Theorem 1 that 

MOII < MlkMIl), vt > to- 
Since G(-) is of class K, (24), this implies 

G(||z(t)||) < G(MN*o)ID), Vt > to. 

Assume for a t m € that 

k{x(t m )) = -[*l(t m ) + Sgn(*i(t m ))G(||z(t m )||]^. 

Integrating ii = u\ = k(x(t m ))f(t.) from t m to t m+1 then 
gives 

*i(t m +i) = -sgn(*i(t m ))G(||z(t m ))||) (79) 

which implies that 

l*i(t m+ i)| = G(||z(t m )||) < G(fc,(l|z(t 0 )||)). 

By assumption, x(to) € (1 which implies that G(/i z (||z(/, 0 )||)) 
< Pj and therefore 

|®l(tm+l)| < 

Since 

[|*i(Wt)l + G(||*(t m+1 )||)]/J < (yp + Tp) 1 * = K 

then from the definition of k(), (23) 

k(x{t m+ 1 )) = -[xi(t m +i)+sgn(xi(f m+ i))G(||z(i m+ i)||)] / £/. 

Equations (77) and (78) are proved by noting that since 
x(to) € then 

[l*i(*o)l + G(||z(tj)||)]/? < (|j + = K 

which implies that 

k(x{t Q )) = -|xi(f 0 ) + sgn(xi(^ 0 ))fj f ( | |^(^o) I Dis¬ 
integrating xy = uy in (15) from U to t < U+i with 
u\ - k(x(ti))f(t) and k constant gives 

M*)| < |*i(fi)| + |A| / f(r)d,T 

Jti 

<M*<)I + \k\ f' +1 Hr)dr 

Jti 

= l*i(fj)l + |*i(*i+i) - *i(fi)l 
< 2|xi(ti)l + |xx(t i+ i)|. 


( 80 ) 
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nee x(to) 6 0 then k(x(t ,)) is given by (78) and we get 
•m (79) and (80) 

*(01 < 2(7(||z(i._i)||) + G(|k(f,)||), ie {1,2,-}. 

(81) 

From Theorem 1 we have that 

ll*(fc)ll < fc-(ll*(*o)||)e- 1 '- (, ‘”* #) . (82) 

From the definition of G( ), (24), we have 

G(ae~ bt ) = K(ae~ bt )x±7 = e^G(a) (83) 

where a and b are positive constants. We denote for simplicity 

9 = MII*(<o)ID- 


Equation (83) combined with (81) and (82) implies 
In(/)| < ■2G(qe-^ t '- 1 - to) ) + G(qe~^ {t '- to) ) 

= 2 G{q)e^ (t '- l ~ to) + G(q)e^ ( ' t, ~ to) 
< 3 G(q)e*& {t ' l ~ to) . 

By convention we have for all ? € {1,2, - ”} 


where 


h(||a:(to)il.T) = Mlk(<o)|l + h J || 1 f/„ ,||. 

o = -2f- 
2n - 4 

where 7 * is given from Theorem 1 . 

Proof of Theorem 2. The theorem will be proved iv> 
showing that z{t) = [x 2 (f), ■ • ■,J’ n ( 0 ] r and .r,(M ,n< 
bounded and reach the neighborhood 12 defined in Lemma 5 
in finite time. After this finite time, exponential convergence 
of x(t) is ensured by Lemma 5. A function /t r (11j (f 0 )||; 7 ) 
is constructed by using expressions Iot the finite time and the 
bounds on ||.r(0ll and fi(||ar(to)||;7’) in Lemma 5. The norm 
|| - || denotes the one-norm (18). 

We will seek a time r(||.r(fo)||) such that for all I > 
t, x(t) € f2. First, define the time function 2. : H + —* R + as 

f llu G(h,(h t {q))M, G{h,(h'{q))) > ~ 

t.( 9) a 7 ’ A w . 

(o. G(MMff))) < ^ 

(89) 


1 i—i = t,+i -2T>t-2T 

since t £ [f,,/ l+ i). This implies 

ki(0l < 3 G(q)c e in-i o) 1 t >i\. (84) 

If i E [foifi) we find the following bound on |xi(0l by 
integrating .r a = k(.r(t 0 ))f (t) where k is given by (78) and 

m >0 

ki(')l < MMI + [kiko)l + g(lkko)ID) jt 7 / 7 7 

fto f( T ) dr 

< 2 ki(to)| + G(||a(to)||). 

Since t E [^o, f 1 ) this implies 

MOI < [2kt(*o)| + G(ikko)||k-"^, t e (toki). 

(85) 

Combining (84) and (85) and using t\ - tu = T implies 

kt(0l<{3C(Mlk(MII)V^ 

+ [2|xi(t 0 )| + G(|k(to)||)]e^}e :;2 ^ !fli 

< /i 1 (|k(fo)||;7’) e ‘^-^“ ) , t > to (86) 

where 

h(ll*(fo)lhD = 3G(Mlk(to)ll))^ 

+ [2|k(f«)|| + G(||x(to)||)]e^. 

Since G(-) and h x (-) are functions of class JC, hi( ;T) is 
also of class /C, (1). The proof is completed by noting from 
Theorem 1 and ( 86 ) that 

lk(OII = M*)l + lk(')ll 

< hi(||x(«o)||; T)e =a ^ al + /i z (|k(f 0 )||)e -7 ' (t- ‘ o) 

< [ft 1 (||*(to)||;T) + Mlk(*o)!l)]e^^ 

= Mlk( i o)||;7 1 ) e ~ 7(t_t ° ) i t>to (87) 


Consider G(/? 2 (|| 2 (f)||)). By inserting (48) and accounting for 
ll-ll < IMI and the fact that G( ) and h z ( ) are of class /C, 
it is seen that 

Vf > T s (|k(fo)||) + /(), G(/ij(|k(f)||)) < £ (90) 


Then, integrating = u\ = A(j(/,))/(/) from t, to t lJf \ 
gives 


f-sRti(j-i(0)G(|k(/ < )||, |A-(.r(f,))| < K 


*(/.+t)=< 


J-l (#.) - SgIl(Ti(f,))^-. |*(x(/,))| = K 


Define the time function r: 7?+ —♦ R+ as 


r(g) = max 


{ r - w 4 "} 


(91) 

(92) 


One can show from (90) and (91) that for all l„, £ {to, /i, • ■ •} 
which satisfies t tn > 1 0 + r(|kko)||) 

ki(' m )l<4- (93) 


By integrating = k(.r(t n1 ))f{t) one can show that xi (/) 
is a linear interpolation between .ri(/ m ) and \) for all 

t E [f Tn ,f m +i]- From the definition of f!, (50), and from (90) 
and (93) we find 


V/ > to + 'r(lk(fo)||), x(t) € ft. 


Denote by t p the smallest element in the sequence (fo^i •' * ■) 
such that t p > io+r(||x(/o)||)- From Lemma 5 it then follows 
that 

vf>t„. |kWII<fi(lk(k)ll;7’K Mf “ t ' ) - W 

To include / E [fokp)» an expression for the maximum of 
|k(f)|| must be found. 
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We can show from (91) by using the convergence property 
yf e(i), theorem 1, that 

max^i (<)| < max{|xj (to)l, G<MH*(*o')l1) : ):} 

t£lo ' 

< max{||x(( 0 )||,G(MIM*o)l|))} 

= *im(||*(*o)ll) (95) 

since [ja:{to)|| ^ || 2 {<o)||> and7*i( ) are of class K. Here, we 
have defined the function x lt „: R + —* R + which is of class 1C, 
From (95) and Lemma 5 we have 

W)||<*i m (IM*o)ll + fc.(IW<o)ll 

= x mac (||x(to)||), vt > to. 

Here, we have defined the function x mx : R.+ —+ R + which 
is Of class K.. The bound in (94) can then be extended to an 
exponential bound for all t > to and all x(to) 6 R n 

||*(0ll < lA(*m«(«);T) + 

where q denotes |Jx(fo)||. From the definition of r(-), (92), 
we then obtain for all t > to 

Vz(to) e R n , ||x(f)|| < MII*(<o)||;T)e-^- f ») 

where the class K. function h r : /?+ —♦ i?+ is defined as 

h x (q\T) 4 [h(x mx (q)\T) + x mx (r,)]e^<«) 

and q denotes ||x(to)||. □ 

Acknowledgment 

The authors would like to thank Dr. C. Canudas de Wit 
for valuable discussions on the stabilization of nonholonomic 
systems and Dr. C. Samson for comments on the stabilization 
about any configuration. 

References 

[t] J. Barraquand and J.-C. Latombe, “On nonholonomic mobile robots and 
optimal maneuvering,” Revue d'Intelligence Artificielle, vol. 3, no. 2, 
pp. 77-103, 1989. 

B] A. M. Bloch and N. H. McClamroch, “Control of mechanical systems 
with classical nonholonomic constraints,” in Proc. 28th Conf Decis. 
Contt \, Tampa. FL, Dec. 1989, pp. 201-205 

(3] A. M. Bloch, N. H. McClamroch, and M. Rcyhanoglu, “Controllability 
and stabilizability properties of a nonholonomic control system” in 
Proc. 29th Conf. Decis, Confr,, Honolulu, HI. Dec. 1990. pp. 1312-1314. 

(4] R. W. Brackett, “Asymptotic stability and feedback stabilization,” in 
Differential Geometric Control Theory , R. W. Brockett, R. S. Millman, 
and H, J. Sussman, Eds. Boston: Birkhuuser, 1983, pp. 181-208. 

(5] C. Canudas de Wit and O. J. Sordalen, “Exponential stabilization of 
mobile robots with nonholonomic constraints,” IEEE Trans. Automat. 
Confr.:vol 37, no. M. pp. 1791-1797, 1992. 

16] J.-M. Coron, “Global asymptotic stabilization for controllable systems 
without drift,” Maih. Contr ., Sig., Syst., vol. 5, pp, 295-315, 1991. 

|7[ J.-M. Coron and J.-B. Pomet, “A remark on the design of time-varying 
stabilizing feedback laws for controllable systems without drift,” in 
Proc. Nonlinear Cmtr. Syst Design Svmp., Bordeaux, France, June 
1992, pp. 413-417. 

(8J B. d" Andrea-Novel, G. Bastin, and G- Campion. “Modeling and control 


of nonhotohomie wheeled mobile robots,” in Proc. 1991 IEEE Int. Conf. 
Robotics Automation, Sacramento, CA, Apr. 1991, pp. 1130-1135. 

[9] L. Gurvits, “Averaging approach to nonholonomic motion planning,” 
in Proc. 1992 IEEE int. Conf. Robotics Automation, Nice, France, May 
1992, pp. 2541-2546. 

[101 I- Gurvits and Z, X. Li, “Smooth timc-pcriodic feedback solutions for 
nonholonomic motion planning,” in Progress in Nonholonomic Motion 
Planning , Z. X. Li and J. Canny, Eds. Norwell, MA: Kluwer, 1993, 
pp. 53-108. 

[11] W. Hahn, Stability of Motion. New York: Springer-Verlag, 1967. 

[12] G. Jacob, “Motion planning by piecewise constant or polynomial 
inputs,” in Proc . nonlinear contr. syst. design symp., Bordeaux, France, 
June 1992, pp. 628-633. 

[13] M. Kawski, “Homogeneous stabilizing feedback laws,” Contr . Theory 
Advanced Tech., vol. 6, no. 4, pp. 497-516, 1990. 

[14] H. K. Khalil, Nonlinear Systems. New York: Macmillan, 1992. 

[15] G. Lafferriere, “A general strategy for computing controls of systems 
without drift,” in Proc. 30th Conf Decis. Contr., Brighton. England, 
Dec. 1991, pp. 1115-1120. 

[16] Ci. Lafferriere and H. J. Sussmann, “Motion planning for controllable 
systems without drift,” in Proc. 1991 IEEE Int. Conf. Robotics and 
Automation , Sacramento, California, Apr. 1991, pp. 1148-1153,. 

[17] J.-P. Laumond, “Feasible trajectories for mobile robots with kinematic 
and environment constraints,” in Proc. Int. Conf. Intelligent Autonomous 
Syst., Amsterdam, The Netherlands, 1986, pp. 346-354. 

|18[ __ “Finding collision-free smooth trajectories for a nonholonomic 

mobile robot," in 10th Int. Joint Conf. Artificial Intelligence, Milano, 
Italy, 1987, pp. 1120-1123. 

[19| J.-P. Laumond, M. Taix, and P. Jacobs, “A motion planner for car¬ 
like robots based on a mixed global/local approach,” in Proc. hit. Conf. 
Intelligent Robots Syst., Japan, 1990, pp. 765-773. 

[20] R. T. M’Closkey and R. M. Murray, “Convergence rates for non¬ 
holonomic systems in power form,” in Proc. Amer. Contr. ConJ'., San 
Fransisco, CA, June 1993, pp. 2967-2972. 

[21] R. K. Miller and A. N. Michel, Ordinary Differential Equations. New 
York: Academic, 1982. 

[22] S. Monaco and D. Normand-Cyrot, “An introduction to motion planning 
under multriate control,” in Proc. 31th Conf. Decis. Contr., Tucson, AZ, 
Dec. 1992, pp. 1780-1785. 

[23] R. M. Murray, “Nilpotent bases for a class of nonintcgrable distributions 
with applications to trajectory’ generation for nonholonomic systems.” 
California Inst. Tech.. Pasadena, Tech. Memo, CIT-CDS 92-002, Oct. 
1992. 

[24] R. M. Murray and S. S. Sastry, “Steering nonholonomic systems using 
sinusoids," in Proc. 29th Conf. Decis. Contr., Honolulu, HI, Dec. 1990, 
pp. 2097-2101. 

[25] __ “Steering nonholonomic systems in chained form,” in Proc. 30th 

Conf. Decis. Contr.. Brighton, England, Dec. 1991, pp. 1121-1126. 

[26] R. M. Murray and S. S. Sastry, “Nonholonomic motion planning: 
Steering using sinusoids,” IEEE Trans. Automat. Contr., vol. 38, no. 
5, pp. 700-716. 1993. 

[27J R. M, Murray, G. Walsh, and S. S. Satry, “Stabilization and tracking 
for nonholonomic control systems using time-varying state feedback,” 
in Proc. Nonlinear Contr. Syst. Design Symp., Bordeaux, France, June 
1992, pp. 182-187. 

[281 J.-P. Pomet, “Explicit design of time-varying stabilizing control laws 
for a class of controllable systems without drift," Syst. Contr. Lett., vol. 
18. no. 2, pp. 147-158. 1992. 

[29[ C. Samson, “Time-varying feedback stabilization of nonholonomic car¬ 
like mobile robots,” INRIA-Sophia Antipolis, Tech. Rep. 1515, Sep. 
1991. 

[30] __ “Velocity and torque feedback control of a nonholonomic cart,” 

in Advanced Robot Contr. Proc. Int. Workshop Nonlinear Adaptive 
Contr: Issues in Robotics France, Nov. 1990, pp, 125-151. 

[311 C. Samson and K. Ait-AdbcTTahim, “Feedback stabilization of a non¬ 
holonomic wheeled mobile robot,” in Proc. Int. Conf Intelligent Robots 
Syst., Japan, 1990. 

[32J __ “Mobile robot control, part 1: Feedback control of nonholonomic 

wheeled can in cartesian space,” INRIA-Sophia Antipolis, Tech. Rep. 
1288, Oct. 1990. 

[331 O. J. Sordalen, "Conversion of the kinematics of a car with n trailers into 
a chained form,” in Proc 1993 IEEE Int. Conf. Robotics and Automation, 
Atlanta, GA, May 1993. pp. 382-387. 

[34J A. R. Teel, R. M. Murray, and G. Walsh, “Nonholonomic control 
systems: From steering to stabilization with sinusoids,” in Proc. 31st 
Conf. Decis . Contr., Tucson, AZ, Dec. 1992, pp. 1603-1609. 

[35] D. Tilbury, R. Murray, and 5. Sastry, “Trajectory generation for the 
h- trailer problem using goursat normal form,” University of California, 
Berkeley, Itch. Rep. UCB/ERL M93/12, Feb. 1993. 



iDAL&N AND BGELAND NONHOLONOMIC CHAINED SYSTEMS 



include nonlinear control 


Ole Jakob Sprdalen (S*92-M’93) was bom in 
Krager0, Norway, in 1965 He received the Diploma 
Engineer (Siv Ing) degree in 1988 and the Dr 
Ing degree in 1993 in electrical engineering and 
Computer Sciences at the Norwegian Institute of 
Technology, Trondheim, Norway 
Dr Sordalen is currently a Researcher at ABB 
Corporate Research, Norway He has held visit¬ 
ing appointments at Laboratoire d’Automatique de 
Grenoble, France, University of California, Bcrke 
ley, and University of Tokyo His research interests 
of mechanical systems and nonholonomic systems 


Olav Egeland (S’8V-M 86) u as bom m I, M 
Norway, in 1959 He received the Si* U\ K 
in 1984 and the Dr Ing degree in 198? m H 
Engineering at the Norwegian Institut o' 1, , 

ogy, TYondheim, Norway 
Dr Egeland became Assistant Protess > r 
and Professor o! Robotics in 1989 ,\1 the I it |> , , , 
of Engineering Cybernetics, the Norwcgi m 1 1 im 
of Technology In the academic year 1988/89 | K 
a Visiting Scientist at the Gcmian Aerospace \\ K 
search Establishment (DLR) in Oberpt at! enhokn 
Germany His research interests include control of manipulators surl u c 
vessels, flexible structures and nonholonomic vehicles 




so 


IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 40. NO. 1. JANUARY 1995 


Global Total Least Squares Modeling 
of Multivariable Time Series 

Berend Roorda and Christiaan Heij, Member , IEEE 


Abtlract —In this paper we present a novel approach for the 
modeling of multivariable time series. The model class consists of 
linear systems, Le., the solution sets of linear difference equations. 
Restricting the model order, the aim is to determine a model with 
minimal fa-distance from the observed time series. Necessary 
conditions for optimality are described in terms of state-space 
representations. These conditions motivate a relatively simple It¬ 
erative algorithm for the nonlinear problem of identifying optimal 
models. Attractive aspects of the proposed method are that the 
model error is measured globally, it can be applied for multi- 
input, multi-output systems, and no prior distinction between 
inputs and outputs is required. We give an Illustration by means 
of some numerical simulations. 

I. Introduction 

T HE basic problem in time series modeling is to find 
a reasonably simple model which gives a sufficiently 
accurate description of the data. Procedures which have been 
developed for this problem differ in the specification of the 
model class and in the way the complexity and accuracy of 
models is evaluated. 

In this paper we will restrict our attention to models 
described by linear difference equations with fixed coefficients 
and finite lags. Within this classical setting several modeling 
procedures have been developed. For an overview, we refer to 
the textbooks [l] f [4], [6]. Well-known examples are the least 
squares identification of input-output systems in polynomial 
form and, more generally, the maximum likelihood identifi¬ 
cation of ARMAX systems, i.e., input-output models where 
the disturbances follow a moving average process. These and 
most other methods require that several structural aspects of 
the model should be specified a priori. 

1) The number of equations, that is, the number of inputs 
and outputs of the system, and the decomposition of the 
system variables into inputs and outputs. 

2) The orders of each of the equations, that is, the so-called 
structural indices of the system. 

3) The stochastic properties of the disturbances, in particu¬ 
lar the joint correlation structure between inputs, outputs, 
and disturbances. 

4) The choice of a canonical parameterization, to avoid 
problems of nonidentifiable parameters. 

Manuscript received July 22, 1993; revised February 5, 1994 and May 
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The identification procedure that we propose in this paper 
differs in some crucial aspects from the methods just described. 
Our aim is to decompose a given multivariable time series, 
denoted by w , into two parts, i.e., 

w = w + w (1) 

where w represents a regular part and w the corresponding 
deviation. The aim is to keep the approximation error as small 
as possible, under the condition that the approximating time 
series w is sufficiently regular. To make this more explicit we 
next describe our notions of regularity and model error. 

A time series is called regular if it satisfies linear, time- 
invariant difference equations of finite lag. Let q denote the 
number of system variables, p the number of independent 
equations, and n the total lag, i.e., the sum of the lags of 
the individual equations. Time series are more regular when 
m: = q - p and n are smaller, i.e., the more equations they 
satisfy and the smaller the number of initial conditions. This 
can also be formulated as follows. Define the complexity of 
a linear, time-invariant, finite dimensional system by the pair 
(m, n), where m is the number of system inputs and n the 
(minimal) number of state variables. A system is called less 
complex if it has fewer inputs, i.e., unexplained variables, and 
if it has less states, i.e., initial degrees of freedom. Then a 
time series is more regular if it can be generated by a less 
complex system. 

The model error is evaluated as follows. For expository 
reasons we restrict ourselves in this paper to time series which 
are specified over the infinite time axis Z and which are square 
summable, i.e., we assume that w € The main results in 
this paper can be extended for modeling time senes observed 
over a finite time interval, but we will not treat this issue 
here to simplify the presentation. The error in approximating 
an observed time series w by a regular part w is measured 
by the f 2 -norm of the deviation w = w - w, denoted by 

IH|:= w(t) T M*)} 1/2 - 

In our approach we will assume that the required regularity 
of the approximating time series w has been specified a priori. 
Stated otherwise, we impose an upper bound on the complexity 
of the system that can generate the approximation. We denote 
by B 9,m,n the set of all time series that can be generated by 
systems with m inputs, q - m outputs, and n states, i.e., all 
time series that satisfy q-m independent linear, time-invariant 
difference equations with total lag n. Under this restriction we 
wish to minimize the approximation error as defined above, 
i.e., 

min{||iu - ti||; w € fl ,,m ' n }. (2) 
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S ving this problem for different values of (//?, 7 /) gives 
ii impression of the involved trade-off between the requued 
k ulanty and the resulting approximation error The search 
fc an acceptable model complexity is facilitated by the lau 
il at the error decreases for increasing {m n ), as B q 1,11 11 C 
h 1 nl * Vi if < m i and «i < n 2 Note that this criterion 
i lows tor deviations in all the system variables and that the 
t nor is not measured locally e g , as a prediction error but as 
ilie global ^-distance between the observation and the regutai 
part Theietore we give the name ‘global total least squaies 
10 the identification criterion (2) 

As compared to classical procedures, our formulation of the 
time series modeling problem involves less a pnon specih 
cations With respect to the loui structuial aspects discussed 
before, our approach has the following features 

1) The number of inputs and outputs is specified a pnon 
but the system variables are all treated alike so that there 
is no need for a specification of inputs and outputs 

2) The structural indexes need not be specified but only 
the total lag and this can be varied easily 

T) The problem formulation involves no stochastic speuli 
cations although these may be mcoipoialed by adjusting 
the norm on 1 2 

4) The cnterion is nonpaiameiric so that any representation 
may be chosen as it suits 

This papei has the following structuie To give some feeling 
for the global total least squares problem we fust describe 
the well known and lelatively simple case oi static total leasi 
squares in Section 11 lhe basis foi oui modeling theory is the 
behavioral approach to systems and this is bncfly discussed 
in Section 111 Foi further treatment of the behavioral approach 
in systems theory we rcfci to [101 and (ll| In Sections 
IV and V wc develop a highly structuicd type of system 
representations isometric state icprcscntations which lorm the 
cornerstone of our modeling theory Section VI concerns the 
question of how to determine an optimal appioximation ol 
an observed time series within a given model This linear 
optimization ptoblcm is solved by a projection algorithm 
which was introduced in [12] In Section VII wc treat the 
problem of determining an optimal system This is a nonlinear 
optimization problem over a nonconvex set We piopose lhiee 
model improvement constructions based on the results m 
Section VI These constiuctions arc used in Section VIII to 
estimate locally optimal models In Section IX wc describe 
three simulation experiments that illustrate the use o! the 
global total least squares method, and Section X contains some 
conclusions 

11 Static Total Least Squares 

We first consider the well-known case ol total least squares 
in static models Although it may be somewhat artificial in 
hts case to consider observations in 1 2 1 c , on an infinite time 
nterval, it gives a better introduction for the dynamic case 

Static total least squares involves the appioximation of 
1 given time series by a regular one that satisfies linear 
rondynamic relations For a required numbci of independent 
quations, the objective is to keep the approximation error 


as small as possible, le to imnimi/c ||u 
denotes the observed lime senes and a tin. »ppm\ u 
If the required number of independent cquilmns 1 , t 

b> p then the regularity of the approximation in h \\ 

Rtv - 0 tor sonic matrix R ol rank p This int uis 1 \w 
approximation w has iank at most m <j p \n | UI j 
(2), this class of regular time scries n given b\ B* w i i 
n =- 0 corresponds to the exclusion ol dynamic equations I hi > 
leads to the following formulation of static total least squaus 

Definition 2 1 (Static fatal l cast Septan v) ( 01 a giu n 
time series u £ !*{ and minimal required mimhu ot 
independent relations p - q - m find a decomposition 
tt - ui 4* n with n c B q m 0 and with ||u || minimal 

The solution to this problem is given b) the sinuulai value 
decomposition (SVD) Wc denote the usual Euclidean norm 
on R q by | | and the induced norm on /j b\ || |l 
Proposition 2 2 (Sint>ulai \alui Decomposition m I ) F vtrv 
11 t /{ can be decomposed as ti , \ // 1 with 

1) > d ^1 > 0 called the smgulai \alues ol u 

2) u £ R /Xl with \u | = 1 and 11 1 u f l) for / y y called 
the left smgulai vectors of u 

T) / ( i\ with ||? || — 1 and / J_ 1 j lor / ■/ y called the 

light singular vcclois of 11 

I he singular values arc uniquely dUcmnncd and if the) aie 
distinct then the singular vectors aie also uniquely determined 
up to a sign 

Pi oof The existence and pmpeities ol the SVD for finite 
matnces aie discussed meg [ I,et II denote the empirical 
covariance matrix of 11 ic H ^Z f K ^ u(t)n(l) 1 As TI 
is symmetne the left singular vectors aie equal to the light 
ones so let its SVD be given by 11 - , p n uj Let 

l - [//1 iij] ihen the cmpnical covanance mamx ol 

l 1 it is given by diag(//i p j) so l 7 11 consists ol ry 
orthogonal components ol nomi yjp It is easily verified that 
foi i (1 / V /7T)/y z ti and A — ^fp, u - £ 1 ] A 11 ,i , is 
an SVD ot u □ 

I he SVD solves the static total least squares problem as 
follows 

Pioposition 2 1 (Optimal Statu Appioumation) Let 11 — 
\,u t f be the SVD ot it C /' Then u , Aim, 

is the optimal approximation of 11 in B 1 ' 1J saiistying the 
static equations u J n 0 for t - /// f 1 ij 

Pi oof I his icsult follows immediately tiom a coire- 
spondmg pioperty of the SVD (or finite matrices ct Ml U 

Futniph One of the essential features ol total least squares 
is lhai all variables are treated in a similar wav For simplicity 
wc consider simulated data from a model with this type 
ol symmetry, namely an errors in variables model Hus also 
gives us the oppoituml> to relate total least squares to othei 
well-known identification methods and to discuss the role of 
stochastic assumptions We considei the model 

"(i) -(/) + '/(/) v(0 (0 f (0 O) 

The observed variables consist ot ti = (u //) and is 
an unobseived latent variable and r and 7 / aie unobseived 
distmbances Three of the possible methods foi the estimation 
of the parameter a are regression of // on u regression ol 11 



TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 40, NO. I. JANUARY 1995 


52 IEEE 


on y, and total least squares. These methods can be interpreted 
as maximum likelihood methods, in the case that e and 17 are 
independent white noise processes with variances crj and 
and if in addition respectively cr* = 0, a] = 0, or <j 2 £ = <?%. 

As an example, we consider data generated by the model (3) 
with 0 = 1 and 0 % = a* = 0,5 and where z is a sample of 20 
observations of a white noise process with variance one and 
independent of 17 and e. The SVD of the resulting observation 

W = 5 - 64 (o.'69) Ul + 2 - 31 (~072) ,;2 (4) 

where v\ and v 2 are two orthogonal vectors of unit length. 
According to Proposition 2.3, the optimal static approximation 
tfc of w of rank 1 is given by the first term in (4), and 
this satisfies [-0.69, 0.72]w; = 0 . This corresponds to an 
estimated value 0.69/0.72 = 0.96, which is close to a — 1. 
The corresponding approximation error is ||u/ - xD|| = 2.31. 
Regression of y on u yields an estimate of 0.69, and regression 
of u on y gives 1.36. These results are depicted in Fig. 1. 

This example illustrates the fact that the static total least 
squares (TLS) scheme is in between both regressions. The 
method is easily adapted for the case cr 2 ^ nr 2 , by using 
a weighted / 2 -norm for yu = (u, y ), defined by ||u)|| 2 : = 
a 2 ||u || 2 + \\y\\ 2 with a = rr^/rr^. The regressions correspond 
to the extremal cases with infinite weight on one of the com¬ 
ponents. To choose an appropriate weighted l 2 -norm we need 
information on the relative errors involved in the measured 
system variables, c.q., the relative weight one attaches to 
deviations in the different variables. In this paper we will not 
further address these problems. 0 

Ill. Global Total Least Squares 

In the rest of this paper we are concerned with total least 
squares in dynamical systems. For this purpose we describe 
in this section our systems concept and corresponding notions 
of complexity and misfit. 

As stated in the introduction, we consider systems described 
by difference equations. From the formulation of the global 
total least squares problem ( 2 ), it is obvious that sets of 
equations with the same solution set are equivalent as they 
yield the same approximation error for every observation. 
Hence not the equations themselves, but their solution set 
is the essential object in our modeling procedure. This is 
a strong motivation for adopting the behavioral approach to 
systems, as introduced in [ 10 ] and [ 11 ], in which a system 
is defined by the set of time series that are compatible 
with the system laws. This set is called the behavior of 
a system. The system laws themselves are considered as a 
description or representation of the behavior. The properties 
we impose on difference equations are reflected by the set- 
thcoretic properties of the corresponding behavior. Linear, 
time-invariant difference equations correspond to linear, shift- 
invariant behaviors. If in addition the equations have finite 
lag, it can be decided if a time series belongs to the behavior 
by scanning it through a finite window. This property is 
called completeness, which is further explained in Appendix 
A, (see Definition A.l), We will further restrict the attention 
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Fig. 1. Scatter diagram of the data w the solid line denotes the TLS model 
and the dashed lines the two regression models; the dotted lines indicate which 
distance is minimized in the three different models. 

to behaviors in l 2 . The resulting class of systems is defined 
as follows. 

Definition 3.1 (l 2 -Systems): / 2 -systems are linear, shift- 
invariant, complete subspaces of l\ 

As a measure of the restrictiveness of a set of difference 
equations we take into account the number of independent 
equations and their total lag, as described in the introduction. 
We call the equations independent if the number of equations 
cannot be reduced without changing the solution set, i.e., if 
there exists no equivalent smaller set of equations describing 
the same system. On the set theoretic level of / 2 -systems, we 
define the complexity of a system in terms of its rank and its 
degree. The rank of a system is defined as the number of the 
degrees of freedom at each time instant, which is equal to the 
number of inputs, and the degree of a system corresponds to 
the dimension of the state space (see Definition A.3). 

Definition 3.2 (Complexity): The complexity of an / 2 - 
system B is defined as c(B)\ = (m, n), where m denotes 
the rank of B and n its degree. 

Let B 9,m,n denote the class of / 2 -systems with rank at 
most m and degree at most n. Then B 9,m,n consists of the 
solution sets of at least q-m independent linear time-invariant 
difference equations with the sum of their lags at most n, i.e., 
w € S 9,m,n if and only if there exists a system B € B 9,rn ’ n 
with w e B. 

We summarize this in Table 1. For further clarification we 
also relate these concepts to the classical characterization of 
systems in terms of input-output mappings. 

We define the following concept of the misfit of a system. 

Definition 3.3 (Misfit): The misfit of an / 2 -system B with 
respect to an / 2 -time series w is defined as d(w, B): = 
infri € *||w - t6||. 

This leads to the following reformulation of the global total 
least squares, problem (GTLS) as described in the introduction 
(see (2)). 




- i>ORDA AND HEU: MODELINO OF MULTIVARIABLE TIME SERIES 


TABLE I 

Comparison of- Sysi-em Concepts 


Difference Equations 

/a- behaviours 

I/O-mapping 

linear 

linear subspace 

linear 

time-invariant 

shift-invariant 

timp'invariant 

; finite order 

/ 2 -complete 

rational transfer function 

q~m indep. equations 

rank m 

rn inputs, q-m outputB 

minimal sum of lags is n 

degree n 

McMillan degree n 


Definition 3.4 (GTLS): For an observation w E l\ and 
given tolerated complexity (m, ri), determine an /^-system 
B* E such thal d(u;, B*) = min^ € B^ * d(w, B). 

This involves a double minimization. The inner minimiza¬ 
tion, evaluating the misfit d(w, B), amounts to optimization 
over a linear space. Secondly, wc have to determine a sys¬ 
tem for which the misfit is minimal. This is a nonlinear 
optimization problem over a nonconvex set. 

Leading Example: We describe a simple example, which 
will be used in the following sections to clarify the introduced 
general framework. We consider a time series ill B 2,1,1 , that 
is corrupted by white noise. The regular part w r consists of 
two components, u r and y r , where t/ 7 is the realization of a 
white noise process with unit variance, and y, satisfies 

Vl (/) - 2/3y r (f - l) + 2v, (1) - 2ti f (/ - 1). (5) 

The observation w consists of two components u and //, with 

u(t) = u,{t) + ri(1)\ u{l) = v,(1) + e(t) (6) 

where ?/ and e are independent white noise processes with 
variance 0.25, both independent of it ,. The data consists of a 
time series of length 100 which is generated by system (6). 
To obtain a time scries in I 2 , the observation (u, y) is taken 
to be zero outside the observation interval. 

Of course, this simple example could be solved by brute 
force as a nonlinear parameter optimization problem, disre¬ 
garding any system theoretic interpretation of the problem. 
In more complicated cases, however, this becomes hardly 
feasible. Therefore we will follow a system theoretic approach, 
that also gives more insight in the problem. 

The GTLS results in the following sections will be com¬ 
pared with those obtained by three other methods, namely 
regression, the “output error" method, and the “local total least 
squares" method. Here we mention that the procedures for 
GTLS and local total least squares have been implemented in 
Matlab and that for the regression and output error method 
we used, respectively, the procedures ARX and OE of the 
System Identification Toolbox. The regression model B reftr i*> 
obtained by regressing y(t) on y(t - 1), u(t) and u(t - 1). 
For the observation w this gives 

= {(«. V ) € <b y(t) = W. 10y(/ - 1) 

+ 1.42u(t)-0.78u(t-l)}. (7) 

This equation yields optimal one-step ahead predictions for 
y(t ), given y(t — 1), u(t), and u(t — 1). According to 
our terminology, we would call this method “local ordinary 
least squares." By ordinary we mean that only one of the 
components of w is approximated and by local that only 


the first step ahead predictions of difference equ.cm 
taken into account. In the GTLS scheme we appioxnr m ,• 
components and take full account of the global, imjlu i 
forward and backward implications of different e u| U »wi , 

The output error model B Qe is the system with ih n MM t 
that for the given input the corresponding system tu< u* 
is as close as possible to the observed outpul y. In hi,i 
method has some similarity to GTLS, the difference ben. * \l>' 
the input is kept fixed and only the output is approximated. In 
our terminology it is a “global ordinary least squares" method. 
The estimated output error model is 

B ue - {(«. y) 6 lb y(t) = 0 .G 8 y(l - J) 

+ i.irm(/)-j.r>iM(/-i)}. («) 

Finally we use a simple modification of the static total least 
squares method of Section II to determine a first-order model, 
as follows. Define the block Hankel matrix H E by 
H(t) — [ ,,, ( ( ;- ) 1) ], then static equations for H correspond to 
first-order equations for the observation w. The optimal static 
equation for H is obtained by applying Proposition 2.2 with 
q = 4 and m — 3. Clearly, the quality of the corresponding 
first-order model for w is evaluated only locally. By this 
we mean that, for example, the second-order restrictions 
on (w(t - 2), w(t - 1), w{i)) r implied by the model are 
not taken into account, and the same holds true for higher 
order restrictions. Therefore we call this the “local total least 
squares" model. For the data of this example this gives 

B ]ih = {(u,y)elly(l) = ^\y(t-i) 

+ 1.87u(/) - \.H‘2u(t - l)}. (9) 

This example will be continued in the next sections. 0 

IV. State Representations 

One of the crucial questions in our modeling theory is 
how to calculate the misfit of a system with respect to a 
given observation, cf. Definition 3.3. Obviously this requires 
a numerical representation of the system. In Section V we 
develop a representation that is extremely useful for this 
purpose, namely the isometric state representations. They also 
play a central role in the construction of optimal models, as 
discussed in Section VII. We now first introduce general state 
representations, which will be abbreviated as SR. Let a denote 
the shift operator, defined as a.r(/):= .v(t F 1). 

Definition 4.J (State Representation). A state representa¬ 
tion (A, B , Cl D) of an / 2 -system B is a description of the 
form 

B = {w E ij; 3 T E /£, v e r 2 ri such that 

c it = Ax + Bv and w = Clr -f Dv} 

with A E R” xr \ B E R" xm , C E R* x,, t D E R* x ™ and 
rt, rn E N. 

Here v is an auxiliary input, x is a state trajectory and w 
a system trajectory, 1 n denotes the number of auxiliary inputs 
and v the number of state variables. The system defined by 
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this representation is denoted as B(A , B, C, D). We mention 
that the condition that x and v are square summable involves 
no Joss of generality, as it can be shown that for every 
(x, v); Z -+ R" x R w generating w € there also exists 
(x\ v f ) £ /£ x V <2 generating the same time series w . Systems 
in i.e., with rank at most m and degree at most n, 

are precisely those systems that admit an SR with m auxiliary 
inputs and n states, cf. Proposition A.4. 

Representations are called equivalent if they describe the 
same system. They are called minimal if the number of state 
variables and the number of auxiliary inputs are both minimal. 
These two quantities can be minimized simultaneously, so for 
every system there indeed exists a minimal SR. From a given 
SR we obtain equivalent ones as follows. 

Proposition 42 (Equivalent State Representations): Let be 
given a state representation (A, B, <7, D) of an ^-system B. 
Then for all invertible S e R r ' xn , invertible R £ R mXm , and 
F £ R mxn , (S(A 4- BF)S -\ SBR , (C 4- DF)S~ l . DR) 
is also a state representation of B. Moreover, if (A, B, C\ D) 
is minimal, then all minimal state representations for B are 
obtained in this way. 

Proof: By definition, w e /£ is contained in 
B(A , B, (7, D) if and only if there exist an v and .r such 
that ox = Ax 4- Bv and w = Cx -f Do. These equations are 
equivalent to o(Sx) = S(A + BF)S~ l (Ss) + SBR(R~ 1 (v- 
Fx)) and w = (<7 + DF)S^(S:r) + DR(R~ l (i> ~ Fx)), 
provided that S and R are invertible. This shows the 
equivalence of the representations. For a proof of the fact that 
for minimal representations all equivalent representations are 
obtained in this way we refer to 15, Corollary IV.3-4]. □ 

Observe that minimal SR’s for a given /^-system are highly 
nonunique. The choice of basis in the state space, corre¬ 
sponding to lS\ is a well-known nonuniqueness of stale-space 
representations. In our fraihework the auxiliary input v is 
merely a tool to describe the system behavior and need not 
have additional external significance. This allows for a basis 
transformation for the auxiliary input, corresponding to R. 
Further the behavior is invariant under a static state feedback 
F to this auxiliary input. This is in contrast to the common 
notion of feedback to the actual input of the system, which 
would affect the set of compatible input-output pairs. As a 
consequence, in our framework the spectrum of the A-matrix 
is not an intrinsic property of a system. In the next section 
we exploit this nonuniqueness to obtain SR’s with convenient 
properties for computing the misfit of a model with respect 
to data. 

Minimality of a representation can be expressed in terms of 
rank conditions on the matrices (A, B, (7, B), as follows. 

Proposition 4.3 (Minimal State Representations for 12- 
Systems): A slate representation (A, B, C\ D) is minimal 
if and only if: 

1) (A. B) is controllable 

2) VF e R" IXW , (A 4 BF, <7 + DF) is observable 

3) kerB - 0. 

Proof: See Appendix B. 

By eliminating the state, SR’s induce an image repre¬ 
sentation of / 2 -systems. For the exposition in the sequel, 


representations with A asymptotically stable are most rele¬ 
vant. Every / 2 -system allows for such a representation, which 
follows from Propositions 4.2 and 4.3.1. 

Proposition 4.4 (Image Representations): Let B be an l\- 
system in B 9,m n , with state representation (A, B, C, B), 
where A is asymptotically stable. Let G : Vf / 2 be defined 
as <7 = C(ol - A)~ l B + B, i.e., (Gv)(t) = Dv(t) 4- 
£r=i CA k - l Bt>(t. - k). Then B = imG. 

Proof: G is well defined, as A is asymptotically stable. 
It is easily verified that the equations ox = Ax + Bv and 
w = Cx + Dv imply that Gv = w. □ 

Leading Example (Continued): Consider the / 2 -system cor¬ 
responding to (5), 0 ex : = {w £ /j; w = (u, 1 /), with y(1) = 
2 f'3y(t - 1) 4- 2 u(t) - 2 u(1 - 1)}. This can be written in 
input/state/output form as follows 

;■(/ + l) = 2/3x(f)4 tt(0;tf(0= -2/3 j(/) + 2m(/). (10) 

From this it is easy to obtain an SR by taking the auxiliary 
input i) to be equal to w, which gives 

£U = S( 2/3,1, (11) 

This representation is minimal. The corresponding image 
representation G = £*=.0 has coefficients ( 7 0 - \\] , 

and G k — [_( 2 /i)^]- An SR equivalent to (11) is given by 

^1, 1/2, which follows from Proposition 4.2 

by taking S — 1, R = 1/2, and F — 1/3. Note that in 
this representation the auxiliary input is given by ij. This also 
illustrates that the eigenvalues of A are not intrinsic for the 
system B. 0 

V. Isometric State Representai ions 

In this section we define isometric state representations 
(ISR's), which are defined by a local isometry property in¬ 
volving the state variable. 

Definition 5A (Isometric State Representation): A slate 
representation (A, B, (7, B) is called isometric if for all 
x e R 7 \ v € R"\ uu £ R^ and z £ Tt n such that z =-- A.r+Br 
and w = Cx 4- Do there holds 

M 2 + M 2 = M' 2 + l~l 2 - 02) 

Equivalently 

(? ”n)-(!s 0 

Minimal ISR’s can be constructed from arbitrary minimal SR’s 
as follows. 

Proposition 52 (Construction of/SR): Let (A, fl, (7, B) 
be a minimal state representation, and let K £ R riXTI be the 
unique symmetric positive definite solution of the algebraic 
Riccati equation 

K = A t KA - ( B r KA + D t C) t (B t KB + D T D)~' 

■{B’ i KA + D t C) + C t C. (14) 
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Let the matrices S € R"*", F € R mx " and 7? <= R mxni be 
solutions of the equations 

S t S=K (15) 

RR t = (B t KB + D t D)- 1 (16) 

F = ~(B t KB + D t D)- 1 (D 1 KA + /; T f). (17) 

Then (S{A + BF)S~\ SDR, (C + DF)S~\ DR) is an 
equivalent isometric minimal state representation. 

Proof: See Appendix B. 

The following proposition gives a necessary minimality 
condition for ISR’s and states that they are unique modulo 
unitary transformations. 

Proposition 5.3 (Minimal ISR)' 

1 ) If (A, fl, C, D) is an ISR, then A is stable. If the rep¬ 
resentation is minimal then A is asymptotically stable. 

2) Two minimal ISR's (A, B , (\ D) and (A 1 , fl', C* m D f ) 
are equivalent if and only if there exist unitary 
matrices U and V such that (A\ fl', f", D') = 
( UAV T , VB\\ CU T , I)V). 

Proof: See Appendix B. 

ISR’s induce a description of /j-systems as the image of 
an isometric operator. This is made explicit in the following 
proposition. 

Proposition 5.4 (Isometric Image Representations): Let 
(A, B. f\ L)) be a minimal isometric state representation 
of an /system B. Then the corresponding image operator 
(1 : 1 2 —► 1 2 defined by G = ('(a I — A) 1 fl 4 I) is isometric, 
i.c., \\Go\\ = ||/’|| for all v G Pf * 

Proof ■ If io - llv then w(t) - C.i(t) 4 Po(1) for 
.r = (crI - A) ] Bi\ so that (ta A./ 4- Bo. Summation 
of (12) over I G Z yields ||/>|| 2 4 |//»|| 2 4- ||rr.r|| 

Clearly ||.r|| = ||<t.i-||, from which the result follows. □ 

Summarizing, every 1 %-system can be represented as the 
image of an isometric q x rn transfer function with an ISR 
as its realization. In the literature an isometric operator G is 
sometimes called lossless, and if it is in addition stable it is 
called inner. So an ISR is a realization of an inner transfer 
function that displays the isometry in a local way, in terms of 
the state variables. 

Leading Example (Continued): We apply the construction 
of Proposition 5.2 to the SR ( 1 1) of / 2 -system B ^. This yields 
K = 4/9, S = 2/3, /? = 3/7, and F — 4/21, resulting in 
the isometric representation 

cr.r = 6/7:r 4“ 2 / 77 ); w = ^ 3 I- r + lK 08) 

According to Proposition 5.3, B ey has a unique minimal ISR, 
modulo sign changes for the state and for the auxiliary input, 0 

VI. Optimal Approximation within a System 

In this and the next section we consider the GTLS problem 
of Definition 3.4. As stated before, this involves a double 
minimization. In this section we discuss the computation of 
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Fig. 2. Projection scheme, 

the mislit, and in the next section we treat the problem ol 
selecting an optimal model. 

For the computation of the misfit we will use the adjoint 
G* of an operator G : Ilf —► /£. This is defined by the 
condition ( w , Gv) - (G*w< v) for all 0 G Pf and to € /!{, 
where (■,•) denotes the inner product on / 2 . h follows that 
G is isometric if and only if G*G = 1 . For the image 
operator G: — (\oT — A)" J fl 4- D the adjoint is given 
by G* = B T {o-U - A r )-HV + D' 1 . This can also be 
written as G* ~ Ti ru G T 'R q where : P 2 -+ /£ denotes 
the time reversion operator defined by 7 Z k w(1)\- w(-t) and 
G 1 : = B T (rrl - A r )~ l C lT 4- D 1 . 

Theorem 6.1 (Optimal Approximation Within a System) 1 Let 
be given an observation w G /.> and an I ]-system B C B 7 "' T '. 
Let G : 1 2 —♦ 11 be an isometric image representation of 

B. Then the optimal approximation to e B is given by 
10 = GG*w, with misfit d(w, B) - ||(i q - GG*)w\\. 

Proof ■ This is a well-known result. To be explicit, we 
should minimize || w - 6 y 7;|| over v G /?'. Let o' = 0 - G*w, 
then \\w - Gv\\ 2 = ||«ii - GG* ”'" 2 GG*w\\ 2 + 

||C/i/|j 2 . as (w-GG*w, GV) = (G*m- G* //>, o') = 0 where 
we use that G is isometric so that G*G = 7, n . So the minimum 
is achieved by taking o' = 0, and 10 - GG* to. □ 

The optimal approximation within a system B gives rise to 
a decomposition to = w 4- with to G B regular and with w 
the corresponding approximation error. We will now show that 
//> also exhibits regularity. As 10 is obtained as the projection 
of w on fl, it follows that w G B 1 - \w G / 2 '. (m'< w f ) — 
0 for all w' G B\. This set is clearly linear and shift-invariant. 
The following theorem states that it is an / 2 -system. 

Theorem 6.2 (Orthogonal Complement). Let B be an / 2 - 
system with rank m and degree n. 

1 ) The orthogonal complement B 1 of B is an Z 2 -systcm. 

2 ) B hB B ± = 

3) B L has rank q - rn and degree n. 

4) Lei ( A , B , C. D) be a minimal isometric state repre¬ 
sentation of B, and let B , P be such that ^ C L ^ ^ j 

is a unitary matrix. Tlicn (A, B , C\ D) is a minimal 
isometric state representation of fl 1 . 

5) Let w be the optimal approximation in B of w G / 2 - 
Thcn the approximation error Co: — w -Co is the optimal 
approximation in B 1 of w. 

Proof: See Appendix B. 

We summarize this result in the following projection scheme 
as shown in Fig. 2. Here G and G denote the image repre¬ 
sentations corresponding to, respectively, (A, D) and 
(A, fl, D), and G: = [G (7]. In the literature G is some¬ 
times called a lossless embedding of G. Further v and v are 
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TABLE U 
Misfit of Models 


model 

Btx 

Boe 

Breqr 

Bnu 

Brand 

misfit 

5.22 

5.62 

6.84 

5.30 

13.53 


the auxiliary inputs corresponding to w and w ) i.e., w = Go 
and w = 6v. According to Theorem 6.1 and 6.2.5, there holds 
that v = G*w and v = G*w as shown in Fig. 2 . 

Leading Example (Continued): We apply the projection al¬ 
gorithm of Theorem 6.1 to determine the optimal approxi¬ 
mation w E B ex of the observation w described in Section 
HI. Let G denote the image operator corresponding to the 
ISR (18), and let (A, B, C, D ) denote the corresponding 
matrices. First compute f>:= G*w , which is given by the 
backward state equations x(1) — A T x(t + 1) + C T w(t)\ 
v(t) = B T r(t -I- 1) + D T w(t ). Then w = Gv is given by 
+ 1 ) = Ax(t) + Bv(t)\ w(t) = Cx(t) + Dv(t). 
According to Theorem 6.2, has ISR (6/7,3/7, 

Tp\\~?n\)' This corresponds to the equation 

y(t) * y(t - l) - l/Mt) + 1/2 u(t - l). (19) 

So every time series in l\ that is orthogonal to B vx satisfies 
this difference equation, in particular the approximation error 
w: = w - w. The misfit of B ex equals ||i7;|| = 5.22, as 
compared to ||u>|| = 22.72. This is considerably smaller that 
the / 2 -norm of the white noise by which the observation was 
corrupted. Recall that w = w, + u>„, with w v G B ex and 
w n white noise. In our example, ||w ri || = 7.62. The optimal 
approximation is simply obtained by projecting the noise w„ 
on B ox , with a resulting decomposition w t1 = w n ■+ w 7l , where 
t&„ € Sox and «/„ 6 B t 4- In our case, \\ib T ,|| = 5.56 and 
||ii» n || = \\w\\ = 5.22. We also determine projections onto the 
models B M , gr , S OP , and Buis, as defined in Section III, and onto 
a randomly chosen system Brand € B 21i . The parameters of 
an SR of B ram i were obtained by a random sample from the 
standard normal distribution. The resulting misfits are listed in 
Table 11. It turns out that the local total least squares model is 
of relatively good quality in this example. This, however, may 
be completely different in other situations, as we will illustrate 
by an example in Section IX-B. 0 

VII. Model Improvement Constructions 

In this section we discuss the second part of the GTLS 
problem, namely determining an I 2 -system with minimal misfit 
with respect to a given observation. Formulated in terms of 
state representations, this amounts to the following. 

Definition 7.1 (GTLS in terms of SR): For given observa¬ 
tion w 6 I* and tolerated complexity (m, n), determine a 
stale representation (A, B, C, D) with m auxiliary inputs 
and n states and an auxiliary input v £ Vf such that for 
— (C(nl -A)~ 1 B + D)v, the error ||«j~w|| is minimal. 

We follow an iterative approach for this nonlinear problem. 
In each step we keep some parameters fixed, such that the re¬ 
sulting vsubproblem becomes sufficiently simple. For instance, 
for fixed (A, B, C, D) the resulting problem in v is solved 


by the projection scheme discussed in the foregoing section. 
We consider the following subproblems. 

Problem l — Optimal C and D: For given A, B and t), 
solve the GTLS problem for C and D. 

Problem 2—Optimal B and D: For given A, C and i ), solve 
the GTLS problem for B and D. 

Problem 3 — Optimal B , D and v in an ISR: For given A 
and C with A r A + C T C = i„, solve the GTLS problem 
for B, Lh and v, under the restriction that (A, B, C, D) is 
isometric. 

In the next theorem we given constructive solutions of these 
problems. 

Theorem 72 (Model Improvement Constructions): 

1) Construction 1 (Projection of the Approximation Error): 
Let x be defined by ax = Ax + Bv, and let £ : - {w e 
l q \ 3 C 6 R q * n , D e R"*™ such that w = Cx + Db). 
Let Px + Qu denote the orthogonal projection of uj onto 
£, then Bi*.= S(A, B, P, Q) solves problem 1. 

2) Construction 2 (Dual version of construction 1): Let 

JF;= {iff g 1 q 2 \ 3 B 6 R"*"\ I) G R ,yy ™, such that 
w — ( C(al - A) _] B + Let the orthogonal 

projection of w onto T be given by {C[oI - A)~ 1 P + 
Q)v, then 62 := (A. B, <7, Q) solves problem 2 . 

3) Construction 3 (SVD on auxiliary inputs): Let B G 
R nXq , D G R qXq be such that ^ is a unitary 

matrix. Let v: -= (B 7 (o ’ 1 1 - A 1 )“ J C T + D 1 )w have 
SVD r = and let ?/„,:= 

Then Ba: — B(A, Bf/ rrj , C\ DU m ) solves problem 3. 

Proof: Parts 1 and 2 follow immediately froin the defi¬ 
nitions of £ and 7. For part 3, observe that (A, /i, C. D) is 
an ISR if and only if there exist B, /), and a unitary V such 
that [B B] — BV and [/) D] ~ I)V For this representation 
the projection scheme gives = V T H. with misfit ||? 1 ||. By 
taking V — [u ,\, ■ ■ ■, u q ], this misfit is determined by the q-m 
smallest singular values of v , which is minimal. □ 

We use these results in an iterative algorithm for the GTLS 
problem. At each step we use one of the three constructions 
to improve the model. The resulting model parameters are 
transformed to ISR, which also involves an update of the 
A-matrix. The projection scheme is then applied to update v. 

Proposition 7.3: The above method leads to a sequence of 
models with monotonically decreasing misfit. 

Proof: This is immediately evident from Theorem 7.2.D 

So the algorithm leads 10 a convergent sequence of misfits. 
In the next section we show that, in the limit, the correspond¬ 
ing models are stationary points with respect to the GTLS 
criterion. 

Leading Example (Continued): To illustrate the foregoing, 
we consider the data w (//, y) described in Section 111 
(see (5) and ( 6 )). To investigate the effect of the choice of an 
initial model, we apply the model improvement constructions 
to the models B ex , B OP , B^, Bjtis 1 ar *d #ran<i as described 
in Sections III and VI. The results are in Table III. The first 
row shows the initial misfits, cf. Table II. The next three rows 
contain the misfits of the models obtained by applying each 
of the constructions separately and only once. This shows that 
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TABLE III 

Model Improvements 


model 

&ex $ot B'regr Bltla Brand 

misfit 

5.22 

5.62 

6.84 

5.30 

13.53 

construction 1 

5.14 

5.12 

6.10 

5.23 

6.21 

construction 2 

5.14 

5.13 

6.13 

5.30 

6.26 

construction 3 

5.21 

5.15 

6.78 

5.30 

6.59 

limit 

5.11 

5.11 

5.11 

5.11 

5.11 


TABLE IV 
Order Shfction 


order 

n = 0 

71 = 1 

71 = 2 

71 = 3 

n = 4 

misfit 

6.71 

5.11 

5.02 

4.91 

1.91 


each individual construction can give a significant decrease 
of the misfit. The last row shows the misfit resulting from 
applying these constructions ileratively until convergence. 

For each initial model convergence occurred after about 20 
iterations. The limiting model is the same in all four cases, 
which suggests that it is optimal. It is given by 

is = {(u, y) G / 2 2 ; y(t) = 0.67;/(/ - i) + 1.85u(f) 

- lMu(t - 1)}. (20) 

The parameters of this system are relatively close to those ol 
the data generating system, cf. (5). 

Next, we investigate whether the model order can be de¬ 
duced from the data. For this purpose we compare in Table IV 
the optimal misfits for models of various degree. The misfit 
of the optimal static model is given by the smallest singular 
value of w. This clearly motivates the choice of a first-order 
model. It is significantly better than the static model, and an 
increase of the order gives only small improvements. 0 

VIII. Optimality Conditions 

From the model improvement constructions in Section VII 
we can derive necessary conditions for optimality, as for an 
optima] model the constructions can give no improvement. 
We express the optimality conditions in terms of empirical 
covariances. For two sequences a € l 2 an d & € l 2 is 
defined as cov(a, i>): = a (0b(t) 7 € K kxi . Further, 

by cov([oi, u 2 ], [ 61 , b 2 ]), we denote the covariance matrix of 
the combined trajectories [a{ a^Y and [b{ b^Y . 

Theorem 8.1 (l 2 -optimality Conditions): Let B denote a 
GTLS model for an observation w 6 l 2 . Let w e B denote the 
optimal approximation of and w £ B 1 the corresponding 
approximation error. Let x , v denote, respectively, the state 
and auxiliary input corresponding to w in a minimal state 
representation of B , and let x, v be defined analogously for 
w. Then the following equivalent conditions hold: 

1 ) cov(t), v) =■ 0 , cov(f>, j’) = 0 and cov(x, D) = 0 ; 

2 ) cov([fi, i], [5, x]) = 0 ; 

3) cov([ 7 ;, x, w, ox), [ii, Jr, u», ax]) = 0. 


Proof: See Appendix B. 

In practice, to evaluate how far these condition* Sll <‘ 4, a 
it may be useful to consider the empirical comelau.ms, 11 \>. 
covariances scaled by the magnitude of the variables. 

Next we investigate how far these conditions are sulliun,. 
for optimality. It is not difficult to check thai the number ol 
free parameters in (A, B , C, D), modulo the equivalence ol 
Proposition 4.2, is given by nq + m(q - m). This is precisely 
the number of equations in Theorem 8.1.1. In fact, these 
conditions characterize the stationary points with respect to 
the GTLS criterion. Wc call a system B a stationary point 
for an observation w if all the derivatives of the GTLS misfit 
d(w, B(A , /?, (\ D)) with respect to the system parameters 
are zero for a minimal SR of B. As all its minimal SR*s 
are linearly related this is equivalent to the condition that all 
minimal SR's of B arc stationary points. 

Theorem 8.2: An / 2 -system B satisfies the optimality con¬ 
ditions of Theorem 8.1 if and only if B is a stationary point 
of the GTLS criterion. 

Proof: See Appendix B. 

This shows that the GTLS algorithm can only converge 
to stationary points. This does not, however, establish con¬ 
vergence of the systems. For what it is worth, we mention 
that we never encountered convergence problems in any of 
our simulations. A thorough discussion of the convergence 
properties of the algorithm falls beyond the scope of this paper. 

The foregoing results can also be used to analyze whether a 
proposed system B is close to optimality. This is, for example, 
relevant in the formulation of stopping criteria for the iterative 
algorithm of Section VII. Probably the most convincing way to 
evaluate optimality is to consider the distance between B and a 
GTLS model B* . as defined in Definition 3.4. This is in general 
not feasible, however, as it would require the knowledge of 
B*. Instead of asking how far the system should be changed 
to become optimal for the observed data w. wc will consider 
the question of how far these data should be changed to make 
the given system optimal. For pragmatic reasons we consider 
the distance to the nearest stationary point, defined as 

min{||«7||; B is stationary for w - w). (21) 

Because it seems difficult to evaluate this distance exactly, we 
present an upper bound that is relatively easy to compute. This 
upper bound is obtained by allowing only adjustments of the 
data that belong to B 1 , so that the optimal approximation of 
the data within B is not affected. This leads to the following 
definition of the optimality margin. 

Definition 83 (Optimality Margin): The optimality margin 
of a system B with respect to an observation w is defined as 

min{|M|; w e B 1 and B is stationary for w - in}. (22) 

The following result shows that the compulation of the opti¬ 
mality margin is indeed relatively easy. 

Proposition 8.4 (Optimality Margin): Let w denote the op¬ 
timal approximation of w in B and let w : — w — w denote the 
corresponding approximation error. Further define 2: = {z € 
B 1 : B h stationary for w -I- z}. Then 2 isr a linear space, and 
the optimality margin is given by ||in - 11 / 1 |, where w f is the 
orthogonal projection of w on 2 . 
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TABLE V 

OriMALHY Margins 


model 

B&tl* Bws &oe Brngr &ltla Brand 

optimality margin 

4.58 * 10 -8 | iJDQ 1 2.29 ] 3.32 | 1.23 | IU9 


Proof: S$e Appendix B. 

These results can also be used to determine a lower bound 
for the achievable misfit. This indicates the quality of a 
proposed model relative to the optimal one. For this purpose 
we will assume that a proposed model B is not only stationary 
for the adjusted data w - w , but even globally optimal. 

Proposition 8.5 (Bounds for the Minimal Misfit): Let be 
giyen an observation w, and let v* be the minimally achievable 
misfit under a certain complexity constraint. Further let B be a 
(JTLS model of tolerated complexity for adjusted data w — w 
with misfit c: = d(w, B). Then there holds 

e - 2||W|| < «* < a. (23) 

Proof: Let B* denote a GTLS model for the original 
data w, so that d(vj, B*) = e*. Then the upper bound 
follows from the optimality of B*. For the lower bound 
we use the properties of the misfit that d(w, B) < ||w|| 
and d(w\ -f B) < d(w i, B) + d,(w 2 , B), so that e = 
d(w, B) < d(w, B) + d(w - w, B) < ||wJ|| + d(w -w,B*)< 
||tZJ|| + d(iu, B*) + d(w, B *) < 2||t/7||±e*. Here we have used 
the optimality of B for w - w in the second inequality. □ 

This shows that for models with a small optimality margin 
the corresponding misfit is nearly optimal. 

Leading Example (Continued): The covariances of Theo¬ 
rem 8,1 give a first indication of the optimality of a model. 
To make this scale invariant we consider the correlations in an 
ISR. For the nominal model B ox they are around 0.3, for the 
regression model B rpgI around 0.35, while for the randomly 
chosen model H raiKl correlations of 0.8 occur. For the model 
0 K ti fi in (20) the conrelations are approximately zero, below 
10 ~ 5 . 

From the optimality margins we obtain more precise infor¬ 
mation about the optimality of the systems. They are listed 
in Table V. This shows that it requires only a change of 
the observation of the order ||W|| « 10” 5 to make u a 
stationary point, cf. (22). Now assume that Bg t i« is globally 
optimal for w - W , which is reasonable assumption. The 
evaluation of the bounds in Proposition 8.5 for the data in this 
example with c = 5.1095 and 2||uJ|| « 10 -5 shows that the 
optimal misfit 5.1095 is determined within an accuracy 
of l(r r \ This also shows that B#h is optimal within this 
accuracy level. 

IX. Simulation Experiments 

We illustrate the use of the GTLS algorithm by three 
simulation experiments. The first example concerns model 
reduction, i.e., the approximation of a system by one of lower 
complexity. We use weighted / 2 -norcns to determine the / 2 - 
optimal approximation of a systems impulse response. In 
the second example we show that the algorithm can handle 
noncausal systems without any additional difficulty. As a final 
example we identify a system with multiple outputs, described 


TABLE VI 

Properties op Reduced Models 


model 

Bgtls 

Bial 

&hank 

misfit with respect to vj 

0.32 

0.36 

0.38 

error in impulse response 

0.54 

0.46 

0.57 

Hankel norm distance 

0.93 

0.80 

0.74 


TABLE VII 
Effect or* Scai inc. 


scaling factor 

1 Qf = 1 0 = 10 Q = 100 

||.|| tt -misfit with respect to w 
error in impulse response 

0.3204 

0.5407 

0.4457 

0.4477 

0.4476 

0.4476 


by more than one equation, and wc discuss the choice of the 
model complexity. 

A . fa-Mode l Reduction 

The algorithm of Section VII can be applied to arbitrary 
time scries in i 2 , Here we analyze its performance for very 
special data, a system impulse response. The aim is to reduce 
the dimension of the state space in such a way that the error 
in the impulse response is as small as possible (cf. |9] and the 
references therein). We compare the results of our algorithm 
with those obtained by balanced reduction and optimal Hankcl 
norm approximation that have been developed especially for 
model reduction (see [2] and [7]). We consider the single-input, 
single-output system B with poles in ±0.9/ and -0.7 ± 0.G/, 
so B ={[w 1 /] T £ / 2 ; y(t) - 0.5u(J) - 1 - J)- l.GG/y(/ - 

2) - 1.13//(t- 3) - ().G9y(f ~ 4)}. This system has complexity 
(1, 4), and we consider reduction to complexity (1, 2). The 
observation w 6 B consisls of two components u anti //. where 
u is a unit pulse at time / = 0 and y is the corresponding 
response. 

We apply the GTLS algorithm, starting in a randomly 
chosen model. When the decrease in the misfit has become 
sufficiently small, below 1() _J0 , the iterations arc stopped. This 
occurs after a few hundred iterations. The final model # fi u H is 
compared in Tabic VI with the balanced reduction Si M i and 
the Hankel norm reduction Shank- 

The 1‘2-trroT in the impulse response in B^\* is somewhat 
larger than that in By m \ and #hank- If one is interested in 
this response then one should prcvenl an approximation of 
the input, so that an optimal approximation of the output 
becomes the criterion. This is achieved by taking the norm 
||m>|| 2 : = ar 2 ||ii || 2 ± ||?/|| 2 with a sufficiently large. The effect 
of increasing nr is given in Table VII, This shows that for large 
at the method determines better approximations of the impulse 
response. 

This also gives bounds for the minimally achievable I 2 - 
error, which we denote by e*. Let B n be the GTLS model 
for || - ||c, and let y n be the impulse response of B n ; then 
it is easily checked that d(w, B Lk ) < e* < \\y - ?/ rt ||. By 
increasing a we can obtain an arbitrarily accurate estimate of 
e*. This gives an iterative solution method for the / 2 -oplimal 
impulse response approximation problem. For a = 100 we 
obtain e* = 0.4476; see Table VII. The corresponding model 
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Fig. 3 The impulse response (solid line) and its 1 2 -optimal approximation Fig. 4 The Mexican hat. 
of second order (dashed line) 
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is given by the equation y(t) = -1.35//(/-l)-0.78t/(/-2)-h 
0.50ti(/) + 0.11a(/ - l)-0.21u(/- 2). The impulse response 
is depicted in Fig. 3. 

B Noncausal Systems 

We consider the following noncausal system, the "Mexican 
hat/’ 

n’ 20 ) = -^j J <p(r)i‘n(t - 2 -)dr| (24) 

where <p is the standard normal density <p( r) = 

_1 _ 1 r 2 

(271) 'ie 2* 1 . In the simulations wc consider a discrete-time 
version ^(0 — ~j) Wlth N - ^ and limc 

steps of size 0.2. Note that W 2 is not a causal output, as the 
transfer function from in i to W 2 is not proper. 

First we apply our procedure to the impulse response 
observation, i.e., w\ is a unit pulse at time / = 0 and W 2 is the 
corresponding response; see Fig. 4. The misfits of the optimal 
models of orders 2, 4, and 6 are given in Table VIII. They 
are compared with the optimal Hankel norm approximations 
of orders 2, 4, and 6. These are obtained by approximations of 
orders 1, 2, and 3 of the causal part of the impulse response 
and using the symmetry of the Mexican hat to estimate the 
anticausal part. Analogously we determined approximations 
by balanced reduction. In Table VIII we also list the error 
in the impulse response of these models, i.e., the ^-distance 
between the systems impulse response and the Mexican hat 
w 2 . This error should be compared with the magnitude of the 
response, given by ||w 2 || = 0.35. 

Hankel norm reduction and especially balancing give rather 
good results. They can only be used, however, when a causal 
impulse response is available. The GTLS method makes no 
use of the symmetry of the observed signals, but this property 
is preserved well in the identified models. This is illustrated 
in Figs. 5 and 6, which contain the optimal approximations of 
orders 2 and 4. 

We also apply the GTLS method to data w n consisting of 
two noisy steps for the input and the corresponding system 
output. These data and the optimal approximation of order 4 


TABLh Vlll 
Model Quaiiiy 


model 

Bph 

B„al 

&hank 

n=2 

misfit 

0.17 

0.18 

0.32 


ciroi in impulse res p on bp 

0.21 

0 20 

0 34 

n=4 

misfit 

0 046 

0.047 

0.052 


error in impulse response 

0.051 

0.050 

0.055 

n=6 

misfit 

0.0070 

0.0071 

0.0078 


error in impulse response 

0.0070 

0 0075 

0.0081 


0 8 1 

1 

0 6 

0 4 

0 2 


° 2 40 10 20 0) 0 10 20 U) 40 

Fig. 5 The pulse w j (solid line), the first component of the 1 2 -optimal 
approximation of second order (dashed line) and fourth order (dash-dolled 
line) 

are given in Fig. 7. We should mention that the approximation 
error in the input is so small that it is nearly invisible in this 
figure. 

The misfit of this model and the error in its impulse response 
are listed in the first column of Table IX. In view of the 
results for ?i = 4 in Table VIII, this shows that the identified 
model is a rather accurate approximation of the Mexican hat. 
Depending on the choice of an initial model, it typically takes 
a few hundred iterations to obtain convergence, and sometimes 
convergence to a local optimum occurred. 
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Fig, 6. The Mexican hat W 2 (solid line), the second component of the 
f?-optimal approximation oi second order (dashed line) and fourth order 
(dash-dotted line). 



Fig. 7. Noisy step measurement w T , (solid lines) and its /■_>-optimal approx¬ 
imation of fourth order (dashed lines). 


TABLE IX 

Local vs Global Miihod 


method 

Global TLS Local TLS 

misfit with respect to w n 
error in impulse response 

0.11 

0.066 

1.17 

0.522 


Finally, as mentioned at the end of Section VI, we will once 
mrire consider the local total least squares method described 
in Section III. The results for the fourth order model are given 
in the second column in Table IX, This clearly shows that, 
perhaps not surprisingly, the local method gives poor results 
with respect to the global total least squares criterion. For 
example, the error in the impulse response of the local model 
is even larger than the Mexican hat itself, which has norm 
II wall = 0.35. 

C. A System with Multiple Outputs 

In this experiment we consider a system with multiple 
outputs, so that a single difference equation does not suffice 
to describe the system. For simplicity we consider a system 


TABLE X 

Selection of Rank and Decree 


model 

n = 0 

71= l 

71 — 2 

n = 3 

71 — 4 

m = 1 

3.33 

1.45 

0.93 

0.89 

0.88 

TO = 2 

1.30 

0.61 

0.56 

0.52 

0.46 


with one input and two outputs. The data are generated as 
w = w' + r, where w f E B 3,1,2 satisfies the equations 

w' 2 {t) = w' 2 {i -l) + u>i(0 

u^t) = w' 2 {i)+ w\ (t-1). (25) 

For we take white noise with unit variance, and for c 
a three-dimensional white noise process with independent 
components and variance 0.01. The observation interval has 
length 50. Outside this interval we define w(t) = 0. The GTLS 
model of rank one and degree two is 

w 2 (t) = 0.95 w 2 {t - 1) - 0.04 w 2 (t - 2 ) + 1.08u/,(0 
+ 0.07ti/i(f - l) + 0Arnv 1 (t - 2) 
w 3 (t) = 0.95 u>- A (t - 1) - 0.04m/;, (t - 2) + 1.12m»i(/) 

4- 1.08m/ 1 (/ - 1) - 0.89ii/i(f - 2). (26) 

Transforming model equation (25) to the form 

«4(0 - w 2 (t - i) = w\(i) 

■<‘>*(1) - «£(* - 1) = «/, (0 + «>', (t - 1) - «/,(# - 2) (27) 

shows that the original model equations are estimated rather 
accurately. We compare the misfils of GTLS models of various 
complexity in Table X. 

For rank one, the misfit hardly decreases for orders above 
two. This could be expected, as the regular part of the data 
belongs to a system of order two. For rank two the results 
suggest to take the order one. Comparing the complexities 
(m. n) = (1,2) and (2, 1), the first one of course leads to 
a larger misfit, as it imposes more restrictions. The misfit is 
still relatively small, however, when compared to the norm of 
the data ||m’|| - 6.10. 

X. Conclusion 

In this paper we investigated the modeling of vector time 
series by means of difference equations, using the global total 
least squares criterion. Distinctive features of our approach 
are that no decomposition into inputs and outputs is required 
and chat the criterion measures the global misfit in a nonpara- 
metric way. The misfit of a given system is evaluated by a 
dynamic projection algorithm formulated in terms of isometric 
state representations. We developed an iterative algorithm for 
constructing optimal models and gave a characterization of 
stationary points of the GTLS criterion. The method was 
illustrated by some simulation experiments. 

The results of this paper can be extended in several direc¬ 
tions, e.g., time series on a finite time interval, time varying 
systems, and time-varying norms for the misfit. Further re¬ 
search will be concerned with statistical properties and the 
development of faster algorithms and recursive methods. 
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Appendix A 
fa- Systems 

By l 2 we denote the set of (/-dimensional square sum- 
inable time series over time axis Z, i.e., l\\- {w: Z —► 
R°; !£t=:-oo w (t) Tw (t) < We define Z 2 -systems as 

follows. By a we denote the time shift defined by (aw)(t): = 
w(t 4 1), t £ Z. 

Definition A.I (fa-systems): An ^-system is a subset B of 
/!j that is: 

1) linear, i.e., for w, w' £ B and o, fi £ R there holds 
aw + fiw' £ B; 

2) shift-invariant, i.e., aB = B; 

3) complete, i.e., if w £ l\ satisfies w\ T £ B\ T for all finite 
T C Z, then 7/; £ B. 

This class of systems can be represented by linear, time- 
invariant difference equations of finite lag. Let R p * 9 [.s] denote 
the set of p x q matrices with polynomial entries. A set of p 
difference equations corresponds to a polynomial matrix in the 
shift (T, i.e., R(a) £ R pX9 [(j]. 

Proposition A.2: A set B £ 1% is an fa -system if and only 
if B = {w £ R(a)w = 0} for some R £ R p * fjf [.s] and 
p e N. 

Proof: The if-part is trivial, and the other part is proved 
by construction; see [ 10, Theorem 51. □ 

The complexity of a system is measured in terms of its 
dimension. Considered as a linear space, every nonzero I 2 - 
syslem has infinite dimension. Therefore we consider its 
dimension on finite time intervals. 

Definition A 3 ■ For a given system B let B°: = {u 1 £ 
B; w{1) = 0 for / < ()}, and let B[] be the restriction of 
B° to time / = 0. 

1) The rank of B is defined as m(B): = dim(Bn). 

2) The degree of B is defined as n(B): = dim (B|[ 0 cXJ ) 

mod S |[0,^))- 

The rank and degree determine the dimension of an l 2 - 
systems on finite intervals. To be specific, if B is a system 
with rank rn and degree 7/, then dim(B| 7 ) = mN 4 n for 
intervals T C Z of length N > n. The rank and degree have 
the following interpretation. The rank is the number of degrees 
of freedom for a system at each time instant, given the past. 
This is equal to the number of inputs in the system. The degree 
measures the remaining freedom due to initial conditions. This 
is equal to the number of states. This is made precise in the 
following result. 

Proposition A A: The rank and degree of a system equal, 
respectively, the number of auxiliary inputs and the number 
of states in a minimal state representation. 

Proof: For a proof we refer to 110. Theorem 9]. □ 

Appendix B 
Proofs 

Proof of Proposition 4.3: 

Necessity of I: Let 72: = im[B AD A T1_1 B], then 
A7? c 72 and imB C 72. If (A, D) is not controllable there 
is a choice of basis for the state space such that with respect 
to this basis A and B take the form ^ ^ j and [*o ]■ 


So in a corresponding partition ^ of the stale ».t v * 
holds that (tt 2 = A 2 x 2 . As this admits only the /eio * M 
in fa , x 2 can be removed. 

Necessity of 2: Suppose there exists an f such ih.i, 1 i 
BF , C 4 DF) ir not observable. Then the umihv * lt |»| L 
components of the state can be removed, as in the 
part. 

Necessity of 3: An obvious condition is that [{.* has lull 
column rank, so it suffices to prove that ker f) c kei H 
Suppose kcrD\kerB ^ { 0 }, then there exists an invertible 
R e R mxm such thal = [»', *] with 0 i I, t 

R n . Then B is represented by o.i = Ax 4 B'r' | (>z 
and w = Cx 4 BV. As z influences w only with a 
delay, we remove its direct influence on aj by defining 
ax': — ax - bz. This gives ax' = Ax' 4 BV 4 Aba’ l z 
and w = Cx' 4 D'o' 4 Cba~ l z. Hence (A, B, (\ D) is 
equivalent to (A, [B' Aft], (\ [D f ( ‘ft]). From Proposition 4.2 
it follows that this is equivalent to (A 4 Aft/, [B f Aft], C 4 
Cbf, [D ' Cft]), for all / £ R lxn . As ft / 0, / can be chosen 
such that I n 4 bf is singular, by taking / - -b 1 /\\b\\ 2 . It 
is easily verified that then (A 4 Aft/, C 4 Cbf) - (A(I fi 4 
ft/), ('(I n 4 ft/)) is not observable. From the Necessity of 2 
it follows that the state dimension can be reduced. 

Sufficiency of the Conditions: Let (A, B, (\ D) be an SR 
with m auxiliary inputs and n slates that satisfies Conditions 

1) , 2), and 3). From controllability and Proposition 4.2 it 
follows that without loss of generality we may assume that 
A is asymptotically stable. We prove that the rank of B equals 
rn, and its degree equals n. Then the result follows from 
Proposition A.4. 

Concerning the rank, consider B° as introduced in Definition 
A.3. Observability implies that trajectories w £ B° have state 
zero at time t = 0. So Bfj = imB, and from Condition 3) it 
follows thal dim Bo = rankB — rn. Concerning the degree, 
we consider the space B|[o |OU ) modBj| 0 ^ y cf. Definition A.3- 

2) . This space can be parameterized by the initial stale Jo. 

Observability implies thal this parameterization is injective, 
from which the result follows. □ 

Proof of Proposition 5.2: Write (A', B', C', D') \ - 
(S(A4BF)5“ 1 , SBB, (C + DF)S~\ DR). Then 

(A' B'\ (S 0 \ (A B\( S~ l <)\ 

\C f D')~{ 0 I m )\C DJ\FS - 1 It)' 

Equation (13) for (A', B', C\ I)') gives, with K ~ S 1 S 


(A r C 1 \ (K 0 \ (A B \ 
D T D 1 ){ 0 I m ){c V) 



(S T -F r R~ 1 \ f S 0 \ 

Vo R~ J )\-R-'F R~ l ) 

K 4 F T (RR T )~ l F -F 1 (RR T y l \ 

-(RR J y 1 F ( RR T y l y 

Now verification of (16), (17), and (14) is straightforward. For 
a proof of the uniqueness of K we refer to (8]. □ 
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Proof of Proposition 5.3: From (13) it follows that A T A = 
In - C^C, hence |Ac| 2 = |z| 2 - \Cx\ 2 < |x| 2 . So A is 
stable. We prove that the representation is not minimal if .4 
is not asymptotically stable. In that case A has an eigenvalue 
A with |A|s ( = 1. Let x denote a corresponding eigenvector, 
and x* its complex conjugate. Then ICaj 2 = x*C T Cx = 
x*x - x*A t Ax = |z| 2 - |Ar| 2 = 0. This implies that 
CA k x ss 0, k > 0, so that (.4, C) is not observable and 
hence not minimal. 

From the proof of Proposition 5.2 it follows that (15)—(17) 
are necessary conditions. Equation (15) determines S mod¬ 
ulo a left unitaiy factor, corresponding to U. Equation (16) 
determines R modulo a right unitary factor, corresponding to 
V. □ 

Proof of Theorem 6.2: From Definition 5.1 it follows 
that (A, B, C. D) as defined in part 4) is an ISR. As A is 
asymptotically stable (see Proposition 5.3-1 )| the correspond¬ 
ing image representation G: l’ - '" il is well defined. We 
first prove that 


GG* + GG* = J H (28) 


Define 77 = [B B] and D = [D D], ThenJ.4. B. C\ D) 
is also an isometric state representation. Let G(a): 1% —> l\ 
denote the corresponding image representation. Then G = 
[G G], so GG* -I- GG* = GG*. It remains to prove that 
GG* = , or equivalently, that G* = 7? v G T 7£ y is isometric. 

—T 

The operator 7 Z q is clearly isometric, and because G is the 
image representation induced by the ISR (A 7 \ C l , B T , l ) T ), 
it is also isometric. This proves (28). 

1) From Theorem 6.1 it follows that GG*w is the optimal 

approximation of w ip B(A, B , G. D). From (28) we 
obtain {w e B{A, B, G. D)} & [GG*w = w\ 
{GG*w = 0} {?/> G B x }. So imG = B x , which 

proves 1). 

2) Equation (28) shows that every w G l 7 can be decom¬ 
posed into a part contained in B and one in fi x . 

3) As (A, /), G, D) is a slate representation of B x , it 
follows that m(0 x ) < r/-m and n(B x ) < tt. From pan 
2) it follows that m(B x ) > q — tp, so m(B x ) =r q — m. 
As (B 1 ) 1 = 0, there holds that v — n((B x ) J ) < 
n(B x ), so that n(B x ) = n. This proves 3). 

4) That (A, B, G, D) is an ISR of B l was proved in 1), 
and minimality follows from 3) and Proposition A.4. 

5) Equation (28) implies that w = w — w = w — GG*w — 

GG*w, and the result follows from Theorem 6.1. □ 

Proof of Theorem 8.1; It suffices to prove the theorem 

for a minimal isometric SR. This can be seen as follows. 
If condition 2) holds for an arbitrary minimal SR, then it 
holds for all equivalent minimal SR's, as in the transformation 
(S{A + BF)S~ [ , STIR , (G 4- DF)S~\ DR), the auxiliary 
input and state are linearly transformed to R~ l (v ~ Ft) and 
S,v: see the proof of Proposition 4.2. 

So let (A. B, G, D) be a-minimal ISR of B, and let B. i), 
be defined as in Theorem 6,2-4), In the proof we will make use 
of the following relations, which follow from the projection 


scheme in Section VI; see Fig. 2. 


fort): x = A T (tx 4- C T w, 
for?): x = A T ax 4- C T w , 
for ?/): ax = Ax 4- Bv , 
for?//: ax = Ax 4 -Bl), 

Further, from (13) we obtain 

x = A r a.r 4 - C 7 w, 
0 = B r a:v 4- T) T w 
x = A r ax 4 - C T w, 
v = B T ax 4 - f) T w. 


v = B T ax 4- D T w (29) 
v = B 1 ax 4- D T w (30) 
w = Cx + Di) (31) 

w — Cx 4- bv. (32) 

v = B f ax 4- D T w, 

(33) 

0 = B T ax 4- D T w, 

(34) 


We first describe those optimality conditions that can be 
derived straightforwardly from the model improvement con¬ 
structions in Theorem 7,2. 

Lemma B.l: 

1) Construction 1 gives no improvement if and only if 

cov([f, ?;], ?i)) — 0. 

2) Construction 2 gives no improvement if and only if 
cov( b, [a.r, u;]) = 0. 

3) If Construction 3 gives no improvement, then 
cov(t), u) ~ o. 

Proof of the Lemma: We use the notation of Theorem 7.2. 

1) As w = (>,v 4- Dv, clearly w G f. Construction 1 does 
not yield an improvement iff the projection of w onto f 
is zero, from which the result follows. 

2) Construction 2 gives no improvement iff the projection 

of w onto F is zero, cf. part 1). This is equivalent to 
(w, (C(al - A)- l B' + D 9 )v) = 0 for all B', //. From 
(34) this is equivalent to (w, CA k ~ 1 B'rr - k v f 

D’v) = (££=! B rl (A T ) k - 1 C T a i tl + D' t uui>) - 
(B ,T ax+D ,T Wf o) = 0, from which the result follows. 

3) If cov(f), v) ^ 0, then construction 3 decreases \\v\\ — 

n«'ii- □ 

Reluming to Theorem 8.1, we first prove part 2. From 
the lemma it follows that cov(?), v) = 0, and cov(r, x) = 
cov(f;, A T ax 4- C T w) - 0. As b is injective, cov(.r, u) — 0 
if and only if cov(.r, bv) = 0, which is equivalent to 
cov(:r, w - Cx) ~ — cov(.r, G.r) — (). So to prove 2 
it remains to show that cov(.r, i) = 0. As (A T , B 1 ) is 
observable, this is equivalent to cov(rr.r, B T A T ax) = 0 
for all h > 0. We prove this by induction. For k = 0, the 
lemma shows that cov(a.r, B T ax) - cov(rr.r, -D T w) = 0. 
Now suppose that cov(ax, B 1 A T ax) — 0 for k < N. Then 
cov(/rr, B t A t " ¥ 1 ax) - cov(A.r 4 Bv, B l A T " + l ax) — 
COv( A.r, B T A 7 ^ "* * ax) = cov(A.r, B 1 A r " (.r - C T u>)) - 
Acov(x, B r A TN .r) = 0. This proves Theorem 8.1.2. Con¬ 
cerning the equivalence of l, 2, and 3, the implications 
3 =» 2 => 1 are trivial. Further, if 1 holds then cov(i. .}•) = 
cov(tfi*, ax) = cov(Ai; 4- Bit, Ax 4- Bv) = cov(Ai‘, Ax) = 
A cov(i', x)A T . As A is asymptotically stable, it follows that 
cov(f, x) = 0, so 1 implies 2. Finally, 3 is easily derived 
from 2 by using (31) and (32). □ 

Proof of Theorem 8.2: Let w be a given observation 
and let w be its optimal approximation in B. Further let 
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(. A , B , C\ D) be a minimal SR of B with A asymptotically 
stable, and write the approximation error as w: = w - w - 
ui — ( C(crl — A)~ l B + D)v. We have to prove that B satisfies 
the optimality conditions if and only if the derivative of ||i2>|| 
with respect to the parameters in A } £, (7, Z), and v is zero. 
First we analyze the tangent space of w with respect to these 
parameters. Let £ and T be defined as in Theorem 7.2, and let 
G: = {177 £ l\\ 3 H £ R" xn such that Hi = C{al - A^Hx} 
with x the state corresponding to w. 

Lemma B.2: The tangent space X of w = w - ( C((tI - 
A)~ l B + D)v is given byT = B + £ + F + g. 

Proof of the Lemma: The tangent space is defined as the 
smallest closed subspace of I 2 containing all partial deriva¬ 
tives. Note that w is linear in it, in B , in C and in D. A 
change of v corresponds to adding Hi £ B to w, a change 
of C and D to adding w £ £ to w and a change of B 
and D to adding w £ T to w. It remains to prove that the 
derivatives of w with respect to the parameters in A span 
the space Q. For H £ R t,xn let j' be defined by ax' = 
(A - H)x f + Bv and let w f : = Cx' + Do. The corresponding 
error is w f := w — so that the change in w is given by 
Hi = til' - w = w — w f = C(x — x f ) = Os for x: = x — x f . As 
ers = Ax-(A-H)s f - Ax-\-Hx ' = Ax+Hx-Hx, ignoring 
the second-order term Hx for small H gives the result. □ 

We will next prove the theorem by showing that the 
optimality conditions and the stationarity condition are both 
equivalent to w J_ X. 

Stationarity is equivalent to the condition that lim^o^ -1 
{||u; + #ul|| - ||mj||} = 0 for all Hi £ X. It is easily verified that 
this limit equals (</;, i77)/||w||, so stationarity is equivalent to 
w _L X. 

Finally we show that w 1 X is equivalent to the optimality 
conditions. First, suppose that the optimality conditions hold. 
As w is an optimal approximation within B thee holds that 
w 1 B. Further, Theorem 8.1-3) slates that cov([u. f], ui) = 
0, so that ut J_ £, and eov(f\ w]) = 0, so that the 
proof of Lemma B.l-2) shows that w _L T. Finally, for 
Hi £ Q given by w = YlkLi CA K ~ [ we obtain 

by using (34) that (Hi, w) = CA k ~ l HfT~ k j\ w) = 

(x, H 1 A q k 'C 7 cr k w) = (x, H 7 crx) = 0, so w _L Q. 

From Lemma B.2 it follows that w 1 X. 

Second, supposing that it; 1 X we prove the optimality 
conditions. The fact that w 1 £ + T implies the conditions 
in Lemma B.l-2) and B.l-2), cf. the proof of that lemma. 
The condition in Lemma B.l-3) follows from that in B.l-2) 
by using (34). Further, the optimality conditions were derived 
from these conditions in the proof of Theorem 8.1. □ 

Proof of Proposition 84: First we prove that Z is a 
linear space. Observe that for every z £ B ± the optimal 
approximation of w 4- z within B is given by w. Let ( v , x) 
be the auxiliary input and state for w in a minimal SR of 
0, and let (v z , x z ) be defined analogously for z in B L . As 
(f;, x) is fixed, the condition in Theorem 8.1-2) for stationarity 
of B with respect to w -I- z consists of linear restrictions on 
(i) z , x z ). As x z is a linear function of v z , these conditions can 
be expressed as linear restrictions on v z alone. This shows 
that B is a stationary point for w + z if and only if the 


auxiliary input it 3 is restricted to a lincm subspa o f * 
from which the linearity of Z follows. Next we show , > m, 
minimum in (22) is achieved by taking tr„ = u \ s 
w “ = w + vt - (ti> - w') = ut + w' and ir # e I u u*r« * v 

that B is stationary for w - «J 0 , by definition ol ‘ fn,,] K - 
HJ £ B 1 is such that B is stationary for w - TO \n 1 „ 
if and only if z: = w -w £ Z . Now the norm of u - , f 
is minimized by taking z = w\ the orthogonal projection tn 
tv on Z y hence TTJn — w - mV. p 
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Control of Chained Systems 
Application to Path Following and Time-Varying 
Point-Stabilization of Mobile Robots 

Claude Samson 


Abstract — Chain form systems have recently been introduced 
to model the kinematics of a class of nonholonomic mechanical 
systems. The first part of the study is centered on control design 
and analysis for nonlinear systems which can be converted to 
the chain form. Solutions to various control problems (open- 
loop steering, partial or complete state feedback stabilization) are 
either recalled, generalized, or developed. In particular, globally 
stabilizing time-varying feedbacks are derived, and a discussion 
of their convergence properties is provided. Application to the 
control of nonholonomic wheeled mobile robots is described in the 
second part of the study by considering the case of a car-pulling 
trailers. 


1. Introduction 

F ROM a theorem due to Brockett 15], it is known that 
nonholonomic wheeled mobile robots with restricted mo¬ 
bility (such as unicycle-type and car-like vehicles) cannot be 
stabilized to a desired configuration (or posture) via differ¬ 
entiable, or even continuous, pure-state feedback [3], 124], 
[1]. Nonsmooth feedback has been proposed as an alternative 
solution (see |4], 16], [32|, for example). Another alterna¬ 
tive, first pointed out by the author in [25], consists of 
using smooth time-varying feedbacks, i.c., feedbacks which 
explicitly depend on the tithe variable. Such feedbacks had 
previously been very little studied in Control Theory. The 
result given in [251, for a unicycle-rype vehicle, has subse¬ 
quently motivated research work to explore the potentialities 
of time-varying feedbacks [8], [9], [11], derive explicit design 
methods [19], [34], and extend their use in robotics applica¬ 
tions [26], [29]. 

The possibility of modeling the kinematic equations of 
wheeled mobile robots by so-called canonical chain form 
equations (a particular class of nonlinear nilpotent systems) 
has been pointed out in [17] when treating the case of car-like 
vehicles. It was known before that the equations of unicycle- 
type vehicles (a simpler case) could be written in this form, 
but this had not been used explicitly at the control design level. 
More recently, it has been shown [31] that the equations of 
vehicles with trailers could also be locally converted into a 
chain form. 
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In [17], the authors aimed essentially at exploring methods 
for open-loop steering of nonholonomic systems by using 
sinusoidal inputs. More recently, the authors of |34] have 
realized that chained systems could also be pul under another 
canonical form, called “power form,” and that power form 
systems belonged to the class of systems considered in [19] 
for which explicit smooth time-varying stabilizing feedbacks 
can be derived. The method proposed in [34], for deriving 
such controls, is applied to the problem of locally stabilizing a 
car-like system to a desired posture. A global solution to this 
problem had previously been given by the author in [26], [27] 
by using another approach. 

Since the power form is mathematically equivalent to the 
chain form, one may question the interest of using one 
form rather than the other one when dealing with practical 
applications and mobile robot control in particular. To the 
author's knowledge, no single and definitive answer can 
be brought to this question because the two forms present 
complementary advantages and drawbacks. For example, it 
seems that model equations of nonholonomic vehicles using 
physical coordinates, such as Cartesian coordinates and angles 
between articulated bodies, are naturally closer to the chain 
form than to the power form. In this respect, the logic 
of using the chain form when addressing the problem of 
path following should be evident from the present paper. 
Concerning the theoretically more difficult problem of point 
stabilization, the answer is not as clear. As shown in this 
paper, a rather simple solution to smooth feedback stabilization 
can be obtained by complementing path following solutions, 
thus making the chain form intuitively attractive in this case, 
Smooth time-varying stabilization, however, does not seem to 
be compatible with fast convergence [llj. For this reason, 
recent studies on point stabilization have focused on the 
possibility of achieving faster (exponential) convergence by 
using nonsmooth feedback: M’Closkey and Murray have been 
working with the power form (see [14] for example), with 
the probable reason that their approach based on the use of 
homogeneous system coordinates does not apply well to the 
chain form, while S0rdalcn has proposed an original solution, 
mixing open-loop and feedback control strategies, by using 
the chain form [32]. The question is therefore far from being 
settled at the time being and will certainly motivate future 
developments. 

This paper is organized as follows. Section II focuses on 
the control of chain form systems. After pointing out some 
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Tacts about these systems and recalling some results about 
the open-loop steering problem, it is shown that chain form 
systems can themselves be converted into a slightly different 
orm, named here skew-symmetric chain form, particularly 
well adapted to subsequent Lyapunov design and analysis 
of globally stabilizing time-varying feedbacks. Whenever this 
is possible, as it is the case here, finding adequate Lya¬ 
punov functions at an early stage of the analysis presents 
some advantages in terms of simplicity of the control design 
and stability proofs. A comparison with Center Manifold 
techniques, as used in [34] for example, is in this respect 
illustrative. In the process of deriving smooth feedback con¬ 
trol laws for chained systems, useful connections with more 
classical linear control techniques are carried out. Several 
solutions to the point-stabilization problem are then tentatively 
compared by analyzing the type of stability associated with 
each of them. This motivates a short discussion about the 
practical relevance of the notion of asymptotical rate of 
convergence. 

In Section Ill, the results of Section II are applied to a 
car pulling -n- trailers, seen as an extension of the unicycle 
and car cases. The same approach as in [29], in which 
stabilization to a desired configuration is treated as an ex¬ 
tension of the path following problem, is considered. This 
approach involves a specific parameterization of the vehicle’s 
posture which facilitates the decoupling of the path following 
problem from translational velocity control. The interest of 
this parameterization for solving path planning issues is also 
pointed out in connection with the approach developed in 
|21|. Finally, a modification in the modeling of the system's 
equations is proposed so as to broaden the control stability 
domain. 


II. Control of Chain Form Systems 


A. About Chained Systems 

Let us consider a chain form system which may be written as 

■h = «i 
d’2 = v 1^3 
= v\x 4 


Tn—1 — Mil'll 
•I n ~ 


or, equivalently 


X = h\(X)‘U] + h 2 fi2 h]{X) — 


1 

T3 

.t 4 


h 2 = 


( 1 ) 


.( 2 ) 


LO J 


With respect to the notations used in 117], the components 
of the state vector have just been ordered differently. 


In the second part of the study, where system < i , 
used to model the kinematic equations o( wheeled i >NL 
robots, x i will represent the distance covered by ik uLu W 
along a path to be followed, and ./ 2 will rcpiesent tin* U: t A \ 
distance between the vehicle and the path. Path iollowng 
will therefore mainly consist in regulating t 2 to /cm, in¬ 
dependently of the values taken by x\ (and thus a ») while 
stabilization to a desired configuration will turlher involve the 
regulation of .Ti to zero by utilizing also the input u { . 'rite 
possibility of directly relating the first state components of the 
chained form to physical euclidean-type coordinates justifies 
the preference given here to this iorm over the theoretically 
equivalent power form introduced in |34). 

It is worth noting that a chained system like (1), although 
it is nonlinear, has a strong underlying linear structure. This 
clearly appears when u\ is taken as a function of time and no 
longer as a control variable. In this case, the system becomes a 
single-input time-varying linear system which may be written 
as 


./'i = 0 


x 2 


■o 

Ul(1) 

0 

o ■ 


■()" 

0 

0 

«i(0 

0 

*2 f 

0 

0 



7!l(0 0 

0 

0 



0 Ui(0 


0 

.0 

0 


0 . 




with 


x i = xi - / Mi(r)dr and A'J = [.r 2 , .r 3 , ■ ■ ■,:r„]. 

Jo 


Putting a two-input nonlinear system in the chain form, 
when it is possible, is thus equivalent to linearizing this 
system with respect to one of its inputs. Since the chained 
system is controllable, controllability of the original system is 
a necessary condition. A necessary and sufficient condition is 
given in [181- 

When the input </i is taken as a function of time, the system 
is clearly no longer controllable due to the first equation. Under 
certain conditions upon the choice of u\(t)< however, the 
second part of the system involving A' 2 remains controllable. 

This type of property is very useful. It is used further in 
the study for the derivation of smooth time-varying feedbacks 
which asymptotically stabilize the point A’ = 0. It can also 
be utilized to solve the open-loop steering problem, i.e., the 
problem of determining open-loop control inputs that steer the 
system to a desired configuration AcieMredi chosen equal to 
zero without loss of generality. The method basically consists 
of two steps: i) choose an integrable function u\(t) which 
ensures controllability of the second part of the system, and 
determine a control ?/ 2 (0 which drives X 2 (t) to zero in finite 
time (usually done by integrating the system's equations on 
some time interval and solving a set of algebraic equations), 
and ii) once X 2 is at zero, keep v 2 equal to zero so as to 
leave X 2 unchanged, and determine u\(i) so as to drive X\(L) 
to zero in finite time. 
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This method has been used in [16], with u\(t) and 112 (f) 
chosen as piecewise constant inputs. In this case, the first 
step corresponds to discretizing the system's equations with 
u\ being kept constant (and different from zero) over n - 1 
sampling time intervals A and applying a dead-beat control 
strategy (the poles of the controlled discretized system are 
set equal to zero) to determine the values of U 2 on the 
time intervals [A A, (A + 1)A] (0 < k < n - 2). At time 
t » (n - 1)A, X .2 has reached zero and u\ may then be 
chosen equal to -.Ti((n - 1)A)/A so as to have x\ equal 
to zero at time t = nA. Note that by working more on 
the choice of u\ , feedback versions of this technique can be 
obtained. It may also be shown, as a complement to [16], that 
multiplying the piecewise constant inputs by (1 - cos(wO), 
with uj = (27t/A), does not change the values of X at the 
sampling instants. In this way piecewise constant inputs are 
transformed into time-continuous inputs that achieve the same 
result. 

The solution proposed in [17], with u\(t) and 112 (f) being 
composed of sinusoids at integrally related frequencies, may 
also be viewed as a variant of this method. Obviously, the same 
method applies to other inputs. A more geometrical method 
for open-loop steering of nonholonomic vehicles will also be 
pointed out further in Section III-B. 

When ?ii is constant and different from zero, the above 
system becomes time-invariant and the second part of the 
system is clearly controllable. By applying classical linear 
control techniques, it is then possible to derive linear feedbacks 
U 2 (A 2 ) which stabilize the origin X 2 = 0 exponentially. 

In fact, even if 1 ii(t) is not constant but only piecewise 
continuous, bounded, and strictly positive (or negative), it 
is quite simple to derive stabilizing feedbacks 'u 2 (X 2 ) for 
the second part of the system. Indeed, since xi(t) varies 
monotonically with time, differentiation with respect to time 
can be replaced by differentiation with respect to .r^ From 
now on we will refer to this change of variable as the u\- 
time-scaling procedure. Then, the second part of the system 
may equivalently be written 

.r^ - sign(ui).r a 

J-\ :) = sign(i<i)j’4 

'nil = sign(»/i )x„ 

.x^ = sign(u])« 2 (4) 

with 

■f! ,) = s'gn(«i)TrT and '"2 = «2/«i(0- 

US | 

This is the equation of a linear invariant system, an equiv¬ 
alent input-output representation of which is 

4" _1> = sign(i/i)" -1 V 2 . (5) 


One falls upon a controllable invariant linear system which 
admits exponentially stabilizing linear feedbacks in the form 

1=71—1 

v-i{X 2 ) = —sign(u])" -1 (p, > 0, Vi) (6) 

i=i 

the control gains g % being chosen so as to satisfy the classi¬ 
cal Routh-Hurwitz stability criterion (the positivity of g, is 
necessary but not sufficient). 

Hence, the time-varying control 

u 2 (X 2 't) = ui{l)v 2 (X 2 ) (7) 

globally asymptotically stabilizes the origin X 2 = 0 in this 
case. Moreover, the trajectories followed by the system’s 
solutions are invariant with respect to variations of 

This “feedback linearization” technique, associated with vq- 
time-scaling, has in fact been used by other authors working 
on mobile robot control. For example, Sampei et ai [22| have 
applied it to the problem of following a straight line in the 
case of a car pulling a single trailer. Their solution differs, 
however, from the one given further in the article in that they 
took the car’s steering wheel angle as a control, instead of the 
angle’s velocity. 

In the earlier work of Dickmanns and Zapp [10|. on vision- 
based roadline following, U] -time-scaling is also implicitly 
used together with tangent linearization of the system’s equa¬ 
tions, instead of exact feedback linearization. In their work, u j 
has the physical meaning of the car’s translational velocity. 

Extension of the path following problem to the point- 
stabilization problem to achieve smooth time-varying feedback 
stabilization of a unicycle-type vehicle to a given posture, 
based on u\ -time-scaling, has been first proposed in [29]. The 
present study may be seen as a generalization of the results 
described in this paper. 

B. Skew-Symmetric Chain Form and Lyapunov Control Design 

We show next, by introducing the skew-symmetric chain 
form evoked before, and via a Lyapunov-like analysis, that 
control (7) globally stabilizes the origin — 0 for the second 
part of the chained system, provided that \ui(f)\ and |t/i(/)| 
are bounded, and ui(/) does not asymptotically tend to zero. 
An important difference with the result stated previously is 
that u,i(l) is now allowed to pass through zero. 

From there it will be simple to complement the analysis and 
derive smooth time-varying feedbacks which globally stabilize 
the origin X = 0 of the complete system. 

To this purpose, let us consider the following change of 
coordinates </>j : X h-> Z in /?” 

Zi = x \ 

Z 2 = J’2 
= X* 

Z,+ 3 = kjZj +1 + Li„Zj +2 i < ./ < ri - 3 (8) 



SAMSON: CONTROL OF CHAINED SYSTEMS 


• kj (1 < j < n - 3) is a real positive number; 

• Ln x Zj = the Lie derivative of Zj along /i x ; 

• : the Lie differentiation operator of order 
k along h\. 

One easily verifies that the Jacobian matrix (dfa/dX) is a 
constant lower triangular matrix with ones on the diagonal. It 
is therefore a regular linear change of coordinates in R n . 
Moreover: L hj z, = 0 (1 < i < n - 1), and L\ l2 z n = L 
Taking the time derivative of z J+3 and using 2 

• _ 3 y 

,+ " 3_ ox x 


= (I'h l Zj+ 3)l‘l + (L tl2 Zj +3 )u 2 . 


Also, from 8 


Lhi z l +3 - -fcj+1^,42 + -,44- 


Hence 


Moreover ui(f)V(Z 2 (0) asympioikallj u>m', •, , , 0 

Therefore, if uj (f) does not asymptotically inui i.,, P u 
V Uni = 0 and the manifold Z 2 = 0 is glohalK u'wu/,, 
stable. 

The proof ot this proposition, as of subsequent )>, up. i ,■■■>.,- 
uses an extended version of Barbalat's Lemma sunup ilui . t , 
given differentiable function f(x) from IV to H coiiveigcs to 
some limit value when .r tends to infinity, and il the denvahw 
(df/dx)(x) of this function is the sum of two terms, one bemi: 
uniformly continuous and the other one tending to zero when .< 
tends to infinity, then (df/dx)(x) tends to zero when ,i tends 
to infinity. 

Proof of Proposition 2.1: Taking the time derivative of V 
and using the system’s (rt - 1) last equations, one obtains 


V= 1 / 


a ‘4 


Sj+3 = -fcj + ]Ui2 J+2 4 V\Zj + .l (0 <j<U- 4) (11) 


Z Tl — Lf l{ Zn(I\ 4 ^ 2 - 


Thus, if control (15) is used 


v =- n k J (<“)• 


The original chained system has thus been converted to the 
following skew-symmetric chained system 

ii = Vi 
z 2 - U\Z A 

i 3 = -kiU]Z'2 4 U]Z.i 

ij+i = -k,+iuiz t¥ 2 4 wiz ;+4 (0 < j < n - 4) 

-tt = —k u -2 u \ z n~ 1 4 V’2 (13) 

with 

w 2 = (A: ii _2-,,-j 4L/i 1 z 7 |)mj 4'</2- (14) 

The interest of this form is that it naturally lends itself to 
Lyapunov control design and analysis, as illustrated by the 
following proposition. 

Proposition 2.1: Assume that and |tti(/)| are 

bounded, and consider the control 

w 2 = -k W2 (ui)z n (15) 

where A u ,,( ) is a continuous application strictly positive on 
R - {0}. If this control is applied to system (13), then the 
positive function 

V(Z 2 ) = 1/2 (zj + (1/*, )zl + (l/hhtf + ■ ■ • 

(|6> 

is nonincreasing along the closed-loop system’s solutions, and 
asymptotically converges to some limit value Vj im (which a 
priori depends on the initial conditions). 



The considered Lyapunov-like function is thus non¬ 
increasing. 

This in turn implies that ||^2(0I| is bounded, uniformly with 
respect to the initial conditions. Existence and uniqueness of 
the system’s solutions over /?+ also follows. 

Now, since V is nonincreasing, K(/) converges to some 
limit value Vi irn (> 0). Since k w A-) is continuous and since 
M0| and |i>i(/)| are bounded, k Wi (u}(t)) is uniformly con¬ 
tinuous. Hence, the right-hand side member of equality (18) 
is uniformly continuous along any system's solution, and, by 
application of Barbalat’s lemma, V(/) tends to zero. Therefore, 
k u , 2 (ni(l))z u (t) tends to zero. This in turn implies, using 
the properties of the function k W2 (-) and the boundedness of 
|u 3 (0| and |z n (/)|, that u\(t)z n (t) tends to zero. 

From now on, the time index will often be omitted to 
simplify the notations. Taking the time derivative of u\z n and 
using the convergence of U]Z u to zero, gives 

4(«?*") = with lilll o(t) = 0 (19) 

(It f— 

„i is uniformly continuous along a system’s solution 
since ils lime derivative is bounded. Therefore, in view of 
(19) and since ufz n tends to zero, (d/dt)(a iZ n ) also tends to 
zero (by application of the aforementioned extended version 
of Barbalat’s lemma). Hence, and thus u\z n ~\* tend 

to zero. 

Taking the lime derivative of u\zj and repeating the above 
procedure iteratively, one obtains that u\z 3 tends to zero for 
2 < j < v. In view of the system’s equations, we note that 
this in turn implies the convergence of to zero. 

Summing up the squared values of u\(t)z } {l), for 2 < 
j < n, it appears that v\(t) 2 V(t) tends to zero. And so does 
ui(0 2 Viin. (from the already established convergence of V(t) 

to V Um ). 0 
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Remarks: 

• One can verify that, with the particular choice k W2 (u\) = 
k' Wi |wi |, the set of controls u 2 given by (14) and (15) with 

> 0 (i = 1, * ■ • , n - 2) and k' u „ 2 > 0, coincides with 
the set of stable linear controls (7) previously associated 
with the linear invariant system (4), More precisely, there 
is a one-to-one correspondence between the elements of 
the two sets. One can thus apply classical linear control 
design methods to determine adequate values for the 
parameters k t (i = 1, • ■ *, w - 2) and k f W2 and optimize 
the control performance near Z 2 = 0, as illustrated in 
[29], This correspondence also underlies the connection 
existing between the Routh-Hurwitz criterion and the 
operation consisting in transforming the original chain 
form into a skew-symmetric chain form. 

* Nonconvergence of v\{t) to zero, under the assumption 
that |*i(t)| * s bounded, implies that /J \u\(r)\dr tends 
to infinity with t. Divergence of this integral is in fact 
necessary to the asymptotical convergence of ||Z 2 (/)|| to 
zero, when using control (15) with k W2 (u\) = A*' UF Ju]|. 
This appears clearly when interpreting this control as a 
stabilizing linear control for the linear invariant system (4) 
obtained by replacing the time variable by the aforemen¬ 
tioned integral. This integral may still diverge, however, 

when u\(t) tends to zero slowly “enough” (like /”2, for 
example). This indicates that ||Z 2 (t)l| may still converge 
to zero when u\(t) does. 

Proposition 2.1 is not only of interest in solving the path 
following problem for mobile robots, it also suggests a way 
of determining smooth time-varying feedbacks which globally 
asymptotically stabilize the origin Z = ,0 (or X = 0) of 
the whole system. In this case, t/j is used as a control the 
role of which is to complement the action of the control v 2 
(or u/ 2 ) in order to also obtain asymptotical convergence of 
Z\ (or Xi) to zero. Since chained systems like (1) cannot be 
asymptotically stabilized by using smooth pure state feedbacks 
(by application of a Brocketss theorem (51), smooth feedback 
stabilization can only be achieved by using another type of 
control. A time-varying control law will be considered in the 
present case. 

Proposition 2.2: Consider the same control as in Proposi¬ 
tion 2.1 

w 2 = -k Wj (u 1 )z n (20) 

complemented with the following time-varying control 

ui = -k U] z ] +h(Z 2 , l) (21) 

where 

• k Ul is a positive number; 

• h(Z 2 . t) is a function of class C p + l (/> > 1), uniformly 
bounded with respect to t , with all successive partial 
derivatives also uniformly bounded with respect to 
and such that CV h(i), t) ~ 0, Vf C 2 \ There is a time- 


diverging sequence and a positive continuous 

function nr(-) such that 

\m >l> 0 => g(|y 2> o) > a(l) > 0, V». 

Controls (20) and (21) globally asymptotically stabiliz- 
able the origin Z = 0. 

Proof of Proposition 2.2: It has already been shown that 
the positive function V(Z 2 ) used in Proposition 2.1 is non¬ 
increasing along the closed-loop system's solutions, implying 
that || #2(Oil ls bounded uniformly with respect to initial 
conditions. Note that the boundedness of \ui(t)\ and 
is not needed to prove this fact. 

The first equation of the controlled system is 

zi = -k Ul zi + h(Z 2 , f). (22) 

This is the equation of a stable linear system subjected to the 
bounded additive perturbation h(Z 2 (t ), /). Therefore, |-i(0| 
is also bounded uniformly with respect to the initial conditions. 

Existence and uniqueness of the solutions over /?+ is thus 
ensured. 

From the expression of ii\. it is then found that ui(1) (taken 
as a function of time along a system’s solution) is bounded. 
And so is its first derivative [by using the regularity properties 
imposed upon h(Z 2 , /)]. 

Proposition 2.1 thus applies. In particular, V(Z 2 (/)) tends 
to some limit value V\ im (> 0), ||Z 2 (0II tends to zero, and 
Z 2 (t) tends to zero if u } (t) does not. 

We now proceed by contradiction. 

Assume that u\(t) docs not tend to zero. Then, ||Z 2 (/)|| 
tends to zero. By uniform continuity and since /;((), /) = 0 
(condition ), h(Z 2 (f ), /) also tends to zero. Equation (22) 
then becomes the equation of a stable linear system subjected 
to an additive perturbation which asymptotically vanishes. As 
a consequence, z\[i) tends to zero. From the expression of 
i/i, this in turn implies that u^(t) tends to zero, yielding a 
contradiction. 

Therefore, must asymptotically tend to zero. 

Differentiating the expression of u\ with respect to time and 
using the convergence of wi (/) and ||Z 2 (0ll to zero, we get 

'iii(i) = “ t *( Z 2 (/), t) -I- o(t) with lim o(t) = 0. (23) 

ut ^ 

Since (0h/0l)(Z 2 (t,), t) is unifonnly continuous (its time 
derivative is bounded), iii(t ), and thus (Oh/dt)(Z 2 (t), t), 
tend to zero (Barbalat's Lemma). 

By using similar arguments, one obtains that the time- 
derivalive of (i)h/dt)(Z 2 (t), t) and (i) 2 h/dt 2 )(Z 2 (t)< 1) tend 
to zero. 

By repeating the same procedure as many times as neces¬ 
sary, we show that (< i) l hldt J )(Z 2 {t ), /) tends to zero, (1 < 
j < p)- Therefore 

j=i ' 


( 24 ) 
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Assume now that V^n, is different from zero. This implies 
that ||^2(0II remains larger than some positive real number l 
(which can be calculated from Vn m ). The previous conver¬ 
gence result is then not compatible with the condition C 2 
imposed on the function /i(Z 2 , /)■ 

Therefore, Vj im is equal to zero, and Z 2 (/) asymptotically 
converges to zero. Then, by uniform continuity and using 
condition C\ , h(Z 2 {t), t) tends to zero. 

In view of the expression of wi, asymptotical convergence 
of z\(t) to zero readily follows. 0 

Remark: Dependence of the function h(Z 2 , t) upon the last 
state variable z n is not required when the function fe Wj (-) is 
strictly positive on R. The reason is that z n (t) unconditionally 
converges to zero in this case, due to the convergence of 
V(Z 2 (0) to zero (cf. proof of Proposition 2.1). 

It can be noted that only the control input '</] depends 
on time explicitly via the function h(Z 2) /). We will refer 
to this function as the heat-function, to establish a parallel 
with well-known probabilistic global minimization methods 
and underline the primary role of this term in the control, i.e., 
forcing “motion” as long as the system has not reached the 
desired equilibrium point, thus preventing the system’s state 
from converging to other equilibrium points. 

According to Proposition 2.1, when one is only interested 
in the regulation of Z 2 (as in the case of mobile robot path 
following), any sufficiently regular input ii\(t), which does 
not asymptotically tend to zero, can be used. This leaves the 
user with some freedom concerning the choice of this input. 
For instance, uniform exponential convergence of ||Z 2 (/)|| 
to zero is obtained when \ii[(t)\ remains larger than some 
positive number. Other sufficient conditions for exponential 
convergence ol ||Z‘2(0|| to zero, which do not require u\(t) to 
have always the same sign, may also be derived. For example, 
if 1 1/1 (/)| is bounded, it is sufficient to have |ji] (/)| periodically 
larger than some positive number. 

If the application further requires the regulation of z\ (stabi¬ 
lization of a mobile robot about a fixed desired configuration, 
for example), then Proposition 2.2 suggests implementing a 
time-varying feedback </]. In both cases, the same control law 
v 2 (or w 2 ) based on u\ -time-scaling can be used. 

The conditions imposed by Theorem 2.2 upon the heat- 
function are not severe and can easily be met. For example, 
the following three functions 


h(Z 2 , 0 = lWfcin(0 

/=n - 2 

h(Z 2 , t)= a J sin (/V) 2 2+j 

j=0 


HZ 2 , t) = 




j=o 


exp(M 2 +j) ~ 1 

c\p(bjz 2+j ) + ] 


sin(/i,/) 


(25) 

(26) 

(27) 


(with a.j ^ 0, bj ^ 0, / 0, and (i, ^ when i ^ j) 

satisfy these conditions. For the first function, this is obvious. 
For the second function, the proof is given in [26], The same 
proof basically applies to the third function which presents the 
additional feature of being uniformly bounded with respect to 
all its arguments. It can be noted that it is not necessary to use 
sinusoids at integrally related frequencies, as opposed to the 


solution proposed in [34]. In fact, the iheoiem m 4 n. 
it is not even necessary to use time-periodic tunaum* 
assumed in most time-varying feedback siahili/jn >,<>11 v , 

For practical purposes, the choice of the heal Imi im, M 
portant because the overall control performance t ,*i ■ v k 
convergence rate, time needed to enter a small hall un 0,1 i »\ 
zero, sensitivity with respect to perturbations, etc ) uuu slu 
depends upon this choice. This has been checked by ihe autj -n 
in simulation. By performing a complementary analysis in 
three-dimensional case, based on Center Manifold techniques, 
it is also possible to explain why the functions (26) and (27) 
arc better than (25) with respect to the induced asymptotical 
convergence rate. For the last two functions, the parameters 
n) and b,, which characterize the “slope” of h(Z 2 , /) near 
Z 2 = 0 have been found to have much influence on the 
transient time needed for the system's solutions to get close 
to zero. Basically, the larger these parameters arc, the shorter 
the transient lime is. 

C Stability and Asymptotical Rate of Convergence 

From Section 11-B, we already know that, when using the 
smooth lime-varying control law (20), (21) with k Wj (u\) — 
k' Wi \ui \ (k f W2 > 0), the convergence of ||Z(0|| to zero cannot 
be exponential. Indeed, would otherwise converge to 

zero exponentially and the integral J‘*\u[(t)\(It would not 
diverge. This would be in contradiction with the fact (pointed 
out earlier) that divergence of this integral is necessary to the 
asymptotical convergence of ||Z 2 (/)|| to zero. 

From the simulation of a smooth time-varying feedback 
control applied to a unicycle-lype vehicle, it has also been 
observed in [28] that the norm of the state vector did not 

converge to zero faster than t~ 2 for most initial configurations. 
This is much slower than the uniform exponential rate of con¬ 
vergence that can be obtained in the case of nonlinear systems 
the linear tangent approximation of which is controllable, It is 
claimed in [11] that it is not possible 1o achieve exponential 
stability for nonholonomic systems by using smooth (differ¬ 
ential everywhere) time-periodic feedbacks. In mathematical 
terms, this means that the system’s trajectories cannot satisfy 
the following inequality 

\\X(t)\\ <An|A(0)||ex P (-A/) 

VA r (0) in some open ball centered on zero (28) 

for some positive real numbers K and A. 

The practical significance of this relation, when it is satis¬ 
fied, is two-fold: i) the ratio ||A'(£)||/||A’’(())|| between transient 
and initial errors is uniformly bounded, and ii) all solutions end 
up converging to zero exponentially. 

It is worth noting that these two properties, regrouped under 
the strong concept of exponential stability, do not necessarily 
hold together. 

For example, the piecewise-continuous time-invariant feed¬ 
back law proposed by Canudas and S0rdalen in [6], for posture 
stabilization of a unicycle, only yields the following result 

\\X(t)\\ < {Ki + ^ 2 ||X(0)||)exp(-A0. (29) 
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Each solution converges to zero exponentially, but die 
slightest initial error, or perturbation, may produce transient 
deviations the size of which is larger than some constant. Note 
that this sensitivity to small perturbations, observed for this 
particular example, is not necessarily indicative of nonsmooth 
feedbacks as a whole. 

In [32], Sprdalen proposes another interesting control which 
may be seen as a time-varying mix of open-loop and feedback 
strategies, continuous with respect to time, but nonsmooth in 
the state vector. He shows that this control, which applies to 
any chain form system, yields the following property 

||X(0|| < ff (||X(0)||)exp(-Af) (30) 

where g(-) is a class JC-function (i.e., strictly increasing and 
such that #(0) = 0) which is not Lipschitz around zero. 
Precisely, the derivative of g(x) tends to infinity when x tends 
to zero. 

This property has been called ^-exponential stability. It 
is weaker than the usual exponential stability notion in the 
sense that the ratio between transient and initial deviations is 
not uniformly bounded. Nevertheless, it is better than (29) in 
the sense that the deviations are not lowerbounded by some 
positive constant. As in the previous case, all solutions tend 
to zero exponentially in the absence of perturbations acting 
on the system. 

By using the properties of homogeneous systems [12|, time- 
periodic feedbacks which are continuous with respect to both 
time and state, but not differentiable at the origin, have been 
proposed in [14], [20], [30] and have been shown to achieve 
the same type of exponential stability. 

Concerning the case of smooth time-varying feedbacks, such 
as the ones derived in the Section II-B, it is simple to verify 
that we have 


Pf(f)||<A'Pf(0)|| (31) 

for some positive constant K the size of which may be taken 
as close to one as desired via a suitable choice of the control 
parameters. 

Smooth time-varying feedbacks thus are, in some sense, less 
sensitive to initial errors than the aforementioned nonsmooth 
feedbacks. The “price 1 ’ paid for this type of robustness is 
that the system’s solutions do not converge to zero as fast 
as exponentially. In fact, in view of the above discussion, one 
can only expect to have 


controlled nonholonomic mobile robots (with restricted mo¬ 
bility) in the case of point-stabilization and to point out the 
difficulty in objectively comparing smooth and nonsmooth 
feedback solutions. So far, exponential stability, in the usual 
sense, has not been obtained and is most likely out of reach. 
Exponential convergence of the solutions to zero, in the 
absence of perturbations and modeling errors, is possible, 
however, by using either piecewise-continuous time-invariant 
feedbacks or continuous time-varying feedbacks which are 
not differentiable at zero. Smooth time-varying feedbacks are 
less efficient in terms of asymptotical rate of convergence, 
but are also potentially not very sensitive to initial conditions 
and perturbations in the vicinity of zero. A slow asymptot¬ 
ical convergence rate still does not mean that the system's 
solutions cannot be rapidly steered to an arbitrarily small 
neighborhood of zero, as pointed out in [34] (for example) 
and illustrated by simulation results in 129]. Nevertheless, this 
type of performance has not been obtained for small values of 
K . This again reflects the apparently unavoidable compromise 
between performance and robustness. It should also be noted 
that perturbations acting on nonholonomic systems are not of 
equal importance depending on the state component which is 
primarily affected: a deviation in a direction compatible with 
the vehicle’s mobility is clearly not as severe as a deviation 
which violates one of the system's kinematic constraints 
(lateral skidding of a car, for example). 

Further clarification of these issues is thus needed in relation 
to a rather fundamental question, seldom addressed in the 
control literature: Is the asymptotical rate of convergence 
a good measurement of the overall control performance? 
Answering this question is not simple, knowing that regulation 
errors are physically unavoidable and that what often really 
matters in practice is to keep these errors as small as possible 
under realistic adverse experimental conditions. While con¬ 
nections between robustness issues and asymptotical rate of 
convergence of the controlled system have been much studied 
in the case of linear systems (or nonlinear systems that can 
be approximated by controllable linear systems), they are still 
not well understood in other cases. 

To illustrate the difficulty with a concrete example, a control 
law similar to (20)—(21) has been simulated for a three- 
dimensional unicycle-type vehicle with the following non- 
smooth heat-function 


»<*.. i)={r 10 


>f :r 2 + U/ fr l) J 3 > <' 
if*2 + U/ fc l) J 3 < < 2 - 


(33) 


ii*(oii < *ii*(o)ii/(0; m = i. , Km m =o 

(32) 

where /(/.) is a decreasing function which does not tend to zero 

as fast as exponentially. For example: f(t) = (1 + t)~ 2 , in the 
case of the control considered in [28], as it may be rigorously 
established either by applying Center Manifold techniques [7] 
or by invoking two-time scale techniques, as done in [14]. 

The purpose of the above discussion was to summarize 
our actual knowledge concerning the stability properties of 


Note that this function does not satisfies the conditions im¬ 
posed in Proposition 2.2. The corresponding feedback control 
is time-varying, but not even continuous. 

The x 2 (t)) Cartesian position of the vehicle is 

represented in Fig. 1(a). 

The time-evolution of x*i(f), and the vehicle’s orien¬ 
tation angle 0(0 (« .r 3 (0) is shown in Fig. 1(b). 

After 25 seconds, all variables “seem” to have converged 
to zero. In reality, it is possible to show that xi(t) and 
both control inputs converge to zero exponentially, while 
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* 



Xt 


Fig. 1. 


*2 . Th«ta 

(b) 


2V(Jf 2 (0) = .r 2 (0 2 + ( 1 /A;i)j 3 ( 0 2 can only be shown to 
become smaller than t 2 (= 10~ c , in the simulation) after a 
finite time. 

This control therefore does not asymptotically stabilize the 
system to the desired equilibrium, so that the notion of asymp¬ 
totical rate of convergence does not even apply here. Is this 
fact sufficient in itself to assert that this is not a good control? 



Fig. 2 

III. Application to the Control 
op a Car with -n- Trailers 

A Modeling Equations and Notations 

We consider a car with -n- trailers as represented in Fig. 2. 
The system is assumed to move on flat ground. The wheels 
are allowed to roll and spin, but not slip. 

The vehicle counted first is the last trailer and the following 
notations are used: 

• / 7 is the distance between P, and r ,+1 

• O', (1 < / < n) is the angle between | P, and P t P,+ 1 
which characterizes the orientation of the vehicle (i + 1) 
with respect to the previous vehicle 

• nro gives the orientation of the first vehicle with respect 
to a fixed frame. For instance, we may choose: oo ~ 
angle(7(), P 0 A) 

• (v n + 1 is the angle of the car’s driving front wheel with 
respect to the car’s body. 

• v i (0 < / < 7i 4-1) is the intensity ol the velocity of the 
point P t . This is the translational velocity of the (i + 1) 
vehicle. 

• r is the radius of the car’s front steering wheel, and u the 
angular velocity of this wheel about its horizontal axis so 
that = ru. 

In what follows, only velocity control is considered, and u> 
and (d/dt)a 7l +i are chosen as control variables. 

Kinematic equations of this system, with respect to a fixed 
frame, have been derived by various authors. Sec [13], [221, 
135], for example. We will work here with a more general set 
of equations expressing the system's motion relatively to an 
arbitrary given path. 

A first set of equations is simply obtained by using the 
classical identity 

^P , + 1 = ^ P i + w, A P,P X +1 forO < I <ri (34) 
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where <2 t is the angular velocity about the vertical axis of the 
tth vehicle’s body. 

This yields the following equations 


7;*+i = rw 

~ Vi+i cos(o # +i) 

lan(nf!) 

/i 

( tan(a l4 .i) 


ao = Vq" 


a, = v t 




+i 


(0 < i < n) 

(!<.<»). (35) 


We note that the set of variables a, entirely characterizes the 
relative positioning of each vehicle with respect to the others. 

The remaining equations must describe the motion of one of 
the vehicles with respect to the path we would like this vehicle 
to follow. To this purpose, we choose the first vehicle (i.e., the 
last trailer) and the position coordinates of the point /V 

The path to be followed by this point is denoted as (C). 
For the sake of simplicity, we consider a smooth simple curve 
defined by one of its point, the unitary tangent vector at this 
point, and its curvature curv(s), with s being the curvilinear 
coordinate along the curve. Moreover, it is assumed that 

• curves) is differentiable (n + 1) times. This is necessary 
for P 0 to be able to remain on (C) without stopping, as 
it will later appear. 

• The radius of any circle tangenting (C) at two or more 
points and the interior of which does not contain any 
point of the curve is lowerbounded by some positive real 
number denoted as r mm , The set of the circles’ centers 
so defined is the Voronoi diagram associated with the 
curve [33]. This assumption implies in particular that 
|curv(s)| < l/r IM1I1 , Vs. For example, if (C) is a straight 
line, then r, mn = +dl and curv(.s) = 0. If (C) is a circle, 
then 7 inln is the circle’s radius and curves) = l/r nnI1 . 

Under these assumptions, if the distance between Pq and 
(C) is smaller than r mun there is a unique point on ((7), 
denoted as Po iPro ,, so that ||PoPo,proj|| is equal to this distance 
(see Fig. 2). 

Let a denote the curvilinear coordinate at the point Po, P i OP 
and (/ J o,proj‘» L f?) the Frenet frame on the curve at this point. 
The position of Pu in the plane is completely characterized 
by the pair of variables («, y) where y is the intensity of the 
vector Po.prajPu, i.e„ 


^0,proj P q — UN* 


(36) 


• 9 = ao - 0 t the angle between the first vehicle’s body 
and the curve’s tangent vector f. When the first vehicle 
follows (f f ) exactly, with a nonzero translational velocity, 
9 can only take values equal to ten (k E Z). Without loss 
of generality one may assume that the desired value for 
the angle 9 is zero. 

The following equations for the first vehicle (see also [29]) 
are then easily derived 

cos(0) 

tS = 1-TT" 

1 - curv(.s)iy 

i) = ”o sin {9) 

9i — curv(.s).s 

curv(.>»)cos(0) 

— <'()";-TT - ■ (3/) 

1 - c\irv(s)y 

By regrouping (35) and (37) one obtains the following 
control system 

A — v {) 4- Ui (38) 


with 


,s 


y 

9 



dim(.Y) — // 4- 1 


Ur n +i J 

J=?<+4 

"(I = ruj cos(j-,) 
4 

>‘■1 = -j'li+4 


cos(.r t ) 


ffl.l(A) = 

1 — curv(.j i )j- 2 

9i, a(A') = 

tan(./j) curv(j'i)co.s(.rj) 

) = ---7-7-7- 

U 1 - cur V(T\)J 2 

tlr \ 1 /' tan ( J 's) wn(- *4) 

’“W'zs&rA— - i 


“) 


, v ._l_ ( tan(j-^i) sin(.p) \ 

,lj n!:i c ^,)V l >- 2 ) 


Note that in the particular case where (C) is a straight 
line, & and y coincide with classical Cartesian coordinates. 
For other curves, one of the control objectives will be to keep 
the coordinate y smaller than r mlll all the time so as to avoid 
any ambiguity when using the parameterization (a, y). 

This parameterization has previously been proposed in [29] 
for the control of a unicycle-type vehicle. While it is primarily 
used here for feedback control purposes, it is also related to 
the approach developed in [21] for path planning, as shown 
in the next section. 

Let: 

• 9 t denote the angle between to and f(*) 


9 1, n +4 ( A ) = 0 

"O' 

• 92= 0 • 

. 1 . 

This control system characterizes our mechanical system, 
as long as A* belongs to the set 

= *x] - r min ,+r lllin [x7? x (] - |, + ^[)" +1 . (39) 
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Although controllability of the mechanical system in R 3 x 
(i.e., the possibility of steering the system between 
my two configurations in finite time) has been theoretically 
established (see |13J), for example), the control design and 
analysis is hereafter limited to the set SI. In particular, the 
angles n, (1 < i < n + \) are bound to stay in the 
interval ] - 7r/2, +7 t/2 [. For most practical purposes, this is 
not restrictive. Moreover, undesirable “jack-knife" effects will 
systematically be avoided in this way. 


B . About Path Planning 

As already mentioned, the angle 0 must be equal to zero (or 
7 r) when I\) moves along the desired path. This also means 
that the time-derivative of this angle must be equal to zero. 
Thus, according to the third equation of the above system, and 
since y is also equal to zero along the path, one must have 


tan(rvj 


f+/i curv(.s) if0 = () 
(-/icurv(.s) if 0 — 7r 


(40) 


It may also be noted that there is an ahuudan h » 
dealing with this type of geometrical problem Solan , 
posed in this domain, such a$ widely used Bezier's polui -, h 
curves (splines) for example [2], could thus lx oi mtnrsi i ( 
people working on mobile robot path planning pioblcm> :m,' 
yield methods complementary to those already oxptoiol 


C. Smooth Feedback Controls for Path Following and 
Stabilization about a Fixed Desired Configuration 

To apply the results of Section 11, system (38) should lust 
be converted into the chain form (1), or, equivalently, lo ihe 
skew-symmetric chain form (13). 

We first rewrite the original system as follows 

X = f\(X)u\ +//2*i'„+4 (42) 


with 


Ml 


'M() 


cos(.rj) 
1 - cur 


By taking the time derivative of this last equation and 
comparing the result with the fourth system’s equation, one 
obtains after simple calculations 


tan(r* 2 ) = 


/. 

curv?7pTi 
if 9 = 0 
_ 

if curv(s)'+i 

if 0 = 7T 


^curv(.s) + 
fcurv(.s) - 


/! curv (l, (») \ 
7fcurv(.-o-’+i ) 

1 1 curv (1) (s) \ 
ij curv(«.) J Yi ) 


(41) 


/il = 1 


f \ ,2 {X ) 
f>AX) 
/uffl 


1 - curv(./*i).r -2 
cos (./;*) 

1 - curv(./ , l ).r 2 
cos (d\\) 

1 - curv(.ri).r 2 
cos(j-j) 


fh. j(T), .r 2 , .r.i, 

fl\ 4(j’l- -Ch) 


.r 4 ) 


where curv^^(.s) is the first derivative (with respect to the 
curvilinear coordinate) of the path's curvature. 

Repeating the above procedure (n - J) times, one would 
obtain that, along the desired path and for each of the two 
possible values of 0, the angles tv, (1 < / < n + 1) are 
functions of the path’s curvature and its successive derivatives 
up to the order (/ - 1). Moreover, one may verify that the 
correspondence from (]-7 t/2, +tt/ 2[) m+1 onto /i m + 1 between 
the set of angles {r*i}i<i<rr/+i and the set {curv^(.*)}o<i< r » 
is one-to-one. 

A direct consequence of this fact is that the problem of 
steering the system between any two configurations (satisfying 
the aforementioned condition imposed on the range of the 
angles a,) can be addressed as a purely geometrical problem 
consisting of finding a planar path of class f?" -1 " 3 which 
connects two given points (corresponding to the initial and 
final position of the point I\ b), with given tangents at these 
points (corresponding to initial and final values of rvo), and 
conditions imposed on the curvature and its n successive 
derivatives at both extremities of the path (corresponding to 
initial and final values of angles nr, (1 < i < n + 1)). Since 
such a path obviously exists, one finds again in this way that 
the system is controllable in il. 

The possibility of parameterizing the vehicle’s motion by 
the curvature, and its derivatives, of the path drawn by the 
first vehicle has been here derived from the system’s error 
equation (38). In (21 J, this possibility, which yields the above 
formulation of the path planning problem, is presented as a 
more general consequence of the system’s flatness. 


1 -curv(ji)j‘ 2 , . 

/l.«+j(^ ) = -TT^-//l,n+j(- r 4< 1 ' ’ 


cos(.rj) 


/],«+4 = 0 - 


This control system is equivalent to the original one within 
the reduced set 

^rndurerl — It*] ~ 1 mini "f M Illin [X (] - “i ^) - 

In particular, due to the choice of the input v\_. the variable 
.r a (i.e., the orientation error 0) has to be kept in the interval 

l-f+ft 

This control system is directly converted into a skew- 
symmetric chain form via the change of coordinates (j> 2 ‘ X h-v 
Z, with 


Zi = .1’! 

Z 2 — 

Zi = /l,2(.r 1, X 2 , 

UX 2 

Z4 = k\Z2 + L fi Z.i 


Zj +3 — kjZ J+ i + Lf l z J+ 2 (j < 2 < n + 1) 


(43) 

where h v (x 2 ) is a smooth monotonic function which maps 
] - 7'nuit! + f, inm[ onto R. with first derivative strictly larger 
than a positive real number, and such that /i„(0) = 0. For 
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example, h y {xt) = (2r min /7r)tan(7rj//2r m ; n ) is a possible 
candidate when r min < +oo. If r nil „ = +oc, the simplest 
choice is h v (x2) = # 2 - This function is introduced here to 
force |#(t)| (— |x 2 (tf)|) to remain smaller than r m i n in the 
subsequent control analysis. 

One can verify that the Jacobian matrix (Ofc/OX) is a 
lower triangular'matrix with nonzero components d,on the 
diagonal given by 


Di = 1 



1 - curv(.fi).X '2 dh y 
cos(x 3 ) 2 0x 2 
(1 - curv(:i ]).r 2 ) 2 Oh u 
/lC0s(T3) 3 C0S(X4) 2 dX2 


_ (1 - curv(.r 1 ).r 2 ) , ' +2 dh t 

n+4 ~ ^ 

All other components are well defined in ftredu«rd- This 
matrix is thus defined and nonsingular on IJ r educ^i- Moreover, 
02(^rf»duted) = Thus, according to the theorem 0.5 in 

[23, p, 13], </)2 induces a diffeomorphism of class (7 71+] ( 7i + l 
being the degree of differentiability of curv^)) between 

^reduced 

Then, by using 

Lg 2 L J ft z t +.\ = 0 (0 < / < n, 0 < jf < ti - i) (44) 


and 

+i, (1-curv(.r I ).r 2 ) n+2 Oh . 

° h (n::? +1 iixnj:r 4 ^s(x ) r+«-^^ 2 ( ' 2) 

(45) 

it is easy to verify that the control system, with Z as state 
vector, has the form (13), with the auxiliary control input w 2 
defined by 


w 2 = (A:,,+ 2 Z 71+3 4- Lf x z n +i)ui + (L g2 L r }+ 1 z$)v2. (46) 

Once the system has been put into the skew-symmetric 
chain form, there only remains to apply Proposition 2.1 to 
determine a control input w 2 which stabilizes the point Z 2 =■ 0 
and thus make the mechanical system follow the path (C). 
One can also apply Proposition 2,2 to determine smooth time- 
varying feedbacks which make the system converge to a given 
configuration on the path. 

Note that stability is only local in this case since the state 
vector X must belong to f2 rt , r i ucec j. This implies that the angles 
0 and a, (1 < n 4-1) must have initial values in ] - f, 4-f [, 
and that the initial distance \y(ti)\ must be smaller than r mux . 
Remark: The method described above allows asymptoti¬ 
cally stabilizing the mechanical system to any configuration 
so that |r*i| < (7 t/2) (1 < n + !)■ In other studies ([22], 


[32]), for example), only configurations with zero angles a t 
(all trailers aligned) have been considered. 


D. A Modification to Broaden the Stability Domain 

A practical shortcoming of the above control design method 
is the necessity of starting with an orientation error |0| smaller 
than 7 t/2. 

This limitation can be removed by considering a more global 
change of coordinates which converts the initial control system 
to a modified chain form. 

This modification was implicitly used in [29] in the particu¬ 
lar case of a unicycle-type vehicle, and simulation results can 
be found in the same reference. 

The new transformation </m: X •-> Z that is considered is 
the following 


Z! = ,r i 
*2 = K{x 2 ) 


zj = 

. sin (j' A )dh U/ , . 

z 4 = A*] - — J* 2 )z 2 4- tfi a(JCi, ./' 2 , ./’j, r 4 ) 

.rj dx 2 

2;, = k 2 z 4 4- L qi z 4 


— k )4 .\z j4 . 2 4" L (Ji S i ^ n ) 


(47) 


where 

• (X) is the vector field involved in the system’s repre¬ 
sentation (38) 

• Aj ( 1 < j < n 4- 1) arc positive real numbers 

• hi/U'o) is the monotonic function introduced before. 
Remark: Instead of = .v^, one may also take cj = 

//£i(.j’ 3 ), with being a smooth monotonic function, alike 

h y (j- 2 )< which maps an open interval containing ] - i r, 4 -tt[ 
into B. For example, he(j^) = tan(fcrj) with k < (i/2) 
can be used. In this case the coordinate z 4 becomes Z[ — 
A 1 1 (sin(.i\i)//i«(./’j))(c)/i v / fir 2 )(x* 2 ) 2 2 4- -J‘ 2 . Jr 4 ), 

and all subsequent coordinates are modified accordingly. 

One can verify that <f> j(S2) = 7? ,H4 and that 
the Jacobian matrix (i)fa/dX) is a lower-triangular 
matrix with nonzero components on the diagonal 
equal to 1, (t>/i v /ftr 2 )(x 2 ). 1. l/(/i cos(.r 4 ) 2 ), 

i nhh cos(x 4 ) : ‘cos(.r 5 ) 2 ), ■ ■., i/((n::; ,+1 m oi^r 1 

cos (■/■j) 7l+0 ”- y ))- 

The change of coordinates 03 thus induces a diffeomor¬ 
phism of class (7 n4_1 between 11 and /? y,+4 . 

Then, by using 

L gj Li n z 1+3 = 0 (0 < / < n, 0 < j < n - 7 ) (48) 


J9iL ” aiZA 


(49) 


with the following auxiliary control input w 2 


to 2 — {k n4 . 2 z f)4 .^ 4“ Ijy x z u4 . 4 )vq 4* (50) 
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,t is simple to verify that the control system, expressed in terms 
.if the new coordinates, has the following skew-symmetric 
chain form 


_ „ cos^aj 

1 0 1 - curv(2] )a‘2 

sin(* 3 ) dh y , x 

z 2 = v 0 -„ 12Z3 

z-i dx 2 

sin( 2 3 ) dh y s , 

2 3 = —K\Vo - ~(j-'l)Z‘2 + V 0 Z4 

Z 3 0 x 2 

i 4 = -k 2 voZ3 + v 0 Z5 


Z,+4 — ~k l +2 ll 0 z j + :i + "O-J+5 (1 < ,1 < tl — 1) 


k] (1 < :I < 11 + 2) can be determined by comiiinng. 1 
the neighborhood of Z 2 = 0, the control u 2 ptovidcd in tii, 
proposition and relation (50), with the lincari/mg IcedbaA (7) 
Since these gains do not a priori depend on the path's shape 
one may also use, for this comparison, a snnplci iui *ar control 
calculated from the system’s tangent linear appumimation 
about the equilibrium (Z 2 = 0, u 2 - 0). assuming that the 
path to be followed is a straight line and that the velocity i> 0 
is constant. 

The problem of stabilizing the system to a ttsed desired 
configuration requires asymptotical convergence of the full 
state vector Z to zero. A smooth time-varying feedback 
solution is given in the next complementary proposition. 

Proposition 3.2. Consider the same control as in Proposi¬ 
tion 3.1 


i„ H = -fc„ + 2'l'0Zn + S + W 2 (51) 

with J '2 = hy X {z 2 ), and v u = ru> cos(ir,). 

Although this system is not exactly the same as the skew- 
symmetric chained system (13), a result very similar to Propo¬ 
sition 2.1 can be derived for path following. 

Proposition 3.1: If |no(0| and IM0I are bounded, and if 
the control 


n't = -k Wi (v„)z l ,+4 

(A 11 , J (): continuous application strictly positive on/? {()}) 

(52) 


is applied to system (51), then the positive function 


V(Z 2 ) = 1/2 (zj + (l/krfzj + (\/hh)z 2 4 

J=n+'l 

+■ ■ ■ + v n 


z n+l 


(53) 


is nonincreasing along the system’s solutions, and thus asymp¬ 
totically converges to some limit value Vj mi (which a priori 
depends on the initial conditions). 

Moreover, ih)(t)V (Z 2 {t)) asymptotically converges to zero. 

Therefore, if />o(0 does not converge to zero, then Vj im = 0. 
The submanifold Z 2 = 0 is thus globally asymptotically 
stabilized in this case. 

Proof of Proposition 3 I ■ The proof is quite similar to the 
proof of Proposition 2.1 except that one has to show at some 
point that the convergence of vo(s\n(z-s) / zz)(dh v / d:r 2 )(.r 2 )z 2 
and vqZj to zero yields the convergence of 1 * 0*2 to zero. 

Since |2 2 (0I is bounded (from the boundedness of the 
Lyapunov function), (dh y /dx 2 ){x2{1)) also upperbounded, 
and voz^{dh y /dx 2 ){.r 2 )z 2 thus tends to zero. Therefore, 
^(((sin(z : j)/2 3 )) 2 4- z$)((dh y /dx 2 )(x 2 )) 2 z$ also tends to 
zero. 

By assumption, {Oh y /dx 2 )(x 2 ) is bounded from below by a 
positive real number. Moreover, the function ((sin(z 3 )/z 3 )) 2 + 
z\ is itself larger than some positive real number. Along a 
system’s solution, it is also bounded from above, since |z 3 (/)| 
is bounded. Using these bounds in the previous convergence 
result, it is found that tends to zero. 0 

The remarks made after Proposition 2.1 also hold in this 
case. In particular, adequate values for the control “gains” 


w 2 = (54) 

complemented with the following time-varying control 

4* h(% 2 ' t) (55) 


where: 

• k V{) is a positive real number. 

• /»*,(■) is a function of class C 1 which maps /? into a 
hounded interval of /?, and such that: i) h s ( 0) = 0. ii) 0 < 
h[ l \x) < +00, V.r, and lii) |//, 2 ) (.r)| < 40c, Take 
for example, the sigmoid function: h H (:r) = (exp(uj) - 
l/exp(«.r) + I) (a > 0 ). 

• h(Z 2 ,1) is a function with the same properties as in 
Proposition 2.2. 

This control globally asymptotically stabilizes the origin 
Z = 0 of the system (51). 

Proof of Theorem 3.2: The first part of the proof consists in 
showing that vq(1 ), and its time derivative are bounded along 
any system’s solution. 

Since ||Z 2 (0ll * s bounded (due to the boundedness of the 
Lyapunov function considered in Theorem 3.1), it is clear, 
from the expression of the control and the properties of the 
functions //,(*i) and h{Z 2 , /), that \oo(t)\ is bounded. As a 
consequence, ||2r(f)|| is also bounded. 

Taking the time derivative of the 1/0 control law expression 


Vi) - 




Oh 

0Z 2 


■ Oh 

/j + m 


(56) 


which, in view of the boundedness of \\Z 2 {t )|| and ||Z(f)||> 
implies that \vo{t)\ is bounded. 

Although 7/0 is not, strictly speaking, a function of time only 
(since it is a feedback control), it can be viewed as such along 
any system's solution, and the results of Proposition 3.1 do 
apply. 

In particular, il vq(L) does not tend to zero, then ||#2(0|l 
tends to zero. In this case, h(Z 2 (t), t) tends to zero (from 
condition C\ and by uniform continuity). From the first 
system’s equation, we also have 


Z[(i) - -k^h^ziit)) 4- o(t) willHhn o{t) = 0. (57) 
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Using the properties of the function the equation 

implies that the following proposition is true 

Ve > 0, 3r) > 0, 3t 0 : (t > t 0 wd\zi{t)\ > e) 

=> (zi(t)i\(t) < -r/). (58) 

Since z\{t) cannot remain larger than t 2 with a negative 
derivative smaller than —27/, there is a time t\ (> tu) such 
that |«i(^i)| < c. Moreover, after the time Zi, |^i(Z)| remains 
smaller than e (since c 2 cannot be reached from below by z\(t) 
with a negative derivative). The above proposition thus implies 

Vc>0,3Z 1 :(/>Z 1 )=>(h(f)|<e). (59) 

This is a characterization of the convergence of z\(t) to 
zero. In view of the expression of this in turn implies that 
vo(t) tends to zero (contradiction). 

Therefore vo(t) must tend to zero, implying in turn that 
ti/2(0 and z(t) tend to zero. Now, in view of (56) 

i’o(t) t) + o(t). 

Since (c)h/dt)(Z 2 (t), t) is uniformly continuous (its time 
derivative is bounded), vo(t) tends to zero (Barbalat’s lemma). 
Therefore, (0h/dt)(Z2(t), t) also tends to zero. From there, 
the proof goes on like the proof of Proposition 2.2. 0 

IV. Conclusion 

New results about feedback control of chained systems and 
their application to path following and point stabilization of 
nonholonomic mobile robots have been presented. 

Throughout the paper, feedback control design and analysis 
have been performed via explicit Lyapunov techniques which 
apply naturally once the origirial chain form has been trans¬ 
formed into an equivalent form termed here skew-symmetric 
chain form. Asymptotical stabilization of the origin of a 71- 
dimensional chained system has been achieved via a two-step 
approach which allows path following and point stabilization 
of mobile robots to be treated within the same framework. This 
approach yields a simple way of determining globally stabiliz¬ 
ing smooth time-varying feedbacks the underlying structure of 
which is easily interpreted. It also applies to the determination 
of Holder continuous time-periodic feedbacks which ensure 
/C-exponential stabilization of the origin, as illustrated in [30] 
in the three-dimensional case. Several smooth and nonsmooth 
solutions to the point stabilization problem have then been 
tentatively compared by recalling and commenting upon the 
type of stability associated with each of them. 

Application to path following and point stabilization of a 
car pulling trailers has been addressed in the second part 
of the study, based on a specific parameterization of the 
system’s configuration which facilitates the decoupling of the 
path following problem from translational velocity control and 
yields general model equations in the chain form. Finally, it 
has been shown how, by an adequate choice of the system’s 
state coordinates and a slight modification of the original chain 
form, it is possible to derive feedback control laws endowed 
with a larger stability domain. 
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Technical Notes and Correspondence. 


Input-Output Robust Tracking Control 
Design for Flexible Joint Robots 

Zhihua Qu 

Atofrttcf—This paper presents the first input-output robust control 
design for the trajectory following problem of a flexible-joint robot 
manipulator. The proposed design provides a class of controllers which 
only require position and velocity feedback and ensure global stability. 
The resulting stability is that the tracking error can be made to be smaller 
than a design parameter arbitrarily chosen by the designer. The proposed 
control is robust since it guarantees tracking performance in the presence 
of high-order nonlinear uncertainties including unknown joint elasticity, 
unknown parameters, load variation, and disturbances. Practically, it is 
more Important that no measurement on acceleration, jerk, or position 
and velocity deformation is required. 

1. Introduction 

Control design for robot manipulators with joint flexibility has 
attracted much attention from control researchers in recent years. The 
main reason is that, as shown by experimental study in [241, joint 
flexibility must be taken into account in both modeling and control 
design to achieve high tracking performance. Common sources of 
joint flexibility are gear elasticity (for example, harmonic drives), 
shaft windup, etc. One of the models of flexible joint robots was 
presented in [201 in which some energy terms are neglected. Never¬ 
theless, most researchers have adopted this approximate model, and 
we choose to use this model as well in this paper. 

Several methods, such as feedback linearization, observer design, 
adaptive control, singular perturbation, and robust control, have been 
investigated to design effective conrrol for flexible joint robots. 
A list of references on these Results can be found in the survey 
paper [221; we shall only give a brief synopsis of the most recent 
developments and provide a comparison of different approaches. In 
feedback linearization method [20, 10], control is designed based on 
feedback linearization transformation which requires the knowledge 
of the dynamics and acceleration and jerk measurements. Under 
the assumption that joint flexibility is “small," there are two time 
scales in system dynamics, and a control can be designed using 
singular perturbation technique. Adaptive version of this slow/fast 
control was studied in [5H71. The resulting control is intuitively 
simple since it usually contains two pans; a control for rigid body 
and a corrective control; the control, however, does not guarantee 
global stability. The most recent adaptive control scheme proposed 
in [II| guarantees global stability under the standard assumption 
that dynamics can be fully parameterized. On the from of ro¬ 
bust control design, local stability was shown [19] which can be 
viewed as an extension of robust control results for rigid robots 
[14], [16). Later, a robust control guaranteeing global stability was 
presented in 131; this full state feedback robust control design is 
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an application of the result in [18] and an extension of the robust 
control law [2| for rigid robots. Hybrid control, or combined robust 
and adaptive control, of flexible joint robots has also been studied 
[15], [4], 

So far all existing control laws for elastic robots require full 
state feedback. There are some results [121, [25] on designing stale 
observer for flexible joint robots and on the subject of how to 
close the control loop, that is, how to design a control based on 
the estimates of the states [13]. The result in [13] combines high- 
gain observer with high-gain controller to guarantee local stability. 
Since the separation principle does not hold, in general, for non¬ 
linear systems, it appears unlikely that full slate feedback control 
can be changed to an estimate-based control to guarantee global 
stability. 

In the present paper we consider for the first lime the input-output 
(I/O) control problem of flexible joint manipulators. The control 
objective is to design a robust control which guarantees global 
stability and good tracking performance but requires only feedback of 
link position and velocity. The proposed design method is inspired by 
the recent result [17] on robust input-output design of mainly linear 
systems. The proposed control design is not only more attractive 
in practice since it requires less feedback information, but also 
allows the presence of significant but bounded nonlinear uncertainties 
including fast time varying parameters, modeling errors, load change, 
and unknown flexibility. 

The paper is organized as follows. In Section II, dynamics of 
flexible joint robots and its properties arc presented, and necessary 
assumptions are introduced. In Section III, input-output robusL control 
design is proceeded. Simulation results are presented in Section IV. 
Finally, some conclusions are made in Section V. 

II. Problem Formulation 

We shall consider the dynamics of a robot with flexible joints to 
be described by the following nonlinear differential equations 

0 = M{q\ )(j { + A T (i/i. (Ji ) + I\(q\ - q>)- 0) 

Jij 2 = J\(qi - qi) - Dq> + T + P(q ]t q { ) ( 2 ) 

where q\ and qi represent n x J vectors of joint angles and motor 
angles, respectively, r is n x 1 the control vector of motor output 
torques, M(q\) is an n x » inertia matrix (symmetric and positive 
definite) for the rigid links, and I\ is a diagonal matrix representing 
the joint stiffness. In (I), N(q\> <h) = q { ) ( h + G[q\) + 

F,iq i -I- Ftiq i) -I- r t/ , V„,(r/i, (j \) is an n x u matrix of centripetal 
and Coriolis terms, G(q\) is an // x 1 vector of gravity terms, 
Fd is an n x n diagonal matrix of dynamic friction coefficients, 
FAq i) stands for static friction, and T,i denotes the lumped sum 
of all bounded disturbances and is in general a unknown nonlinear 
functional of q \, <]\. In (2). J is a diagonal matrix of actuator 
inertias reflected to the link side of The gears, D is also a diagonal 
matrix of torsional damping coefficients, and P(q}, qv ) denotes any 
possible nonlinearity whose bounding function depends only on qi 
and q\ . 

Let qi characterize the desired, smooth trajectory that the robot 
should track. The assumption of q'l being smooth implies that q'{ and 
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derivatives up to the second order are continuous and bounded 

ictions of time. The following ure the other important properties 

,ind assumptions on robot dynamics that will be used in this paper. 

P.l) As shown in [1], the inertia matrix satisfies that, for all r/, € 
W", MI„ < W(</i) < 777(fy,)/ (l some for known constant 
m_ and nonnegativc function Tf/(ryj), where I„ E 9V' y " is 
the identity matrix. Function 777(r/j) reduces to a constant lor 
robots with only revolule joints. 

P.2) It has been shown in [1] that Y,„[tn. q\) is a function 
of at most of first order in q\ and qi . Therefore, there 
exist constants ,i\ and h such that ||V, M (r/j, q\)\\ <J| + 

• h\U'\ || = f >\( -r j ), where || • || denotes Euclidean norm, .n is 
the output vector defined by ,rj = \t { / {] r , and r i is an n x 1 
vector representing the trajectory error, i.e., r i = </'/ — q\. 

A.l) Unknown functionals in dynamics equations (I) and (2) 
include dynamic and static friction, bounded disturbances, 
etc. It is assumed that they be bounded by known functions 
as ||G*(</i) + F ( /r/i *f I'\ ((/ 1 )|| < U 4- )i||.i‘i|| = ), 

||T f #|| < /m(■*’], /), ||F(ryi. ()\ )|| < !>\(.11 ), and \\M(u i )|| < 

f> r A -n )• 

A.2) The feedback information for control design are only the 
measurements of joint position and velocity, i.e., q\ and 
cyi, The position and velocity deformations qj — q\ and 
qi - <ji are unknown, but initial delormations arc bounded, 
that is, ||.o(fn) - .i*i (fo)|| < ft i, or ||.i j(/o)|| < for some 
constant /y, or /> 7 , where ./■_» is the internal state defined by 
■ r -’ = ['1j 'Ij 1 1 ■ 

A.3) It is physically guaranteed by the property ot matrices 
./. 73. f\ > () that the fictitious system 

./;■' + Dr 4 A'l- = /+2f + f (3) 

is unilormly asymptotically stable |23|, | 8 ] under zero fic¬ 
titious input /(/) = 0. That is, there exist constants -> n , 
■ 1 , and such that the state transition matrix /») 
associated with (3) satisfies the inequality ||i»(/. tu )|| < 
-,n fi{t) + m ,\< ~' jil 1 01 for all t > /n, where fi{t) is the 
standard impulse function. 

The parameter matrices A’, ./, and 73 are unknown and lime varying, 
their values are limited within compact sets as 0 < A_ < A < h\ 
0 < 7 < ,/ ^ .7, 0 < D < I) < 73. and their denvalives up to the 
third order are assumed to be bounded. 

It is worth making several remarks here about the properties 
and assumptions listed above. First, for notalional simplicity, we 
consider here the worst situation that no information on dynamics 
except their bounding functions is known. In many applications in 
which part of the dynamics are known, only the unknown dynamics 
need to be bounded since known dynamics can be compensated 
directly by some nominal (or standard) control law and since only 
the unknown dynamics have to be hounded and consequently com¬ 
pensated by robust control. The second remark is about Assumption 
A.3). If the matrices ,/, 73, and K are constants, a lower bound 
on convergence rale, -, 2 , can be determined from their upper and 
lower bounds. If the matrices arc slowly time varying (that is, 
time varying parts belong to some space such as Li or L >), 
can also be found by invoking the results in |9|. Only in the 
case that the matrices arc fast time varying, yj has to be assumed 
since there is no closed-form solution. ->2 can be estimated in 
this case, however, because system (3) is always stable. In the 
event thal matrices .7, 73, and K arc known and that there is 
no uncertainty in ( 2 ), the motor angle qi can be estimated using 
linear observer, and I/O control problem reduces to state feedback 
control problem. The assumption D > 0 may always be ensured 


physically; although this assumption is needed to sU > 
stability under I/O control, the control to be piopo^i 
damping and in turn achieves stability of output luiU 
stability imposes no restriction on the value of /). 1 

be shown later that Assumption A.2) can be rcmo\ed so ii. „ ,, 
speaking, I/O control becomes globally stabilizing tin Hu 
system. 

The control problem addressed in this paper is u> v\r\,\ l{ ,, 
input-output robust control design procedure foi flexible |oun 
The main difference between this work and previous work uu mini 1 
control in [3], |4| is that the control to he designed require onu Hi 
input and output feedback information, i.e., r, and 7, Measuununi 
of acceleration, or jerk, or deformation is not required The coiiuol 
is called to be robust since it requires no exact knowledge of the 
systems, that is, nothing beyond the properties listed above 'The 
control objective is to make the robot track any given uaieciory 
with arbitrarily selected accuracy despite of unknown but bounded 
nonlinear dynamics, 

Wc can rewrite system dynamics (2) into the following state-space 
representation 


■i‘> = Aj.r: 4 B>[T + 7w/j F(q \, tj\ )]. qj t ( \>,r 
where 

_ ^ 


(4) 


A> = 


0 /„ 

-.7 1 A' -.7 7; 


D) - 


.7 


r, = [l„ ()]. rj!]'. 


In the general case that the matrices and lh are tune varying, the 
solution to system (4) can be expressed in terms of transition matrix 
4 , (/. t {] ) defined in Assumption A.3) as 

qA f ) ~ U) ) » 1 , <o) + [t ■+ qi + P(qi • <ji)\ (3) 

where initial conditions are assumed to be zero, and * denotes the 
convolution operation. 


111. Robust I/O Control 

To design control without feedback of motoi angle q,>, consider the 
following linear time-varying system 

F(s. t )ti = r + </ (b) 

where y, i\ and <1 are column vectors of n dimension, y is the 
output, r is the corresponding input, d is disturbance, and F(s. t) -- 
Js 2 4 73s + K is the so-called time-varying differential operator 
[231, 128]. Thai is, system ( 6 ) can be rewritten in terms ot differential 
equation as .7 ij -f Dy + Ky = r 4 d. 

The intermediate problem to be studied is under what input r the 
closed-loop system becomes 

F tJ (ji)J„ i ,7iy = r 4 - d' (7) 

where /3 fJ , and K„ represent any choice of nominal values 
of ,7. 73, and 1\ respectively, r is any reference input, d' is the 
equivalent disturbance, and differential operator F t ,{ *) is given by 
F 0 (.s) = 4 73 (J .s -I- K „. A solution to this problem is provided 

by the following lemma. Its proof can be found in the Appendix. 
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Lemma 1: Choose the input v to be 

= G.\y + r + tyi + n a (8) 

where Cm, G 2 , and G,\ are diagonal matrices, and q\ and //? are 
auxiliary signals defined by 

ill = -91 4* Gn\ 1/2 - -i/a + G 2 //■ (9) 

Then, there are unique choices for matrices G, such that, under the 
input ( 8 ), system ( 6 ) is translated to (7) with </' = (*+ 1 )' 1 (*7„ 4 * 
/n — Gi )tf. 

The above lemma provides us a mean of designing I/O robust 
control. Now, reconsider dynamic equation (2) by rewriting it as 

+ Din + Ken = r + Kq x + P(q 1 , q 1 ) 

= [G: t c/2 + t + r/1 + 1/j] + [A’r/1 + P(r/1, </)) 

“ G:iqi - i/} - i} 2 ] = i» + rf 

where //i And //2 are given by (9) with /■ = r and // = </ 2 , and (7, are 
given by the solutions in the proof of Lemma 1 [i.e., (18)]. It follows 
from the discussion from Lemma 1 that dynamics (2) becomes 

i'll + «A7 ‘ DJh + .7,7 1 K t ,Hi ~ r 4- P'(q\, c/ 1 . O (10) 

where r/i = J</ 2 , Ci = * r, C 2 = f * Cm <:» = r “' * O 2 , 

C = [Ci T d cn 7 '. and 

91 , C) = (« + l) -, (»7» + A. - Gi) 

■ [K(ji + P( f/i .</,) — Ch<]i ~ >li ~ »/a]. 

For now, the equivalent uncertainty P f (qi, q 1 , O contains the un¬ 
measured state variable q 2 . Later, qi will be eliminated using solution 
(5), and then P ( {qi< <j\ , C) will be bounded by a known bounding 
function only of qi. q \. 0 i for designing robust control. 

It follows from ( 10 ) that the Solution for q 2 is given by 

<Mt) = .r 'G 2 < Ai ° , / 2 (t„) + r'T 

+ J-'Cir^IhoJv /*'(«.-91, o (in 

where = [9a (^o)./(^o) </■]'■/((.>) + 9 a' (A> )Mto )] r , . 4 *,, and 

Din are the matrices A 2 and Di, respectively, with .7 = 7v = 7v„, 
and D = D„, and 7, a filtered version of r, is defined and can be 
calculated by 

? = <?*:. ; = [;?' ] 7 , i = Ja»-. + /Ja..J.r. (12) 


Lyapunov function candidate, a positive definite function with respect 
to .ri and - - if, to be 

V = V 1 +V 2 . Y\ = if / A‘,,fi + i(nri+f'i ) 7 Af(#/i)(nfi+f i ), 

= i(i - i/) f (i - u) (14) 

where // is the vector to be chosen later, and a > 0 is a constant 
scalar. The following lemma illustrates the property of sub-Lyapunov 
function Vi (. 14 ) along the trajectory of subsystem (13), and its proof 
is included in Appendix. 

Lemma 2: Consider systems ( 1 ) and (2) under Properties P.l) and 
P,2) and Assumptions A.l), A.2), and A.3). The time derivative of 
the sub-Lyapunov function ^ j (.r j) satisfies the following inequality 
along the trajectory of system (13) 

Vi(.ri) < -A 1 ||fi|| i 4*||H , (.i , i )||v?,Ui. (■**])A' J' 1 T (15) 

where Ai = oA„ llu (7i yl ), A U1111 (-), and A IIIliN ( ) represent the opera- 
tion of taking the minimum and maximum eigenvalues, respectively, 
*/ (]) = [r*7„ 7„].ri = nr i + r i, and 00 is a well-defined 

function only of .c\ and 0 i. 

Note that bounding function CO does not depend on 

internal state and only requires an estimate (or bound) on the 
initial condition of .r 2 . In » Ci)- only one of bounding terms 
on initial conditions is included and is denoted by that associated 
with p 7 . In the resl of the proofs of Lemmas I and 2, wc choose 
to leave the terms of initial conditions out of bounding functions 
for simplicity, and we can remove the term containing />- as well. 
Although the system is highly nonlinear, this treatment is legitimate 
since nonzero initial conditions obtained in inpul-output modeling 
come from exponentially stable linear systems such as (II) and 
(9). Consequently, nonzero initial conditions contribute in V only 
exponentially decaying time functions multiplied by terms that are 
linearly proportional in \ r to the norm of the state. Note that Lyapunov 
function and its time derivative still have quadratic-like expressions. 
Thus, the terms associated with nonzero initial conditions can be 
combined with negative definite quadratic terms using the inequality 
-rr-t-2|r/| \b\ < b* to conclude the exactly same stability result. That 
is, Assumption A.2) can be removed, and nonzero initial conditions 
need not be considered in stability analysis. The trade-off of assuming 
zero initial conditions is that the terms bounding initial conditions 
may converge slow and therefore dominate the behavior of the overall 
system. By compensating nonzero initial conditions, convergence rate 
is what is designed and found in stability proof. 

Upon having Lemma 1, we can proceed input-output robust control 
design, the proposed I/O controller is generated by the following 
recursive mapping 


Substituting solution (11) into (1) yields the output error system 
.i*i = .4 1.r i 4- D\(AA — .7 1 7 + r/i ) (13) 


where 


-4, s 


0 

— 37~ 1 (qv )Kfj 


In 

0 


Di 


0 


■^-4 = A'~'A'|,n + A' 1 )ij'{ + A'*'JV(9i, </,) - 

J-'C 2 </„) - J-'Ctr'**' * B a „JoP'(gi. 91. o. and 

K P > 0 is a diagonal gain matrix chosen by the designer. 

Stability analysis and control design will be done using Lyapunov's 
direct method. To apply the Lyapunov technique, let us choose 


!' = [«'/ "It] 1 ' 


If I = If I I + 1/12- Hi = H'JI 4* ff‘22 - 


// ] 1 = r».7J£ 1 rtf ( q i )«'( V 2 ). 


-J J /|(.!*1, O-/)||//iUm Cj- t)P , 


U 22 


If 21 = Mill ~ ~~ I )■ 

= ga(£i> Ct. f)|l// 2 (.n-M,G-Cw 0 ir j 
ii//2 ( *r j. “-j. Oa f)ir j+l 
■ !J‘i (■*'i - ~i. Cl- f)- 

r = T\ + Ti. t\ = fUl2 + if] + (.Ar’A’u - 7„Ui 

+ (.7,7* Do - o7„ 

//;i(/’i- C- f) 


r-i = 


//■■ i(* r i ^ U* Oil + e. 


* flu (■/* i - -- CO 


(16) 
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nere a > 0 and \*j > 0, A- = 1.2.3 and j = 1,2, are 
nstants, the constants Vj arc,chosen such that the first-order 
irtial derivatives of u with respect to their variables are well 
/fined, and /h(jvi,C*- 0 - w (-« i )y?t(-n, £.*<<:*• 0 = 

|wi|4* mil *11^11' ll ,r (- r l)|| (■»'!, = ("l — -I )<u 

II.Cl XiX’ut). Ha = IIIAalll. /mUi. 00 = (m - 

(-n. 

The recursive mapping basically involves finding bounding func¬ 
tions of ||f/j|| and || #V- 2 1| for obtaining u 2 and r, respectively. These 
bounding functions should not be functions of the state derivatives 
but only of the system state. To this end, the terms ||<f||| and 
||/'/ill can be bounded by first developing bounds for the first-order 
partial derivatives of u, with respect to its variables and then by 
determining the bounds for the first-order time derivatives of its 
variables using dynamic equations (12) and (13). The bounding 
functions are usually obtained by taking the Euclidean norm. This 
design process yields that, since u \ is a function of r t and Ci» Hi (■) 
and therefore //j are functions of .ri, O, C< and :i, and that */»(■) 
and therefore t are functions of r \, c,' and As a result, the control 
design procedure is well defined. There have been several ways to 
guarantee differentiability of robust control [17). In the case where 
there arc known dynamics in the system, «i i should be modified to 
compensate. Similarly, 1 and t\ can be redesigned since not every 
term in ii\ and ii 2 is uncertain. For simplicity, we leave the details 
to the reader. 

Wc are now in a position to slate the following theorem on 
control (16) which requires only input-output measurements and 
is robust with respect to any bounded parameters and uncertainties 
lor manipulators with flexible joints. Its proof can be found in the 
Appendix. 

Theorem ■ Consider system (1) and (2) under Properties P.l) and 
P.2) and Assumptions A.I), A.2), and A.3). It I/O robust controller 
t is chosen to be the outcome of the mapping procedure (16), then, 
while all internal state variables are uniformly bounded, the trajectory 
tracking error a \ (both t and /) is globally and uniformly ultimately 
bounded. That is, as lime approaches infinity, the magnitude of the 
tracking error characterized by \ \ (•. t ) becomes no larger than the 
design parameter t* where 


2n min 


1 


/ ^ m i ii ( p ) 

\ ^ 111 ix ( h p ) 



Furthermore, the robust control r is continuous and globally, uni¬ 
formly bounded. 

It should be noted that, for good transient performance under 
nonzero initial conditions, f, should be chosen to be proportional 
to the initial condition ol measurable sub-state, .ri{tu). In this case, 
the stability of uniform ultimate boundedness becomes Lyapunov 
stability, and the stability is global since .n(/o) is available as pari 
of state feedback information. Under zero conditions, f, should be 
chosen to be any small constant. 


IV. Simulation 

To illustrate the effectiveness of the proposed input-output robust 
control scheme, consider a two-link flexible joint robot. An I/O 
robust controller will be designed for the robot using the nonlinear 
mapping (16). Each step in mapping (16) requires to find a bounding 
function for the time derivative of the previous fictitious controller 
and, if necessary, this derivation procedure can be automated using 
symbolic manipulation package. Note that a sophisticated nonlinear 
function of some variables can always be bounded by a simpler but 



MliMUu arror« 



higher order polynomial functions of the same variables. Since the 
proposed control has complete robustness, even a control with over¬ 
simplified expressions works very well. Based on this observation, 
we choose robust controllers u i. i/j, and r m the lorm ot (16) 
as follows: - (1 4- ||jt||^)/ 1U, mlyi) = 2, f i ~ tj = 

M = 1.0. n = 33, A /f =. /, IK"' ■“ 57. i/i - pj =■ 2, 

D„ = 0.3 1, K„ = 90/, J„ — /, hi = [1 + (y?i T + ||»i - 

-ill 2 + Ill’ll 2 ]/ 10 - (h = [1 + Hi + II "i “ -;|| J ]/10. The auxiliary 

state : is defined by (12) using the nominal parameters D „, 

and K„. 

For a two-link flexible joint robot, the expressions ol M{q) and 
X(q* i)) in the dynamic equation (1) can be found in |l|. For 
the simulation purpose, we made the following choices: ni\ = 
in. = l, /i /j =. 1, F, t = F\ = 0, ./ = /,/)=?/, 
K[t) = 100+90 sin (0.3/), T fl (j) sin (0.2ry, (/)) + ().!cos(/), and 
Pi<]i- 7i)(.y) = cos(||(/i|| J + f /2 0)), where j - 1. 2. The desired 
trajectory for |oint angle to track is <j l = [1 - cox(/)]7. 

The lobusl control was simulated using SIMNON^ 0 , and the 
simulation results are shown in the figures. Further research is needed 
to determine bounding functions such that robust control requires less 
control magnitude. 


V. Conclusion 

A robust control design method was presented in this paper for 
robot manipulators with flexible joints. In contrast to existing robust 
control results and to other design techniques such as adaptive 
control and singular perturbation, the proposed control guarantees 
global stability while using only measurement of joint position 
and velocity. Because acceleration and jerk measurement is no 
longer required, the proposed control is more practical and attractive. 
Furthermore, the proposed control works robustly to compensate 
any unknown but bounded uncertainties including unknown flexi¬ 
bility. (time-varying) parameters, load variation, and disturbances. 
Regardless of uncertainties, position and velocity tracking errors 
have been shown to satisfy any arbitrarily small tracking accuracy 
requirement. In return of achieving such a complete robustness, 
the designer is required off-line to perform differentiation and to 
determine bounding functions in order to find robust controller 
through the proposed nonlinear mapping. The off-line calculation 
can be automated using a software package with the capability of 
symbolic manipulations. 
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Appendix 

Proof of Lemma J; Utilizing differential operator, we cap rewrite 
the input v as 

25 Gaj/ 4 r + (* 4 1) 1 [Gi v + Gzu] 

— (*/« 4 I„ — CTr'i) 1 {[(•** 4 1)Cm 4 4 (* 4- l )r}. 

Substituting the above equation into ( 6 ) yields 
[(rt/ii + In — Ctj )(j/ 4 Os, H-A) — (jhtH-1 )G% — G'f]y 

= ( 8 4- 1) v “I" ( rti I, 4 In — G I )ft. 

It then follows that the closed-loop system is given by (7) with 
c1 ' = (* 4 1) _ 1 (*A ( 4 I n - Gi )d iff the following identity holds 

[(sl tt 4 Ai — Gi )(Jh 1 + Ds 4- A ) - ( * 4 - 1 ) 6 \ — Gf\ 

= (*41 HA,/ 4 4- A\,)J“ 1 A (17) 

Since all matrices are diagonal, the above equality can be rewritten as 

[(* 4-1 — Gi i)(A* 2 + D,s 4- Ai) — (.v 4 l)Gi, — (^ 2 ,] 

= (* 4“ 1)(A,i/ 4- 0 (3 ,* 4- A) -j- 

where subscript i denotes the /th element on the diagonal. It is easy 
to verify that the above equality holds if G \, GY and G\ are chosen 
such that 

r 1 9 j ' 

G Xi = — - 2j - —, 

(j i, — /), 4" A, 4* (1 ■“ G \,) D, — 3,7, — 2,7, 

eyOn, j I\-oi , D ot - 

Z7 1Z 17, 

Gi, = A', + (1 - Gu)K, - Gu - 

- f 1 + 1~ V, - — T — j. ~ (18) 

Solution (18) guarantees identity (17) which in turn guarantees that 
the closed-loop system is internally stable. The above relations will be 
used to determine known bounds on G [t G 2 , and GY These bounds 
are required in designing robust control. □ 

Proof of Lemma 2. It follows from (13) and (14) that 

Vi(.n) = rf K t ,i j 4 - -i/ (.11 nV/U/i )*/'( n ) 

+ «' 7 ('*’i )(«n 4“ m ) 

= A',,r 1 -w l (,n )7 v\7 _i t4</ 1 Ui (0-5A/(</i) 1 / (.r t ) 

4 r\M{q \)('i 4 7i A4 4- 7vY]. 

Thus, the prool can be completed it the uncertainty Q.5Al(qi )</'(,*’1 )4 
<\M(q\)i] 4 K A4 4 h'm can be bounded in euclidean norm by a 
well-defined function q?\ (q \, < 71 , Ci). 

It tollows from (9) and ( 8 ) that (si,, 4 /« - G\ )i/i = G\ GV/j 4 
Git 4 G\i)j and (#* 4 l)»/j “ GVjj. Hence, wc have 

= 7wji 4 P((H*q\) - Gnq 2 - (« 4 IP 1 
’ [Gi A qi 4 Gi P((/i,r/i) 4 Gjfji] — (h 4 1) 1 Gi r 

= h q\ 4/%i <<i \) —(* 4 l)Gj —— 7 j4G 1 1 1 

*41 •* 4 1 

- (« 4 l)“'[G y i A ‘71 4 GiP{q\*(j \)] - Go —qi 

■s 4 1 

4* —— Gj -—“72 - GjCi 4 (« 4 1) 3 Gi(i 
m 41 « 41 

1 ■ 1 

5 = 7l q\ +P(qi, fji )-(s 4 1 )Gi j (J 2 4Gi <li 


- (s 4 1)” J [Gj Kq\ 4 Gi Pfr/], fji)] - G* 72 

+ ~TT^ ——(/2 
x4 1 4 1 

- (* 4 1 ) J [GiCi — 3(« 4 1)~'GiCi 4 3(s 4 1)" J 
• GiCi -(.*> + 1) 1 

in which the last two steps are derived by integration by part. 
Such steps are necessary to find a well-defined robust control r by 
guaranteeing that the second-order time derivative of the bounding 
function to be determined does not depend on r explicitly. 

The bounding function <^i(.rj,CO that bounds uncertainties in \\ 
can then be chosen to be 

(•»’!, Ci) 

= || A|| \\qi || 4 ||A>, || 4 77%, JHtfll 4 /11 (.11 )|| q\ || 

4 ) 4 pi(-ri) 4 ^ II 1 )pft(.ri) 4 nm(r/i )||r 1 1| 

+ Ill’ll Hill" l tl|CV ’ J ' 'II • 11(7 + || J||)||/,t + |||G, HI ■ Holl 

+ /'|K' i( ^<'-^,„J„||p'||||r yi || + , t(Jl ) 

Jin 


4 / 

Jin 

+ (|||G,||| + |||6', 
I ] I . 


+ 3|||G’i III • Iki|| 


fjc'i< , ^'-’ , Bi..j..|| nit;, hi 

ClII <l» + 

Jin 


+ lll^'IH- 

+ /'iir.^^-’^.iKiiriMiiG.HMi, 

Ji u 

■ /»,( j !) + IIKVjHI ■ 7^-^ + ‘^r 


+ Gi 


where ||| ■ ||| denotes a known bound or bounding function on (he 
norm ot the argument 

0 /„ 0 
A , = 0 0 /„ 

-J„ 1 A., -J - 1 (A„ + - J - 1 D „_ 

C‘, = [/„ - .7,7 1 A„ 21 „ — .7,7 1 D„\. 

c" = [/„ /„]. 0, = [() 0 

c\ = (/„ 0 0 ] 

—r 

and .7 denotes the upper bound on J. Although qi is not measured, 
a bounding function of measured variables can be determined from 
(5) for ||(l/« 4 1 )</- 2 II a* 

-J-tK <('hi«M + 1 ir" >,(, ' ,0) ) 

*41 

■ *[IKi|| + f " ( '"' o) * 'O) „ (117,11 + *>,(.!-,))] = —j <u . 

Hence, v?i (.r,, CO is a well-defined function only of .ri and Ci> □ 
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Proof of Theotem Taking time derivative of \ 2 along the tra- 
jory of the system given by (12) yields \ 2 = (7 - it i) 7 U_- 
+ („j — " 2 ) 1 ( — 7 _1 A 0 ^j — 1 H- r - t/J Jt follows from 

i \ i + 1 j and from Lemma 2 that 


\>M 2 + ll«('i)t|yj(i. U-n'lnlAr'f 

- (+-«l)'(-2 - Ml) 

+ ( j - m) 1 (-J -1 Ac-i - J-'D i + r- iiJ 

- \i IK i II* + II« (' i )||ri(>i O) - m 1 (i i )A 1 'i/i 

+ (_| - Ml ) / [M2 -m 1 - / 1 A H (11)] 

4( - hj) 7 (-) - m - I 1 A i - 7 '£> + t — iij) 

i -A) ||f 1 11^ - <\w(rji )||n (i j )||‘ - n( i - m ) J 

- »( i - H )’ 

+ ||l<( 1 )||f-1 ( I I ) - « ( ( I I )A / ‘llu + ( l-lll)'ll2J 


+ 11 1 - Mil 

[III M I III 


A ' " ( i i ) ] + ( - ii r. + || . - li 


v — viIK iII - 'iii'(i/i )l|» ( 11 )|| 

— n ( i — il i ) 2 — n ( — it ) + <] + ^ 2 *f f \ 

^ -\M 4 \V 


where A* = nnn {2Ai/A x (7v 7 ) in ( md z* — (o + *- + fi)/V* 

Solving the above hist Older diflerential equation yields 


1(0 < < K " ' M U, ) + <*(l-i ' " a s/— x. 

IhcrcfoiL \ and the output tracking error n (1 e < and <) are 
umfoimly ullimalely hounded and their convergence is exponential 
I lit uniform boundedness ol all internal variables can be estah 
lishcd as lollows bust 1 1 being bounded implies that both i - i/} 
and - it ate umloimly bounded Second it follows from 1 1 being 
boundtd that <j\ is bounded and hence i/jj is uniformly bounded It 
follows from (16) that || it i || < ll\ 1 y i ( 1 1 u) which shows that 
//1 is umfoimly bounded it t,i is bounded It follows from 92) that 

U = [A/ + D,Ji ~ r(,/l ' Vl) J 

(l +1) 

Note that i i (and therefore q\ and q \) is bounded Even if matrices 
J D [ are time varying we can show using integration by part 
(as did in the pioof of I emma 1) that C\ is a function of signal 
(l/s +1)( i 2 and its tillered versions through stable transfer functions 
On tht othei hand it follows from (1) that 

—}-r >/. = -" IA 1 [1/('/i )'/i + A (</i '/i) + A<yi]} 

The terms on the right hand side of the above equation can be 
shown again using integration by part to be functions of qi q\ 
and then hltered versions lhus we conclude from the above two 
equations that (,* and "1 are globally and umlormly bounded Now 
one can conclude boundedness of r i since m is bounded The 
boundedness of then implies boundedness of (, which in turn 
shows boundedness of u and therefore Finally boundedness of 
2 is used to establish in turn boundedness of (,i t\ t 2 t and 1 2 □ 
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*4 

The Use of Symbolic Computation 
in Nonlinear Control: Is It Viable? 

Bram dc Jager 

Abstract —To Help along the analysis and design of nonlinear control 
systems the NonLinCon package, an acronym for Nonlinear Control, was 
developed. This note addresses the usefulness of symbolic computation, 
and of the NonLinCon package in particular, for the symbolic analysis 
and design of nonlinear control systems. The symbolic computation 
program Maple Is used as computing substratum. Textbook problems 
show that the NonLinCon package can be used successfully. A larger scale 
problem Is too complex, however to be solved with the current versions 
of NonLinCon and Maple. The conclusion is that symbolic computation 
is a viable approach for textbook problems but not yet for more complex 
ones, 

I. Introduction 

The use of symbolic computation for control purposes is inves¬ 
tigated by several researchers. Zcitz et al [1] applies the program 
MACNON, based on Macsyma, to analyze observability and reach¬ 
ability, and to design observers and controllers for nonlinear systems. 
Blankenship [2\ also used Macsyma to solve some control problems 
with his implementation CONDENS, The use of Maple for several 
control problems is reported in [3J. Some problems reported in 
this note, e.g., with solving partial differential equations, are partly 
resolved in [41. They describe a Maple package, here called NonCon, 
that can compute, e.g., the zero dynamics and provide solutions for 
exact linearization problems, In this note we illustrate the use of 
this package by applying it to some textbook problems and a more 
complex one. 

The main goals and contributions oi this note are 

• a proof of the viability of symbolic computation for some 
problems in the analysis and design of nonlinear control systems 

• to show the characteristics of a prototype implementation 

• to give some examples and to document some applications 

• to discuss directions for future research 

• to familiarize a larger audience in the control community with 
the use of symbolic computation. 

The note is structured as follows. First, Section 11 formulates the con¬ 
trol problems and presents the algorithms used. The implementation 
in NonLinCon of these algorithms is treated in Section III. Section IV 
presents textbook examples for some problem areas. A more complex 
example is treated in Section V. Section VI closes with conclusions 
and gives directions for future research. 

II. The Problems 

Of several areas in nonlinear control, where symbolic computation 
is likely to be of some profit, we discuss the computation of the 
normal form, the zero dynamics, and the input-output and state-space 
exact linearization. In the presentation of these problems, we follow 
Isidori [5|. 

A. Preliminaries 

We start with a nonlinear model of a plant and issume that it can 
be described adequately by a set of nonlinear differential equations, 
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affine in the input v, and without direct feed-through from input to 
output 

x = f[x) + g(x)u % y = hU) (1) 

with state vector .r £ R", containing all necessary information of the 
plant, input vector u £ R m , and output vector y € R m . The number 
of inputs is equal to the number of outputs, i.e., the plant is square. 
This assumption is for convenience only. Part of the theoi^ can also 
be derived if the number of inputs is not equal to the number of 
outputs, The vector field / is smooth, g has in columns g , of smooth 
vector fields, and h is a column of m scalar-valued smooth functions 
h,. 

The type of control law chosen is static state-feedback. Therefore, 
the value of the input vector u(t) depends on the state jr(t) and a 
new reference input vector e(t) as 

» = rtU) + (2) 

where the components n, and ii tJ are smooth functions. 

For nonlinear models it is appropriate to allow for a nonlinear 
change of coordinates 

- = (3) 

It is required that Ihe Jacobian d^/D.r of the transformation is, at 
least locally, invertible for $ to qualify as a change of coordinates. 

B. Relative Degree 

The nonlinear model (1) is said to have a vector relative degree 
{/‘i, ■ ■ ■. r„ ( ) at .r = .r" if 

1) L t]j L } h ,(.!■) = 0 for k — 1, - ■ ■, r, -2 (?, j = 1. ■, m) and 

all .r in a neighborhood of r", 

2) the following m x m matrix is nonsingular at .r" 

[ L, n Ly ^M.r) ■” 1 /i j (.r) 

A{r) = 

Li L, h 1 l Li m h„,{.r )J 

Here L k f h,(.r) means the Ath successive Lie derivative of the scalar 
function h,(.r) in the direction of the vector field /, e.g., Ljh,(.v) = 
{dh t (j m )/dx)f(A m ). The matrix .4 is sometimes called the decoupling 
matrix. The relative degree can also be interpreted as the number of 
times the outputs have to be differentiated before the input explicitly 
appears. 

The models we consider do not necessarily have a relative degree, 
either because the first condition cannot be satisfied or because the 
matrix .4 is singular at ,r". 

C\ Normal Form 

When the model has a well-defined relative degree we can use a 
change of coordinates (3), with i = (£. »/), to transform (1) to the 
normal form 

Ih = /j,U) = f, 1 

£ = £ 

C = Mf. »/) + 53 n .j(f- for ' = 

j -i 

»)< = '/,(£, '/) + ;>.(£, n)» for/ = r + l, -.n (4) 
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ith r = E',=i r > and 

>1 ) = I-q, h,w 1 ((.* I /)) 

'/) = L'f’h ,($ _1 ({. //)) for /. j = 1. • ■ •. /». 

iic terms are the entries of .4 and we can compactly write |with 
mly equations containing the input u in (4)] 

=!>((.. 'I) + A(t. .;)« (5) 

V = </(f. v) + p(f- 

where £ ( ' * contains 1 , > = 1,• • ■. m. The equation for t) is called 
ihe internal dynamics. Because .4 is nonsingular if the relative degree 
in well defined, the control 

u = A~ l (v - h) (6) 

with the new input i», is properly defined and linearizes the part of 
model (5) that is visible at the output. 


D Zero Dynamics 

The zero dynamics problem is: obtain the dynamics of the model 
when the output }/ is required to be zero for all f, by a proper choice 
ot initial state .»■(()) and control input it* (1). Here we have to employ 
an appropriate static state feedback and use proper initial conditions. 
More specific: we are looking for the locally maximal output zeroing 
submanifold and its associated dynamics. 

When the model has a well-defined relative degree, the zero 
dynamics follows from the normal form, by substitution of the output 
nulling input a* and using the property that the states £, can be set 
to zero in the internal dynamics. 

For models without a relative degree the zero dynamics can be 
computed by using the Zero Dynamics Algorithm. The way this 
algorithm works is by considering a sequence of nested submanifolds 
U,, with M, D \f, 4 i and A/n ~ /i _l (0), i.e., the first submanifold 
is the inverse image ol the point ij — t). When some conditions 
are fulfilled this sequence converges to the locally maximal output 
zeroing submanifold Z* in some neighborhood of i " and there exists 
a mapping u* such that /*(.i) = f[.r) + (f(.r)iF {.i ) is tangent to 
Z*, The pair (Z*. f A ) is called the zero dynamics of the model. 
When the mapping FT (a ) is defined in a neighborhood l of by 
Z* fl r = \.r G l’: //(./ ) = 0} the input iG can be computed as 
the solution ol L\U{.r ) -f L,, H (.r) u* = 0. For further details ot this 
algorithm and the conditions to be imposed to guarantee convergence 
of the sequence to Z\ we refer to [5, Section 6.1). 


E. Input-Output Exact Linearization 


The input-output exact linearization problem is: find out if it is 
possible to transform model (1) to a linear one by stale feedback 
(2) and compute this feedback. The linearity properly should be 
established between the new input c and the output ,</. Formally, we 
are looking for a neighborhood V of .r" and a static state feedback 
such that for all k > 0 and all 1 < /. j < in the expression 
L {l , 1)t L k f f 1/n h,(.v) is independent of .r on l\ 

For models with a well-defined relative degree, the input-oulput 
exact linearizing feedback follows from the normal form and is given 
by (6). 

For models without a relative degree, the static state feedback that 
solves the problem can be computed with the Structure Algorithm. 
This algorithm uses a sequence of Toeplitz matrices 
|To(.r) ■■■ 7 \ (r) 1 


MiU)=- 


0 T 0 (.r) 


where 




T-jh i (*r) 
L L< n L j h ,,, (,r) 


L, h} I i h 
L'ln, I fl* , \ I 


for 0 < k < 2 n - 1. The conditions for existence ^in W , , 
in rank conditions on A/*(.!■). Within the Structure Wyo , ♦ 
feedback functions n(.r) and 4(.r) are constructed. Foi fuitlu, .u ■ 
of this algorithm wc refer to [5, Section 3.41. 


F. State-Space Exact Linearization 

The state-space exact linearization problem is: under which ton 
ditions is it possible to transform the model (I) to a Imcai and 
controllable one by state feedback (2) and a change ol com dilutes 
(3)? The linearity property should be established between the new 
input v and the transformed state :. 

This problem has been solved. The solution is only valid for models 
with a well-defined relative degree and requires the existence of (syn- 
thelic) outputs for which the model has a lull order relative degree, 
r — n. When this cannot be obtained, it is sometimes convenient 
to strive after a maximal relative degree. Then the corresponding 
input-output linearizing state feedback realizes a minimal dimension 
ol the internal dynamics. 

Using the Lie product 


[/■- 




define the adjoint ad recursively as ad*</, — [/.ad* 1 r/,) with 
adThen define the distributions 

G, = span {ad/f/yi 0 < k < i. I < j < m} for ()</<.»- 1. 


We now state the conditions for a solution [5, Theorem 3.2.4). 
Theorem I. Given the model 


= /(.r) 4- cy(j-)M. Jr G R". u F R’" 


with rank <r/( .r") = in . There exists a solution for the state-space exact 
linearization problem it and only if 

1) C» # has constant dimension near ,r" for each 0 < / ^ n — 1 

2) G u -\ has dimension n 

3) G, is involutive for each 0 <?<»/— 2. 

Here, involutive means that the distribution is closed under the Lie 
product, i.e., the dimension of the distribution ft, does not change 
when a vector field, generated by the Lie product of each combination 
of two of the vector fields in C»',. is added to the distribution. 

When the given conditions arc fulfilled, there exist solutions A,(./■), 
/ = 1, • • -. m, for the following partial differential equations 

L, h L) A,(.i ) = 0. for 0 < k < r, -2 and 1 < j < *"• 

Also r, — n, where the set of integers (rj. is Ihe 

relative degree vector. The w functions A, can be computed, based on 
a constructive proof of Theorem I. Using the functions A, the change 
of coordinates ( T> and state feedback a = n(.r) 4- 4(.r)r follow. 


III. Solution with Symboik' Combutaiion 

An analysis of the algorithms shows that symbolic compulation 
programs should be able to compute the Lie derivative and Lie 
product, the Jacobian, the rank, the Gauss Jordan decomposition, the 
inverse of a matrix, and the determinant of a matrix; to do matrix- 
vector and matrix-matrix multiplication: and to lest involutiveness. 
More mundane facilities like symbolic substitution are needed also. 
The main problems are the computation of solutions for sets of 
nonlinear equations, e.g., to compute the kernel of a mapping, and 



to 
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Fig, 1. Structure of NonLinCon. 



Fig. 2. Robot with two rcvolutc joints. 


for sets of partial differential equations. These problems present the 
major bottlenecks for the symbolic solution of our problems. 

Solutions for the problems described in the previous section 

1) the normal form 

2) the zero dynamics 

3) the input-output exact linearization 

4) the state-space exact linearization 

are included in NonLinCon. For problems 1) and 4) the model should 
have a well-defined relative degree; for 2) and 3) this is not necessary. 
Implementations of other algorithms, e.g., the Dynamic Extension 
Algorithm and a method to solve partial differential equations, are 
included in this package also. The structure of the implementation is 
sketched in Fig. 1. For a thorough discussion of the implementation, 
see [6], 

Above we noted that a main problem was to compute solutions 
for sets of nonlinear equations. In Maple a facility is available to 
solve sets of nonlinear equations, namely the solve function, but 
this facility is not always sufficient. We did not try to improve this. 

We also noted that another main problem was to solve sets of partial 
differential equations. In Maple almost no facilities are available for 
their solution. To improve this, the following route was chosen. 

The partial differential equations we have to solve are from the 
“completely tntegrable” type; jn other words, based on Frobenius’ 
Theorem, we know that a solution for the partial differential equations 
exists, To compute the solutions Frobenius’ Theorem itself is of 
no help. This problem was solved by computing the solutions 
with an algorithm based on a constructive proof of Frobenius’ 
Theorem, The procedure is as follows. The solution of the partial 
differential equation can be constructed from the solutions of related 
sets of ordinary differential equations. Because Maple provides some 
facilities for this type of problems, the dsolve command, the 
problem seems solved. The dsolve command is not very powerful, 
however, and is often unable to present a solution, although this 
solution is known to exist. Therefore the dsolve procedure was 
extended in an ad hoc way, to handle a larger class of problems, by 
setting up a recursive procedure to solve sets of differential equations, 
starting from the “shortest” (assumed to be the simplest) equation, 
substituting the solution in the remaining equations, and so on. No 
effort was spent in trying to detect a (block) triangular dependency 
structure in the set of differential equations, which would be a more 
rigorous option. Despite this extension the solution of the partial 
differential equations is often unsuccessful, so NonLinCon cannot 
complete the computations. 

A minor problem was that some standard Maple linalg func¬ 
tions are only suitable for rational polynomials. This was too limited 
for our purposes. Therefore, the rank and implicitly the gausse- 
llm and gauss jord procedures, were extended, so a larger class 
of problems could be handled. This resulted in the new functions 
extrank, extgausselim, and extgaussjord. 


IV. Textbook Problems 

To illustrate the use of NonLinCon we consider several examples. 
The first example computes the zero dynamics of a single-input 
single-output model with a well-defined relative degree. For the sec¬ 
ond example the zero dynamics is computed with the Zero Dynamics 
Algorithm, For the last example the input-output linearizing state 
feedback is computed with the Structure Algorithm. The last two 
examples are based on models that do not have a well-defined relative 
degree. All examples aer contrived ones. The first example is taken 
from [7, Example 12.431 and the other two from 15, Examples 6.1.2 
and 5.4.1 J. 

Example I: The model for a robot with two revolule joints (see 
Fig. 2) can be derived from the kinetic and potential energy T and \ r 

2T = m j ll 0? 4- H- /jj (0i H- Hy ) 2 

+ 21 1 1 2 (0 1 -F il 2 ) cos 0 2 ) 

V = —f/{(m i 4- nu)l i cos0i 4- nuh cos(0i 4- 0j)(. 

The inputs ii j and it 2 are the joint torques and the outputs ij\ and 
y -2 the joint positions. The state ,r of the model corresponds with 
the dcgrecs-of-freedom and their derivatives in the following way 
.v 1 = [0i 02 0] 0_>). The aim is to derive the dynamics of the 
model when it is constrained. There are two cases, the first one with 
y = I/, 0 1 constrained to 0 and = 0, the second one with 

y = if 2 = 02 constrained to 0 and u\ = 0. According to |7] the zero 
dynamics are given by (with all model parameters set to one) 

jj — .r 4 .i ’4 = (f sin .r -2 

respectively 

= j i i, = (3/5)i/sin.!■]. 

The following log of a NonLinCon session shows that these results 
can be reproduced. Two functions are used: normform to compute 
the zero dynamics and transform for the inverse transformation. 
The log for the first case, where y = 0i, is found at the bottom of 
the next page (as marked by (x)). The second case for y = 0^ is 
found at the bottom of the next page, as marked by (y). The results 
coincide with the ones given above. 

Example 2: The model of the system is 



' .I’; ‘ 


■ 1 

o - 




= 

X\ 

■I’1.1-1 

.l’ r , 

4- 

.r.i 

0 

J‘2 

1 

.r-2 
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•n 

/a 


- .11 . 


.1 
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This model has no well-defined relative degree at .v° = 0 because 











lhbfc TRANSACTIONS ON AUTOMATIC CONTROL VOl 40 NO I JANUARY IW 


singular for t j = 0 The zero dynamics is given in 15] by 
- m and the zeroing input by 

‘ 0 
— i 

! ht next edited log of a NonLinCon session shows that the zero 
'ynimics can be computed From the functions supplied we only 
med e>tnormform that implements the Zero Dynamics Algorithm 
is shown at the bottom ol the page (as indicated by (z)) The term 
omputed last (for n) represents the zero dynamics The results 
coincide with the ones given above 
Example 1 The model of the system is 

»i + 1 1 

'Hi 

I — * i + n + a // = 

0 

+ ' 1 J 

This model has no well defined relative degree because 



is singular tor all i According to [^] the necessary feedback foi 
input-output linearization is i/ = n( i) + i( i )i wilh 

-i I + M 

! t i + 2 r j i + t i i i 

The following shows that NonLinCon can compute the input-output 
linearizing state feedback From the turn.lions supplied we only need 
i ut In that implements the Structure Algorithm which is found 
it the bottom ol the next page The computed slate feedback agrees 
with the previous lesult 



Fig 3 One Lrack model ol a vehicle 


V Vt HIC 1 b SlMUl AHON PROH1F M 

This more complex problem is derived from an inverse simulation 
problem in multibody dynamics A vehicle is required to perform 
a standardized maneuver To simulate the maneuver the inputs to 
the model must be known Normally the maneuver is such that 
a suitable selected output of the model tan be set to zero Often 
this output does not fully determine the behavior of Iht model and 
the remaining freedom represents exactly the zero dynamics If this 
dynamics is stable then the simulation is also stable if it is unstable 
then additional measures are needed for a stable simulation 
As an example we use a simple two dimensional one track model 
ol a vehicle with traction and cornering forces acting on the lires It 
looks like the model of a bike because it only considers (he center 
line of a motor vehicle (sec Fig 1) 

The required maneuvci is steady state turning the longitudinal 
speed ol the center of mass U is constant and a point P on the 


i normierm+ 
p r <_ lyr am j c i ° 

L r t a [ 1 ] ck t eta Ml, Pt a [ ]d t qnv am (et a 11 ] ) 1 
output nulln j mj-ut r 

2 (x) 

[ in(utd[l]) ctj[;] + ccs(eta[l|) qrav sm(tLa[l]) 1 

* t idn^fm m + 

inver < transformation 

{/[ 1 '[-1] ota[>], x[l] retail!, x t 0 zeta[2]} 

* nc tmform * 
zero dynamic- 1 

[eta[]]dot - eta M M ctaM|dct_ - 3/^ qrav in(«.ta[]])] 

output nulling input 1 [ L/ r qrav sm (etdll] ) ] (y) 

+ ti in^icrm + 

inverse t r insform at mn 

{<[4] zet a[?] t x[3] etj[2], / [2]-zeta[1], x[1] eta[l]} 

+ p\tnoimform * 

1st step, output zeroing submanifold 
[\[2] - 0, x[l]- 0] 

Pnd step, output zproing oubmamtold 

MMJ - 0, x[l] - 0, x[4] - 0] 

3rd step, output zeroing submanifold 
[ x [ 2 ] - 0, \\l] 0, x f 4 ] - 0, x [5] - 0] 

3rd r tep, matrix L_gH has full rank m 
the zero dynamics (zerodyn) 

{x[1]dot-0 r a[ 2]dot-0,x[3]dot—x[^1, x[4]dot-0,x[5]dol-0} 
the zeroing input (uzero) [ 0, - Ml J 
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center line should describe a circle of specified radius. The specific 
problem is for which distances p, from P to M t the zero-dynamics 
is stable or unstable. 

The equations of motion of the model are 

mx ni = '-F/ sin (A + 0) — F r sin0 + Ft, cos0 
rojf*, = +F/ cos(A + 0) + F, cos0 4* F, sin0 
jti - oFj cos A - bF, 

with inputs Fu (traction force) and A (steering angle). The three 
degrees-of-frcedom j-, in y ln , and 0, are, respectively, the coordinates 
of M and the orientation of the vehicle with respect to a fixed 
reference frame (r*?, ^), indicated by the superscript 0. The steering 
angle A and the drift angles a f and ns are given with respect to a 
body fixed reference frame. The vehicle has mass w and moment 
of inertia J with respect to A/. The forces acting on the vehicle are 
the traction force F t , and the lateral tire forces F/, F,. The last two 
forces can be expressed in the drift angle and the normal tire force 
F„ by the so-called magic formula for pure slip [8] 

F(F, m a) = Z?(F V ,)sin(C&tan(£( - F(£$ - atan(£()))) 0 ) 

with f, = n + Sk. The dependency holds for both front and rear. The 
parameters in this formula have to be fitted to experimental data. The 
drift angles can be expressed in the degrees-of-frcedom and steering 
angle by 

a/ = A - atanlr/'y/t'J'j ), n, = -atan( rjy/r/,) 
with 

r) = B 1 (Z + Hi) ). r! = R l (.r", + ml ) 

and 

«COS0 -sintf 
^“[sinfl cos 0 


where the superscript 1 indicates the coordinates with respect to the 
body fixed reference frame (fj, rj,). The dependency of the lateral 
tire forces on the degrees-of-freedom and the steering angle make 
this model quite nonlinear and difficult to analyze. 


By choosing the outputs as 

y I = ■»*,» cos 0 + j/m sin 0 - 1 rf 

U2 = (j’im + j)cos0) 2 + (ym + psin0) a — 

with R,i the desired radius of the circle and 1"/ the desired longitu¬ 
dinal speed, the required maneuver corresponds with a zero output. 
The output i /2 depends on the distance p from P to M. To solve 
the problem, first compute the zero dynamics and then find out its 
stability as a function of />. 

The first problem encountered in computing the zero dynamics is 
the nonaffine character of the equations of motion with respect (o 
the steering angle A. To overcome this problem an integrator can be 
added to the model and A can be regarded as a new input. Then the 
model has a singular decoupling matrix A , however, which makes 
the analysis more difficult. The extnormform function can be used, 
but this leads to insurmountable problems in the computation due to 
huge memory needs. Another solution is to add another integrator 
and use F ( , as a new input, making the decoupling matrix regular 
by “delaying” the input Fu also. The extension with two integrators 
allows the use of the standard transformation to the normal form 
with the function normform, although for a larger model. Due to 
the larger number of states, again problems arise with the memory 
needed. 

To simplify the problem another “saturating” function was used 
in (7), \f instead of sin(alan(■)). Now, the transformation can 
be computed, but the inverse transformation cannot, because ol an 
artificial limit in the size of objects allowed by the software, putting an 
untimely end to the computation. Results are therefore only available 
in terms of the old coordinates .r, but not in o. 

Another attempt to solve the problem of nonaffine input is not to 
extend the model, but to simplify it so that the input A appears linear 
in the equations. To this end, the assumption that A is small has to 
be adopted. Then, by using a Taylor series expansion, the system of 
equations can become linear in A if the dependency on A of the Iron! 
lateral tire force is dropped, so A should also be small compared with 
uj , an unrealistic assumption. This time the transformation to the 
normal form can be computed, but the set of nonlinear equations, 
needed to compute the inverse transformation, could not be solved, 
not due to memory constraints but due to limitations in the solve 
function of MAPLE. Further model simplifications are, of course, 
possible and may remedy this. 


* moutJ in * 

the i/o linearized system dynamics: (fl, gl) 

2 3 

[ x[2] + x[l], -2x[1]x[2] - 2x11], 0, x[1]- x[31, 

2 

x [5] + x [3] ¥ x [ 1 ] x [ 2 ] - x [2] x [ 3] J 
[00] 

10 1 ] 

[10) 

[10] 
f x[2] 0 ] 

the explicit feedback: (alpha, beta) 

3 

[ x[1] - x [3], - 2 x11] xf2] - 2 x[l]- x[l] x[3J ] 

[10] 

[01] 
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Still another possibility is to use an algorithm that does not require 
model with affine inputs, see [7] This has not been implemented 

i 

oncluding several attempts to completely solve this problem were 
vain due to 

• limits on the maximal object size 

• limitations of the solve function 

• limits of the available physical (or virtual) memory 

I he first two problems should he resolved by the Mafif developers 
I he last problem is not so easy to solve because the memory 
uquircments are likely to be exponential or double exponential 
with respect to the problem size [9) Moie efficient algorithms are 
necessary but some problems will still be computationally intractable 
Numerics can be helpful e g, to solve sets of nonlinear (partial 
differential) equations Foi the vehicle problem we resorted to a 
purely numeric approach to find out the stability of the /ero dynamics 
We propose to use this problem as a benchmark or yardstick tor the 
viability of symbolic computation programs in nonlinear control 

VI Conclusion and Discussion 

rhi computation ol the normal form ol the zero dynamics ol the 
solution of the inpul-output and the state space exact linearization 
problems can be automated by using symbolic computation c g 
by using the Non! inCon package At the moment the computations 
c miiot bt done foi complex models due to the limited capability 
to solve sets ol nonlinear (ditleiential) equations Thcrelore the 
designers ol control systems cannot yet routinely compute solutions 
toi Ihese problems using tools based on symbolic computation 
programs fo icmedy this wc recommend extending the capabilities 
of symbolic computation programs for solving large and intricate sets 
of nonlinear (differential) equations It is also necessary to use more 
efficient algonthms Future lesearch will rhcietore aim at 

■ deusing new or modifying existing algorithms to be more 
efficient in space and time 

• implementing the algorithms in a more efficient way especially 
with icgaid to computci memoiy requirements 

• investigating a merge ol symbolic wilh numeric computation lor 
a hybrid solution of tht problems 

• solving more snmll and larget scale problems, to further guide 
in the selection of pressing lines of research 

lo come back to the title ol the nole at the moment we cannol 
deny noi confirm the unqualified viability of this approach It is 
expected that within this decade the capability and efficiency of 
symbolic computation programs aie enhanced and that the increasing 
computing powcr/cost ratio of computers will enable the solution of 
the benchmark and larger problems at reasonable costs 

AcKNowLrnc.MFNr 

H van Essen implemented most of the algonthms in Mapll 
I van dc Broek supplied the vehicle simulation problem 
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Note on Decentralized Adaptive Controller Design 

Joon l you 


Abstract — This note extends the decentralized adaptive controller 
design of Gavel und Siljak [II to the case when the relative order of 
each isolated subsystem does not exceed two 


I Iniroduc 11ON 

Recently Gavel and Siljak |1] presented a stable adaptive de 
centralized design in the face of unknown interconnection strengths 
and under some structural conditions placed on the interconnections 
As a mam result they suggested a decentralized high gain feedback 
scheme toi model reference adaptive control (MRAC) ol an uncertain 
interconnected system which is composed ol single input single 
output subsystems and in each subsystem of which either incoming 
interactions enter through the control channel (Class I) or outgoing 
interactions pass thiough the measurement channel (Class 0) How 
evei the design technique for the Class 1 has the disadvantage of 
disconnecting the interconnection signal from the input of the plant 
And moreovei the mam result is restricted to the case when the 
relative order of each subsystem (n*) is equal to one (Case 1) 

This note is concerned with developing a decentralized MRAC 
design lor an uncertain interconnected sysiem which belongs to the 
Class 0 and allows lor the iclaxed assumption that n* is less than and 
equal to two Since in case that n* — 2 (Case 2) it is not possible lo 
choose the reference model strictly positive real any longci the Case 
2 needs to be treated in a different way from the C ase I 
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For later comparison, the main result of [1] for the Class 0 and 
Case 1 is summarized in the following. Each subsystem controller 
consists of a precompensator Cf* and a feedback compensator Cf 
leading to the open-loop subsystem description 

Si: = A,j\ -I- b,n, + /,(L 11) 



(1) 

Cf: i,„ = F,, i lt , + g,u, 

(2) 

('!’■ if. = F, ; ,, + (/,i/,, 

(3) 


for / = 1. 2, • ■.. .V, 

Where j*, 6 R u '. E and if, E I ?" l ~ 1 are the states of 

5 m Cj\ and C /, u, and y, are the scalar input and the scalar output 
of the subsystem S, , and y -- [1/1 . </ 2 - ■ ■ ■. y x] 1 - Also, /, is a vector 
of output-dependent disturbances affecting S, and satisfies the conic 
sector bound expressed by 

/\ 

//)|l < (4) 

J -1 

where C/s are unknown but positive constants. 

The local adaptive controllers arc given by 

ii ( = ()f v i% / = 1, 2. ■ ■ ■ ,A\ (5) 

where r # ] 1 is a vector of available signals, and 

0, = [///,, r/ lf , c J f , A’,,,] 7 is a vector of adaptation gains. Also, /■, is 
the reference input, and y, = y, - y ini is the model output following 
error with y,,,, being the model output. 0,(f) in (5) is automatically 
adjusted by the adaptation law to levels that assure stability of the 
overall system. 

0, = + v,ih)* (f») 

where T, is a symmetric positive definite weighting matrix and <r is 
a positive decaying constant. 

II. Main Result 

To handle the Case 2, the basic control structure is modified so that 
the strictly positive realness of the closed-loop isolated subsystem be 
preserved, and the following local adaptive controllers are devised 
based on the design concept of [2]. 

u< = 0/ v, 4- 0,* #i, - sgn(</, )(p,\y,\ 4- ) (7) 

O. = -T ,(<r0, + Vui .) 

P, = («) 

'J'li = + v t 

*2, = -/l,^ 2, + |r.|. (9) 

where h , is a positive constant, - M is a positive weighting factor, 
and sgn(-) means the signum function. In (7), p,(t) is introduced 
to override some destabilizing effects by the interconnections from 
other subsystems to the / the subsystem. 

Applying (7) to the nonminimal representation (1), (2), and (3), 
the closed-loop interconnected system becomes 

5: .r, = A,(0:).r, 4- b^! i/ f + (J/ - (9,° # / f ) 

- sgn(i), )h,((i,\y,\ -f- - p*|y f |) 

- sgn(,</, )b,f>*\ih\ + Mv>« “ Mill/'"' +/i 

= .4,(0*),r, + - sgn(y,)A,/' h02i9ji 

- 4-M*,i\ - +/ 

=s r/.r,, i = 1, 2.■ ■ ■. ,Y. 


where 

_ f-T JT . 7 iT 

i ~ l*' i ’ -pi« -//J i 

u+ft.^cr i,c;j 6,pjn 
.4, (0 = ftrfj.r? F, + (),(■*! g.rf 
g, i", r 0 F, 

i>, = nr. 9 ?, 0 ] r . = [c-; r . o. 0 ] 

/ = [/, T .o.o] r , = 

<t>i, = p'> o, <j>i, = n, -1>:, (id 

and is a vector satisfying {r/ («/ t - .4*)“ 1 6, }(a 4- /*,) = 
{r/ (.si, -.4, ) _l /?,/-/?}. Note that for 0 , = &\ t or 02 ,, (rf+/i,)o,(*4- 
/»,)-' is functionally equivalent to { 0 , 4- 0,(* + />,)”'} [2J. In 
(11), and (>*, are unknown parameter vector and unknown positive 
constant to be defined later for which 0,(0 and />,(0 are estimated, 
respectively. 

Corresponding to the plant (10), the reference model chosen to 
be stable and of minimum phase, and such that 11 * - 2 can be 
expressed in a nonminimal representation fl]. That is, the model 
transfer functions y \„,(.s), i = 1. 2, ■ • ■, X can be realized as 

M: = -4,(0* +h,k*'r t 

i = 1, 2. -, V. (12) 

Defining the tracking error v, = .r, - the closed-loop error 
equations are derived by subtracting (12) from (10) 

5,: 0 = -4,(0*)e, + i< - Sgn(//, )i ( /*«0 2 .#2. 

- sgn(y,)b,i'Hf>*ty 2 i ~ b.d},y nn + /, 

!/, = (■! i = 1, (13) 

where the transfer functions {<’■/ («/, — -4,)~ 1 ft,/*/,) = y,„,(.s + 
h,) for all /, are stricily positive real. Hence, from the Kalman- 
Yacubovich Lemma [I], it follows that (-4,. b,rH< <‘! ) satisfies the 
equations 

AjH.+d.A, = -(A’ + 2)7, 

H,b,j>Ft = f‘i. (14) 

for symmetric positive definite matrix H, and the identity matrix 
The stability of the overall error system (13), 0i, = 0, and 
02 , = p» in (8) is then established through the following theorem. 

Theorem; The solutions (c m <£i,.02i) {f; 0 \,(t 0 ), 

0*21 (ft')}. t = L 2. • • •, X of the system S, are globally uniformly 
bounded. 

Proof: Let us choose a Lyapunov function candidate as 

yv 

V(f, 01 , 0 2 ) = H,<, +^i,rr , 0 1 , + -,,“’(02, ) 2 }- (15) 

,=i 

Evaluating the time derivative of V along (13), (8), and (14), and 
then taking the norm operation, we use (4) to get 

V ' * - 2 INI* - 2/»riy.|®a. - 2rr||o 1 .|| a 

+ 2||f ( ||||j l i.|||rfJ ( ||y 1111 | 

r- • 


( 10 ) 


( 16 ) 
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Fill,. 1. Model output following errors. 


Neglecting the negative term from (17), we ohmin 

t , N 

V < £HM a - <t||<?i.||' - 

■ +/C + HKir + i r j . 

Hence, V < 0 outside some compact region whose size depend on 
the strength of the interconnections, local design parameters, and the 
maximum values of the model outputs. Therefore, all signals arc 
bounded according lo Theorem 2.24 of [3|. 

111. Numerical Example 

Consider the unstable linear constant interconnected system with 
unknown system parameters described by 



Pig. 2. Adapted gains for the subsystem 2. 


Note that sgn( fn U 1 , H-,b,pu — sgn(f/,)i/, = |//,|. Completing 
squares. (16) is modified as 


•n = 


0 1 
-4 0 


J’j + 


tt i + 


//. = [4 2 ].!■, 

fO 1 


2 0 


■r j + 


v 2 + 


1 

-1 

0.2 

- 0.2 


ih 


y-2 + 


fj2 = [2 0].r ±. 


Note that v\ = 1 and = 2. Let us choose the reference model as 


y’n.lf") = 


(« + 4) 

(7+2)*;. 




I 

( 7 + 2 )*' 


where reference inputs arc square waves of height 5 and period 10 s. 

Now, using the proposed decentralized udaptive scheme, computer 
simulations are carried out for MRAC of the example plant. The 
results with design parameters in (2), (3), (6), (8). and (9) 

F, = -4. !}l = l. F, = /,. / — 1, 2 


i'-<t -{tdl.-.H-II^IIC.Iwl)*} 


+ 2, < ;|y ( -|*-2 / .;|^|#*.-||r i || a - < r||o„||* 

- <r|KII* - (llr.-ll - „,) 2 ; + //? - <r(||«..|l - IKllr 
•+HK||--«T(| f >,,|- / »*) 2 + rr(K) 2 ]. (17) 

where p* and //y are finite positive constants given by 

,i =i 


l‘, = |r/} ( |su])|,y„„| + ( 18 ) 


-,2 = 2. /|,.= 1. (T = 0.01. 

are presented in Figs. I and 2. Due to lack of space, adapted gains are 
displayed only for the subsystem 2. As can be seen in the figures, the 
output of each subsystem closely tracks the model output after a finite 
interval and all the adjustable parameters are bounded for all time. 

IV. Conclusion 

Decentralized adaptive controllers have been developed for a class 
of interconnected systems with unknown interconnection strengths as 
well as uncertainties in subsystem dynamics. The local controllers 
essentially have output error feedback terms to override destabilizing 
effects by the output-dependent interactions. Specifically when the 
relative order of each subsystem is equal to two, each subsystem 
controller is reinforced such that the transfer function of the closed- 
loop isolated subsystem be strictly positive real. 


In deriving Eili Ej“=i ll^il'tfjKI' = 

Ell, EyL-i II# j II' & 'S use d. The term (-2p*|;/, | #*;) 

in (17) plays the same role as the term (—2/>,<*/ r,c \= 2p,\y 2 |) 
in (5.27) of 11J. To override the destabilizing term (2/»*|ii| 2 ) 
which arises after aggregation of all the interactions affecting 
the z'th subsystem, must be positive and $21 > |;/.| for 
all /. This is the rationale in adopting the signum and absolute 
functions contrary to [ I J. 
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Characterization of Zeros in Two-Frequency-Scale Systems 

Hossein M. Oloomi and Mahmoud E. Sawan 

Abstract —We study the asymptotic behavior of the zeros of a two- 
frequency-scale transfer function In terms of the zeros of its slow and 
fast transfer functions. By Introducing the notion of type index, we will 
show that all zeros of a two-frequency-scale transfer function with the 
type Index of either if> 0 or i/= - I can be completely characterized in 
a manner that Is quite similar to the characterization of the poles. We 
also give a partial characterization for all other zeros in terms of the 
cardinality of their zero sets. 

I. Introduction 

Single parameter standard singularity perturbed systems have been 
traditionally studied in the time domain [1]. However, an equivalent 
frequency domain characterization now exists which allows these 
systems to be studied entirely in the frequency domain [2]-[3], Using 
this characterization, many important frequency domain problems 
including the two-frequency-scale (TFS) compensation design [4], 
the model matching problem [5]-[6], the control theory [7]-[8], 
and the Hankel norm approximation problem [9] have been solved 
recently. 

Study of the zeros of TFS systems, or equivalently the zeros of 
singularly perturbed systems, has not been vigorously pursued in 
the past. The obvious reason for this is that most often a property 
pertaining to singularly perturbed systems is ascertained in terms 
of ascertaining the same property for two lower order subsystems. 
However, when the property is taken to be the asymptotic behavior 
of the zeros, this deductive technique of proof seems to break down. 
The discouraging results in the following example demonstrate this 
point. However, despite the hopeless appearance of the results, we 
will show that much useful information can be obtained regarding 
the zeros of ft(.s, f) from the knowledge of the zeros of fts(*) and 
Hy(es). 

Example 1; Consider 


This is an example of a TFS transfer function with the slow and fast 
transfer functions H s(*) = 1 /(* -h 1) and Hri f*) = f*/(f.s 4-1), 
respectively. Zeros of ft(w, 0 are at ±j/>/f. However, the slow 
transfer function has a zero at infinity and the fast transfer function 
has a zero at the origin. In contrast to what one can say about the 
poles, these zeros do not seem to approximate zeros of H{s. f) in 
any meaningful sense. 

From the computational point of view, until now the Newton 
polygon test has been the only available tool for the study of the 
zeros of TFS transfer functions 110]-[111. However, this lest does not 
take advantage of the zeros of the slow and fast transfer functions. 
In contrast, in this paper the behavior of the zeros of ft(*, e) are 
studied in terms of the behavior of the zeros of ft s(*) and Hy{es). 
Since these two transfer functions have fewer zeros than ft(.s, f), 
one always gains a computational advantage by taking our approach. 
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Additionally, we believe that our results will shed light on the 
performance analysis of schemes such as TFS LQG/LTR [12] and 
TFS ftcx. 113]-[14] theories since performance of these schemes as 
well as many others are ultimately tied to the location of the system 
zeros in the complex plane [151. 

We will use the standard notation N . Z, Q, and ft to denote the 
set of natural, integer, rational, and real numbers, respectively. The 
superscript “4-” will be used to mean “non-negative.” For example, 
Z + = N U {()}. A\D denotes the complement of B in A , and 
Card {A) denotes the cardinality of the finite set .4. We use deg[.] 
and <*>[,] to designate the degree functions for polynomials and rational 
functions, respectively. The set of all TFS transfer functions is 
denoted by T,. When ft(.s, f) E T, we denote its slow part by 
fts(.s) and its fast part by Hf. (p) where p = f.s. 

We recall from Part 4 of Definition 1 in [2] that poles of a TFS 
transfer function are separated into two distinct groups. The first 
group of poles are of type 0{f r ) for some v E Q + and are called 
the slow poles. The second group are of type ()( I/O and are called 
the fast poles. However, as Example 1 shows, zeros of TFS systems 
can have more general expansions. To further explore this idea, let 
ft. denote the ring of functions of f that are analytic at t — 0 and 
let ft, [*] denote the set of all polynomials in * with coefficients in 
ft [*]. The following theorem characterizes the roots of a polynomial 
in ft M 121, [10|, [16]. 

Theorem 1: Let p(.* s, f) E ft[.s]. Then 

1) Each of the n roots of p(s, r) can be expanded about f = 0 as 

, i < # <», (i) 

where h tJ E C for each / and j, k E ft, V £ Z. and f ,/a is 
a branch of = f. 

2) 11 one root has an expansion as in (1) with k > 1, then there 
will be k - 1 other roots having expansions with the same 
coefficients but using different branches of — f. 

It is important to note that only a finite number of terms with 
negative exponents appear in the Laurent expansion (1). We will use 
Theorem 1 to classify the roots of a polynomial in ft, [a] according 
to their Laurent expansions as given by (1). 

Definition 1: A root *,(0 of /;(*, f) E ft. [*] is said to be ot type 
C)(f r ) with v = X/k, if expansion of *,(0. as given by (1), has 
h, \ ^ 0 and b n = 0 for j < X. If b,, = 0 for all j. then v is taken 
to be zero, u is called the type index of the root .s,(f). 

Note that, roots that are identically zero are classified as 0(1) 
type roots. 

We will also need the concept of lost poles. Lost poles were 
introduced in [2] in terms of the polynomial system matrix. They 
were later given a state space interpretation m [3]. The following 
lemma is a direct consequence of Definition 2.1 in [3]. 

Lemma I: Let H (.s. f) e T, have n j slow poles and nj fast poles. 
Then //(*. t) will have /i slow lost poles iff A[fts(s)] = n] - / 1 , 
I 2 fast lost poles iff fi\Hj ip)] = - /j, and / lost poles iff it has 

h slow losl poles and li fast lost poles with / = /1 4 - h- 

In the next section we will study the zeros of ft( s, f) e T, under 
the following two assumptions: 

Al) ft(.s, f) has at least one slow pole and one fast pole. 

A2) H(s . f) has no lost poles. 

Note that assumption (Al) excludes the single-frequency-scale 
case. This is no loss in generality since Ihe single-frequency-scale 
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case can be treated as a special case of the TFS case. Assumption 
(A2), on the other hand, is not unique to this paper and in fact it is 
the standing assumption in the previously published works [3]-[9]. 
The following lemma is a direct consequence of these assumptions. 

Lenuna 2; Under assumptions (A1MA2), the transfer functions 
and Hf(p) are nonconstant. In particular, they are nonzero. 

Proof: By (A2), l = 0. This implies h = l 2 = 0. By (Al), 
n\ > 1 and n 2 > 1. Hence, fi[Hs(s)] = n 1 - /1 = //1 > 1 and 
Mffi- , (F)] = »2-/a = i»2 > 1. Q.E.D. 


II. Asymptotic Behavior of the Zeros 

Let F(.s, f) G T, . Then //(«, f) can be written in the fractional 
form 


//(*, f) = 


MM 

A 7 (-s f)" 


( 2 ) 


where M(s, f). A r (,s, e) G [*] are relatively prime. We arc 
interested in studying the asymptotic behavior of the zeros of //(*. r). 
Definition 2: Define the following sets. 

■ The sets of zeros: Let Z l) H and Zf) denote the set of all zeros 
of //( s. f) at the origin and at infinity, respectively, and let 


Z//: = {s£C\H(s. r) = 0|. 

Zn{i' 1. i'j ): = {.s G Zji | s = Off"). u\ < p < Vi }, 
ZuM: = {* e Z,J | .s = Off")). 


where #(f) is analytic at f = 0. and A r i(.s, f ), aiKi s ( 

are monic polynomials (in *) with roots that have ivpe indium n ; 
sets Qp(0, oc), Qpi&). Qp(- 1). respectively. To obtain a j l; . 
useful representation for A~(*, f). let f‘'J M ,(f) denoie the ,[\\ u a 
of A r (.v, f) among those roots that have type index r. Assumims 
of such roots, we define the following two families of' polynomials 


L 


</..(«. f):= n<" ” 

J=r 1 

>' e 

14 t 

/ — 1 

^(s. f):= e~ l ' l ' / q„(s. f). 


( 5 ) 


We observe that f ) € R. [*] whenever if G and f,[s. f) G 
R t [#] whenever v G Qp\Q p. Use of (4M5) allows us to write 


.V,f)= JJ ' 

. 0. 

«eQj»( 0 ,nc.) 

hi 11 DO 

.V 2 (.v, 0 = qa(s. O = «"»» 

(* ■” Ty.n(f)) 


./- * 

:=.s""°.V 2 o(*. O. 


A',(«. r) = f) = 

(-S, f ) 


n 1: ~ /-1, 


(6) 


dj,o(0 7^11 

(7) 

( 8 ) 


where = deg[A’ ; (.s, *)] Tor j =1, 2, 4, and by definition 
Tj _i(0) ^ () [2|. Thus, A r (.s. f) can be factored alternatively as 


with the multiplicities included in each set. 
• The sets of zero indices: Let 


A'(.s\ t)~-f "V/(f)[A T i(.s, r).s” 0 ,, ]A : 2o(.s'. OA 7 i(a, <)■ ( 9 ) 


Q:r- -- {v | s G Zn k * = 0(f")}. 

Vi)'.— {v | * £ Z// cV.s = Off 1 '). "1 < 1/ < 1*2}■ 

• The sets of poles: Let P\\ denote the set of all poles of H (.s, <■) at 
the origin and Pn: -- {* G C | A r (.s*. e) = 0}. Let Ph(v 1 , 7 / 2 ) 
and Ph(p) be defined similar to Zn{i'\,u 2 ) and Z/-/(zs). 
respectively, but with Z// replacing Pji . Let multiplicities be 
included in each of these sets. 

• The sets of pole indices: Let Qp and Qp(v j, 1 * 2 ) be defined as 
Qz and Q/Ao 1 , 7*2 )■ respectively but with Zy/ replacing Ph- 

• For the slow and the fast transfer functions, define the set of 
zeros and the set of poles exactly as above but with Hs and 
Hr replacing H. respectively. 

We are now in a position to present our main theorems. Theorem 
2 shows how zeros of the slow transfer function relate to the zeros 
of the TFS transfer function //(*, f). 

Theorem 2: Let H(s.f) G T, satisfy assumptions (AIHA2). 
Then 

1) Zj[ s = Zy/ S (0), 

2) C(ird{Zi /,.) = Ctrrd(Zj](if )), 

3) Card(Z? ls ) = Card(Z% ) -f Citrd(Zn( 0. oc )), 

4) Every zero -(f) G Z/y(//) with G can be approximated 
as -(f) = ; + r{f) where - G Zn s and r(r) —► 0 as f —► 0. 

Proof: We first write //(*. f) in the fractional form (2). Our 
immediate goal is to express both 3 /(.s, f) and A r (.s. f) in terms of 
their primitive factors. Since A^(.s, r) has only slow and fast roots, 
its factorization is simpler. So, we proceed with A r (*, f) first. By 
definition, we have the disjoint union Qp — Qp{ 0, oc-)(J{()}U {— 1}. 
Therefore, we can factor S{s, e) uniquely as 


Next, we obtain a factorization for A/(x, f). Since each root of 
M{s. f) has an expansion of the form (1) given by Theorem I, we 
factor A/(*, f) as 

no) 

where /(f) is analytic at t = 0. and M,{s< r) for j — 1. -.5 

are monic polynomials (in s) and have roots with type indices in 
the sets QAl )._oc), {()}. Q z {- U)). {-1}. and Q/A-'X, -I), 
respectively. Since these sets are pairwise disjoint and have union 
equal to Qz . this factorization is unique. An alternative representation 
for A/(.s‘, f) is obtained by introducing the following two families of 
polynomials 

k t , 

p,Af )■ = ” f^, ; >(f))^ v G (11) 

f): = P * )* (!2) 

where f'^Vj .^r) denotes the jth root of A/(«, f) among those that 
have type index v . and denotes the number of such roots. We 
observe that f) G /!,.[«] whenever v G Qj. and y)^(«. f) E 
/?. [«] whenever 7 / G Q/\Q% ■ Use of ( 1 1)—(1 2 ) allows us to write 

A/i(.s\ () = f). 

vi-Q'/A i] ^) 

— <lf 00 

3/2(77, f) = />o(*. f) = A ' 1 " 00 (.H - a y .i)(f)), o 7 .u(f) ^ 0 

s uiw >M 20 (m, f), (14) 

f) = /vK f) — JJ P'^P'A#. O 

"Ggat-i.o) 

J] y),(-s. f): = r' , l A/;,(^f). (15) 

‘'€Qz(-i.0) 


N{*, f) = ^y(f )A 7 j (.s. f )A' 2 (-s. f)A" t (w. f). 


(3) 


: = f 
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M*[». 0 = p-i(n, t) = f):= t e), 

rn 4 : s !, (16) 

AA(j*. f) = p u (n. r)=f J| f) 

«'€^/(-oc -l) veQz(-o. ,-1) 

:sf' M JJ $*(«■ f). (17) 

«^€Qz( — fx< - 1 ) 

where Wj = deg[JV/j (/», *)] for j - 1, -*,5, and //a, //r, E AT by 
Part 2 df Theorem 1. Hence, A/(s, e) can be factored alternatively as 

A/(#, 0 = f‘” ( ' , ’ 3+,r,4+ ^°/(0[M I (.s, r)* r,,wi ] 

* f)A7t(*, f)A/ 4 (s, f)A/r,(.S, F), (18) 

and using (9) and (18) we can express tf (*, f) as 

f) = 

/(f) A/i(jr, Q* mufl A/ 2 o(*, f)Ai(a, QAA^ f) 

0(0 A r ](*, f)* u ooA r jo(.s. f )Ni(». t) 

where 

“>s —rti — (/I i + hi i + /in). (20) 


We are now in a position to analyze the limit of H( .s, f ) as f -> 0. We 
observe that as * approaches zero, each of the polynomials f), 

AAi(*\ f), and AAi(*, 0 approaches a finite constant limit, say K r 
j =3, 4, 5, respectively. Since H{s , #) has no lost poles, these con¬ 
stants are nonzero according to Lemma 2. Since H(s , 0 ) is defined, 
the polynomial /V 4 (.s, f) must also approach a finite nonzero limit, 
say A’'. Moreover, A/i(w, 0 -♦ n m », Jl/ a o(.s. f) —♦ A/ 2 o(». 0) = 

n;"« i r , " # ° ( * “ "*o(0)) where a j o ( 0 ) £ 0, AM*. 0 - a" 1 , and 

A^oK f) -*■ A t 20 (^, 0) = n”ir T,0 °( w - ^yn(fi)) where /*, 0 (0) £ 
0, Therefore, 

A/t(*. f)*"' 00 u 

—-—► s 

Al (,s, f)j? ,l oo 

where. 

As = (moo + hi i) — (nun 4 iii )■ (21) 


Finally, since Jf(w, 0) is defined, the quantity f^ /(0/flf(0 must 
approach a finite limit, say AMv'/n^i Aj, an d the light 
Lemma 2, this limit must be nonzero. Hence, 

„(». 0) 


ffs(*|:=J?K0) = A\‘ 


A 2 o( 0) 


( 22 ) 


Since all zeros of H s(0 are 0(1) type zeros. Part 1 of the theorem 
follows immediately. To prove Part 3, first note from (21) that 
although the quantities (moo 4 m i) and (iioo 4*»i) are both positive, 
they cannot be simultaneously strictly positive since otherwise, there 
would be at least one pole-zero cancellation at the origin and this 
contradicts (A2). Using this fact we observe that As <0 iff 
(noo 4 Hi) > 0 and (moo 4 mi) = 0. Hence, Card(Z^ lti ) = 
moo 4 mi = 0 and the result holds trivially in this case. On the 
other hand, As > 0 iff (moo 4 m i) > 0 and (iiuo 4 in ) = 0, and 
in this case 


C(trd[Zu s ) = m 0 u 4 mi 

= m 0 o 4 deg[A/i(*, r)J 
= CardiZu) 4 Card{Zn(0, x:)). 


This completes the proof of Part 3. To prove Part 2, we note that 

Car(HZn,) = Cardiff,) + deg[Jl/«,(*. 0)] 

= (hi oo 4 mi) 4 (m 2 — moo) 

= mi 4 m 2 
= £ C,ird{Z „(«/)). 

*Qt 

Finally, suppose -(f) £ Zn(v) for some v £ Q"p. Then i(f) is 
a root of the polynomial A/i(.s, OA/zt*, 0- Since ~(0 has type 
index v > 0, ;(f) is analytic at t = 0. Hence, -„(0 = ~(0) 4 r(r) 
where r(f) —* 0 as f —> 0. Our foregoing analysis shows that 
A/i(*, OA/ 2 (*, f) approaches s T,,1+fMl00 A/ 20 (*. 0), which is the zero 
polynomial of H s(a). Hence, c(0) e ffs(x). This proves the 
last part of the theorem. Q.E.D. 

The next theorem relates the zeros of the last transfer function to 
the zeros of the TFS transfer function H(s , r). 

Theorem 3: Let H(u, f) E T, satisfy assumptions (A1)-(A2). 
Then 

1) Zu h ~ Z'j iy U Z„ h {-1). 

2) Card(Zfi, ) = i n. in<-i \ <.'nrd(Z„{v)) 

— 1 ■») C(trd{Pn{v)), 

3) C<nd{Zl t ) = E„ e y,(-i x) CanUZuM) - 

L f g,,(-i Card(PnU')). 

4) Every zero -(r) E Z/y(-l) can be approximated as :(r) = 

;4r(f) where ; € (-1) and tr[f) —► 0. 

Frrw/'- Due to the space limitation, we only outline the proof. 
Using the notation and results from the previous theorem, we first 
express f) as shown by (23), found at the bottom of the 

page, where 

V = (>U 4 "2 4 n\) - (mi 4 m 2 4 ou 4 hi 1 4 \ 1 )- (24) 

By the repeated use of Part 2 of Theorem I, we observe thal as 
f — ♦ 0. f ,n 1 A fj(p/f< r) — ► i>" ,J and t n 1 A" y (/>/r, e) —> />" » for 7 — I, 
2, O - //"L O - ~ -i(0)) 

where rv 7 _ 1 ( 0 ) / 0 . A/i(/>/r. r) —► A’%, and Vi(/>/f, f) 
n*’ii(/ J - —1 (0)) where 7 ^ 0. Thus, taking the limit 

in (23) as f —» 0, we obtain 


M 

f) 

n(f) 

'=0 A 1 (^- 

v «=0 


where, 


A/. = {nii 4 nij 4 m j ) — (#11 4 )■ (26) 


Since //(*, f) has no lost poles, A\ ^ 0 by Lemma 2. Additionally, 
since Hi (p) is defined, c lh f[e)/(j(f) must approach a finite nonzero 
constant, say A'/ /A\. Hence, we can represent Hi ip) as 


/// (/») = K 1 



(27) 





f 1# 


. /(r) p” 11 AT, (f.0] P"' 1 01 1 X/.(f-»)]^ (f.»)A]r»(f. 0 

f ) [♦" ■> .v, (?. f) ] [f 1 - ■> .V 2 (E. f) ] .V, (f. f) 


(23) 
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To prove Part 1 of the theorem, we must hrst show that A/. >0 
T )i that, suppose to the contrary A/ < 0 Since H(<t f) E T ( , 
bv definition Hj ip) cannot have a pole at the origin Therefore, 
see from (27) that M 4 ^, f)| _ 0 must have \/ roots at the 
igin to cancel these poles But this leads to a contradiction since 
i(0) ^ 0 Hence, Aj >0 ic all zeros of Hi (f*<) are either at 
he origin or have type the index v = -*] To prove Part 2, we see that 

y C m t1(£n{v)) - ^2 C m (1 (Ph{i')) 

fcC?/(-1 1) » CQ, ( 1 x) 

"• -51" 

1 1 

= \i 4- in 1 

= A/ + deg e) 

- ( m d{Zu t ) 

Part T follows easily by subtracting in 1 = C anl(Zn(- 1)) from 
both sides of the above equation Finally let (f) E Z>y(~ 1) Then 
(f) is a root of \/i(s e) Since .(e) has the type index v = -1 
r (e) is analytic at f = 0 and we can write t (e)= 1 +/ 1 (r) where 
1 is a constant and 1 i(f) —> 0 as r —> 0 Part 4 now follows by 
setting / (i) = m(OA =“ iA and noting that 1 is a root of 
V,(pA Ol-n Q FD 

By studying Part 4 of Theorems 2 and T we see that the asymptotic 
behavior ol the two polynomials Ui(s t)M (s f) and ’lAfs f) 
ire completely chaiacteri/ed in terms of the zeios of the slow and 
the last tiansltr functions icspectivcly However to completely 
chaiacteri/e the asymptotic behavior of all zeros of H{ s t) the 
asymptotic behavior of the two polynomials l/|(s *) and U (s r) 
in (IS) should also bt studied in terms of the behavior of the 
zeros of the slow and fast tianster functions Although a complete 
characterization cannot be given due to the vanishing behavior of 
these two polynomials some partial information can still be obtained 
as the following two theorems show To stale the hrst theoicm note 
that \f t (s t-) m (15) can be written as 

U,(s t) =rt' S * + n/(s 0 r,( f) - U|(() 0 (28) 

whete m/(s f) E H fs] has degree less than in 1 and < »(()) ^ 0 
Theorem 4 shows that in \ /m and < 1 (0) can be deteimined horn 
the knowledge of the slow and fast transfer functions 
Theoicm 4 Under assumptions (A1MA2) 

1) /M = C nnl(7 ( l, i ) 

2) mu = C <n<l(7„(-l 0)) = C«irf(Z,J ) + 

a n 1 ,< ' ii 

M It mi / 0 ()(()) = ; [ j-j j ! ( - — Otherwise < t(0) = 

ttt n,'.(- >> i(o» 

Pioof In the course of pioving Theorems 2 and 3 we proved 
that f and f 1 /(*-)A /(0 approach some finite nonzero 

constants This implies that Ihus by equating (20) and 

(24), we obtain 

in = mi + n/> + m» - (/»i + u 2 ) (29) 

On the other hand Part 1 of Theorem T shows that 

C tii iliZn, ) = i»i ■+■ ni2 4 - in 1 — (n 1 -f* n*) (TO) 

C ompaiing (29) and (TO) proves Part I of the theorem For Pait 2 we 
rearrange (29) as in j = [(n 1 + « 2 ) - (m 1 4- m 2 )] + p 1 and identify 
the term in the bracket with C at <l{7^ ) The last part follows from 
(22) and (27) noting that h i = r «(0) Q E D 



The next theorem shows that m- = deg[T/ (s hi un \ 
determined Noting that nir = Cant(Zn{- x - 1 ,) vu h 
Theoiem 5 Let if (#, r) satisfy assumptions (AI > ( \ ^ > l , 

Caul(Zn(-~\ , -1)) - C hi ) - ( 1 / /( 


Pioof Wc have 

C tndiZfi) = C anl{7n{v)) 

• tg/ 

= 51 f <"<1(7n[n)) 

• c gj 


+ 


5 ^ ( nn1{/uU )) 

/tg/t 1 x) ( i) 

y c<iui(Zu(i')) 

« eg ( 1 > 1 

y Ctmi(/u( 10) 


c g /■ ( 1 u) 


+ y C onH7n(i')) 
. eg/i 1 > ) 


Wc now replace the hrst and (he fourth summations with the quantities 
given by Part 2 ol Theorem 2 and Part 2 of Theorem 4 respectively 
We also replace the summation in the bracket with the quantity 
obtained after subtracting Part 1 from Part 2 in Theorem T After 
simplifying the result we obtain 


C ai tl{7n ) — ( ai il(7u ) + C aitl(/u, ) + ( in ) 
+ C m d{Zn{-^ - ])) 


A further simplification is made upon substituting ( anliPn ) - 
( <ud(/n ) for the quantity (anl(/i) ) and noting that 
( 11 HPi 1 ) “ ( difl(Pn) - ( tinHPii, ) This yields 

Cm J(7y/(-x — L)) [( (utl{Pn, ) - ( )] 

- [( m <1{ Pn ) - C m ri(/n )] 

The theorem now follows by identifying the terms in the brackets 
with C ukH/i), ) and C nul{/^) icspectivcly 0 T D 

It is important to note that in determining the exact value of m 
the iclative degree of 11 {s t) should be known in gencial In other 
words knowing only the fast transfer function does not help one to 
determine in in general as staled by Theorem 5 However if the 
relative degree ol H (s f ) is not known a pi ion an upper bound lor 
m can slill be obtained as the following coiollary shows 

Corollat\ 1 Let //(«*) satisfy assumptions (AI)-(A2) Then 

( — 1)) < C <u </(///, ) 

There is of course a special case where the knowledge of the fast 
transfer function is sufficient to detcimine in This special cast is 
treated in the following corollary 

Cotollaix 2 Let H{s t) satisfy assumptions (AI) (A2) If 
Hh ip) is not strictly proper, then 

1) Cm (l{Zfi )-() le H{ s f) is not strictly proper either 

2) Cm<l(Zn{- v -1)) = 0 

Proof Since Hi (j/) is proper but not strictly proper, ( mili/fi, ) 
= 0 By Theorem S this implies (rnr/(7//(-x -1)) - 

—C md(7i) ) The results now follow QF D 

Theorem 5 can also be used to prove a scalar version of l^cmma 
2 5 in 14] 

Corollary ? Let H{ s r) satisfy assumptions (AI) (A2) Then 
H~ l {*< f) € T iff //s( ~>c) ^ 0 and Hi (*x) / () 
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Proof: (s»): Since Hs{«) and (H~ J (w. f))s = (tfs(*r))~ l are 
proper, Card(Zf} s ) = 0. On the other hand, propemess of e) 
and imply that Cnrd(Zff) - 0. Since H~ l (s, r) € T r , 

we also have 0/rrf(Zw(-x:-. -1)) = 0 and Crtrr/(Z>/(-l, ())) = 
0, Hence, by Theorem 5, Card{Z^ F ) - 0. 

(<=): Since' ^ 0, Part 1 of Corollary 2 shows that 

Can\(ZJj ) = 0, Hence, _ 1 (.*«, *=) is proper. Part 2 of Corollary 
2 shows that Card{Zn{-n> c, -1)) = 0. Hence, for O to 

be in T,, it suffices to show that Card{Zti{-l< 0)) = 0. But this 
follows since Hr( 0) = Hs(^c) £ 0 implies Card{Zu t ) ~ 0 and 
Card(Zff s ) = 0. So, by Theorem 4, //* = m,» = 0. Q.E.D. 

Example 2: Reconsider Example 1. Suppose that H^(s) and 
Ht> (p) are given. Suppose it is also known that H(i s, r) has no lost 
poles. Since the slow transfer function has no finite zeros, Parts 1 and 
4 of Theorem 2 are vacuously true. Part 2 of that theorem shows that 
ff(*. r) has no zeros with type index v > 0. In particular, H («. r) 
has no zeros at the origin. Since Z\i y (-1) = 0, Part 4 of Theorem 3 
is vacuously true. Part 2 of Theorem 3 shows that tf (*, f) must have 
two zeros with type indices in the interval (-1. x). Since H(i s, O 
has no zeros with type index v > U, we conclude that //( «, r) must 
have two zeros with type indices in the interval (-1. 0). This result 
can also be verified from Theorem 4 which gives m i = 1. v\ — 2, 
and rS(0) — 1. Moreover, Corollary 2 applies to the present case. 
Therefore, we conclude that 7f(.s\ e) has a zero polynomial of the 
form rs 2 + r,w(*. f) where rw(*. () £ R,[s] has degree of either 
zero or one and / a/(U, 0) = J. 

Remark J: The results obtained in this paper are also useful when 
studying the asymptotic behavior of the poles of a certain class 
of singularly perturbed system which arc more general than TFS 
systems [17], To see this, let //(*, f) be the transfer function of a 
(not necessarily standard) singularly perturbed system. Assume that 
f) € T,. In particular, this means that //(*, r) is invert¬ 
ible. From (31, we know' that H~ l (.s. O has a standard singularly 
perturbed state space realization 




B , ' 


f Bj 


K'i 
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where all matrices arc functions of * which are analytic at < =0 and 
A'n( 0) is invertible. The original system H(. s. t) has the realization 




' D,D 1 ' 


f DiD-' 





where A*: = A tJ - D t D’ 'C t for j = 1, 2. Hence, when A.* 2 (0) 
is invertible, the original system is a TFS system and its poles have 
the usual TFS behavior. However, when A* L ,(()) is not invertible, 
one gets more complicated asymptotic behavior as described in this 
paper. Since poles of the original system are zeros of the inverse 
system, the asymptotic behavior of poles of H(h, f) can be studied 
in terms of the asymptotic behavior of the zeros of 77" 1 (*. r) using 
the results obtained in this paper. 
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Andrew R. Teel 


Abstract —An Hoo -based Lyapunov proof is provided for a result 
recently established by Lin and Saberi: if a linear system is asymptotically 
null controllable with bounded controls then, when subject to input 
saturation, it is semi-globally stabilizable by linear state feedback. A 
new result is that if the system is also detectable then it is semi-global 
stabilizable by completely linear output feedback. Further, an extension 
which relaxes the requirements on the input characteristic is obtained. 


I. Introduction 

There has been much interest in the last few years concerning 
stabilization of linear systems subject to actuator saturation. Several 
important papers have focused on systems which are null controllable 
with bounded controls. A linear system is said to be asymptotically 
null controllable with bounded controls if any initial condition can be 
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jnven to the origin, perhaps in the limit as / —► x. using arbitrarily 
small controls. It is well known that asymptotic null controllability 
vutli bounded controls is equivalent to the usual notion of linear 
i tbilizahilhy plus the added condition (hat the open loop eigenvalues 
h.ivc nonpositive real part. For a thorough treatment of this concept 
see [7J or 181. It is also well known that, in general, a linear feedback 
, .mnot be used to globally asymptotically stabilize the origin of a 
linear system null controllable with bounded controls if the system 
has input constraints. This was first pointed out m |3| and elaborated 
on in [10|. On the other hand, it was established in [9| that, for 
this same class of systems, the origin is globally asymptotically 
stabilizable by nonlinear feedback. A particular algorithm using 
multiple saturation functions was initialed in |13] and completed 
infill. 

On a slightly different front, there has been renewed interest in 
the domains of attraction that can be achieved using linear feedback. 
In a pair of recept papers, Lin and Saberi have shown that, both for 
discrete time |4] and continuous time [5|, the domain of attraction tor 
a linear system null controllable with bounded controls and subject 
to input saturation can be made arbitraril) large using appropriately 
luned linear feedback. The purpose of this note is to provide a simple 
Lyapunov proof for the result of [5] which, at the same lime, allows 
for some extensions, In addition, it is illustrated with an example that 
the result of this note is useful for semi-global stabilization ol some 
nonlinear systems that appear unrelated to the problem of stabilization 
with input saturation. 

The particular choice ol feedback that is used to achieve the 
extensions of this note is based on the solution to a family of 
“full information” H^ control problems |2|. The solution to this 
problem provides a family of arbitrarily small feedback gains and 
a corresponding family of Lyapunov functions. Given a prescribed 
compact set, a member ol this feedback I amity is selected which 
exhibits stability robust to input nonlmcanties lor initial conditions 
in this compact set. 


We make the following assumptions: 

Assumption 1: The linear approximation of (I) exists and 
controllable with bounded controls, i.e., the pair \ 1. n'((i j ,, ( * M| 
lizable, and the eigenvalues of A have nonpositive real pan 

Assumption 2. The pair (C\ 4) is delectable. 

The next section is dedicated to a Lyapunov proof for the lotto \ \u } > 
result: 

Theorem 1: If Assumption I is satisfied then the origin ol (\) is 
semi-globally stabilizable by linear stale feedback. It Assumptions I 
and 2 are satisfied then the origin of (1) is semi-globall> siabih/able 
by linear dynamic output'feedback. 

Remark 2.1 Lin and Saberi have proved this result in |3| tmdei 
the additional assumptions that rr(u) = where (l B) is 

stabilizable, the /th entry of the vector ?f( u ) depends only on //, and 
each entry is linear near the origin, and the nonhncai element n is 
used in the compensator (2). 

To motivate the solution to this problem, we note that, for each 
/) > (), 3A > 0 such that 

|rr(i,)-V(())i/| < ,,|„| Vi/ 6 <A|. (3) 

This bound can be exploited if |//| can be kept sufficiently small. 1-or 
a linear system of the form 

./* = A.r + o'(())u (4) 

satisfying Assumption 1, for any compact set ol initial condi¬ 
tions, a linear stabilizing feedback can be chosen which makes 
sup,, x \ u[t)\ arbitrarily small. This was shown in [5| for example. 
The contribution of this note is that leedbacks with this same 
property can be constructed from the solution to an appropriate full 
information B . control problem, thereby achieving stability robust 
to perturbations of the form cr{u) - fr'{i))u satisfying the bound (3) 
for () sufficiently small. 


II. PROBIXM SfATLMlNT 
We initially consider systems ol the form: 

./ “ A.r 4 <t{u) 

H ^ ( '■/. (I) 

where ./■ e R". // 6 R m . // 6 R' J and cr: R'" -* / C R" 
where P is some bounded neighborhood of the origin and rr(0) — 0. 
The objective is to find a family of linear controllers that can be 
used to make the domain of attraction for the point r — f) arbitrarily 
large. This problem is referred to as the semi-global stabilization 
problem: 

Definition 2.1 ■ The point./ — 0 of (I) is semi-globally stabilizable 
by linear state feedback if, for each compact set J\ there exists a 
feedback // — A.r such that the origin is a locally asymptotically 
stable equilibrium with domain of attraction containing (\ 

Definition 22 ■ The point .»■ = 0 of (I) is semi-globally stabilizable 
by linear dynamic output feedback if. for each compact set V. there 
exist a dynamic output feedback of the form 


lit. PkOOh of Thforim I 

We will solve a more general problem than the one considered in 
Section II. Consider the system: 

.4./' 4“ B\ a -f Pj */( a ). (3) 

We recover (1) by taking B\ =■- //(()). U_> — / and //(//) - 
— a' (())//. Then Assumption 1 becomes: 

Assumption lb: The pairs (.4. B \) and (4. B>) are stabilizable 
and the eigenvalues of 1 have nonpositive real pan. 

We will fomi a family of semi-globally stabilizing feedbacks for 
(5) from the following result: 

Lemma JJ. Let Assumption lb hold. If Q : (0. 1] R" *" is a 
continuous, positive definite matrix valued I unction and 

liniA.n.Mt^ff)) = ri (6) 

then there exists a positive real number * such that, lor each 
(■ € (0, 1]. 3P(r) > 0 satisfying 
I) 


k = Pk + On 

a = 1\\ (2) 

and a compact set l\ such that the point (j\ O = (0, 0) is a 
locally asymptotically stable equilibrium with domain of attraction 
containing O x t\. 


.i'P(r) + /VH + P(o(J 7 n i /?j 
2 ) 


W) + C>W) = 0 
(7) 


limA 1Iltls ( P(f)) = 1). 


( 8 ) 
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Proof: Using the results of [2], since Q is positive definite for 
each f €( 0 , 1 ) and (A, D x ) and (4, Bi) are stabilizable pairs, for 
each f 6 ( 0 , 1 ] there exists (O such that, for each 7 > 7 o P i(f), 
there exists P(f< 7 ) > 0 satisfying 

A r P+'PA + P^BtDi -23|Bi r )p + Q(f) = 0. (9) 

In addition, since 

■sup A lll( 1 xKc?( 0 ) < TC ( 10 ) 

•€(0 1 ] 

we can choose 7 to satisfy 

7>T:*= s ni> 7«*,,i(0 (II) 

' 6 ( 0 , 1 ] 

so that, for each f 6 ( 0 , 1 ], there exists P(e) > 0 satisfying (7). 

To establish ( 8 ), we first note that, for each e 6(0, 1], there exists 
X 2 (f) > 0 satisfying 

A 1 .Y„(r) + X 2 (f)A - XB 1 , Xi(f) + Q(t) = 0 ( 12 ) 

and 

liinA Ilm ,(A' 2 (O) = 0. (13) 

This follows from a result of (16] which gives that, since the 
eigenvalues of -4 have nonpositive real part, X 2 = 0 is the unique 
positive semi-definite solution to 

-4 1 An + A :,A - A2-4 - X-iDiDl X, = 0 (14) 

and the result of [15] on continuity of the solution to the algebraic 
Riccati equation ( 12 ) under the condition ( 6 ). Second, using ( 11 ) and 
[ 6 , Theorem 3.1], we have that, for each t 6(0, I], 

P(0 < - il -^jX'Af) (15) 

where P(f) and A’a (0 are the solutions to the Riccati equations (7) 
and (12), respectively. The bound (15) can also be established with 
the tools of [2, Section VI.A, VII.C). Then ( 8 ) follows from (13) 
and (15). 

A. The State Feedback Result 

Assumption J: There exist positive definite matrix valued func¬ 
tions Q:(0, l |—> R"* M and P:( 0 , 1 |—* R M x " and strictly positive 
real numbers 7 and A satisfying ( 6 M 8 ) and 

|(/(i»)| < 1 ' Vu 6 {u 6 R'" : |if| < X). (16) 

Remark 3.1: There is no requirement that g be differentiable at the 
origin. All that is required is that g{ 0) = 0 and g is locally Lipschitz 
at the origin with a sufficiently small Lipschitz constant. 

From (3) and Lemma 3.1, Assumption 3 holds if Assumption l 
holds. Thus, the slate feedback result of Theorem 1 is a consequence 
of the following result: 

Proposition 3.1: If Assumption 3 holds then the family of linear 
stale feedbacks: 

« = -J OlPlf)*. (17) 

parameterized by f, semi-globally stabilizes the origin of (5). 


Proof: Define v = A II>a>( (). Let V be a compact set 
such that .r( 0 ) 6 P. Let c be a strictly positive real number such that 

2 > sup {.r* P(f)u') (18) 

and pick t* 6(0, 1] so that 

A„..,(/*(**)) < • 09) 

Since (8) holds, r and f* exist. Define c = P^[c*).v so that 

. i 

it = —B\ P* From (19) it follows that 

(20) 

In the ~ coordinates, the closed loop is given by 

i = Pi[AP l-.-DiDlr^ + Bj //(«(:))] 

=A-; + PiB>tj( if (--)). (21) 

Note from (8) that 

7 1 i * 1 I . 1 I J 

4 y -Kl = —P * Dj> Bl P - — P % Fi 113\ P — P ±QP 

^ (22) 

Let I (t) = : 7 : and consider \ on the set 

D = {:: L(v) <i J \. (23) 

From (16) and (20) it follows that 

|ff(«(:))| < ||h(:)| V:GP. (24) 

'Therefore, V: 6 P. 

,( l L ; 1 I , i i i\ 

i = IfjP^B.B! I’* + PiB t n , l i , ‘ -+ r ;Qi’->y. 

+ 2 : / Pi D ></(«(;)) 

< -\\B\Pi -,\ J - I b! r'i :|- - p- 7-I' 

1 

+ *\Blri-.\\Dln-\ 

<-W57»-K| a . (25) 

Since Q is positive definite, (25) and the definition of V imply that 
every initial condition in the set P converges exponentially to the 
origin. From (18) and (23), .r 6 P => : 6 P. 


B. The Output Feedback Result 

Assumption 3b: There exist positive definite matrix valued func¬ 
tions C?:(0, !]—♦ R ,,x " and P:{ 0, 1]^ R" x ”. strictly positive real 
numbers ) and A and a matrix of gains L such that 

1) (6)-(8) are satisfied, 

2) .4 -I- LC is Hurwitz 

3) defining: 

a) 3 = A IIIIIH (P„) where (A +LC)* P„ +P„(A +LC) = 

b) r = s/xZJbTdT). 

c) U — SUp, Jj A 1IU , x ( p{ f ) ), 
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d) /i = \/6«nax{/?(2+ 4 i' 2 k 2 ), 7}, 

e) we have 

|/;(«)| < Jh v« e {u 6 R'" : |«| < A). (26) 

Frofn (3) and Lemma 3.1, Assumption 3b holds if Assumptions 1 
and 2 hold. Thus, the output feedback result of Theorem 1 follows 
from the result: 

Proposition 3.2: If Assumption 3b holds then the family of dy¬ 
namic, linear output feedbacks: 

x = Ax + B\ v + L(Cx - y) 
a = -nfP(f).r (27) 

parameterized by f , x initialized in a compact subset of R 7 ', semi- 
globally stabilizes the origin of (5). 

Proof: With P v . v and n defined in Assumption 3b, define 

A,. (PA 

r = 2 + 4i-V. (28) 

Let V and f 7 ., be compact sets such that (t(0). .< (())) 6 f x U, and 
let v be a strictly positive real number such that 

c~ > sup { x 1 P(f)x + t( x - x) 1 P () (x - .!•)). (29) 

Pick r* E(0, I] so that 

\„u.s(P(P)) < min^ (30) 

Define c — x - x and * — P 7 i(r*)x. Observe that 

u = -B] Ph + BlPe (31) 

and, from the definition of v in Assumption 3b and (30), it follows 
that 

I "I < |-(|-l+ v^M>- 02) 

In the coordinates <. : the dynamics are given by 
( = + B'2[{( ii ) 

i = A. r. + P-2 B\D]Pc + f»5 (33) 

where .4. is defined in (21). Choose the Lyapunov function candidate 
ru. (■) = • + r. ' />,.( (34) 

and consider V on the set 

P= {(:, <•): ! (-, r) < r 2 ) (35) 

with r defined in (29). Observe that 
max (l ., )eP |i| = r 

maX( ; , ( xepH - r( s/ra)~ l . (36) 

Then, from (26) and (32), 

|,'/(»)| < - V(i.r)g P. (37) 

/' 

Also, from the definition of v and h in Assumption 3b, 

\Dj Pr\ < (38) 


Therefore, V(c, e) 6 p. 

V = —r|f | 2 + 2t( * Pt,D‘2[j(n ) 

- ^Lp5B a B.]'P5 +P3B 1 B, r pi + p !,),■ ' \ 

+ 2;' J 'P5 BiBj'Pc + 2 z r P*B-,y(n) 

- pL|* - |BfpL| a - |Q 5 P ~5 :f 

+ 2|B, r r5i|(«'KH) + + H'-IMW 

Using, for all real numbers a and h. 

(lb < Jjd- + ±lr (40) 

we get 

J'M + <’*H) 

2\B’f P$ i|() 

< 5 | 2 ?rrL| a +VK a |r|* 

< ~\dIp-2^ + i|B?'rhf + (4i) 

It follows that, V( ■. c) 6 P. 

v < -riel* + + 4<'V|cf - ~\ B.]>3 :|‘ 

+ (42) 

Then, from (2S) and the definition of // in Assumption 3b, 

V' < -|<f - V(~. r) € P. (43) 

Since (2 is positive definite, (43) and (34) imply that every initial 
condition in the set P converges exponentially to the origin. Finally, 
from (29) and (35), (x. x) C U x Uj => U. < ) £ P. 

IV. Application: A System Nonlinear in the State 
T he result of this note can be used as a tool for semi-globally 
stabilizing the origin of the system 

«h = r-2 + A 

X/ = I'A 

.i‘;i = w (44) 

for example, using linear feedback. A nonlinear globally stabilizing 
solution is given in [14]. Indeed, semi-global stabilization by linear 
feedback can be accomplished in two steps. First, using the results 
described here, the origin of 

•h = ,rj + i' 2 

= v (45) 

can be semi-globally stabilized using linear state feedback, r = 

— P 2 \ (f).*’i - P 22 (()x 2 - Moreover, a Lyapunov function that demon¬ 
strates the result is available. Then, using the semi-global backstcp- 
ping tool of [12], the origin of (44) can be semi-globally stabilized 
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with a linear state feedback of the form 

u = -P2i(f)-ri +Prj(r)s2). (46) 

Likewise, with the output feedback result of this note, the origin of 
(44) is semi-globally stabilizable by linear dynamic feedback with 
;n and J'a as 'measurements. 

V. Conclusion 

A solution to the problem of semi-global stabilization by linear 
feedback for linear systems with input saturation has been proposed 
which is an alternative to the solution given in [5]. The approach used 
is based on the solution to a family of Riccati equations. As com¬ 
pared with the result in [5], the scmi-global result is achieved while 
significantly relaxing the requirements on the input characteristic. 
Moreover, an output feedback solution is derived which is completely 
linear and hence does not explicitly use the input characteristic. On 
the other hand, we have not attempted to make any comparison with 
[5] regarding numerical and implementation issues. 

It has also been demonstrated that this result is useful for semi- 
global stabilization of some nonlinear systems that would otherwise 
seem unrelated to the problem of stabilization with input saturation. 
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Stability and Exponential Stability of an Adaptive 
Control Scheme for Plants of any Relative Degree 

Giorgio Bartolini, Antonella Ferrara, and Alexander A. Stotsky 


Abstract —Relying on a new adaptive pole assignment control scheme 
for plants with unknown relative degree and uncertain order which 
can be made minimum phase by means of a relative-degree-one first 
order parallel compensator, a new adaptation mechanism is proposed 
in this note in order to tackle with the problem of the attainment 
of an exponentially stable behavior of the output error. In particular, 
the connection between exponential stability and persistent excitation is 
outlined, and a complete stability analysis is performed. 


I. Introduction 

Starting from the works by Landau. Monopoli, and Narendra 
[11, [2], [31, the research activity relevant to the model reference 
approach to the adaptive control of LTI SISO systems has been 
mainly oriented towards both the simplification of the basic schemes 
and the modification of the adaptation mechanisms to cope with the 
nonstandard cases of unmodeled dynamics and bounded disturbances 
[4]-|7|. The common target has been however represented by the 
necessity of assuring, at least, the boundedness of tracking and 
parameter errors. 

Recently, the attention has been focused on the development of 
adaptive schemes for plants with unknown relative degree. In [7|, 
for instance, the relative degree is uncertain but its uncertainly is 
tied to the uncertainty on the plant description which is expressed in 
multiplicative as well as additive form however required lo be small 
enough. While in [8] and [9], a simplified adaptive control scheme 
has been presented, which performs the regulation of uncertain plants 
via pole assignment without requiring the perfect knowledge of the 
relative degree of the plant to be controlled and independently of the 
magnitude of the unmodeled dynamics. Yet, in this simplified scheme, 
traditional integral adaptation mechanisms have been adopted. Con¬ 
sequently, no great emphasis has been placed on the analysis of 
transient behaviors, as usually happens when only asymptotic stability 
is ensured. 
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As a matter of fact, the convergence rate of the adjustable param¬ 
eters when using traditional integral adaptation mechanism can be 
slow even under the condition of persistency of excitation and can be 
improved by the introduction of a prediction error into the adaptation 
law. Controllers composed of the terms driven by both the tracking 
and the prediction error for robot manipulators were proposed by 
Slotine and Li [10]. To avoid the measurement of the derivatives of 
the tracking error when getting the prediction error, various types 
of filtering have been proposed [6], 111], 112]. However, in previous 
works, the term which is responsible for filter design was not included 
in the Lyapunov function. 

The design procedure for constructing a prediction error used in 
this note is based on the Lyapunov function method and allows one 
to perform the filter design simultaneously with algorithm design, 
to connect the parameters of filter with algorithm parameters and to 
include a relay term driven by the estimate of the prediction error in 
the filter for robustness enhancement, in accordance with [13|. 

The main contribution of this note is the introduction of a new 
combined direct and indirect adaptive algorithm, driven by both the 
tracking and the prediction error, which does not require the perfect 
knowledge of the relative degree of the plant. Moreover, exponential 
stability of both the tracking and the parameter error is established 
when some persistent excitation requirements are met. 

The present work is organized as follows. In the next section the 
existence of a control structure characterized by a complexity which 
is independent of both the relative degree and the precise knowledge 
of the order is first discussed. Section HI is devoted to the construction 
of a suitable error model relevant to the adaptive case. Finally, the 
description of the new adaptation mechanism and the analysis of 
the stability properties of the overall adaptive scheme is provided in 
Section IV. 


11. The Underlying Linear Structure oi- the Controller 
We consider the following LIT SISO plant 

.*'/.( f) — ) + b v u r (t) 

«,(/) = /</•<•, ,(o 0) 


where r(f) is a suitable reference input which is jssumiil -o 
bounded, and A,,(s) is a polynomial with degree equ.o u> , i„ i,, l(U 
to a set of arbitrarily chosen Hurwitz polynomials \u;h d i 
between n i and w 2 (in practice, a pole assignment imam! on,!-', 
has to be solved in spite of the uncertainty we jssnni* ^ j .v t , W1 
the plant). 

To face the control problem above introduced, the UL 
parameterization of the control law u P {t) can be chosen 

Y(f). (4) 

where X'(t) = [.,■{ (/)«„(/).!•.] (f) r[t).r{ (f)] is the vectoi contain, 
ing the signals of interest. Among them, the vector signals .r,(f) are 
the states of three state variable filters described by the following 
differential equations 

hit) = A.r, (t) + b f u r (t) 
h (0 = A.rj ( t) + It / i/„ ( t) 

.h(0 = Aj-,(/) +ft,!■(/), (5) 

where A is a ui x n* stable matrix, and (t) is an auxiliary signal 

obtained as y„(t) = y,,(1) -I- y,(t), y>(i) being the output signal 
generated by a rational feedforward compensator, namely, 

= -ay, it) + At (6) 

(k\ and (\ positive constants). Hence, i/„ (f) can be regarded as the 
output of the parallel between the plant and a first order relative- 
degrcc-one filler. Finally, (-)* is a constant vector of suitable dimen¬ 
sion, containing the parameters which solve the control problem. 
The introduction of the parallel compensator (6) is motivated by 
the following considerations. The relative degiec of the parallel 
connection of the plant and the first order filler turns out to be one 
regardless of the relative degree and of the order of the true plant. 
Indeed, 


Halt) = 

.-l(.s) * + II 




A'i -4(.s) + h l ,D(s){s -F o 
4(s)(.s + «) 




(7) 


where .v Jt (t) € R'\ with ;/i < q < /# 2 , n f ,(f) € R\ A,, is a matrix 
of suitable dimension, and, using the operational notation s =, d/dt, 

u,At) = - A ,,r\ = *> — »,.(/). (2) 

< *» t s ) 

This plant is assumed to satisfy the following assumptions; a) 
A(s), B{s) are coprimc not necessarily Hurwitz polynomials; b) 
deg (/?(*)) < q — 1; c) the parameters of A,., h n , h v are supposed 
to be unknown; d) only the input u v (t) and the output y Jt {t) are 
assumed available; e) the sign of the gain is known. Note that 
this set of assumptions allows us to deal with the case in which the 
reference model order is lower than the plant order. In particular, 
to simplify the treatment, we shall make use of a first order strictly 
proper model, although the plant relative degree and order are not 
exactly known as happens in any general representation of systems 
with unmodeled dynamics [6]. 

The control problem we are dealing with can be stated as follows. 

Problem J: Find a control law = »,,(/. y v (t), r[t)) such that 
for any possible value of the plant order q 

*/,.(') = *>? ( * r(M. (3) 


Note that in this work we assume that a known parallel compen¬ 
sator H{s) = X, [s)/D, (.s), with relative degree equal to one, such 
that X(.s)A(s) -F k r D, (*)B{s) is Hurwitz exists. This means that 
the class of systems to which the proposed approach is applicable is 
enlarged with respect to classical MR AC schemes to contain those 
nonminimum phase systems whose zeros can be robustly located in 
the complex l.h.p. by the parallel connection with a known relative- 
degree-one compensator [14]. In particular, in 114] the conditions 
under which the problem is solvable by means of a first order parallel 
compensator are reported. While, with reference to systems for which 
the above robust stabilization problem cannot be solvable, a possible 
solution to the control problem has been proposed in [151. However, 
this topic is still under investigation (see (16] for some preliminary 
results). 

More precisely, in writing (6)-{7), for the sake of simplicity we are 
implicitly assuming that H(s) = ]/(* + «), (i.c., k\ = 1) suffices to 
satisfy the minimum phase requirement for the roots of the numerator 
of the transfer function in (7). On the basis of these considerations, 
it is possible to claim that, by means of the parallel compensator, 
an auxiliary minimum phase and relativc-degrec-one plant has been 
obtained. Then, we are in a position to cope with the relevant control 
problem previously stated. To this end, let us introduce the following 
results. 
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Theorem 1: For any minimum phase and rdasive-degrec-one plant 
represented by the input output pair {MM* MM} whose order can 
range between two known values |iij, n 2 ], there exists a parameter 
vector pair 0J 6 , of fixed dimensions, and a scalar value 0Jo, such 
that, by using a control law of the type 

i 

«;<n = -6>; 7 \r, (t) - tr/.rAt) - fio Ua(t) + «■*(/). (8) 

where .ri(/), MM € R"*, and u'*(l) is a signal depending on the 
reference input, as will be specified below, a unique transfer function 
TM*) exists such that the following condition is satisfied 

.MM = Un(t) = T,u(ti)w*(t) (9) 


Proof: The transfer function between u*(t) and y P (t), due to 
the choice of the control law indicated in ( 8 ), results 


MM = T,„{s 


k r B(*)(H + a) 
D[m) ‘ 


W*(t)n 


(15) 


where, for the sake of simplicity, T„,(*) = D n Ae)/Ao,{n) can be 
chosen as r ir ,(*) = l/(* 4- M i.e,, A w (s) = (s + a)D m (-*)■ By 
posing «>*(M = (F!j(s)/D(#))r(/). where the coefficients of poly¬ 
nomial F\(s) coincide with the components of the vector [0J O 0 *] 1 * 
we obtain 


MM ~ 


k } ,B(s)FAs) v{{] 

D(s)D(s) 


(16) 


To,(a) being a strictly positive real (SPR) transfer function. 

Proof: If D(a) is the arbitrarily chosen characteristic poly¬ 
nomial of degree equal to n 2 associated with matrix A in (5), 
and Fi(s), F 2 (») are the numerators of M(.s/ - A )~ l hj and 
02 7 (a/ - A )” 1 bf 4- 0Jo, respectively, denoting with B(s)/A {*) the 
transfer function of the parallel structure (auxiliary plant) indicated 
in (7), then the transfer function between ?/•*(/) and </„( t ) results 




D(s)D(s) 

A(fi)iD(s) - F\(h)) - F 2 {s)B(#) 


( 10 ) 


The model transfer function T m (f* ) needs to be independent of the 
considered auxiliary plant B(s)/A(s). Keeping in mind this fact, ihe 
proof can be developed by introducing the following factorization 

d(k) = n; ,j (.s + a,) = n:i, + .<" + rf.>n:..i<* + «M = 

D'(*)£>"( *) and choosing 

F\(s) - D{fi) = D'(s)B(s) 


FA«) = D'(s)(M»)- A.is)). (II) 

where the monic polynomial A q (s), with the same degree as A(.s), 
can be expressed as A,,(s) =* {* 4 - a)A q (s), and A,,(s) belongs 
to a fixed set of polynomials of the type |/U(.s); Ak + i(s) = 
(* + A k)Ak{h), k = Hi.-'-./Ja}, with ,4 Jfl («) and fA /l } chosen 
so that for any 7 , A q (») is the relevant characteristic polynomial to 
be assigned to the plant. 

From ( 10 ) and (II), results equal to D ,, (s)/A, i (s)> and can 

be invariant for any q only if D " (*) and .4 (/ ( s ) have common factors. 
Due to the arbitrarily of the choice of D(s) and A* (*), we can always 
assume that this is the case. Therefore, T m (is) can be rewritten as 


Tm(ti) 


Dm{») 
A, tl {s) 


( 12 ) 


where A m (n): — A n , (»). Note that in ( 12 ) the following factorization 
has been used 


Hence, choosing 

F\(s) = B(s)D , {s) t 

where D’{s) has been specified above, it results 

m k t ,B(s) 

• f ' (t)= DZw r(t) - 


(17) 


(18) 


Taking into account (I0)-(11), the definition of 4 (/ («) and the 
choice above made for T ni (s), one has T, „(s) = D"{. s)/[(.s + 
r/)i 4 y (.s)] = 1 /(-s* -f «). Therefore. D H 0 s) — .4 r/ (.s) which is 
the relevant characteristic polynomial to be assigned. So the pole 
assignment requirement is fulfilled and Theorem 2 is proved. □ 
It should be explicitly noted that the number of parameter sets to 
be tuned is only two (namely, the sets of coefficients oi polynomials 
F\{s) and F 2 (*)), because of the relationship between /’i(.s) and 
/n ( s ) expressed by (11) and (17). 

Indeed, 


F\(s) 

D{*) 


= Vn> + 


FM - DO S) 

D(, 


(19) 


(0*,n = 1, since F .\( s ) and D (.s) are monic), and F\(s) - D(s) = 
F\{s), so thut 


0; = 0;+iZ 


( 20 ) 


(]_ being the vector of the known coefficients of D(.s). Moreover, with 
respect to classical adaptive control schemes, the present one solves a 
trivial Bezout equation which does not require any matrix inversion. 


111. The Adaptive Case,: Constrijciion op the Error Model 

Let us now consider the ideal control law expressed by (8), (II), 
(14), and (17). When the plant is unknown, it is possible to choose 
a control law with the same structure, that is 

«,,(*) = {t)J"At)-# 2 aU)uM) + U'(t) 

>r(t) = #j (t)., At) + <•(/). (21) 


j,(*) = .4„,(*)D" , (*) D"(.A = D m (x)D , "(,) (13) 

Hence, the same T, u («) is obtained starting from any admissible plant, 
and polynomials F\ (*), F 2 (h), and D(h) are of constant degree (so 
that 0 [ and [0J O 0J 7 ] y , which contain the coefficients of F\(s) and 
Fif.v), are of fixed dimensions) even if the degree of .4(*) ranges in 
the interval l»»i 4-1, 4-1], which completes the proof. □ 

Theorem 2: In the same conditions us in Theorem 1 , there exists 
a parameter vector [ 0 *o 0 * f ] , , such that the choice 

w , {t) = »'/rAt) + 0' M r(t) (14) 

ensures the complete satisfaction of the control object ( 3 ), for any 
plant belonging to the set of systems fulfilling assumptions a)-d). 


Defining, as usual in the MRAC context, 

MM = MM - ",*(M (22) 

where u*(t) is the ideal control law determined from (8) and (II). 
one has 

«,.(/) = -(<(M - f>; T ).r, (/) - (# 20 (0 - 0w)vAi) 

- (f>i (f) - #I 7 ).r 2 (/) + It) - ).n (t). (23) 

Thus, the auxiliary plant state equation can be rewritten as follows 
./■„(/) = A4- h„ ) 

= A„j‘ (l (0 4- K(u*,{t) 4- MM) 

».,(o = -At) 


( 24 ) 
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where .r«(0 € and A„ is a (r/4 1) x (q + 1) matrix Taking 
into account (8), the first of (24) becomes 

, (/)= n(f) + 0i" r 2 (f) 

+ «jo»u(/)) + 6 1 I («*(f) + ii p (/)) (25) 

This equation is identical to the state equation of the ideal auxiliary 
plant (i e., the auxiliary plant in the known parameters case), with an 
input signal (u *(04u P (0) instead of simply u *{i) From Theorems 
l and 2 we know that the zero state response of </,,(/) is expressible 
as the solution of the following first order differential equation 

!Ja{t ) = 4- 0* *(0 4 u,At)) (26) 

Therefore, by introducing a parallel model of the type 
ia(t) = -<i-,(0 4 it (t) 

= -<> .(/) + #{ (/)• ,(0 + »m. (27) 

ihe final error cqualion, due to the error on the control signal a,At), 
results 

i'(t)= y,(f)-~.(t) 

I'(t) = 11 (t)-nl 

- - rt»/(0 + «'(/)\(0 ( 28 ) 

where 


Theorem 3 Consider the error model (28) with adjusting 
(29H32) If the reference input /(0 is bounded then ilu u 
error i/(t) and the |!)iediction error estimate t,ih / , 

asymptotically converge to zero and all the traicuoiies m th m *m 

remain bounded Moreover, if \ (0 is persistently e\c muj. It u u k 
ing error v(t ) and the parameter error 0(f) are globally \p<m< n M L 
convergent 

Proof The stability of the system can be studied b\ lkiiu? \>\ k 
following Lyapunov function 

r= j^(o+^(»'(n-f(f)-/(/)(-)(0) 7 +’i|«(Mii, 1(0 (i4» 
whose derivative along the solutions of (28), (29)-(32) is 
1 — v(—au 4 0 1 \ ) 4 (v - f — y 0){-<ir 

4 \ 1 0 4 <it - T(0[A/' 4 - 0 4 y'l ] - i 

4 0 - \ 1 0 4 y 1 r(/)[\/> 4 <v('' -0 4> i i ]} 

- (V [\ V 4 - 04^1+ “ 0 ; yV 7 

“ jlWt J U)-/„ /A„| 

= —av 2 — ri(iv — ^ B) -2 - (// - 

4 0 7 y?i “ n0 / y 1 ! ,y " f ) ~ 0 ' r i 

+ ^ - ^IMIn I,,, , /*„, 05) 


(-)'(f) = (t)-t)2n{t) -#{(/)] 

V'(0 - [»f (/) v. (0 < i (0] 

Thus the adaptive control problem has been reduced to the standard 
foim, le, u(t) = /,„(*.)(-)' (M \(t) with /'„(•>) = ]/(s + n|€ 
SPR, and j/( 0 playing the role ol tracking error 


IV lur Adapiivl Casl Introduction of a 
COMBINLD DlRLtr AND lNDIRl CT ADAPTIVL LAW 

Let us consider the ideal control law expressed by (8), and the 
error model previously dcnved, i c v{t ) -=■ - m f (1) 4 0 7 V (/) with 
0(0 € R 1 ", i'(t) € R [ We propose the following adjustment Jaw 

0(0 = -T(0[\(0i/(04 r/y'(0(//(0-f(0)4^(0< (v 0] (29) 

where r (0 E R ,n , *-(t) € R 1 , T(f) is a m x m matrix, and they 
are adjusted as follows 

^(0 = -M0 4 \ (t) (30) 

<(t) = -«<•(/) + y ' (t)l (0[\(0"<M 

+ «y-(0(*'0)-f(0) + ^(/)< (»- 0] + i(/'. 0 01) 

i (/) = -«V{t) r v)A (t)r(t) + «{Ut) + i r (t)T(i)/k„).w) 

where 0 < T(0) £ l {l J„ k u >0 being an upperbound on the gam 
matrix, and i (i/. f) E R l satishes the following inequality 

1,(1' O(i'(0-'(0)>O (33) 

Note that (/'(0 - 00) is playing the role ol the prediction er¬ 
ror estimate The function i, (/', O can he chosen as v(»a 0 = 

11 sign (/^(f) - 00), * i > 0 The introduction ol the relay term 

t O', f) driven by the estimate of the prediction error is essential for 
robustness enhancement, when bounded disturbances are acting on 
the plant [13] 

Moreover, it is worth remarking that the least squares gain update 
(32) is a special case of the modification of least squares update law 
proposed by Lozano et at in [17) 

By taking into account the combined adaptive algorithm (29H32) 
we can prove the following result 


(note that tor the sake of simplicity the dependence of f, H on 
time has be omitted) 

Taking into account (31) and decomposing the second term we 
have 


l < ~m> 1 - £(/' 


y? 7 (-))' 

£(■' - f)“ - ^IK-)|ln i„, i /a, i (56) 


From this we conclude that u(t) (i/(/) - >(I) - ^(fjBU)) are 
bounded and square-integrable, 0(f) is bounded and (/'(f) - r(t)) 
is square-integrable Now, let us examine the boundedness of i'(0 
Since r(t) is bounded, i i{t) is bounded as well (see (5)) From (27) 
we conclude that the boundedness of /(/), 0(f) and i At) implies 
the boundedness of ^,(t) and in turn the boundedness ol i/,(/) since 
i'(/) is bounded Prom (S) we conclude that i >(/) is bounded since 
if, is bounded Examining Ihc boundedness ol u,(/), from (7) we 
see that n, (/) is bounded since the auxiliary plant transfer function 
has zeros in the open left half plane and according to, loi instance. 
Lemma 4 in [3|, its input cannot grow faster than its output, namely 
tj At), which is bounded From the boundedness ol u,{t) we conclude 
the boundedness of ii(f), and that of the output of ihc parallel 
compensator i/ ( (f), as well as, in turn, the boundedness of the plant 
output iy, (/), since y„{t) is bounded From the boundedness of \ (/) 
it follows the boundedness of j{t) and > (f) The boundedness ol 
ip(t), r (/). \(t), v(t) and !•(/) implies the boundedness of <{t) and 
f(t) and hence we conclude the boundedness of <r){t) Since o(t) 
and (i'(f) - f(t)) are square-integrable and u[t) and (r'(f) - 
are bounded, they both converge to zero 

Now consider the case when \(/) is persistently exciting The 
excitation of the regressor A ( /) is connected with the excitation 
strength of the reference input i{t) [18] If \{t) is persistently 
exciting then y?( t ) is exciting as well Note that the modified least 
squares estimator (32) presents the following properties a) 1(0 < 
A(j/„, for any t > 0, b) if ^(0 is persistently exciting, i e , there exist 
strictly positive constants <*>, f, t\ such that 


r 1 ** , 

/,„ > > II,„ 


(37) 
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then, there exist Ai > 0 and A? > 0 such that - 

it */fco) > Ai/ rn and T~ x (t) < A 2 iui for any t > 0. Indeed, 
define :(f) = jr 7 r~ 1 ( 0 *J\ where Jr € is any constant unitary 
vector. Differentiating i{t) and taking into account that r~ J (t) = 
-r^*(t)t(t)r~ x (t) we have i(t) = ax r yp(t)^UtU - <r4t) + 
a/ko > ~az(t) + a/fa which implies that F(0 < A*o/ TII . Integrating 
i(f) and taking into account (37), we obtain :(f + h) - z(t) > mi¬ 
tt f t H * ~(r)<lT+ab/h. This, in turn implies that there exists Ai > 0 
such that - 1,,,/k 0 ) > Ai. Using similar arguments one 

can show that there exists A a > 0 such thal r -, (f) < A 2 /„. for 
any t > 0. 

Then, by using the above properties of the estimator (32) we can 
write (35) as 

V < - £(" »>* - 5~ll»llr- (38) 

Let f/| be a positive constant such that <\\ — min[n. «Ai/Aa]- Then 
V < -a iV which implies the exponential convergence of 
(v(t) — e(t) - ^ r (f)0(O) and 0(f) to zero, □ 

To conclude the treatment, it should be noted that no direct 
adaptation of the parameter vector 0j(f) is performed, since this 
vector does not affect the output error v(t). Yet, the parameters 
03 (f) can be indirectly adjusted on the basis of (20). Since the 
exponential convergence of 0i(f), MO, # 2 n (0 to their ideal values 
has been ensured, also the convergence of MO to 0* is guaranteed. 
This corresponds to the attainment of the control objective (3) in 
Problem 1. 


V. Conclusions 

In this note, an alternative adaptive control approach is presented 
so as to deal with plants with unknown relative degree and un¬ 
certain order which can be made minimum phase by means of a 
relative-degree-one first order parallel compensator. Apart from the 
fundamental feature of having a complexity which turns out to be 
independent of the relative degree of the plant, the proposed control 
scheme is characterized by the use of a nonstandard adaptation 
mechanism which includes a time-varying gain and can be viewed 
as a combined direcl/indircct mechanism, being based on the use 
of a prediction error estimate. The introduction of this modified 
adaptation mechanism allows us to prove that the tracking error 
asymptotically converges to zero and that all the trajectories of the 
system remain bounded. Moreover, under a suitable assumption of 
persistent excitation, the tracking error and the parameter error turn 
out to be globally exponentially convergent. This latter property is 
indirectly used to complete the design of the control scheme so as to 
assign the desired dynamics to the true plant output. 
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Preservation of Reachability and Observability 
under Sampling with a First-Order Hold 

Tomomichi Hagiwara 


Abstract —This paper gives the necessary and sufficient condition for the 
reachability of the sampletl-data system S i obtained by the discretization 
of a linear time-invariant continuous-time system with a first-order hold. 
Equivalence of the reachability and controllability of S\ is also shown. 
Similar results are given also for observability and reconstructlbility. 
It turns out that S i is reachable only if So is reachable, while S i is 
observable if and only if S\l() Is observable, where So is the sampled- 
data system obtained by the discretization with a zero-order hold of the 
same sampling period. 


I. Introduction 

In sampled-data control, hold circuits are used to convert the 
discrete-time signals from digital compensators into the continuous- 
time signals to be applied to the continuous-time systems. Hold cir¬ 
cuits can be viewed also as filters which attenuate the high-frequency 
alias spectra generated by sampling continuous-time signals. Typical 
hold circuits are a zero-order hold and a first-order hold [1], but the 
former seems to be particularly popular in industrial applications. The 
primary reason for this is that a zero-order hold can be implemented 
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quite easily by using the function of D/A converters while a first-order 
hold can be implemented only with the aid of some additional analog 
circuits. Another reason might be that, when viewed as continuous¬ 
time filters, the phase lag of a first-order hold is greater than that 
of a zero-order hold for high-frequency ranges, which seems to 
be a disadvantage from the point of view of closed-loop stability. 
However, the latter reason seems to apply mainly in the case when 
a digital compensator is obtained by a digital redesign method [2] 
of a continuous-time compensator and closed-loop stability is noi 
necessarily assured theoretically. If we could use a first-order hold 
in such a way that closed-loop stability can be assured, then it might 
provide some advantages over a zero-order hold, such as reduction 
of the intersample ripple of the response. 

Based upon the above consideration, the aim of this paper is to 
provide a basis for the use of a first-order hold in the context of the 
state-space approach of control system design. For this purpose, we 
give the necessary and sufficient condition for the reachability of the 
sampled-data system obtained by the discretization of a linear time- 
invariant continuous-time system with a first-order hold. In addition, 
we show the equivalence of the reachability and controllability of 
this sampled-data system. Furthermore, we give similar results for 
observability and reconstructibility. (For the standard definitions of 
these concepts, see [3).) 


II. Discretization with a First-Order Hold 
We consider the system given by 
d.v 


dt 


= .4, ,r + /?,//. y = C.\\ 


(1) 


where 4 f £ R"*\ B, £ R" y, \ and C £ R ,,y ". Suppose a 
first-order hold is connected to the input. Then, i i(t) is given by 

/JX t j , "(AT) - u(k - 1 T) IT , V 
u(1) = u{kT) 4-- # - (t - AT) 


(AT < t < k + 17), (2) 


where 7 denotes the sampling period (//(AT) stands for //(AT4-0)). 
ft should be noted that there is a built-in constraint that the input 
it(t) (AT < t < k IT) depends not only on ji(AT) but also on 
i/(A- — 17), which shows sharp contrast with a zero-order hold. In 
particular, v{t) (0 < t < T) depends on //(-7), which has been 
determined before t = 0 and cannot be changed by the compensator 
at t — 0. 

The resulting sampled-data system can be described by the equation 
(see [41, |51) 

.r(F+T T) - .4.1'( kT) + B + it(kT) + B~u(k -IT). 

V(kT) = C'.r(AT) (3) 


where, 


.4 = exp(-4, T), D + = ^2 - Cj exp (A,t)B, 


dt. 


(4) 


We denote the system (3) by , which can be rewritten in the form 
of the ordinary discrete-lime state equation as 


■r(A-+J7) 

"(AT) 


4 zr 
o o 


■/'(AT) 
if (A’ - 17) 


4* 



//(AT), 


(5) 


[ .«‘(AT) 
"(A - 17) 


( 6 ) 


III. Controllability and Rha( ilABiim s 

A. Definitions of Controllability and Reachabthn 
and Their Equivalence 

In this subsection, we first give the definitions ol the lonim'U c > 
and reachability of Si. In view of the discrctc-lime stau 1 equ iiu-n , ( 
let us adopt the following definition. 

Definition 1: S j is controllable if the pair ( 4]. JI i) is mnnoi 
lable, where 


'4 B ’ 


B 1 ' 

0 0 


./« . 


Now, let us verify that the above formal definition matches oui 
practical control purposes in spite of the built-in constraint of a 
first-order hold. 

If we regard S i simply as an ordinary discrete-limc system, then its 
controllability might be defined as the property that, given any initial 
condition .r((i ), there exists a sequence //(AT) (A- = 0. ■ • •, 1 Y - 1) 
such that .r(XT) = 0. However, this is not appropriate, because 
this properly does not reflect real purposes of control. Namely, this 
definition docs not always imply the property that there exists a 
sequence //(AT) (A = 0, ■ ■ ■, X - 1, A’. ■■■) such that ./■(/) - () 
( v f > XT), because of the built-in constraint of a first-order hold as 
discussed in the previous section. (This can be easily understood if we 
notice that //(/) = 0 (AT < 1 < X + 17 ) implies //(A' - 17) — 
//(AT) = 0 from (2).) Therefore, to define controllability, we must 
require that there exists a sequence //(AT) (k = 0.■ ■ ■, 1 V - 2) 
such that this together with //(X - 17) — 0 implies ,r(XT) = 0. 
Likewise, as discussed in the previous section, //(/) (0 < / < T) 
is constrained by the unpresentable value //(-7). Despite this 
constraint, j-(XT) is required to be made 0. 

From the above consideration, the controllability of S\ should be 
defined as the property that, given any initial conditions .r(0) and 
i/(-74, there exists a sequence u(kT) (k = 0, ■ ■ ■, \Y -2) such that 
this together with u[X - 17) = 0 implies ./( NT) = 0. Obviously, 
this definition is equivalent to the controllability of the discrete-time 
system (5). Thus, validity of the above definition is assured. 

Similarly, we are led to the following definition (see 10] for details). 

Definition 2: Si is reachable if the pair (4 1 . D\) is reachable. 

Now, in spite of the singularity of 4i, we can establish the 
following result (the straightforward proof [6] is omitted here). 

Theorem J. Si is reachable if and only if it is controllable. 


H. Condition for Preservation of Reachability 


In this subsection, we study the necessary and sufficient condition 
for the reachability ol S i in terms ol A,, B, and 7\ From Definition 
2, it is reachable if and only if 




B' 


-- n 4 /// ( V : € C) 


( 8 ) 


where B:= B + 4- B~ = f 0 * exp( 4 ,t)B, dt. (Note that (4. D) 
is nothing but the pair of the sampled-data system obtained by the 
discretization of (I) with a zero-order hold, which we denote by Sn.) 
The condition (8) is nothing but the reachability condition for the pair 


(4 ; 


i* /?„):= ( 


.4 B " 


B + 

o r,„ 


1,„ . 


(91 


This pair can be regarded as the pair obtained by the discretization 
of the fictitious 7-dcpendent continuous-time pair 


(4*. B h 


H 


4, 

0 


with a zero-order hold, because 

Ji!‘ exp(.4a, t)Bj, /It. 


I), 

0 

.4a 


' n, 

1,,,/T 


) 


= exp(.4j, T), 


( 10 ) 


B. = 


v (kT) = [C 0] 
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Remark I: (4a,, ) is reachable if and only if a) (4,, B ,) is 

reachable and b) -4, does not have the eigenvalue ~l/7\ 

Since the eigenvalues and left eigenvectors of 4»c are not depen¬ 
dent on 7\ we can apply the necessary and sufficient condition T7] 
for the reachability of So to the pair (4*,, B;<). Then, the following 
theorem is obtained (see Appendix for proof). 

Theorem 2: S\ is reachable if and only if the following iwo 
conditions are satisfied. 

a) 5 g is reachable. 

b) A, does not have the eigenvalue -\/T. 

Remark 2: Suppose that we define the stabilizability of S\ by the 
stabilizability of the pair (A\ , B \). Then, the necessary and sufficient 
condition for the stabilizability of Si is given by the stabilizability 
of So (the condition b) can be dropped). Although we can rewrite 
the reachability/slabilizability of S 0 as the conditions on 4, , B t , 
and T (see [7|, and Theorem A. I in Appendix), we did not do this 
because the importance of the theorem seems to be much clearer in 
the present form of the statement. 

IV. Observability and Reconstructibility of Si 

As in the preceding section, let us consider how to define the 
observability and reconstructibility of Sj, taking account of practical 
purposes. 

If we regard Si simply as an ordinary discrete-time system, then 
its observability might be defined as the property that its initial 
state ./ (()) can be uniquely determined from the input data u{kT) 
(k = 0, ■ ■ ■, d t V - 1) and the output data //( kT) (A- = ().■■*. JV). 
However, this is not appropriate, because n(t) (0 < t < T) cannot 
be known completely from the knowledge of the above input data, 
as discussed in Section II, and it is clearly impossible to determine 
,r(0) under this lack of knowledge. Therefore, to define observability, 
we must assume that u{t) (0 < 1 < T) is also known. This 
assumption is equivalent to the assumption that u{-T) as well as 
the above input and output data can be used. Noting that ,r(0) can 
be determined uniquely if and only if [,i-(()) 7 . //(— 7") 7 ] y can be 
determined uniquely (if we know u{ ~T)), we arc led to the following 
definition. 

Definition 3: S i is observable if the pair (G, A \) is observable, 
where 


Similarly, we are led to the following definition (see 16] for details). 
Definition 4: S i is reconstructive if the pair (G. A \) is recon¬ 
structive, where 


Cr.= [C 0). 


( 12 ) 


(G.4i) is reconstructive if and only if (C\. A j) is recon¬ 
structive. Furthermore, in spite of the singularity of 4i, we can 
readily show that ((V 4]) is reconstructive if and only if it is 
observable. Thus we obtain the following result. 

Theorem 3: Si is observable if and only if it is reconstructive. 

Now, from Definition 3, S i is observable if and only if 


rank 


.4 - * J„ B ~ 

C 0 
0 


= a + in ( ; £ C). 


(13) 


Since ((', .4) 'is the pair of So, we readily obtain the following 
theorem. 

Theorem 4 ■ S\ is observable if and only if So is observable. 


Remark 3: Suppose that we define the detectability of S\ by the 
detectability of the pair (G, 4i). Then, the necessary and sufficient 
condition for the detectability of S\ is given by the detectability of 
So. The condition for observability/detectability of S () in terms of G 
At and T is given by |7J (see also Remark A.l in Appendix). 

V. Conclusion 

In this paper, we studied the use of a first-order hold in the 
context of the state-space approach of control system design. We 
first studied how to define the controllability and reachability for the 
sampled-data system S\ obtained by the discretization of a linear 
time-invariant continuous-time system with a first-order hold, taking 
account of the built-in constraint of a first order hold. Next, we 
showed the equivalence of these two concepts for S i. Then, we 
studied the necessary and sufficienl condition for the reachability 
of Sj in terms of the parameters of the continuous-time system and 
the sampling period. We also gave similar results for observability 
and reconstructibility. In particular, it turned out that Si is reachable 
only if So is reachable, while S’i is observable if and only if So 
is observable, where So is the sampled-data system for the zero- 
order hold case. The compensator design problem under the use ot 
a firsl-order hold is also studied in [61. 


Appendix 

Proof of Theorem 2 

Before proving Theorem 2. we give a more comprehensible 
statement of the necessary and sufficient condition lor the reachability 
of So derived in [7|. 

Let A(.4, ) denole the set of the eigenvalues of .4,. For each 
A, £ A( 4,), we define 

A(A,):= (A | A 6 A(4, ). Re (A) = Re (A,). 

1m(A) - Ini (A,) - 2 *tt/7 (A- = 0, ±1, ±2. ■■■)}■ (14) 


Note that A, £ \(A,), and that A(A f ) — A(A,) if A, 6 A(A f ). 
Our interest is only in the sets A (A,) which have at least two 
elements. Ler \i (J ~ 1 ,■•■,£) be such distinct sets, where we 
assume that A / (/ = l.--,L + (< /.)) are the sets corresponding 
to the eigenvalues with nonnegative real parts {L and L + might be 
zero). We denole the elements of A/ by A/a (A 1 — L ■ • •. A"/). 

Next, for each A, £ A(4,), wc define 


T (A,): ~ 


»;/i 




(15) 


where v, denotes the geometric multiplicity of the eigenvalue A, 
and //j, (A 1 = 1 ,---,*/,) the corresponding linearly independent 
left eigenvectors. That is to say, all the linearly independent left 
eigenvectors of A t corresponding to the eigenvalue A, form the rows 
of T(A,). We further define 


r< Af): = 


f T(A/i) 1 


|jnA/ a, ) 


(16) 


for / = 1,■■■.£. That is to say, all the linearly independent left 
eigenvectors of ,4, corresponding to the eigenvalues in the set A; 
form the rows of T(A/). 

Now, we obtain the following theorem, which is merely a restate¬ 
ment of Theorem 2 of [7]. 
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Theorem A J' So is reachable (respectively, stabilizable) if and 
)nly if the following three conditions hold: 

a) (-4,, B, ) is reachable (respectively, stabilizable). 

b) A , does not have the nonzero eigenvalue 

j2kn/T (k = ±1, ±2, ■ ■ ■), (17) 


c) T(At)B t has full row rank for / = (respectively, 

i = 

Remark A. 1: The conditions for the observability and the de¬ 
tectability of So are given by the dual of the conditions a) and c) 
of the above theorem. (This is a restatement of Theorem 3 of [71.) 

Remark A2: Note that T( A/) never contains the left eigenvectors 
lor the zero eigenvalue of A , if the condition b) holds. 

Now, we give the proof of Theorem 2. 

Proof of Theorem 2: Without loss of generality, we assume that 


A, = 


A f 

0 


0 

z 



(18) 


where A, is nonsingular and all the eigenvalues of Z are zero. Then, 
applying the similarity transformation by the matrix 


I 0 —.4, 'D (l 
0 1 0 

0 0 T 


(19) 


to the pair ( 1 0), wc obtain the pair 


f\A, 0 0 

0 ZB,, 
\ [ I) O 0 


U+ A7 l /T)B n 


d i2 

I/T 



( 20 ) 


Applying Theorem A.l to the pair (10) for the conditions a) and 
b), and to the pair (20) for the condition c), and taking Remarks 
1 and A.2 into account, we can obtain the following necessary and 
sufficient condition for the reachability of S\\ 

A1) (A, * D f ) is reachable. 

A2) .4, does not have the eigenvalue -1 /T. 

B) .4, does not have the nonzero eigenvalue 


flktr/T (k = ±1, ±2, ■ ■ ■)■ (21) 

C) r (A /) (/ + A7 1 /T)B, i has full row rank for / = 1 

where A/. F(A/) and L are defined for .4, in a consistent way 
with the above definitions of Aj, T(A/) and L . 

Since the rows of F( A /) are the left eigenvectors of .4, by 
definition, under condition A2) the condition C) is equivalent to 


C') f (A/)Zf, i has full row rank for / = 1, ■, L. 


In view of the form of (18), the conditions B) and C’) arc equivalent 
to the conditions b) and c) of Theorem A.l. Since the condition A1) is 
the same as the condition a) of Theorem A.l, and since the condition 
(A2) is the same as the condition b) of Theorem 2, the proof has 
become complete. Q.E.D. 
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Abstract — The robust asymptotic tracking problem Is analyzed in this 
paper relative to unstructured perturbations on each coprime factor of the 
transfer functions from the control input to the measured and controlled 
outputs. In each case, necessary and sufficient conditions for the existence 
of solutions arc presented which are explicitly given in terms of problem 
data. Under such conditions, explicit parameterizations are given of all 
controllers which achieve robust asymptotic tracking, in terms of free, 
rational proper, and stable matrices. 

I. Nomenclature 

C Sel of complex numbers. 

R,, Set of real-rational and proper functions from C to 

<L\ 

S Ring of real-rational, stable, and proper functions 

from C to C. 

M (A') Sei of matrices with entries in V. 

GLCD{A, 73} Greatest left common divisor of A and B. 

A Column vector obtained by slacking the rows of .4. 

-4 B Kronecker product of A and B. 

RCP Right coprime. 

LCP Left coprime. 

ESP Externally skewprime. 
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11. Introduction 

The robust asymptotic tracking problem (RATP) can be roughly 
staled as that of finding a stabilizing controller which ensures 
asymptotic tracking of a given class of reference signals for any plant 
model P “sufficiently close” to a nominal model Pq. Early references 
on this problem include [1H41. The emergence of factorization 
methods led to fresh approaches to this problem, among which 
one could mention [5J-[7], In contrast with previous work which 
considered unity feedback systems, [8]—[10] addressed the RATP 
for plant-sensor systems and two-degree-of-freedom controllers. In 
particular, [101 considered perturbations in the plant model (and 
not in the sensor) and presented a parameterization of the set 
of solutions where the “free” parameter must satisfy a “skew” 
diophantine equation. 

The RATP for four-block multivariable linear systems (follow¬ 
ing fill) was considered in [12] and [13]. The former consid¬ 
ered the robust asymptotic tracking problem relative to perturba¬ 
tions only in the numerator of the transfer function relating the 
control a to the measured output ~ and, separately, relative to 
perturbations only in the denominator of the same transfer func¬ 
tion—essentially, the existence of solutions to the RATP relative 
to perturbations on each coprime factor is shown to be equiva¬ 
lent to the existence of a stabilizing controller which satisfies an 
internal model condition on its corresponding coprime factor. The 
latter considered the RATP relative to perturbations constrained by 
a special linear relation involving the numerators of the transfer 
functions from u to the controlled and measured outputs (y and 
respectively). 

The present note considers perturbed model classes obtained by 
allowing a single coprime factor of the transfer functions from u to // 
and - to vary while the others are held fixed, with the following aims: 
to give explicit necessary and sufficient conditions on problem data 
for the existence of solutions to the RATP relative to such classes; 
to give an explicit parameterization of all controllers which solve the 
RATP relative to such model perturbation classes. 

This note is organized as fqllows. In Section II, the RATP is pre¬ 
cisely formulated and the required background material is concisely 
described. In Sections III, IV, and V, the RATP is considered for 
perturbed model classes where, respectively, only the numerator of 
the transfer function from u to its denominator, and the numerator 
or denominator of the transfer function from w to y are allowed to 
vary. The Lemmas required in the main text can be found in the 
Appendix; proofs omitted here are to be found in |14]. Notation is 
essentially the same as in [7|. 


III. Problem Statement 
Consider linear closed-loop systems defined by 


V 

— p 

1/ 


Pi 

1 

U 


- 4 

(1 


Pi 

Pi_ 

(I 


A = C 


= [C, Ci\ 


(i s if + II'. ; = ^ -f r, 

where P and C are, respectively, the plant and controller transfer 
functions, P,. P,. C. / = 1, 2, are real-rational and proper matrices, 
«\ r denote disturbances, r the reference signal, u the control 
vanable, // the controlled output, and 4 the measurement output. The 
closed-loop transfer matrix from r to y is P\Q\ , where Q\ = (/ - 

or.r'c,. 


Let a class of reference signals be given by S, = {/* = 
vr 1 /'*: € A4(S) and is strictly proper), where i 1 , is a given 

biproper, square, and nonsingular matrix in A4(S) such that the zeroes 
of its determinant have nonnegative real parts. 

Given P and i,\. the so-called asymptotic tracking problem is to 
find a stabilizing controller C' such that the error r ~ r - y — 
(/ - PiQ[ H = (I - P\Q\ H r V, goes to zero as time goes to 
infinity for all strictly proper //, in ,V4(S) (hence, in particular, 
for all strictly proper //, in .Vf(S) such that // f and <\ are left 
coprime). By Lemma 5,7.6 [7|, this is achieved iff (/- P\ Q\ H f ' € 
M($). 

Accordingly, given To and i,\. the so-called RATP can be roughly 
stated as that of finding a controller that solves the asymptotic tracking 
problem for P and , for all P “sufficiently close" to TV Before 
introducing a precise definition for the RATP for Pq and i \. and 
since, in any case, it involves closed-loop stabilization, the results 
of the latLer problem which are required below are collected in the 
following theorem (see |11|). 

Theorem 1: If P £ .Vf(R,.) is stabilizable then: 

i) for P[ =■ JYi D\\Pi — A jD, 1 RCP factorizations, there 
exists M £ M(S) such that D> = D\ M. 

ii) The set of all stabilizing controllers is given by: 


Sr= \C=.(h'\> + Y) 1 [/? : l\f) 2 - A ]: IT /v'G.M(S). 

del {(h Y_» -+ 1 )( x)\ ^0} 


where X . V £ „Vf(S) are such that A'.Yj + YD* - /. P_> - 
Dj 1 A?, AA and D> LCP. The corresponding sel of closed- 

loop transfer matrices from r to (//. ;) is {[A’, r : A'/ ] 1 U: li £ 
.Vf(S)J. where Y, = Vi M. 

A natural way of considering plant models close to the nominal 
one (Pm. Pjo) would be to consider small perturbations on the 
RCP factors (A^n, /)jo). (A m« D m) ol Pm — A m. Pjq — 
Vju DJn . However, condition (I.l) of Theorem I implies that 

Pm — Am 771o* = (Am }f)D 2 o - U)D 2 q ■ 


As a result, one could consider perturbations on ( Yjo. D>q. Ym) as 
the general setup where the RATP would be precisely formulated. 
To this effect, define, for t > 0 


V(*. P)) = {(Pl A)€-Vf(R |f ) a : 

Pj = A j P J 1 . Pi = A'i D 2 1 . for some A’i . Yj. D* 

such that A’i, D 2 are RCP and ||[( A'a — Xw ) 1 . (D>-D> o) ; . ( AA — 

Am ) 7 ] 7 II 

The RATP for a given pair U’,, Pq) relative to a model class 
which contains (Pm, Pju) can now be precisely defined. 

Definition 1: Let a class of models V C be given where 

(Pm, Pin) € V. A controller C £ .Vf(R,,) is said to solve the RATP 
for U\, Pm. P 20 ) relative to V if and only if there exists fu > 0 
such that, for any (Pj. Pj) in VD V(n>. Pn),C stabilizes (Pi, P 2 ) 
and (/- JW,)rr‘ € .Vf(S). 

Consider a doubly toprime factorization (A'jo. Dm. A’ju, D> u . 
A'o. V 0 , A'u. Vo) of Pm (sec |7, p. 48|), i.e.. Pm = A 'mDJ tl [ = 
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nrjN'ia with 


-Ao 

Vo ' 

—A '20 A'o 


77ao 

7V a0 . 

Dao lo 



-A'q Vo 
D> o A 20 


— A r 20 A o 

Dm Yo 


The following proposition gives necessary and sufficient conditions 
lor a controller C to achieve robust asymptotic tracking. 

Proposition]: Let (t/>,, Pin, Pm) be given, An = Nmd; 
RCP, Pm = NioDio- Let V C .Vt(R,,) be such that (P l0 , Pm) £ 
V. A controller solves the RATP for (*'>,, Pm, P 20 ) relative to V 

iff C is given by C(R, K) = 1 (A')[R:AV• (A")], .\> • (A') = 

AT)-io — A'o. f)r(K) = KNm + Vo. for some /?. A' in ,Vf($) such 
that det {(A’.V* + I o)(x ) | ^ 0 and 

a) (/ — A'io 7?H’i 1 £ .VI(S): 

b) there exists ft, > 0 such that for any (7 J i, Pi ) £ V fl V( fo. Pu) 

(A.V, + .V,„(AY(A-)A.V, - f), (A’)AD a )] 

■(7 - .Yc( A'JAA’i + i?< (K)±Di)- \'r' e.Vf(S). 


Theorem 2; Let (i",. Pio. Pjo) be as above 

a) The set Pi of all controllers which solve the I; \! 

(V.. Pio. Pi o) relative to A" is nonempty ill ' , ln j 

ESP and and Dm are ESP. 

b) In this case. C\ is given by 

Pi = {(A'A '20 + 1 o) ’[R: KDm — .Yo]: (/?. h ) ( s, 

del {(A'A’ju f 1„)( v i| x i)[ 

where Si = {(77. A’): R = A7 - D,R K -- ) S X U - 
d^&ar, A) £ .vi(S) x ,vi(s>} : 

M, =GLCD{(iVioi /), {li V', 1 )}. ( Vio /)-- U, .Y in , 

(7 x tf') = M, 

/, Y and 77, are defined by / = 37,/. A'io-Y 4 ~ /. 

and V’i ^Yio = A\ D, 1 (an RCP factorization), the existence 
of /, AW, N, , and D, being a consequence of the externally 
skew-primeness of Yu, and i»,; 

Mw = {GCLD{ (D„. 7). (7 £)]„)}, (D,. I)=M S D V . 

(7 7>i„) = .U%/V -Y„. V N . /) v 


where 7?v, 1 = 1 /? are coprime factorizations, V, = 

Y,o 4 AA'i. A r j = A t 2I , 4 AAY P 2 = Djd 4 AD 2 . 

Remark: Note that condition a) corresponds to the requirement of 
nominal asymptotic tracking and that b) is obtained by taking the 
difference between the perturbed and the nominal ( 1 - P\ Q\ )i/\ _ 1 . 


are defined by An = M\ X o. Y \ 4 7> 2 „Y\ = 7, and 

DjoD^ — D^D' 21 ) which exist since D+ and Did are ESP. 
Proof: In the light of the derivation preceding Theorem 2, only 
the expression for S i needs a proof. For that, note that Lemma 3 
applied to (I) and to ,Y| 0 /?4 HY, - I gives the desired expression. 


IV. Numerator Perturbations on A 

In this section the RATP for U\ . 7%, Ao) is addressed relatively 
lo the model class A" in which Si can vary freely around Y 20 while 
Ai and D> are held fixed, i.e., 

A‘ = {(A r i«7 JjV. A’i D'J ): N> £ ,Vf(S). \i and A>,oRCP}. 

To this effect, note first that C(R. K) solves the RATP for 
U 1 , . Ao, Ao ) relative to A if and only if 7? satisfies condition 
a) and A 1 n A <■• (7v) A A -j (1 — A < (A ) A A 2 ) 1 1 /, * € * Vf (S) for 
all SS '2 such that ||AA T -j|U < f f° r some f > 0 (this follows 
from Proposition 1). The aforementioned condition is equivalent 
to .Y|d A(-(A 1 = A 1 <7(A Dio “ Ao)v n , 1 £ „Vf(S). due to 

Lemma 1, where y 1 , is the greatest invariant factor of r,, hence 
the greatest invariant factor of v, (because Ih,\ ] 77 are 

coprime factorizations). This condition, in turn, is equivalent to 
S((Iv) G .Vf(S), where .YinyY 1 = S^D~\S^ and 
RCP. Due to [7, Lemma 5.7.6], the aforementioned condition is 
equivalent to D~ 1 A7 - ( I \) G -Vf(S). i.e., 

there exists S<' G Mb such that - D^jFc + K'Vm - A’o. (1) 

Now, A Dm — D^N(' — Au implies A A 2 n77 2 u — AfAan = 
A"o*V 2 n and hence (since .Yu iVjo + YoDm = /) (A'A\> 0 + lo )Dm + 
D+(-N('X‘ 2 q) = /• Thus, Condition 1 implies that D^ and 77 2 g are 
ESP. Conversely, suppose that there exists (A. Y) G M{§) x ,Yf(5) 
such that D^X + YDu) — /. Then, ^yA A’o 4- 1 77- 2 oA’t, = 
A‘o. which is equivalent to D^(.YA’o) 4 (1”1 0)^20 = A’o (since 
77 20 A'o = I 0 D 00 ). Thus, if 77^ and D 2 o are ESP, (1) is satisfied by 
A’ = 1T„ and X< = -A'A'o- Finally, whenever D ^ and D 20 
are ESP, the set of A which satisfy Condition I is obtained by 
directly applying Lemma 3 to (1). The following theorem can then 
be stated. 


V. Denominator Perturbation on P> 

In this section, the RATP for (i«,, /V,, P 2 u) is addressed relative 
to the model class P in which D 2 can vary freely around 77 2( , while 
.Yj and A 2 are held fixed, i.e., 

r> i {(.V,-Y w 7), ', £ .VI(R^) x M(R,,):Di £ X(S). 

A 2 d and Di RCP}. 

To this effect, note first that f'(7?, K) solves the RATP for 
(i\, P \0 . Ao) relative lo D if and only if R satisfies condition 
a) and AY A (7i )AD 2 (7 4 77r(7v lAZW'r, 1 G M (S), for all 
XD 2 such that ||AZ7 2 ||^ < t for some f > 0 (this follows from 
Proposition 1). Due to Lemma I, the aforementioned condition is 
equivalent to AjolVl A )^, -1 G M[S). i.e., A\-I7“ I (A Y 2 o4lo) € 
.M(S), which is equivalent lo ZA 1 (A A 20 4 lo) G _Vf (S). i.e., 
there exists 7A G .M(S) such that -D^D< 4 7v Y 20 — -lo. This 
equation is formally identical to (1); thus, the arguments which led 
to Theorem 2 apply mutatis mutandis to it and lead to the following 
theorem. 

Theorem 3: Let (i,\ , P\ 0 , P 20 ) be as above. 

a) The set C 2 of all controllers which solve the RATP for 
(i/\. Ao, Ao) relative to P is nonempty iff A ’10 and r, are 
ESP and D^ and A '20 are ESP. 

b) In this case, C 2 is given hy 

Cl ={(KNrn + Vo, l \H':KDi n - -Y„]: ( II. h ) € .S',. 

del((A'.Y„, + Y»)(x)}/0> 

where Si = {(7?. I\): R = YJ— D, R. K = —1 i>Yn-t)*K . 
(/?. A ) £ .VI(S) x .VI(S)}. the expression for II is the same 
as in Theorem 2, 

Mn =GCLD[(7?^ I). (I A, r o)|. <7^ l) = Mi>D 

(7 1 .x! n ) = Mi,\M. 



no 
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with (B*., T?<t 0 ) 1X2?, To.Yp, and C) y - are defined by Y 0 = 
MpYo 

o • 

+ ^ 2 oF/j = / and ~ B^N-jo, 

■ for some Xp E .M(5).Nao E A4(S). □ 

Remark: If iVm and i/v are ESP and A\ 0 /, T tl } ! are RCP, 

the expression for i? in Theorems 2 and 3 does not require Kronccker 
products (see 114]). Analogously, if D^ I and A D.j 0 . respectively, 
/( j /V./ 0 . are RCP* no Kronecker products are needed in the expression 
for K in Theorem 2, respectively, Theorem 3. □ 

Remark: It is worth noting that since D<- - KX 20 4- Yo and 
Nr = KDi 0 - -Vo are LCP, there is no controller which solves (he 
RATP when free, independent perturbations on both and Dj are 
allowed while tfi is held fixed. 


VI. Perturbations on Dj , 3/, and N\ 

Perturbations are now considered which are defined on the basis 
of the coprime factors of Pi. To this effect, write P\ = X\D } 1 
and P 2 = N 2 Df [ RCP factorizations—due to stabilizability re¬ 
quirements (Theorem 1). Da = DiA/, so that Pi = .V t D> 2 \ 
Ni s -V, A/. 

The RATP for ( tyS, Pm, P 20 ) is now addressed relative to the 
model class Pi, in which Dj can vary freely around Dm while 
JV,, Aa. and M are held fixed, i.e., 

I>, = {(A'i 0 i?r '• A*o(/?i.Ub) _, )€ M(R„) x M(R„): 

£>i 6 M(S), (A’ 10 , D] )RCP,(A'^n. D,.U u )RCP}. 

Note first that, in this case, XX 1 = 0, A Yj = 0 and AD. = 
ADi37 0 ; thus, it follows from Proposition 1 (and [7, Lemma 5.7.6], 
that if C{ P, A ) solves the RATP for (1 \ . Pm. P 2 u ) relative to P 1 
then 

N\ubr(K)±D{M\) 

■ {I + Dr(K)±D t Mo)' V, -1 /? 6 ,M(S) 

•» \ n) D, (K)SD l (J + M n Dr(K)M) l )- 1 
• (A/o77)i/, 1 € .Vf(S). 

Writing (A/o /?) 1 = <.’ M, n (coprime factorizations), i( follows 
from Lemma l that the last condition is verified if and only if 

-N io6i'(A )y?, 1 = A u)( A A an 4- 3 o )*?, 1 E .V((S) 

(recall that undci" the nominal tracking condition (A/oi?) and i\ arc 
RCP, so that the greatest invariant factors of </’,n.v. and </\ are 
the same y?,). This is the same condition found in Section IV, so that 
Theorem 3 applies here with P replaced by Pi. 

Consider now perturbed model classes Pm in which M can vary 
freely around Mu while Aj. AV and D\ are held fixed, i.c., 

Pm = {(A'mDm . .Vjo(DmA/) -1 ) E -Vf(R 7 ,) x .Vf(R y ,): 

-UE.Vl(S), (A 20 . D, 0 A/)RCP}. 

C(P, A*) solves the RATP for (r, , Pio, Pju) relative to Vm iff A 
satisfies condition a) of Proposition 1 and 

(A'loA.U - .Vio3/ 0 Dr(A')PioAJl/)(/+ Dr(A‘)Dii,AA/r l 

■tV‘ € -Vf(S) for all A M 


such that ||DmAA/|| < f, for some f > 0 (this is due to 
Proposition 1). The last condition is equivalent, by Lemma 1, to 
-Vio(/ - 3/oDr(A’)D,u)y?r 1 6 .Vf(S). Let (A r „, D*) RCP be 
defined by Am^r 1 = A V 'D^ J ; then the condition above holds iff 

there exist A* and Z in .Vf(S) such that A/ () Dr ■( A)D U ) 

+D*Z = 7. (2) 

This, in turn, implies that D v ,, A/o LCP and D y -, Dm ESP, which by 
Lemma 2, are equivalent to the existence of LY. V) E A1(S) x 
,W(S) such that A/n.YDm 4- D y -V = /. Conversely, suppose 
that there exists such (.V. V): then M Q XDu\(I - A/oYoDm) 4- 
D^[V(/-AMoDn))] = I-MuYul)io or, equivalently, A/oADm- 
A/o A (DioA/olo) 4* DyjZo — l — A/oloDm. where Zo = V(/ — 
A/oVoDjo). Now, Dio3/()lo = DjoVj = / — loAao so thal 

A/o A Dm- A/o A* D, 0 + Mo X Yo X 20 D m + D y Zo = / - A/o Vo D 10 

or, equivalently, 

A/o (A Vo) A 20 Dm 4 - D y Zu =■ / — Afolo Dio, 


A/d[(A V))A i! o + V)]Dio + D^Zo = /. 

It has thus been established that **3(A\ Z) E .Vd(S) x .VI(S) such 
that A/o Pr(K )Dm 4- D,Z = / iff 3(A, V) E .W(S) x .Vt(S) 
such thal \I 0 XD \0 4- D y V — By Lemma 2, it then follows thal 
there exists such (h\ Z) if an only if Mu and D r arc LCP and D v 
and Dio are ESP. 

The following theorem can then be stated. 

Theorem 4: Let (r,. PmP*o) be as above. 

a) The set (\ of all controllers which solve the RATP for 
(r,. Pm, P 20 ) relative to Pm, is nonempty iff and Mu arc 
LCP and D^ and Dm are ESP. 

b) In this case. C\ is given by 

C\ = {(A A 20 + 3 0 ) 1 [P! A D*_»n — A 0 ]: (P, f\ ) £ S\. 

del ] (AWjo 4 Vo)( x )] ? 0} 

where Si = {(/?. A ) € .Vf(S) x .y«S): H = X7 - D,R. 
K = Y\tXo — D^\K for some (P. A ) E -V1(S) x ,VI($)}, the 
expression for P is the same as in Theorem 2, 

Mu = GLCD {{l\. /). (-U„ Dln.vfn)}. 

(D y - I ) — A / u D v 1. (A/o D10 AVo ) = A/u A20 

(which implies D v -, A'jo LCP), A’o,V\;, and D y j are defined by 
/—(A/o D/o )V 0 = M\jXu, D*\m 4- A 20 A m = /. 

and 

A t 2 oD^i = D v -i A 20 for some Ax/ E X(S). .V 20 E ,V1(S). 

Proof: The proof is exactly like the proof of Theorem 2 when 
and the equation “A D 20 - D^7V ( > f ; — AV’ are replaced, 
respectively, by Pm and (2), i.e., “A/o A'A^oDm 4 - D^Z =/- 

A/o) 0 D 10 ”. □ 

Remark: As in the last Remark in Section IV, there is no controller 
which solves the RATP when free independent perturbations on M 
and Di are allowed while A'j is held fixed. 
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Remark It can be shown that if D* and M 0 are LCP, then D v and 
[) u) are ESP if and only if D ^ and D^o are ESP (the last condition 
tlso appears in Theorem 2) 

Finally, perturbations on are considered with Di, and M 
neld hxed In this case, arbitrary perturbations on N\ lead to RATPS 
with no solutions and the most general perturbed model class tor 
which the RATP has a solution is given by 

V" (/?o) = {(JV,i>ro\ ^oDJu') V, = V 10 + S4 m ,. S € ,V<(S)J 

where i'' i o W 1 = M," 1 1 ,il, nRa = J?o t\ , for some J?o satisfy- 
mg condition a) of Proposition 1 (see [14]), where all controllers 
that solve the RATP for (i /,, Pio, Pj o) relative to V\(i?o) are 
explicitly parameterized An entirely similar situation occurs when 
perturbations on N] are considered with \ 2 and are held fixed 
(cf [14]) 

Vll Concluding Rfmarks 

In this note, the robust asymptotic tracking problem was addressed 
for model perturbation classes obtained by allowing one of the 
coprime factors of the tiansfer functions from u to i/ and to 
vary while the others are held fixed In each such a case, explicit, 
necessary, and sufficient conditions on the problem data are given 
for the existence of solutions, and, under these conditions, an explicit 
parameterization of all solutions to each of these problems has been 
obtained 

Appi ndix 

Lemma1 1 et 4 EC MJj i be matrices in A4($), t, non 

singular biproper, and let y be the greatest invariant facloi of i 
Then 4(/JA\ + C A/))(J + WA A + QSDy £ Vf(S) for 

any (A A A D) such that ||[AA 1 SI) 1 ] 1 \\^ * f for some f > 0 
if and only if 4 D^~ ] £ M( S) 4CV7 1 € .Vf($) If B C are LCP, 
the condition above is equivalent to l*r 1 £ W(§) 

Lemma 2 Let 4 B C be matrices in ,V4(§) Then there exist \ 
and 4 (matrices in A4($)) such that 4 XB 4- 4 ( — I if and only 
il t C are ESP and 13X arc RCP 

Lemma 3 Let 4, P,C D, and Q be matrices in >f(S) The 
equation 4 A ZJ+ 774 C = Q has a solution ( \, 4 ) £ A4(S)x,Vf($) 
if and only if M =GCLD{( 4 B l ). {D C J )} is a left divisor 
of Q (the column form of Q), i e , there is a S-matnx () such lhat 
Q = MQ In this case, writing (4 B r ) = M 4, (D C 1 ) — MC 
then 1 C are LCP and all pairs (V 4 ) in the sel defined by 
X = \Q + CK Y = yQ - 4 K,Vh £ M( S), aie solutions 
of 4 \ B + D4 ( = Q where \ , 4 and C are S-matnces such 

that 4 4 +cy =■ 1 4( = C 4 (which exist by the lcft-coprimeness 
of 4. C) Moieover, if M is nonsingular this is indeed the set of 
all solutions 

Rfpercnces 

[ 11 L J Davison “The output control of linear time invariant multivan 
able systems with unmeasurable arbitrary disturbances IEEE Trans 
Automat Conn , vol 17, pp 621-629 1972 

12] __ ‘The robust control of a servomechanism for a linear time 

invariant multivariable system ' It EE Trans Automat Conn vol 21, 
pp 25-44 1976 

14| W M Wonham Ltneai Multi)unable Conttol New York Springer 

1974 

14] B A Francis and W M Wonham, “The internal model principle for 
linear multivariable systems,” Appl Math Optimiz vol 2, pp 170-194 

1975 


[5| B A Francis and Vidyasagar “Algebraic and topoU. ,| ^ 

regulator problem foi lumped lineai systems in u i 
87-90, 1984 

(61 C A Desoerand A N Gundcs Algebraic desman* t K i i 
able leedback systems " in Pim IMSI V5 1985 

|71 M Vidyasagar C mtml System Svnthi w ( ambn i \\ Mi 
Press, 1985 

[8] T Sugie and T Yoshikawa ‘Gencial solution of robust i » w t 
lem in iwo degret of freedom control systems II / / / i lt „ \ , 

Conn vol 4] pp 552-554 1986 

|9) S Hara ‘Parametii/ation of stabilizing controller*- tor nmlti\ tmhl 
servo systems with two degrees of freedom bu ) t mn \ol ir in 
779 790 1986 

110] S Hara and T Sugie ‘Independent parameterization ui i\u> dtgui ol 
freedom compensator in geneial lobusi tracking sjsinns // / / Hwn 
Automat Conti vol 44 pp 59-67 1988 
|ll] C N Nett Algebiaic aspects of linear control system slahihi\ IIII 
Tians Automat Conti vol 44 pp 1169-1172 1986 
[12] P G Ferreira, ‘lour input four output feedback systems Robust as\mp 
totiL behavior ’ ITtl Tians Automat (onti vol 44 pp 1169 1172 
1988 

114] I Sugie and M Vidyasagar further results on the lobusi tracking 
problem in two degree of-freedom control systems S \st Conn Ltli 
vol 14 pp 101-108 1989 

[ 14] G O Correa and M A da Silveua On robust asymptotic tracking 
Perturbations on coprime factois and paramelnzalion ol all solutions’ 
Tech Rep 41/92 l NCC CNPq Rio dc Janeiro 1992 


A Version of Hautus' Test for Tandem 
Connection of Linear Systems 

Andrea Bacuotti and Gianmna Beccari 


Abstract —In Ihis nole, we consider systems which can be decomposed 
as a cascade connection ot linear subsystems. The second subsystem is 
assumed to have the controller canonical lorm. The main result is a 
version of Hautus 7 test, which allows us to check controllability and 
stabiliaubllity properties ol these systems by means «r reduced-order rank 
computations. 

I Introduction 

The study of structural properties of interconnected systems is a 
classical subject (see, for instance, |2] |4] [6| [II |) which has been 
recently revitalized ([I], [8|, [10]) In this note we are interested in 
autonomous linear control systems of the form 

( j 

U = y +r.« 

wheie iy £ R 1 £ W denoie the stale variables and u £ R' is the 
input (m < (j) They can be considered as the tandem (or cascade) 
connection ot the two subsystems 

l/ = A,/ + ^ (la) 

(with input i £ R 1 ) and 

= A. + Gu Ub) 

Manuscript received June 24 1994 revised December 8 1991 and Maich 
18 1994 
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the connection being established setting i> = z. 

In the sequel, we denote by R" ^ (respectively, C u *') the space 
of real (respectively, complex) matrices with u rows fi|nd /< Columns. 
Accordingly, A E R' J D ER 1 ' ^F E H* 4 ' 1 . and G E R q 
It is well known that if (1) is controllable [stabilizable] then (la) 
and (lb) arc separately controllable [stabilizable] (see [4)). However, 
the converse is false in general (see again [4] and the simple example 
in Section II). Thus, it is natural to ask what kind of condition should 
be imposed on (la) and (lb) in order to guarantee controllability 
[stabilizability] of the overall system (1). 

It is easy to see that if </ = m and C is not singular, then 
(1) is controllable [stabilizable] if and only if (la) is controllable 
[stabilizable]. However, this assumption is too conservative to be 
interesting. From a more general point of view, the problem is 
addressed in [6], [2| and especially in [4] using a state-space 
description similar to (1), and in [11], [3| using differential operator 
representation and coprime factorization. We note that the necessary 
and sufficient conditions obtained m |4| require some restrictions 
about the Jordan canonical form of matrices ,4 and F, For instance, 
it is assumed that there is only one Jordan block for each possible 
common eigenvalue of .4 and F. 

In this note, we assume that 


F = diag(Ci,- 

■. c,„ 

). 

G = diag (»/i, • ■ ■. <h „) 

where for each i 

= 1,- 

■ •, m 





/« 

1 

0 

... ih 



0 

0 

1 

... () 


C, = 



... 


€ R*' *' 


0 

0 

0 

... i 



Vo 

0 

0 

■ - ■ 0 / 



and 


<h - 


/"\ 

n 

W 


e R * 1 ' 


(3) 


(A'i + ■ • + Am, = (j). In other words, we assume that (lb) is in 
Brunowski's canonical form (see [9, p. I39|). From a practical point 
of view, we can think of (1), (2), and (3) as a plant represented by the 
linear system (la) and coupled with w parallel chains of integrators 
of (possibly) different lengths. 

Chains of integrators are easy to implement. A typical situation 
occurs when (la) is given and one has to design a dynamical 
compensator. Usually, in these cases only the first component of 
the slate vector : affects directly the first subsystem. Moreover, the 
form (I), (2), (3) incorporates systems whose behavior depends on 
the derivatives of the input map. These possible applications will be 
shortly discussed in Section IV. 

When (lb) is in Brunowski’s canonical form, then F has zero as 
a unique eigenvalue. Since m > 1 and zero is allowed to be an 
eigenvalue of ,4 as well, our result applies also to cases not covered 
by 14). Finally, we note that if (lb) is controllable, then the structure 
(2) and (3) can be recovered up to a linear change of coordinates 


Of course, we can think of system (1), (2), (3) as a whole system. 
To this purpose, we set n = p + q. s = (y< -) 

M 3 ?)■ 4 -( 2 ) 

where we denote by 0 some matrices with zero entries. Accordingly, 
its properties can be investigated with the aid of some classical 
criterion. For instance, Hautus 1 test (see 19, p. 93, 153]) states that 
the system is controllable if and only if 

rank [.4 — A/ fl . D] = n (6) 

for each A E C, and stabilizable if and only if (6) holds for each 
A E Cf = {: 6 f:Rc: > ()}. Note that the matrix in (6) has n 
rows and n + hi columns. 

However, as remarked in [4|, with this approach, it is difficult to 
state the conditions in terms of the subsystems. Moreover, disregard¬ 
ing the structure, there is no gain in the process insight. 

The main contribution of this note is a version of Hautus’ test 
which, taking into account the form of the system, allows us to check 
the controllability and stabilizabilily properties by computing the rank 
of a reduced-order matrix, namely a matrix with p rows and p 4- w 
columns. The result is stated in Section II and proved in Section III. 
Additional comments and remarks arc finally provided in Section IV. 

II. Thl Result 

As already mentioned, if (la) and (lb) are both controllable 
[stabilizable], (1) is not necessarily controllable |stabilizable). 
Example: Consider the following system with p = \.q = 2: 

( A - </ + -j - ^ 

-w 

Both subsystems are controllable, but the overall system is not 
even stabilizable. 

In the following theorem a special role is played by the columns 
of D. We introduce some notation. Let us write B = {B !,■■■. B , u ). 
where for each i = 1. ■ ■ •, m 

D, = [h, fr. *.] e R 1 ' \ 

Moreover, let 

/i, (A) = b , i j ■+■ Xb, > 4- A'7>, * 4- • - ■ 4- A^' 1 b, *, 


and 


H(\) = [/ii(A), ■ ■ ■. 7i rll (A)] E (L' r 


Now, the main result of this note can be staled. 

Theorem: Let .4, B be given by (5), (2), (3), and let A E <L\ Then, 
rank [.4 - Af„, XJ] = n = p 4- q. i.e., maximal, if and only if 

rank [.4 - A T r . if (A)] = p. (7) 



(where T E W f/ ). a preliminary linear feedback transformation and 
the elimination of possible redundant input channels (19)), Thus, the 
assumption that (lb) is in Brunowski's canonical form turns out to be 
restrictive only when we are interested in the stabilizabilily problem. 


i.e., maximal. 

It follows that System (1), (2), (3) is controllable [stabilizable| if 
and only if (7) holds for each A E C |for each A E C t ‘J"]. As explained 
in the Introduction, we have so obtained a criterion which allows us 
to investigate the controllability and stabilizability properties of the 
overall system by means of reduced-order matrix rank computations. 
This is illustrated by the following example. 
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Example: Let us consider a system of the form (1), (2), (3) with 
■<-(4 ?) - B '(] 4 J) 


,vhere k is a real parameter 

. Let 

w 

= 1, so that 



( 3 

3 

1 

k 

!\ 


f°\ 


4 

T 

2 

4 

0 


0 

A = 

0 

0 

0 

1 

0 

and D — 

0 


0 

0 

0 

0 

1 


0 


\0 

0 

0 

0 

0/ 


\\I 


In order to apply Haunts’ test to the pair (A. D). we should 
compute the rank of (A - A Jr,, D) for each eigenvalue A of A 
(actually, it is not difficult to see that we can limit ourselves to the 
eigenvalues of ,4). The eigenvalues of A are. in this case, A = 1 and 
A = 9). Let us take for instance A = 1 . The matrix to be considered is 
fl 3 1 k 1 0\ 

4 C 2 4 0 0 

0 0 -1 1 0 0. 

0 0 0 -1 1 0 

\() 0 0 0 -1 1/ 

Clearly, the rank of this matrix is maximal if and only if the 4 x 1 
matrix obtained by deleting the first column, the last column, and the 
last row is nonsingular. The problem is so reduced to the (borrowing) 
computation of the determinant of a fourth-order matrix. 

Using our criterion, one is led to consider the simpler matrix 

(.4-/.»„+*,+M-(:| i 2 +‘). 

It is immediate to see that its rank is maximal if and only if A ^ 1. 

111. The Proof 

Lei V = [A - A. D]. In order to compute its rank, tl is 
convenient to multiply V on the right by an invertible matrix Q. 
The construction of Q will be accomplished in three steps. 

In what follows, we denote by ()„ ,, the matrix with /' rows, p 
columns, and zero entries (to simplify the notation, the subscripts 
v . ii will be suppressed whenever the matrix dimensions are clear 
from the context). 

Step I: It is convenient to look at V as a block-matrix of the form 
[-Vo, Mi. ■■■. V,„. .Vj.---.Xu], 


Step 2: Let us multiple MQ l on the right by anoihei nv, Ul » i 
matrix Q 2 . In order to describe Q 1 , it is convenient toiinW, oi ' < 
as a block-matrix in a different way 



where 

A' = A- A/,,. 

= and D; = [fl (( 0 Jll |tr*‘ t, t 

y = diag(c < ;.---.c’;„) 


and 





/-A 

1 0 

... |) 

0 \ 

a = \c 

. - A/|L r , <J, 

= 

0 

-A 1 

... 0 

0 




U 

0 n 

... _,\ 

1 / 

6 C l 

i A , 4- 1 






The matrix Q 2 has the form 







-(! 

") 



where A = 

diag (A i, • • • 

■ A„ 

) € 

(p'Z { w r/4 >n 

and 



1 0 


0 

■ ()\ 




A 1 


0 

... () 



A, = 

A 2 A 


i 

... i) 

6C mi 

U-H 


K\ kl A*' 

A 

k, 

::: vJ 



As a consequence 







MQ'Q 


D’A) 

:<> F'Aj 

1 


where 







I! 

Cl 

9 ■ ‘ ■ 

.o', 

Aim] = [B'l 

, • • ■. /»"]. 


n 


// 

J 1 ' 


i) 


, 


where 




D, \ 


O,. 1 \ 

<>h , 


M. = 

C.-Xh, 

6 C" *' and .V, = 


6 <T 


(hi *, ) 


Ul . ) 



(/ = 1, ■ • ■, m ). We perform a reordering of the blocks of V, by 
multiplying on the right by a suitable invertible matrix Q x . It is 
convenient to define Q l as a block matrix. More precisely, we set 
Q l = (Q, 1 j ML j = 0, 1, ■ ■ - ,2m) where 

Qo O = I,: 

Ql. 2 ,-i=h, and + ■_>, = 1 for r. * = 1. • • •. m 

Ql j - () otherwise. 

As a result, we obtain MQ X = [Wo, M \. -Vi. ■ • •. M,„. A’,,,]- 


h" , = b , t , + Xh, , +i 4- X 2 b, ,+2 + " 

• + A A '-‘/», 

and 


F'A = diag ■/„, -l<>k, 

]6C‘ 


The result of this second transformation is that the dependence on 
A has been eliminated in the second row of (R). 

Step J: Finally, let us rewrite MQ ] Q 2 as a block matrix with 
one row 

MQ l Q* = [Vo. Hu JvX ■■■.#,„. AX] 

with 



h, (A) — b, i -f A h, -2 + A 2 6 ( , ,j + - ■ - 4- A h, * 1 

and 
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The purpose of the third transformation is to provide a further 
reordering of the columns of MQ l Q 2 . Setting « (y; J ,)(/'♦ j = 
0, where 


Oo.o*/p 


# 2 ,-i.i-l, and C?L. ■+ m = h, for r, a = 1 , ■■■,/»!. 

Q * j = O otherwise, 

we obtain MQ l Q 2 Q :i = [ A/ 0 , H ,, ■ ■ ■, Jf (>l , A\, ■■■. A\„]. 

We are now ready to get the conclusion. Lei Q = Q'Q'*Q A . ll 
is clear that 


{A-\h,B]Q=(^ 0 ^" 


H(\) 

in 


U A) \ 

L J 


As already recalled in the Introduction, there are other ways to 
study system (1). For instance, in [4] a change of coordinates 

CMS ")(0 

(more general than (4)) is used in order to put .4 and F in Jordan 
canonical form. From a numerical point of view, the difficulties 
encountered in obtaining Brunowski’s canonical form are comparable 
to those encountered in obtaining Jordan form. Thus, the approach 
followed in the present note should be considered as an alternative 
to the approach of [4). 

Sometimes, systems involving an input map and its derivatives are 
considered in the control theory literature (see, for instance, 15], |7|). 
These systems have the form 

ij = Ay 4 An/’ -1- 7J| /• 4 • ■ ■ 4- (10) 

where v E R IM and B, E C v "'(/ = 0. ■■•,/). Setting 


where L{\) is some matrix which we need not specify. As an 
immediate consequence, we see that 

rank [A - XI ,,, /J] = rank [.4 - A B](J = u 

if and only if rank[-4 - A/,,, /7(A)] = p. 

We note that in the case m = 1, we have Q 1 = Q 1 — /„. In other 
words, if w = 1 no reordering is needed. 

IV. Final Remarks 
Condition (7) implies, in particular, that 


= i>, - 2 = r, • * •, ;i, i -= r (/l . and r (/+,, = u 
(10) can be equivalently rewritten as 

r y - Ay + Boii 4-f Bm+ j 

(note that here -,+ i E R ,H for each / = 0, ■ -./). Up to a reordering 
of the variables, (10) appears to be a particular case of (1), (2), (3) and 
its controllability properties can be tested by means ol the Theorem 
of Section II. It turns out that (10) is controllable |stabili/able| il 
and only if 


rank [A - A/,,. B] = p 


rank [A - XI,.. flu + A «, + • ■ • + A'fl,] = 


and so we recover the well-known fact that controllability [stabiliz- 
ability) of (la) is necessary for controllability [stabilizability] of (1). 
More precisely, let us define the deficiency of -4 relative to a set 
5 C C as 

<7(.4, S) = p — min rank [A - A J t ,] m 

According to our theorem, a necessary condition for controllability 
Isiahilizability] of (l) is that 

d{A* C) < rn. revSpectively t/(.4. C r J ) < in. 

This conclusion is not surprising. Indeed, if we want to control (la) 
and the control inputs are implemented through (lb), then it should 
be actually possible to control (la) by means of ni < q independent 
inputs. 

Sometimes, systems of the form 

(i) = Ay + hi ! 



where h E U 1 ’ and :i « E R, have been considered in the 

literature ([8J), They represent a situation where the second subsystem 
is a chain of integrators with output -i, and the first subsystem 
depends only on ci. System (9) is a particular case of System (I), 
(2), (3), with in = 1. Hence, it follows from our theorem that (9) is 
controllable [stabilizable] if and only if the pair (.4, b) is controllable 
(stabilizable). 


for each A E <T [for each A E C ( ||. 

A different criterion for controllability ot (10) can be lound in |5|. 
It is actually an extension of well-known Kalman's controllability 
matrix rank condition. 
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Poisson LQR Design for 
Asynchronous Muliirate Controllers 

Robert P. Leland 

Abstract —We derive the optimal LQR controller for asynchronous 
muliirate digital control. Independent processors operating without syn¬ 
chronization update different components of the control vector at Poisson 
arrival times. An Iterative algorithm to compute the steady-state control 
law and optimal cost is described and its convergence is demonstrated. 

I. Introduction 

For complex systems, such as aircraft, it can be desirable to 
distribute the control task among several processors, rather than 
using a single central computer. If these processors are not triggered 
by a common clock pulse, and their computation, sampling, and 
hold activities are not synchronized, we call them asynchronous 
controllers. In addition, these processors need not operate with the 
same sampling rate. Muliirate sampling in control systems has been 
of interest since the 1950s. The sampling rates of the controllers 
arc typically assumed to be integrally proportional, and sampling is 
synchronized to make the sampling process periodic, with a period 
equal to an integral multiple of the largest sampling period ([1], 
|2], [5], [7] and others). We will allow arbitrary sampling rates, and 
random sampling limes for each processor, so sampling is no longer 
periodic. 

We wish to control the continuous-time system 

.»'(/) = A.r(f) + 0ii(/) (I) 

to minimize the cost functional 

J = E I [ Qj{t ). ,r(0] + [flf/(0, »(0]'/L Q > O.i? > 0. 

Jo 

( 2 ) 

We partition the control vector u(t) € 7?"' into n < m (possibly 
vector) components: 


and assign an independent processor to each of the components 
*/,(/). Processor i updates o,(t) at discrete times, with u,{t) constant 
between updates. The processors operate asynchronously, hence the 
update times of the processors are not coordinated in any way. We 
also assume the update times of processor i occur randomly as the 
arrival times of a Poisson process X,{t). where X\ (t) * ■ ■ A\,(f) 
are independent Poisson processes with arrival rates Ai ■ ■ ■ A,,. The 
output of processor / is described by the stochastic differential 
equation 

r/i/,(f) = ^Vi(0(t\(0-«*(f)). (4) 

If r is an arrival time of .Y # (f), r,(r) is the control update computed 
by processor /. 

We consider such asynchronous LQR controllers, assuming each 
processor has access to the entire state and all control signals at the 
time the control signal is calculated. 

Manuscript received February 22, 1993; revised January 12, 1994. 
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The Poisson process approach, although allowing .uldiiinii.il m,,,, 
tainty in sampling times, removes the need to syikhioni', vmip'nu. 
and permits multirate sampling with sampling rak* lKir 

integral multiples. 

Poisson processes have appeared in control problem m .o\jul 
contexts. A large body of literature exists on the toniml ni jump 
parameter linear systems, in which the system parameters v k-nge at 
random times, e.g., [6], |12J, and 1151. The LQR problem !w such 
systems was solved in 18]. 113], and [17], and the LQCi coniiollct 
is derived in [6], 

Stability and control of systems effected by Poisson noise are 
treated in [9| and |J7]. Malliavin calculus for systems driven by 
Poisson processes is described in [3]. 

In the asynchronous control problem, jumps occur in the stale, 
rather than in the parameters. The LQR controllei for systems with 
jumps in the state due to compound Poisson noise was derived 
in [17], An LQR controller for systems with jumps in the state 
due to switching between linearized system models was derived in 
[14], Existence of the optimal control and continuity ol the optimal 
cost for diffusion processes with jumps were shown in 111 J. In 
each case the Poisson processes represented noise, rather than a 
control update, and the control signal could be updated in continuous 
lime. 

Optimal impulsive control, where control decisions are only made 
at discrete Poisson arrival times, is treated in [4| for a class of 
problems in operations research, and existence of the optimal control 
is demonstrated using likelihood ratios. In f 101, the existence of an 
optimal impulsive control for Markov processes is shown, and the 
cost-to-go, or value function, is shown to he a function ol the current 
state and control values. 

In this note, wc formulate and solve the LQR problem for asyn¬ 
chronous multirate controllers using stochastic differential equations. 
We assume the control signal, or pan of it, can only be up¬ 
dated at Poisson arrival times, as in the impulse control problem, 
treated in the general case in |4] and [ 101- We consider the spe¬ 
cific case of the LQR problem, and obtain explicit expressions 
for the optimal control and cost. Our problem also differs slightly 
from that of [10], in that the control task is distributed among 
several asynchronous processors, and only part o! the control vec¬ 
tor can be updated at a single arrival time. When a steady-state 
controller exists, we describe an iterative algorithm to compute 
the control gain, and demonstrate its convergence. We compute 
I he control gain for a system of three masses with springs and 
dampers, and compare the cost to that for a continuous time LQR 
controller. 

II. System Mopei 

The controller described by (4) can be rewritten in vector form as 

= ,lS(t)(r(t)~ u{t)) ( 5 ) 

where r(t) = [i'i(f)* ■ • ■ r„(f)*]* is a vector of potential control 
signal updates 

7, is the m x m projection matrix that projects the control input 
space B 1 " onto the subspace of B"' updated by processor /'. Thus. 
I,u(t) = [0 • ■ ■ 0m,(M* 0- ■ ■ 0)*, where </,(/) is Ihe (possibly vcclor) 
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Mrt of n(t) updated by processor i. As an example, if m = 7 and 
i - 3, 
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'0 

0 o' 


'0 

0 

0‘ 

h = 

0 

I 0 

. h = 

0 

0 

0 


0 

0 0 


0 

0 

I 


A necessary condition for optimality is given by the following 
Hamilton-Jacobi-Bellman equation: 

0 = (A. t) + min{£„V(A. t) + [CA, A']} 

at i- 

where £ v is the generator for the stochastic differential equation (8): 

CC(A-, = VG(A>XV + |;a.{g(.V-[J J]x 

+ ° ")- GI VI } 


The stochastic differential equation (5) is understood as the limit of where wc denote 
u(t + h) - u(t) = ( N(t + h ) - N{t)){v{t) - u(t)) 


A = 


A = £a,/„ 


A (t) = 


At) 

«(f) 


dX(1) = 


A dt D dt 
0 -<IX(t) 


A(f) + 


0 

JN(t) 


r(t). 


HP) = Y. ! ' PT ‘- 


'A 13 
0 0 


as h —* 0. Thus, each arrival of N t ( t) causes an update of component 
«,(#), where n,(t) is set to v;(t) % newly calculated by processor /, 
it u(t) is replaced by u(t) + /,(<'(/) - u(t)). 

We define the Poisson rate matrix A as 


( 6 ) 


We define an augmented state vector for this system by adding 
“hold” states 


We assume a solution of the form 

V(X.t) = [M(T~t)X. A']. 
Let 

TA/. A/a 
|MS A/a. 

Then £ V [M A', A] can be expressed as 
£,\MX. X] = [(AM 4- A m M)X. X] 


(7) 


Then the entire system can be described by the following srochastic 
differential equation: 


( 8 ) 


HiK v 
Hi: 

{ 


0 0 

0 I, 


x + 

(MA + XM) - M 


;:]")■ 
)] - [l/A, .V]} 


0 0 
0 A 


Since X(t) is a Markov process, the control updates = 

1 • ■ ■II, will only depend on A’(f). and not on the time since the 
last update. 

Wc define a matrix function fi(-) on in x in square matrices as 


0 0 ‘ 


0 

0 

1 .. ,-l 

0 A 

M + 

0 




(9) 


+ [A[A/; ( \Ia - A( j|/ ; » ))]A\ r] 

+ [!■- A[A/; (A/,, -MA/n))].V] 

+ [AA(A/ : ,)r. r]. 

This is minimized by a choice of 

V = (A/, - .M.'/:.))]A 


The function keeps only the block diagonal terms of matrix P.b(P) 
can be computed easily using an element-by-element multiplication 
by a matrix of ones and zeros. A matrix P is suitably block diagonal 
if b(P) = P. The following properties hold for such suitably block 
diagonal matrices: i) I,P = PI,, ii) P~ l = £(P~ ] ) if P _1 exists, 
iii) A P = PA, iv) If P > 0, then A(P) > 0. 


HI. Optimal Poisson LQR Controller 


We derive an optimal Poisson LQR controller using dynamic 
programming. We assume up-to-date analog measurements of the 
entire state vector and all control signals are available to each 
controller at that controller’s update times. In the augmented state- 
space formulation, the cost function (2) may be rewritten as 


.7 = 


E f r [QX(t). X(t)]dt 
Jo 



0 

/? 


To apply dynamic programming, let the cost-to-go be 


V*(A\ t) = E 


j' [QX(t), X(T)](lr\X[t) = X 


which is the optimal linear feedback control law. Then 
min C,.[MX, X] = HMA + A"M)X. .V] 

r 

A<S(;\/:,r'[0 I]MX. X . 

Thus, the Hamilton-Jacobi-Bellman equation is satisfied if \J{T — t) 
satisfies 

0= 0-AI(T - f) + A*M{T - t) + M(T - t)A + Q 
at 

~M(T - t)B\HAh[T- t)r‘lS\M{T - t ) 
with endpoint condition A/(0) = 0. where 



The optimal control is 

«'(/) = -HMAT-t)r'[MAT-ty (A/a (T - t) 

-KMAT -t)))]X(t) (10) 

and the optimal cost is 

V(A'(0), 0) = [M(T)X(0l A”(0)]. (11) 

Note that v(t) depends on both ;r(f) and w(f), however r,(f). the 
potential update for controller i depends only on the stale .r(t) and 
the other control signals in (t)* k £ i. 
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IV. Steady-State Controller 

Ii is difficult to determine the existence of a steady-state controller 
>ecause the system contains a random matrix multiplied by the state. 
' (owever, given a solution to the steady-state Riccati equation, we 
uan show the existence of a steady-state control. 

Theorem 4.1: Suppose that the steady-state Riccati equation 
(SSRE) 

0 = A*Af + MA + C - MBAH Ahr'B* At (12) 

has a nonnegative solution At'. Then as T —» -x.AJ(T) converges 
monotonically upward to a limit A/", where A/" satisfies the SSRE 
(12), and At" < A/'. 

Proof: M(T) is monotonically nondecreasing in 7\ since 
[A/(r).Y(0),-Y(0)] is the optimal cost for the T time-horizon 
problem. The minimal cost for the cost functional 

./' = E [QW), X(t)\dt + [M'X(T). X(T)\ 

is J' = [A/'A(0). A'(0)], since A/' is a constant solution to the 
Riccati equation with initial condition A/((J) = A/'. Since J 1 > 
we must have A/' > M(T) for all T, Since A/(T) is bounded 
above and nondecrcasing. M(T) converges to a matrix A/". and 
At" < A/'. □ 

By iteratively solving a sequence of steady-stale Riccati equations, 
wc can calculate matrix At" whenever it exists. 

Theorem 4.2: Suppose that (A< D) is controllable, {Q. A) is 
observable, and /? > 0 is suitably block diagonal i? = h(P). 
Suppose also that there exists a nonnegative solution A/' to the 
SSRE (12). Then the steady-stale solution to the Riccati equation 
M" =lim 7 - ^ M(T) can be calculated by iteratively solving the 
Riccati equation 

'tf\U (,, + " (13) 

with M\ n ) = A" 1 II, and the sequence .A/*'' 1 converges to M" 
as n —> tc. 

Proof: Denote by M = ${P) the solution to the Riccati 
equation 


Then Af {l) = 4*(A l /?), and AA/ = \P {) - \/ u 1 t q, 

0 = {A-BAB*)*SA1 + SAf[A . 

- aa / ba / t 1 \L?AM 



Equation (15) has unique nonnegative definite solution A \l mu, t 
(A - BAB*, B) is controllable (implied by ( 4. B) controllable 
and (Oo. A - BAB*) is detectable (implied by ((f 1 ) observable) 
Hence A/< n > A/ (n) and h(Af\ n ) > HAt\ l "). 

Performing the next iteration yields: M (l) = 4*(< s n/'/')) > 
♦W-Uj 01 )) = M {l K and HU\ 2) ) > By induction both 

A/ ( '° and h(At\ n] ) are nondecreasing sequences. 

To show convergence, we must show that A/ (M| is uniformly 
bounded above. By Theorem 4.1, AJ(T) -♦ M where \[" < M' 
is a solution to the SSRE. Our desired solution can then be written 
as A I" = 4>(*(A/")). 

We claim that At" > A* 1 JR. If ./■(()) = 1), the optimal cost tor the 
infinite tinie-hori/on problem is [A/7 if(0), </(())]. The cost accrued 
prior to the first update of each processor is 

[A -1 /?!/(()), «(())] = j^K'lTU, i/(0). / ( n(0)] 

i- i 

where A' 1 is the expected value of the first update lime of processor 
i. Hence M" > A -1 /? since Al" describes the cost for a longer 
time interval. Hence also, A( A/| (>) ) ~ A ~ l R <s h{Af"). Therefore. 

< HU") for all n. since h(Af^) < and 4^ is 

a nondecreasing function. Hence M {,,) < A l" for al) i». and A/ (n) 
converges. Denote At 1 " =lim„ . x A/ (m) . Then At" 1 < M". Tlie op¬ 
timal cost using a controller based on At" is [ W"\Y(0). .Y(())]. But 
At'" > Al”. since At" yields the minimal cost [A/".Y((I), A(())]. 
Hence, converges, and the limit is hm M - ^ 1 l/ t " , = A/". □ 

We need only iteratively solve for h(Af\ n) ). which reduces the 
number of parameters that need to converge. Let P be the solution 
lo the steady-state Riccati equation 

O-A'P+PA + Q-rnN 'bt. 

A better initial value for h[AI\) can be found as follows. Take 


0 = A* M + MA+Q- At BAP" 1 B* At (14) 

for any positive definite suitably block diagonal w x m matrix P = 
h(P) > 0. Also, for the solution A/, denote />(A/;i) = 4M-P). For 
every P > 0, (14) has a unique positive definite solution At = 4>(F > ), 
since (A. B) is controllable (implied by ( A , D) controllable), and 
(C, A) is observable (implied by (Q, A) observable and P > 0), 
Our iterative algorithm is A/ (,l+1) = $(fl(A/ , , ,,) )). 

If P > P\ then by [16, Theorem H, $(P) > $(P'). and hence 
$*(P) > 4»i( P 1 ). Thus, $ and are both nondecreasing functions. 

We wish to show the sequence A/ (,,) is nondecrcasing if we take 
as our initial condition the following: 


Mi = P + (A'P+ 

\r, = Mi D.\ ' + 4 * 3/i B \ 2 

hm,) - \~'hh+ uin + nwh). 

The compulations can be accelerated by using an intermediate update 
for b{AJ\). Let 

HM\"" /2] ) = A '/•[/? - {M\"' - A( Wl"'))A HM \"')~ 1 

- *()r\" > )) + M!,"" B + IV 

Then M 1 " +1 * i.s obtained by solving the algebraic Riccati equation 




IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 40, NO. I, JANUARY 1995 


IIS 


V. EXAMPLE: THREE MASSES WITH SPRINGS AND DAMPERS 
A system of three masses and a fixed object linked by springs and 
dampers is given by 

i/i = 2(j/2 - fj \) 4- 10(.i/2 “ y \) + m 
10 j /2 = 2(,(/i — ijt) + 10(i/i - in) + -5 (in — ih) + (j/:i — yi) + 

2j/;i = Mlh - J/.|) + ( H '2 ~ it. i) - 2 ih - Sy, + u* 

where y t is the displacement of mass /. and the control input 
n, is a force on mass i. The augmented state vector is .V = 
[j/i* 2/i , 1 / 2 , y -2 - //a. //a, ii 2 < Hi]*- The control signals i/ l , i/o. <ii 
are assigned to three asynchronous processors with rates A| = 
15. s= 5, -Vi — 3. The optimal steady-state Poisson LQR 
controller for Q = /,/? = / is r{t) = -KX(t) is found in the 
equation at the bottom of the previous page. The diagonal terms in 
the part of K that is multiplied by u(t) are zero, hence the update 
value r,(/) does not depend on u t (t). but may depend on other 
components of u(t). 

If m( 0) = I), the cost is given by [M\. r(0), .»(())]. and Tr.M i 
is a good figure of merit for our design. For the continuous time 
LQR controller, a good figure of merit is 7>./\ where P is the 
solution to the continuous time steady-state Riccati equation. For our 
system, Tr.P — 30.88 and Tv.Mi = 37.Cl, an increase of less 
than 5%. The cost is increased due to sampling and uncertainty in 
the sampling times. 

VI. Conclusion 

We derived the optimal control for a Poisson LQR asynchronous 
multiralc controller. The optimal control for the /th processor in¬ 
volved feedback of both the state and control signals from other 
processors. The optimal control gain can be found by an iterative 
algorithm. 
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On Solving Diophantine Equations 
by Real Matrix Manipulation 

Manabu Yamada, Piao Chung Zun. and Yasuyuki Funahashi 


Abstract — This note presents simple algorithms for obtaining the 
solutions of the Diophantine equation. Our methods can produce classes 
or all solutions with lower degree than a specified number. The previous 
algorithms involve some troublesome computations, e.g., the calculation 
of both the controllability indexes and the observability indexes or the 
solution of a pole assignment problem, etc. . Our contribution is that 
our algorithm requires only basic matrix operations such as addition, 
subtraction, multiplication, and inversion of given real matrices. In 
addition, by solving simple linear equations, the class or all minimum 
degree solutions can be given. Therefore the computational efforts arc- 
reduced compared with previous algorithms. 

I. Introduction 

Consider the following problem: given three polynomial matrices, 
D[») € e and L(s) e Rfind 

polynomial matrices. A’(*) £ 1 and V(s) £ R[^]'" x, . such 

that 

D(s)X(s)+X(s)Y(s) = L(s) ( 1 ) 

where /2 [.n]'' x '“ denotes a set of p x m polynomial matrices with real 
coefficients. We assume that D(-s) and X(s) are left coprime and, 
without loss of generality, D[s) is row-reduced [2, p. 68|. It is well- 
known that the former assumption ensures existence of solutions to 
(1) for arbitrary L(s) [II-[3]. Equation (1) is termed the Diophantine 
equation and plays an important role in many different aspects of 
linear system theory. Our interest is to develop a simple algorithm 
for obtaining solutions of the aforementioned problem, to which 
considerable attention has been paid during the last years [4]—19]. 

The algorithms of [4]-[7| present (he class of all solutions. Those 
of (4) and [9] provide the minimal degree solutions in terms of the 
rows of Y(s) and the columns of [.V( -s ) 7 . V(.s) / ] / , respectively, 
and left coprimeness of D(s) and X(n) is not required. However, 
their algorithms involve some troublesome computational efforts: the 
procedure in 14] requires controllability indexes and observability 
indexes of a realization of D[s) ~ l X(s) and the division of a 
polynomial matrix. The size of the matrix in [9], in which linearly 
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dependent columns are sought by a column-searching algorithm, is 
^ery large. The algorithms in f5] and [6| involve a solution of a 
pole assignment problem and determination of a real number so as 
to make a matrix nonsingular. The synthesis of [7] requires selection 
of some appropriate feedback matrices to make cyclic the realization 
of S{s) and calculation of inversion of a polynomial matrix. 

This note presents a simple algorithm for obtaining the class of 
all solutions with lower degree than a specified number in terms 
of !’(*)■ For the computational aspects, we have some advantages. 
The proposed procedure requires no aforementioned troublesome 
computation. Our algorithm is performed by only basic matrix 
operations such as addition, subtraction, multiplication, and inversion 
of given real matrices. In addition, by solving simple linear equations, 
the class of all solutions with the minimum degree in terms of the 
columns of Y{s) can be also given. Thus, the computational efforts 
are reduced compared with previous algorithms. 

II. Preliminaries 

Let <\,[] denote the /ih row degree, i.e,, ihe highest degree 
of all entries of the /th row vector, of a polynomial matrix and 
let <\,[] denote the /th column degree. Assume that A,,[Z>(.s)] > 
h, /[-V(.s)]. i = 1 ■ ■ ■ j>. This assumption assures that D(.s)“ 1 X (*) is 
strictly proper [1, p. 385). This will be removed in the Appendix in 
a similar way to 17J. Define 

r/, =\,\D(s)]. i = 


HI. Main Result 

Lemma: Let 

S ! s)X(s) + ^(.s-)V(.s) _ 0. 

Then the class of all polynomial solutions \(s) <- U 1 s 11 u ,», 

V(«) € jR[k]" of (11) are given by 

X\s) = CuQ(. s). uJl 

V(s) = -(.s/„ - s) (| 

where Q(s) 6 /2[.s] T,x/ is an arbitrary polynomial matrix. 

Proof: It can be easily checked that ($( s), 4/( *,■ ) j s j e fi co _ 
prime, (T'o. .s/ M - •/) is right eoprime, and 

S{s)(\i = ^(m)(sJ v -J). 

Therefore (12) and (13) hold (|2, p. 186] and [3, p. 46)). □ 

Let 6[-] denote the highest degree of all entries of a polynomial 
matrix. Define 

F{s) = Li - DiDl 1 L/,(.s), 

f = AF{> s). 

Theorem: Given an integer r > max(/. »)„ define 

\y,=[d ad ... j'/j] e/^ x " ,(,+l, 


Sis) - diagK' 1 ./ = 1 ■ ■'/>}, (2) 


<k( s) = block diag f [js' # ‘ 1 ■ ■ • .s I],/ = 1-(3) 

and let Du G R rXi ‘ and D t G R" be the highest row degree 
coefficient matrix and the lower row degree coefficient one of D{. s). 
respectively. From the aforementioned assumption, D{s) and N[m) 
are expressed by 

D[s)= Si*)D h + *is)l)i. (4) 

Y(.s) (,s )Xj (5) 


where Si G When all entries of L(.«s) arc separated into 

polynomials with the lower row degree than that of S{s) and 
the remaining ones, the lower degree polynomial matrix can be 
represented by ^(.s )/„/ and the remaining one by S{s)Lu (.s), i.e., 

L(s) = S(s)L h [s) + <t!(s)Li. (6) 


where L, E R"*‘ and !,,(,)€ Lei 

./ = block diag {./,. -h ■ ■ ■ J r ( 

TO 1 0‘ 




g jr''* 1 ' 1 . i = i 

1 

Lo o. 

Cn = block diag {[1 0 ■■■ 0] 6 R 1 ’. i = 1 •;>}. 


Since the row-reducedness of D(s) means that Du is nonsingular [2, 
p. 70], D{ s )“ 1 _Y(*) can be expressed by the following observer-form 
realization {A. D , C) |l, p. 413]: 

£>(«)■ ‘.V(a) = C(*/- A)~'I3 (7) 


where 


A = J — D,DJ,'C\) 
D = .V, 

C = Dl'C o. 


( 8 ) 

(9) 

( 10 ) 


/’(*) = 6 R[s]" xl (14) 

/ —0 

where F, 6 Then the class of all polynomial solutions 

A'(.s) G R[.s] /,x ' and V(.s) 6 fl[.s] ,,,x ' of (I) with M‘(s) < r are 
given by 

-Y(*)= Dj;'L,,(s) + F[s) + m*)Z' (15) 

V(s) = r(*) + /»(*)Z (16) 

where Z G is an arbitrary real matrix and 

I - I 

(•(.,) = £r,*' G R[>r'- 

t- 0 

i'(*) = 5^i>' e*M ,,,x '- 

i- 0 

r — l 

R(s) = £]/?,*' e RM' ,X ’"" m . 

i=n 

i 

pih) = Yp <>»' 

1-0 


(' ={A’D) r [\V,U:'y'^AT^j eR’"* 1 . 


j = l)---r 


Pj = 


0 /.„ 0 


[ 


L 

j — 0 * r 

ff-J -i 
\ /,-() 


-(.4'R)'(H’,ir/ r'ir, er 1 


i X M|(| +■ 1 ) 


-7-1 


F. +j41 - £ A'BV, + l+ i E R‘ 




J= 0- r- 1 

i - /-j 

Rj = -C" Y, A'BP.+J+I e + 


J = 0 • r-1. 
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Proof: Substituting (4), (5), and (6) into (1) leads to 
D(f»).Y(#)+ N(h\Y(h) - L(s) 

= S(.s){Z?aA'(a) - !>,(*)} 

+ *(h){D/.Y(*) + flr(f.)-Li} 

- 0. 

From the Lemma, it follows that 

A (a) ss D h l {Lf t (*) + CqQ( s)} 

= 1 £/.(*) + 


(,s/ r , - ,4)(J(.s) + £V(.s) = F(.s). 


Letting 


Q(*) = £q,*\ 

i-O 

» 

V(*)s^y lh ', 

<=:0 

and substituting into (19), it follows that 


Q,- 

, + BY, 

= 0 

Q,- 

i + BA ,.. i — AQ ,. i 

= 0 

Qj- 

, + BY, - AQ, 


Qo + by, - AQ , 

= F, 


/?! o — AQo — Fo- 


From (22) 

Qj = £ .4'f, ()+1 - £ .4‘m-, f# +i eir 




' J-l 


()■••/■ - 1. 


Substituting into (23) gives 
H’.[V n f Y>‘ 


v/ i y =E vr - 


(17) 


respectively. [A/]- is termed as the generalized inverse matrix of J\f. 
The solutions |A'(*)< !”(*)} whose = 1 • ■ * / are minimum 

among all solutions of (1) are called the solutions with the minimum 
column degree in terms of V(*). The following corollary provides 
both the minimum possible number of r in the Theorem and the class 
of all solutions with the minimum column degree in terms of Y{s). 

Corollary: Let /, - b fJ F(.s) and 


r* = min | 


k > 0: rank 




= rank 11/ 


(18) 

l 

L 

J 

) 

(19) 




(28) 

Given a set of integers 

{ r ,-J = !■ 

■ ■ 1\ such that r 

, > I-*, the clas 1 


of all polynomial solutions A'(.s) 6 

and V( 

») e R[»]" ,xl of 

(20) 

(1) with b, ,Y{s) < 1 1 

,j = 1 ■■■/ 

are given by 


[*(»)]., = £>,:' [t-i, 

(•')]. j +«’ 

(s) + n] < s)z r 

j = i 

(21) 




(29) 

[V(.s)], ( -„;(*> + 


y = 1 ■ • ■ / 

(30) 


( 22 ) 


(23) 


where z, g 11'"'' 1 \j = ]•■•/ are arbitrary real vectors and 

lllrl't (1, l j )~ 1 

»;<■>)= 53 «>' e 


«!<')=■ 5Z ®*'' 6 

/_0 

Illrl\ (Ij r j )— I 

J?>) = £ fl>' 6 /?[s]' ,x . 

I I) 


(24) 


(25) 


i(/ , ; 0 )'(/ , ;,) / ••■(p;,,) 7 !' = t „ - jh-. j-n-,, 

^ R r "i'' ^ 11 > 1 ‘ * 1 1 


Since (,4, D, C) is minimal [1, p. 439, Theorem 6.5—1 ], H’, H'/ is 
nonsingular. Hence, the general solution is given by 

p;/ V/ ... r , z ] 7 

f 

= u7 or, u7 )'X' 4 ' F +tt.. - «’/ <«’. h 7 r 1 ir, }z 

i-O 

= [r t / r,' r/]'+[p u ' p/ p/]'z (26) 

where Z 6 ,)x< is arbitrary. Therefore, (16) holds. Substi- 

tuting (24) and (26) into (18) gives (15). The proof is completed. 

□ 

Remark: The theorem presents a simple algorithm for obtaining 
the class of all solutions with lower degree than a specified number 
/ + 1 in terms of Y(s). The contribution of this algorithm is that 
the solutions can be computed easily by only basic matrix operations 
such as inversion of U', H r /. whose size is n x », independent of 
a specified number r. 

For a matrix 3/, let [A/], y and [A/]- be the j th column vector of 
M and a matrix satisfying 

A/[A/]_AJ = M (27) 


f \-^\ 


-1 


«;. = q 53 .e[*+*».i.,- 53 eff". 

V * - n AO J 

/ = 0 • ■ ■ max (/ y , - 1, 

1 

K = -c 53 .4*pp;, um <= 4 ". 

A —0 

/ = ()■•* max (/,. f t ) - 1. 

With the choice of {r y = r*.j = 1 ■ ■ ■/}, (29) and (30) provide 
the class of all solutions with the minimum column degree in terms 
of Y(s). 

Proof, Without loss of generality, we consider the equations of 
the jth column in (18) and (19), i.e., 

I-Y(a)],, = D^[L,,(*)],, + C'W(*)].,. (31) 

(»I„ -.4)KM,, + P[V(.s)] (| = [FI*)],,. (32) 
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Letting 


11) MX (Tj , I-j ) — 1 

(0( *)]<•> = £ 9,,"' 




We choose r — u — 1 = 1. Then 



'() 1 
1 1 


-1 - 2 ' 
-3 —3 


From the Theorem, the general solutions of (I) under the 
of £-Y(#) = 0 and *£(#) < 1 are shown in the equation > unui 
the bottom of the page, where Z 6 R' * 2 is arbitrary. 


and substituting into (32), it follows that 

/,-'~L i j -i- i 

ij, = £ A k [F i+k+ ,u - Y aI,b »j.‘+*+i € R 1 

k -0 A-0 

/ = 0 ■ max (/j. r ,) - 1 

and 


(33) 


W’,. J t F J 0 y], »J! ,/«£*[*],; 


(34) 


There exists a [y^y]] ■ ■ • y]] , ] 7 6 satisfying (34) for 

a given number Vj if and only if 


rank 


fj 




= rank [IT, ^ ] 


[10, p. 29J. Then the class of all solutions satisfying (34) is given by 



+ [(/*;,) T (p;,) r ---(p;. J .) T ] 7 '*> (35) 

where z, G R"' (lj + l) . j — 1 ■ ■ / are arbitrary real vectors. Hence, 
(30) holds. Substituting (33) and (35) into (31) leads to (29). □ 

Remark: In [4J and |9], algorithms for the minimum degree 
solutions are proposed. The algorithm of [4] presents a solution in 
which the rows of V (*) has minimum degree. That of [9] results in 
a solution in which the columns of [.Y(#) 7 . y(«) r ] 7 has minimal 
degree. Therefore, our algorithm proposes the minimum degree 
solution in a different sense from those of [4] and [91* 


IV. Numerical Examples 


A. Example 1 

Consider the following polynomial matrices. 
£>(*)- 


+ 1 

» + 2‘ 

/V / w \ — 

0 1 “ 

0 

* + .3 

4 V { s } — 

1 1 


L{a) = I 2 . 

The observer-form realization of D(#)~‘,V(.s) is given by 


.4 = 


-1 -1 
0 -3 


D = 


C = 


1 -1 

0 1 


B. Example 2 


In this example, we compare our algorithm with that of [9| from 
the viewpoint of the minimum degree solution. 

Consider the following polynomial matrices. 


# 2 4-# 4-1 # 

V ( V \ — 

‘.V 4 1 ,s' 2 

# 2 2 

■ \ # ) — 

s .s 4 1 


L(s) 


V +1 # 

3s + 2 s 2 4-1 


Thus, D(h) and ;V(.s) are the same as those of example in [9], By 
using the column searching algorithm of [9|, the minimum degree 
solution is obtained as follows: 


X(m) 


1 3 

# 4- 1 4# 4- 3 


Y(*) 



Second, our algorithm is applied. By our Corollary, we have ~ 0 
and r£ = 1. Then the class of all solutions with the minimum column 
degree in terms of Y(s) is obtained as follows: 

[ -2 -0-3 - 3, 

- 2* - 1 1. Is 1 + 0.7s + 0.8 + {s' 1 -3.s-2)i - 

... . [3 (2.s + 3)(0.]+i) 

[4 —l.l.s — 0.6 - (s — 4)i 

where - G R is arbitrary. 

Therefore, it is seen that the algorithm of [9] can produce the 
solution with lower column degree in terms of [.Y(#) / , l'(.s) / ] / 
than ours and, on the other hand, ours can produce the class of all 
lower column degree solutions in terms of Y(#) than thai of [9|. 


V. Conclusion 

This note presents two results, the Theorem and Corollary. Theo¬ 
rem shows a simple algorithm for obtaining the class of all solutions 
with lower degree than a specified number r 4- 1 in terms of 1 (# K 
where r is an arbitrary integer r > max (/. n). The contribution 
is that the class of solutions can be computed easily by only basic 
matrix operations such as inversion of IVyir/ . whose size is x ik 
independent of a specified number r. Corollary provides the minimum 
possible number of r in the Theorem. So the class of all solutions with 
the minimum column degree in terms of V(.s) can be also obtained. 
The necessary computation is a generalized inverse matrix of W rj in 
the Corollary, that is simple linear equations of (27). 

In also [4] and [9], simple methods are proposed. However, 
the procedure in [4] needs controllability indexes and observability 
indexes of a realization of D(s)~ ] N[s) and the division of a 
polynomial matrix. In [9], it is required to find l linearly dependent 


.5 -0.4 “ 


0.4 -0.1 


0.3 - 0 . 2 ] 




0.3 

4- 

-0.3 -0.3 

- 

0.1 - 0.1 J Z 




0.5# - 0.5 - 

-0.4# 4- 0.3 ‘ 


0.4« + 0.7 

-0.1#+ 0.2 

0.3# + 0.4 

-0.2# - 0.1 ‘ 

■0.5s 4- 0.5 

0.1# - 0.2 

4- 

—0.1 a + 0.2 

0.4# + 0.7 

-0.2# - 0.1 

0.3# + 0.4 
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columns in a real matrix, whose size is (N + K 4- l)p x {(A' 4* 
l)p + (A' + 2)/} , where 

iV = max {/>£)(«), 

l\ = mtixfn, ££(«), ^{/V()adjZ>{«)}] > u. 

Thus, the computational efforts of our algorithm are much reduced 
compared with the previous ones. 

Appendix 

In this Appendix, the assumption of A l( £)(.s) > /», m Y(a) i — 
l- -p, is removed. That is, we consider the case of b,,D(s) < 
for some i. In this case, X{s) can be represented as follows 

[I M3]: 

N(h) = D(*)Qa (*) + Rx{s) 

where Q\>(*) 6 -HI*]*’* R\(s) £ J?[.v] / jX "\ and b,,D(s) > 
^ tR.v(s)J = 1 ■*■]). In particular, if b 1t D(s) > b ,, X(s) for 
i = 1 • ■' p, i.e„ £)(*)™ 1 A’ (*) is proper, then jY( s ) can be represented 
by X h £ W*'" and X t £ fl' ,Xm as follows: 

X{s) = S{s)X h -f ^(h).Y(. 

Hence Q\ and R\(s) can be obtained easily as 

Q\ = D/7 1 A/m 

The theorem can provide polynomial solutions .Y\(s) and Y\[s) 
satisfying 

D(MA v (.s)-Fff\(.s)I\(.s) = £(*). 

Then, by using the .Yv(*) and Y\(s), polynomial solutions A’(.s) 
and V(*) of (1) are given by 

A(M = Av(.s)y v (.s)i'v(.s), 

Y{*) = Y\ («). 

This can be readily verified by substituting into (1) as follows: 
£>(.s)A<*)+ A»l» 

= D(s)\X\{s) - («)Vv(.s)J 

+ { D( h )Q\ (.s) -f /? \ (^) j Y\ (s) 

= I(.s). 
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Nonlinear Versus Linear Control in the 
Absolute Stabilizability of Uncertain 
Systems with Structured Uncertainty 

Audrey V. Savkin and lan R. Petersen 


Abstract —This note considers a stabilization problem for a class of 
uncertain linear systems containing structured uncertainly described by 
a certain Integral Quadratic Constraint. The notion of stabilizability 
considered is that of absolute stabilizability. The main result gives a 
necessary and sufficient condition for the absolute stabilizability of this 
class of uncertain systems in terms of the existence of a solution to 
a corresponding “diagonally scaled” J/'* control problem. It follows 
from this result that absolute stabilizability via nonlinear control implies 
absolute stabilizability via linear control. 


I. Introduction 

In recent years, a number of results have appeared showing that 
certain classes of uncertain linear systems have the property that 
if they can be stabilized via a nonlinear controller, then they can 
also be stabilized via a linear controller; see [IH4J. These results 
are concerned with problems of robust stabilization in which the 
underlying uncertain system contains "unstructured” uncertainty. This 
note establishes a corresponding result for a class of uncertain linear 
system containing structured uncertainty. The uncertain systems 
under consideration allow for nonlinear, time-varying, dynamic, 
noncausal uncertainties. This class of uncertainties is described by 
an "Integral Quadratic Constraint” such as found in the work of 
Yakubovich; c.g.. see |5], f6). The notion of stability considered 
is that of absolute stability which can also be found in the work 
of Yakubovich; e.g., see [5|, [6|. Our main result is a necessary 
and sufficient condition for the absolute stabilizability of such an 
uncertain system. This condition is given in terms of the existence 
of a solution to a corresponding "diagonally sealed” H v control 
problem. A corollary to this result is the fact that if an uncertain 
system is absolutely stabilizable via nonlinear (but time-invariant) 
control, then it will be absolutely stabilizable via linear control. 

The launching point for this note is a recent result developed in |7|. 
This result is a convexity result for a collection of integral quadratic 
forms defined over the space of solutions to a stable linear time- 
invariant system; a related result can also be found in [8]. The result 
of [7] is referred to as an "S-procedure." In this note, we extend the 
S-procedure result of 17] to a collection of integral functionals defined 
over the space of solutions to a nonlinear time-invariant system. This 
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enables us to consider the absolute stabilization problem for the case 
in which nonlinear controllers ‘are allowed. We can summarize our 
approach as a three-step procedure for robust stabilization based on 
the following three steps: 

• the S-procedure theorem; 

• H ^ control; 

• the Kalman-Yakubovich-Popov (KYP) lemma. 

The remaining sections of the note proceed as follows. Section 
II introduces the class of uncertain systems under consideration and 
defines the notion of absolute stabilizability. Section 111 presents our 
“S-procedure” for nonlinear systems. Section IV presents our main 
results. 


Definition 22: The uncertain system. (2 1). x \ , , 

absolutely stabilizable via nonlinear conirol it ihne c\. i ,. , ,i 

of the form (2 4) and a constani r > 0 such that 

i) for any initial condition [./•(0) .i\ (())]=. [.*, , r , , , n|i 

ft 1 ) = [£nH Ck(-)] E x ), the cIos.mI ,i - , lUl 

defined by (2.1) and (2.4) has a unique solution whu. j. 

on [0, x); 

ii) the closed-loop system (2.1), (2.4) with jtn =. u »\n\ n \\< 
unifonnly asymptotically stable; 

iii) for any initial condition [a (ft) .»,(())] _ i, lt ( ) j 

and any admissible fc(-) ~ [to(-) Uirn 

[■<•(•) ■*•,(•) "(■) to(-) (-)] G Lj[i), \ ). (heme. 

t* — x) and 


II. Problem Statement 

We consider a class of uncertain linear systems described by state 
equations of the form 


a 

.;■(/) = + n,<(t) + 

*-0 


A 

y(t) = C.r(t) + Y,H*UD 

•<~n 


-.(f) = K H x(t) + GMt): * = 0. I,---, k (2.1) 

where .r(f) E R" is the state, u(t) E R" is the control input, 
//(/) E R l is the measured output, ^n(/) E R h{) , ■ • ■, ~a (t ) € R'' k 
are the uncertainty outputs, and £n it) E R. U) , - ■ • , £a (t) G R 1 k are 
the uncertainly inputs. 

System Uncertainty: The uncertainly in the above system is de¬ 
scribed by equations of the form 

in (t) = Oo(f. -.«(■)!,'). ii It) = 0,(1. -.1 (•)!('). 

■■■.ti(t) = Oi(t. :*(■)!(',) (2.2) 


where the following Integral Quadratic Constraint is satisfied. This 
uncertainty description allows for nonlinear, time-varying, dynamic, 
noncasual uncertainties. 

Definition 2.1: (Integral Quadratic Constraint; see [5]. [6].) An 
uncertainty of the form (2.2) is an admissible uncertainty for the 
system (2.1) if the following conditions hold. Given any locally 
square integrable control input «(■) and any corresponding solution 
to (2.1), (2.2) defined on an existence interval 1 (0, t+), then there 
exists a sequence {f,} 71 1 and constants r/ ( , > {). <1^ > 0, • • •, dk > 0 
such that t, —► f*, t, > 0 and 




<1t+ds 


Vi and V* = 0, 1, 

(2.3) 


Here || • || denotes the standard Euclidean norm. Also, note that /* 
and t, may be equal to infinity. 

We will consider the problem of absolute stabilization via a 
nonlinear output feedback controller 


= A(.r, (f), !/(t)); u(t) = A(.r f (f), ,/(/)) (2.4) 


where A(-, ■) and A(-. ■) are continuous vector functions. 

Notation: Throughout this note, Li[ 0. x) denotes the Hilbert 
space of square integrable vector valued functions defined on [0, x ). 
Also || • || 2 denotes the corresponding £ 2 [0, x) norm. 

’That is, tm is the upper limit of the time interval over which the solution 
exists. 


A 

lk( n >|| a + lk.(0)||' + £rf. 


; <2.5) 


iv) for any r > 0, there exists a h > l) such that the following 
condition holds: for any vector [.»[, .r! u ]' E [h: ||/i|| < 
and any £(■) = [Co(-) Ci( ) ■■■ (•)] G /^[H. % ), let 

[.r* 1 ’ (t)' .r! 1 ] ( f )']' be the corresponding solution to the closed- 
loop system defined by (2.1) and (2.3) with initial condition 
.r n) (0) = .r«. = .r,(i. Also lei [.r { -'(t)' 

denote the solution to the closed-loop system with initial 
condition .r <J, (0) = 0, .r!‘ ) (0) = 0; then ||.r ,l) (-)-.r u) ( , )|| 2 + 
||.r!‘’(-) - ..•! J) (-)||j +1|«"'(-) - » <JI (-)||i < e. 

Note, condition iv) in the above definition is a technical stability 
condition which is needed in order lo establish the results of this note. 

Observation 2 I: If the uncertain system (2.1), (2.3) is absolutely 
stabilizable, then the corresponding closed-loop uncertain system 
(2.1), (2.3), (2.4) will have the property that ./■(/) —> 0 as f —*■ > for 
any admissible uncertainty {,(•). Indeed, since (.»•(■). </(■)• fc(*)} G 
/^[d. x), wc can conclude from (2.1) that .r(-) E Lj[0. x). 
However, using the fact that .< ( -) E £j[0. x ) and ./’ (■) E [0. x ), 
it now follows that ./■(/)—► 0 as / —> x . Furthermore, if the controller 
(2.4) is linear, then also .r f (/) 0 as t — x. 


III. S-Procedi ire for Nonuni ar Sysiems 

In this section, we present a result which extends the “S-Procedure" 
of [7]. The main result of this section applies to a nonlinear, lime- 
invariant system of the form 

i,(t) = n (h(t), tit)) (3.D 

where h(t) E R r is the state and « (/) E R J is the input. 
Associated with the system (3.1) is the following set of function¬ 
als: /„(/>(•),.,’(■)) = .C !’(■)) = 

). v(t))<lt. 

Assumptions: The system (3.1) and associated set of lunctionals 
satisfy the following assumptions. 

3.1) The function IlK ■) is continuous. 

3.2) For all (/>(•)* r(-)} G L-i\ 0 . x ), the quantities /u(/?(■)* 

''(■)). '■(’)) are finite. 

3.3) Given any t > 0, there exists a constani ^ > 0 such 
that the following condition holds. For any input function 
i/'d ( -) € x) and any /m G [ho G R 1 ‘- ||MI ^ * et 
1u{t) denote the corresponding solution to (3.1) with initial 
condition //(()) — and let /u(f) denote the corresponding 
solution to (3.1) with initial condition /* (0) - 0. Then 



124 


IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 40. NO. L JANUARY 1995 


both and MO are defined on [0, oc) and both func¬ 
tions belong to lu[ 0. -c). Furthermore, |/4M-)- Oo(*)) “ 
fM-h Vo(-))|<^ for * = l.-*-. k. 

Note, Assumption 3.3) is a stability-type assumption on the 
system (3.1). 

Notation: ^or the system (3.1) satisfying the above assumptions, 
we define SI C L^[ 0. x) as follows. SI is the set of {/?(■). ¥'(•)} 
&udi that */'(■)£ -ta[0, x) and //(•) is the corresponding solution to 
(3,1) with hi 0) ~ 0. 

Theorem 3.1: Consider the system (3.1) and associated func¬ 
tionals and suppose the Assumptions 3.1)—3.3) are satisfied. 

If /o( M). !/’(■)) > 11 for all {/#(■)- *,'(■)} £ SI such that 

jfi(*(-)■. «'(•)) > (/#(■). <.■(•)) > 0, then there exist 

constants r n > 0, n > 0, ■ ■ ■. n > 0 such that T « > 0 

and 

ni/o(A(-)% <,’(*)) > Ti/|(/l(«). <.'(•)) + r, /.>(/*( ■), «'(■)) + ■■■ 

+ri/*(fc<-). *'(■)) (3.2) 


for all /i(-). *’(■)} € 12. 

In order to prove this theorem, we will use the following convex 
analysis result (the proof of which was given in the preliminary 
version of this note 19]). However, we first introduce some notation. 

Notation Given S C R" and T C R 1 ' and A E R, then 

S + T: = {.** + y: ,v E S. y E 7 |, AS: — ]A.r: .r E 5}, and cone 

(S):= {o.r: ,r E 5. n > 0}. Also, cl(5) denotes the closure of 
the set 5. 

Lemma 3.1' (See [9] for proof.) Consider a set M C R k + l with 
the property that a + b G cl ( U) for all r/, h G 3/. If ,/o > 0 for 
all vectors [,i*ij ■ ■ ■ .a]' G AA such that .n > 0, -.-u > 0, then 

there exist constants r n > 0. ■ • ■. n >0 such that £j_. (1 t s > 0 and 
rn.ro > ri.n + Tj-rj +-1- T*,a for all [.r () ■ ■ - .r A ]' G A/. 

Proof of Theorem 3 1 Suppose /«(//(-), t (■)) > 0 for all {/>(•). 

C’( •)} £ S2 such that /i (/»(-), i ■( )) > 0. ■ ••. }u {h( ■). *•(■)) > 0 

and let .1/:= {[/„(/,(■)• !•(■))■■ ■/»(/*(■), • ( )>]' 6 R k+l : fft(-). 
i (■)} G 12}. It follows from the assumption on the set 12 that .i*o > 0 
for all vectors [-ni -• ■ -r*]' £ A/ such that . 1 1 > 0.■ -..a > 0. 
Now let {/>„(•). !',.(■)} £ 12 and (M). 1 ■/>(■)} G 12 he given. Since 
/*„(•) G Aj[0, x), there exists a sequence {7',}^-, C R such that 
T, —* x and h„(T,) —* 0 Now consider the corresponding sequence 
{M0- <A(-)},= 1 C 12. where 


r,(t) 


— Ti) 


* < T, 
t > T,. 


We will establish that /,(/»,(■), <’,(•)) — /-(/»„(■)- » ..(■)) + 
/,(M-), 1 ’/,(■)) for s = (). ],-■•. Indeed, let s G {(). 1.--.A} 
be given and fix /. Now suppose ) is the solution to (3.1) with 
input 1 •(*) = <■#,(■) and /»(0) = h„{T,). It follows from the time 
invariance of the system (3.1) that MO = /#(,(/ - T,). Hence, 


c.l-)) 


- f MM/), 

Jn 

Ai 

+ f MMO, i,'hd-T,))<lt 

7i, 


!%(/««(/). + U(f))- 


Using the fact that /< f ,(7A) -♦ 0, Assumption 3.3 implies /„(///,(/). 
« T /»(f)) /-(M-), t>/,(«)) as 1 -♦ x. Also, M/»„(/), K\,it))flt 

^ .MM-), n,(-)). Hence, /.(/i,(-)- r,(-)) -4 /.(/»„(■). *-*„(•)) + 
/«</h(-). M-)). 


From the above, it follows that the set M has the property that 
a + b G cl (A/) for all r/. 6 G A/. Hence, Lemma 3.1 implies that 
there exist constants r 0 > 0. • ■ ■, T| >0 such that r H > 0 
and To.ro > t 1 .r 1 4* ■ •' + ta.ia for all [.rn ■ • • .a]' G A/. Thai is, 
condition (3.2) is satisfied. □ 

Note a feature of the proof of this result is to exploit the “fading 
memory” property of a nonlinear system satisfying Assumptions 
3.D-3.3). 


IV. The Main Result 

Our main results require that the uncertain system (2.1), (2.3) 
satisfies the following assumptions. 

Assumptions. I^et matrices /), A\ E 1 , and Lj be dehned by 
D:= [D a ■■■ D k ], A: = [AY, ■■■ A'i]'. E\ = 

£> = Ylt- o-H’.r/i. Then, we assume 

4.1) The pair (.4. jB) is stabilizable. 

4.2) The pair (3. A ) is observable. 

4.3) Ei > 0. 

4.4) The pair (3. C) is detectable. 

4.5) The pair (.4, D) is controllable. 

4.6) E 2 > 0. 

Assumptions 4.1) and 4.4) are necessary for the “nominal system" 
to be stabilizable and henee can be made without loss ol generality. 
Assumptions 4.2), 4.3), 4.5), and 4.6) are technical assumptions 
required to ensure that the underlying II ^ problem is "nonsingular;" 
e g., sec [1()|. This underlying "diagonally scaled” II' conhol 
problem is defined as follows. Let ti > 0, • • •, n > 0 be given 
consianls and consider the system 

’r(t) = ^4.i (/) -f lhi (t) + Ilu{1) 


,f/(0 = C.r{t) + II it' (t) 


-(/) = A*, i(/) + G'm(/) (4.1) 

where it (t) is the disturbance input. At) is the controlled output, 


‘ Ao ' 


‘ G 0 ' 

v/rTA'i 

G: - 

v/tv G, 

.s/nKi J 




II:=[H 0 -J-H, ... ^JA]. 

D:=[D { ) ■“ ~^rz Dk }- ( 4 -2) 

Also, consider the following induced norm bound condition: 

■I = HU,, < 1. (4.3) 

«< ( )eL 2 (0 0 Jlla 

The /T x control problem corresponding to the system (4.1) is said to 
have a solution if there exists a controller mapping from y to u such 
that the closed-loop system satisfies condition (4.3); e.g., see [10]. 

Theorem 4.1: Consider the uncertain system (2.1), (2.3) and sup¬ 
pose that Assumptions 4.l)-4.6) are satisfied. Then the following 
statements ore equivalent. 

i) The uncertain system (2.1), 1(2.3) is absolutely stabilizable via 
a nonlinear controller of the form (2,4). 
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ii) There exist constants n >().••, n > 0 such that the ff* 
control problem (4.1), (4.3) is solved by a linear controller of 
the form 

*,■(*) = A t ..r,(t) 4 D r y(t); u(t ) = (4.4) 

If condition ii) holds, then the uncertain system (2.1), (2.3) is 
absolutely stabilizable via the linear controller (4.4). 

In order to prove this result, we first establish the following 
preliminary lemma. 

Lemma 4.1: Suppose that the uncertain system (2.1), (2.3) is ab¬ 
solutely stabilizable via a controller of the form (2.4). Then, there 
exist constants t\ >(),-••, n > 0 and A > 0 such that 

||io(-)||'j — IlfoHllis + ^^r,(||i,(-)||a — ||6( - )||i>) 



for all the solutions to the closed-loop syslem with the initial 
condition .r(0) = 0, ,iv(U) = 0. 

Proof: Suppose that the uncertain system (2.0, (2.3) is abso¬ 
lutely stabilizable via a control of the form (2.4) and consider the 
corresponding closed-loop system. This system is described by the 
state equations 

k 

.>■(/) = A.r{t)B\(.c, (t). //(/))+ £A,6(» 

... n 


■iv(f) = A (.r,-(/), i/(/)). (4.6) 

This system can be regarded as a system of the form (3.1) with 
the associations: h{1) ~ [.r{t)‘ .r, (/)']' and !’(/) ~ £U) where 
£(■) “ [£u(-) £*■(■)]• The corresponding set 12 C A[0. -ic) 

defined in Section III is as follows: 12 is the set of vector functions 
such that £(■) € La[0, os) and [.»•(■)' .<’,(■)']' 
is the corresponding solution to (2.1), (2.4) with initial condition 
,r(0) = 0, .r, (0) — 0. Also associated with the above system is 
the following set of integral functionals /o(\). •■■,/*(■): 

/o(.r(-). ./■,(■).£(')) 

:= / (||£o(f)||‘ - \\Ka.r(t) + GnM.rAt). 

Jo 

-*n / (||.r(Oir + lk,(/)||*’ 

Jo 

+ ||A(.r, (/), »(/))|| a + ||Wl| a )rff 

/i(■'■(•). .-•,(•). 60) 

:= (||A’|.r(f) + G, A(j‘.-(/), y(t))\\ 2 - ||Ci(*)|| a )rff 

Jo 

+ 6 r\ |M#)|| a + |k,(*)|| a 

Jo 

+ ||A(x,.(t). »(f))|| a + ||€(M|| a )rff 


A(.r( ). 60) 

:= P(||A,.r(f) + G* A(..v(f). »(t))|| a 

Jo 


+ Mlk(f)ll a + Ik.-(0|| 2 

+ ||A(.r,(f). ir<f)>|| a + |U(f>|| a )rff 


Il6(*)|| 2 )<w 


(4.7) 


where h 0 = 1 /2( A- +■ l)r and <• is as dciineJ „• ; 
Detiniiion 2.2. Thus, using (2.1), we can write 

/o(j( ).- JV< ). 6')) = -(||-o(-)||i’ -||6( jj!,; 

+ *o(|k(0||-j + Ik !-)i; 

+ li6-)iii ! + ii./(-)ii'j)! 

/.(•'•(•)• .<-,(•). 60) = ll--i(■ )||a - Il6(')||a +6(|kl-:||:i 

+ ||.r,-(-)||5 + !U(' )|| > + i|«(-t||;i) 


AWO. •<•. (■). 60) = ll- 2 (-)|| 2 - ||6(-)||H + MkH||; 

+ lk-(-)ll2 + ||6-)||i + ll«l-)||5).(4.8) 

We wish to apply Theorem 3.1 to the system (4.6) and associated 
functionals (4.7). However, we must first show that Assump¬ 
tions 3.1)—3.3) are satisfied. The satisfaction of Assumption 3.1 
follows directly from the continuity of the functions A(>, ■) 
and A(*. ■). Assumption 3.2 follows directly from part iii) of 
Definition 2.2 and equations (4.8). To establish Assumption 3.3, 
let € > 0 be given and consider an uncertainty input £(■) € 
L 2 [0. x>). Let [,r (J) (f) / .ri'VlT be corresponding solu¬ 
tion to the closed-loop syslem (4.6) with [.r (,, (0) .r! l, (0)] = 
[.f*n .r, o], and let [.r (2) (f)' be the solution to (4.6) 

with .r (2) (0) = 0, .r! 2) (0) = 0. Now consider the quan¬ 

tity |/t.(.r u, (-). .r! n (-). £(■)) - /u(.r (, ' , (-), .r!? ) (■).£(■»! where 
the functional / 0 (-, •. •) is defined in (4.7). Using the trian¬ 
gle inequality and the CauchySchwarz inequality, it follows 
that 


l/o(.i ,( 1 ’(•). .i'! 1 ’ (■).«(■)) - /»(■''* 2 '(•), x! a, w. m\ 


< (||.r ( '' - .« t2> () lid -H1 1 * 

(•)-4 , l^ 

+ ||H ,,) 0 

-« ,2) 0||2) + I|:» 2) H 

-4^)111 

< 6(||.r O) (-)-.r ,2) (0l| 2 


+lk! n (-) 

-■'■; 2) (')ii 2 +ii«' (l, (-) - 

-« <2, (-)|| 2 2 ) 

+ l|[Ao 

A.]|| a (||x" , (-)-x (a 

’(•HI? 

+ll» (,, (- 

)-« (a, (-)ii a ) 

(4.9) 


where || [An A)]|| denotes the induced matrix norm of the matrix 
[Ao Aij. However, it follows from part iv) of Definition 2.2 

that given any f > 0, there exists a constant A > 0 such that 

||.f*oH 2 4* ||.iYo||“ < implies ||.r ll) (*) - )\\i 4 Ikl^M “ 

.ri 2) (-)||‘2 4 ||w (l *(-) “ w c2) (-)||2 < (• Now using inequality (4.9), it 

follows that |/o(;r (1) (-)- £(■))- /o(J ,(J) (*L ■r( 2, ( , )« £H)| < 

(A 0 4 ||[A"o Aj]||)f. A similar inequality holds for the quan* 
tities |/i(.i (,) ( ). j-!°, 6-)) - /i(.i' <2) (-). x‘ al (-)- €('))|- — • 
|A(.1 ,(I) (-). .I-! 11 (■).€(■>> -A(.r (2 '(-). •1'! 2, (')-6'))|- Thus, by 
choosing 

and letting equal the corresponding value of ^ ii follows that 
Assumption 3.3 is satisfied. 

To apply Theorem 3.1, we must show that /o(( ■»*(■), »v( -)■£(■)) > 
0 for all {■!■(•), .!■, (•). £(•)} € 12 such that /„((•**(-). ■', (■). £(■)) > 0 
for -s = If this is not true, then there exists a 

{.r°(v), .!■?(•). £°(')} € 12 such that / 0 (.r°(-), .»■?(■). £°(-)) < 0 and 
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/«(-r°(•)• sr?(’), £°(0) > 0 for ** = 1,■■■,&. However, the input 
£°(*) satisfies the integral quadratic constraint (2.3) with t t = oo 
and <h * />d(lk °(-)||2 + Ik?(*)|l 2 + l|£°(’)||s + ll"°(*)lla) for 
h = 0, 1,■ , k. Hence, condition iii) of Definition 2.2 implies that 

||.r°(-)|! a Vjk?(«)ll^+llf 0 (-)|| 3 2 +|!« 0 (-)||^ 

< r^ 0 (||.r°( , )||? + ll/tillS + |K° Oils + II A)ll3) 

<r(* + i)h,(i|r 0 oii:j+||j?oiij+ii^oiiS+||«°t)||!|) 
<j(IU' n (-)|| 3 2 +||.r: , (-)|| 3 2 + ||« 0 (-)|| 2 2 + ||«°(-)||I) 

which yields the desired contradiction. 

We now apply Theorem 3.1 to the system (4.6) and functionals 
(4.7). It follows that there exist r 0 > 0, rj > (),-■■, n > 0 
such that £Lo r* > 0 and r 0 /o(.r(-), .r< (■)< £(•)) > 

Ei =1 r./.(.r( ■).*.<■>.«(■» for all .r. (•). £(•)} 6 tt. 

We now prove that t„ > 0 for all s = 0. 1,If we let 
A: — Th ^ then 

A 

i;r.(||.-.(.)||I-||€.(-)||I) 

<-*^(0III + ||x.(0||I + ||«(0ll-5 + £l|6.(-)||^ (4.10) 

for all {/■(•). -r, (■), £(■)} G SL Now if Tj = 0 for some j £ 
{(), • ■ ■, A’}, then we can let £„(■) s 0 for all * ^ j and choose any 
nonzero £,(■) € La[0, x). However, this leads to a contradiction 
with (4.10) since the left side of (4.10) is nonnegative and the right 
side of (4.10) is negative. Therefore, we must have, t h > 0 for 
s = 0, 1, ■ ■ ■, k. Furthermore, observe that in this case, we may take 
r () = ] without loss of generality. Moreover, with r u = 1, inequality 
(4.10) leads directly to (4.5). □ 

Proof of Theorem 4.1: , i)=>ii) Consider an uncertain system 

(2.1) , (2,3) satisfying Assumptions 4.1 >—4.6) and suppose that 

the system is absolutely stabilizable via a nonlinear controller 
of the form (2.4). It follows from Lemma 4.1 that there exist 
constants tj > 0, t> > 0, - • •, r* > 0 such that inequality 
(4.5) is satisfied. Now the system (2.1) is equivalent to the 
system (4.1) where «■(■) = [£<](*)- v/t7£i(-),■ ■ ■. y/ntk{-)]> 

£(■) = [-()(■)- v/rT^i(-).- ■. and the matrices D, A\ 

(7, and H are defined as in (4.2). It follows from inequality (4.5) 
that there exists a constant A > 0 such that if .r(0) =- 0, then 

- IkHllS = IM-)II3E!!--, - IICo(-)lli - 

ELi r -ll£»( - )ll 2 < -*ELo Ilf.(-)ll2 for all .!■(■) € £ 2 (0, ■%,). 
Hence, there exists a >0 such that if .r(0) = 0, then ||£(*)||'i -- 

IkMlIii < -MI|Gi(-)||a + Et- i T'.IIC.tOlla) = -M|«'(‘)l|a for all 

«'(•) € L 2 [ 0, x). That is, condition (4.3) is satisfied. Therefore, the 
controller (2.4) solves the standard H ^ control problem (4.1), (4.3). 
Furthermore, it follows from Assumptions 4.1)-4.6) that the system 

(4.1) satisfies the assumptions required by Theorems 5.5 and 5.6 of 
110], Hence, using this theorem, it follows that there exists a linear 
controller of the form (4.4) which solves the H x control problem 

(4.1) , (4.3). 

ii)=»i) Consider an uncertain system (2.1), (2.3) satisfying Assump¬ 
tions 4. l)-4*6) and suppose there exist constants n > 0, r 2 > 
(),■■, t* > 0 such that the H no control problem (4.1), (4.3) is 
solved by a linear controller of the form (4.4). The closed-loop system 
defined by (4.1) and (4.4) is described by the slate equations 


where 

DC/ 

a , y 

GC\ ]. 

Since the controller (4.4) solves the N ^ control problem mentioned 
above, the matrix P will be stable; e.g., see [10]. Also, if we apply 
the control (4.4) to the uncertain system (2.1), (2.3), the resulting 
closed-loop uncertain system can be described by the state equations 
(4.11) where w(t) is now interpreted as an uncertainty input and i(t) 
is interpreted as an uncertainty output. 

We will consider an uncertain system described by the state 
equations (4.11) where the uncertainty is required to satisfy the 
following integral quadratic constraint analogous to (2.3): 

f''\\T7(t)\\ 2 dt< f|| t ,(OII^ + „ W. (4.12) 
J u J 0 


Q = 


D 

D,H 


P = 


A 

B,C 


T = [K 


Note that any admissible uncertainty for the closed-loop uncertain 
system (2.1), (2.3), (4.4) will also be an admissible uncertainty for 
the uncertain system (4.11), (4.12) with d = r/ 0 + j 7v/, > 0. 

We will now show that the uncertain system (4.11), (4.12) is 
absolutely stable; see [51, [6] for the definition of absolute stability. 
First, recall that the system (4.11) is such that (4.3) is satisfied. Hence, 
there exists a constant A, > 0 such that if .«•(0) = 0, then ||£(-)||.j - 
||in (-)||‘2 < f° r a H «’(■) G x,). Therefore, if 

. 1 ( 0 ) = 0 , then /„' (||T.r(f)|| 2 - ||«’(/)|| 2 )<H < -to ,/ 0 x ||i«'(f)|| J rff 
for all i/’( ) £ L 2 [ (), x). Furthermore, the stability of the matrix P 
implies that there exists a constant /< > 0 such that if .<■(()) = 0, then 
J?\mrdt<„ ( n x ||«'(/)|| 2 dt for all tr(-) G L 2 [0, x), Hence, 
the constant <*> = /(// -I- 1) > 0 is such that 


r (WTj-inw* 

J u 


ll«’(()ll 2 )^< 



ll«'(f)ll i )</( 

(4.13) 


for all {.!"(•), « (■)} G I /2 [0, x) connected by (4.11) with .r(0) = 0. 
Using a version of the KYP Lemma (Theorem 1 of [6]), condition 
(4.13) and the stability of the matrix P now imply that the uncertain 
system (4.11), (4.12) is absolutely stable. 

We will now show that the closed-loop uncertain system 
(2.1), (2.3), (4.4) satisfies the conditions for absolute stabilizability 
given in Definition 2.2. Condition i) of Definition 2.2 follows directly 
from the absolute stability of the uncertain system (4.11), (4.12). 
Moreover, the absolute stability of this uncertain system implies 
that there exists a constant r {) > 0 such that for any initial 
condition .r(0) = .r c » and any admissible uncertainty input 
«>(■) described by (4.12), then {.»*(■)< »'(■)} £ L2[0, x) and 
lk (-)||'2 + ||«»(-)||2 < co[||. to|| 2 + '/]. Now equation (4.4) implies that 
there exists a constant c\ > 0 such that ||»(f)|| < ri||.r(#)|| for all 
solutions to the closed-loop system (2.1), (4.4). Furthermore, given 
any admissible £(■) = [£o( ) ■ ■ • £a( )] for the closed-loop system 
(2.1), (2.3), (4.4), then w(-) = [£oH, v/n£i (■),■■■- y/nh (•)] 
is an admissible uncertainty input for the uncertain system 
(4.11),(4.12) with d = d Q + r H d„. Hence, we can conclude 
that 




ll* + ilk(- 


+ IlfoHlIa + 



< ro 


k 

lko|| 2 + ^0 4- yV, d M 

ns 1 


7(f) =/^(O + Qu’IO: m = T7(t) 


(4.11) 


(4.14) 
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However, it is straightforward to verify that there exist constants 
ff i > 0 and <T 2 > 0 such that we can write 0 ’i(||h (')||2 + IkHlls + 

ELollU-Jll^ < (l/2n)||.,(-)||3 + i||x(.)||3 + Mo(-)\\l + 
EL, r,|U.(-)||ii, and moreover, + < 1 » + ELi ivM < 

iMIkoll* + EH=o **«]■ Hence, by combining these inequalities with 
(4.14), it follows that ||«(-)||il + ||'(-)||^ + £‘=u ||i,(-)||l < 

(Ta/^j[l|- r ol| a + E,.-o'M- Thai '*>. condition iii) of Definition 
2.2 is satisfied. Conditions ii) and iv) of Definition 2.2 follows 
directly from the stability of the matrix P. Thus, we can now 
conclude that the system (2.1), (2.3) is absolutely stabilizable via the 
control (4.4). □ 

Remark: The above theorem gives a necessary and sufficient 
condition for absolute stabilizability in terms of the solution to a 
corresponding H x control problem. It is well known that such an 
H ' control problem can be solved in terms of two algebraic Riccati 
equations; e.g., see [10|. 

The following corollary is an immediate consequence of the above 
theorem. 

Corollary 4.1: If the uncertain system (2.1). (2.3) satisfies As¬ 
sumptions 4.D-4.6) and is absolutely stabilizable via nonlinear 
control, then it will be absolutely stabilizable via a linear controller 
of the form (4.4). 
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A Linear Algebraic Framework l<;> 
Dynamic Feedback Lineari/utioii 

E. Aranda-Bricaire, C. H. Moog, and J li r 


Abstract —To any accessible nonlinear system we associate i v <» 
infinitesimal Brunovsky form. This gives an algebraic criterion ,( , 
accessibility as well as a generalization of Kranccker control!, diiln. m 
dices. An output function which defines a right-invertible system without 
zero dynamics is shown to exist If and only If the basis ol the Brunov sky 
form can be transformed into a system of exact differential lornis. \ ins 
is equivalent to the system being differentially flat and heitu constitutes 
a necessary and sufficient condition for dynamic feedback lincurirahiliiy. 


1. Introduction 

The problem of exact linearization of a nonlinear systcMn using 
static state feedback was solved in [18| and [22]. It can be shown 
that this problem is linked with the classification of functions (or 
exact one-forms) with respect to their relative degree [8|. When the 
linearization problem can not be solved using static state feedback, it 
is appealing to try to solve it using dynamic state feedback. This note 
emphasizes the links between this new problem and the classification 
of (non necessarily exact) one-forms with respect to their relative 
degree. The dynamic feedback linearization problem was stated in 
its full generality for the first time in |6|: given a nonlinear control 
system 

S: i = ./V) + </(.r)" (1) 

where ,r £ R" . n £ R."\ find a dynamic compensator 

u = £) + iU.tfr 

( = M.r. O+ £)i' 

and an extended set of coordinates : = o(,r. £,) in which the extended 
system reads as a controllable linear one. Relying upon the differential 
geometric approach, sufficient conditions and necessary conditions 
have been given in (7J. In some particular cases, necessary and 
sufficient conditions are given there. A less general formulation of 
the dynamic linearization problem is as follows. Consider a nonlinear 
system where the output p = k(.r), ij £ R"', has been specified. 
If the system is right-invertible, it is always possible to construct 
a dynamic compensator in such a way thut noninteracling control 
is achieved |10], [26|. The standard decoupling feedback provides 
also input-output linearization and if, in addition, the system has 
the property of having no zero dynamics, then it actually solves 
the dynamic linearization problem 119], [21). Thus, the existence 
of such an output function is a sufficient condition for dynamic 
linearizability. 

In 1121, 113], and |241, the notions of linearizing output and 
endogeneous feedback were introduced. A linearizing output is a 
system of functions of r, i/, w, ■■■. which are differentially 
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independent and such that both ac and u can be expressed as functions 
if the <t > t 's and their time derivatives. Existence of a linearizing output 
is called differential flatness of the system. A dynamic compensator 
jf the form (2) such that £ can be expressed as a function of 
jt, w, ii, • ■ •, is called endogeneous. flatness is equivalent to dynamic 
.ineanzability f by endogeneous feedback. Furthermore, recent results 
l 14]* [15] show that no system is linearizable by nonendogeneous 
feedback without being flat. An equivalence relation between systems 
nay be deflned such that differential flatness is equivalence to a 
linear system. This equivalence corresponds to transformations by 
endogeneous feedback. 

In [23], the notion of a dynamic equivalence has been deflned in 
terms of D-algebras. A system is equivalent to a controllable linear 
system if and only if its associated Z)-algebra is free. The set of 
generators of this D-algebra plays the role of linearizing output. 

The goal of the present note is to characterize the existence of 
these linearizing outputs depending on .r, </, w, ■ • ■, deflned in a new 
algebraic approach which can be recovered from the early nolions 
of zero dynamics and infinite zero structure [9], [19], [20), |26]. 
This characterization is equivalent to the system’s property of being 
differentially flat, and hence constitutes, from the results in 114] 
and [15], a necessary and sufficient condition for dynamic feedback 
Unearizability. 

It will be shown that to any nonlinear system, one can associate 
a so-called infinitesimal Brunov sky form which may be viewed 
as a time-varying Brunovsky form of the first-order approximation 
of £ [11]. The construction of this Brunovsky form provides an 
accessibility criterion, as well as a generalization of linear Kronecker 
indices. This form singles out a family of m elements of the formal 
vector space of differential forms, and it is shown that a linearizing 
output exists if, and only if, this family can be transformed into 
a system of exact one-forms via some invertible transformation. A 
preliminary version of this work was presented in [28], where the 
infinitesimal Brunovsky form was defined (it was called nonexact 
instead of infinitesimal), and was shown , to provide a tool to char¬ 
acterize linearizing outputs in the sense of [12], [13], [24|. Let us 
also mention that a more “differential geometric” presentation of the 
present material may be found in [2] and 129], 

Section II is devoted to some preliminaries from [9], and to a 
problem statement of dynamic linearization in terms of the infinite 
zero structure, as in [19]. The infinitesimal Brunovsky form is intro¬ 
duced in Section HI with an algorithmic construction. An accessibility 
criterion is given which involves purely algebraic computations. In 
Section IV, existence of a linearizing output is characterized in terms 
of the infinitesimal Brunovsky form. The above theory is illustrated 
in Section V by the study of various particular cases. Concluding 
remarks are offered in Section VI. 


II. Preliminaries 


A . The Infinite Zero Structure [9], (26] 

Consider the nonlinear control system S. where /(■) and the 
columns of p(-) are meromorphic vector fields. Throughout the 
note it is assumed that rank </(.r) = m. Let K denote the field 
of meromorphic functions of a*, i<, «, ■ ■ ■. The time derivative of a 
function ^ € A, is defined by 









01 r<i> 


Clearly, A.' is closed under time-differentiation. Let £ denote the A - 
vector space spanned by dr, dw, dt/, • * ■. The elements of £ are called 


differential forms of degree one, or simply one-forms, d/dt induces 
a derivation on £ in the following way d/dt: a } dr, 

uj = di>, + a,dv,). The relative degree of a one-foim 

w £ span* {dr] is defined as the smallest integer r such that 
u/’) £ span*- {d.r}. If such an integer does not exist, set r = oo. 

Now, consider the system £ and suppose that the output function 
y = h{j), y £ R u \ has been specified. Introduce the chain of 
subspaces £q C £\ C • ■ ■ C £,> of £, defined by 

fi: = spun v {dr. dj/, • • ■ }. (3) 

The number of zeros at infinity of order less than or equal to A\ for 
1 < A* < »i, is 

ffi = dim (4) 

k - I 

The infinite zero structure can be given either by the list { rr k } or by 
the list {w',} of the orders of the zeros at infinity. £ is said to be 
(right) invertible if a,, = m. Following [9], one has the following. 

Lemma 2.1: Assume that £ is invertible. Let -V: = span* {d.r}, 
>’:= span* {d y {k \ A- > 0}. Then dim(A'n>’) = £,»;. 

Remark 22: Note that although y is, in general, infinite dimen¬ 
sional, the intersection of subspaces A’ D >’ is, at most, of dimension 
ii. The subspace X H 3^ has been first considered in [5] for studying 
minimality in dynamic decoupling. 


B. Dynamic Feedback Linearization Problem Statement 

Any nonlinear system with outputs and which is right-invertible 
can be fully linearized whenever it has no zero dynamics, in the 
sense of the dynamics of the reduced inverse system [201. Thus, 
the absence of zero dynamics is a sufficient condition for dynamic 
feedback linearization [19], [21]. This is equivalent to £,iij — u. 
This yields (more precisely) the following. 

Problem Statement l: Given £, find, if possible, an in- 
dimensional output function y = /i(,r) such that the system is 
right-invertible and £,n' = ?i. ■ 

Solvability of this problem is not necessary for dynamic feedback 
Unearizability. A more general approach to the dynamic feedback 
linearization problem, which originates in [12], 113], [24], where it 
is instrumental to define differential flatness, consists of allowing 
the output function to explicitly depend on the input u as well 
as on a finite number, say, i/ — 1, of its time derivatives. From 
Lemma 2.1, this situation may be stated as the existence of an w- 
dimensional output function y = />(.r, n, • ■ ■. such that the 

system is right-invertible and dim (,V„ H>’) = n 4- ini', where A’„: = 
span* {dr, dn, • ■ ■, dn ( "~ 1 *}. For square invertible systems, one has 
<Y*+y = V-+ y and consequently dim (A' 1/ n>') = dim (A , fl> , )+ mi'. 
So, the more general problem is stated as follows. 

Problem Statement 2: Given £, find, if possible, an integer v and 
an m -dimensional output function y = h(j\ w, • ■ •, l) ) such 
that the system is right-invertible and 

dim (A' H >’) = n. (5) 

If such an output exists, it is called a linearizing output. ■ 


C. Differential Flatness 

In [12], [13], and [24], the notions of linearizing output, differential 
flatness, endogeneous and exogeneous feedback were introduced. In 
[12] and [13], this is done within a differential algebraic framework, 
whereas in [24] the analytic case is also considered. 

Roughly speaking, a linearizing output [12], [13] is a system of 
differentially independent functions & of j\ u, such that .r, i/ 
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tan in lum be expressed as functions of the y s and a finite number 
of then time derivatives This definition is equivalent to the one 
in Problem Statement 2 because condition (S) means that one is 
able to recover the variables i u as tunctions of the outputs iy, 
and their time derivatives* and on the other hand, right invertibility 
ensures that the outputs </ are ditfeientially independent in the 
sense that they do not satisfy any diffeienlial equation independent 
of u We refei the readei to [16] tor an exhaustive discussion of 
differential flatness and its link with an equivalence relation between 
systems 

A system E is said to be diffeientially flat (or simply Hat) it it 
admits a system of linearizing outputs It is proved in [12] [M] 
and 1241 that deferential flatness tor a system 1 is equivalent to 
linearizability via a dynamic compensator C which has the property of 
being endogeneous (this may be defined as the possibility to express 
the vanables as tunctions of / // a ) Recent results [14] | l*)J 
show that flatness is m tact equivalent to dynamic linean/ability 
without any lesirnlion on the natuie of compensator 

III 11Ur Ini ini i lsimai Bri novsio Form 

A A bleu* for the Differential \utor S paee £ 

We shall construct a sequence of subspaccs of £ in the following 
mannci Defint 

H “ span^ {df da) 

h, = [ f Hy ,|~ e Ha 11 n w 

It is cleat that £ D H i ) Hi J H ) and that at the first step 
the above induction yields Hi span A {di } Pmposiiion is d 
sunplt consequence ol the constiuction Ttedhack invanance comes 
from the fact that the rclitive dcgicc ol a out foim is obviously 
invanant undei ngulai static stale feedback bxislencc of the integei 
/ conics (rum the fact that each Ha is a finite dimensional A 
vector space so that at each step either its dimension decreases or 

H mi H a 

Proposition 1 l Ha is the space of one foims which have iclative 
degite gieater than oi equal to A Both the subspaces Ha and 
the integers pi - diniH/ are invanant undei regular static slate 
feedback There exists an integei A* ^ 0 such that Ha +i — 
Ha j — - H 

The lollowing algorithm allows us to explicitly construct bases lor 
the subspaces Ha 

Step I lake {dii di di/| da } {di| di } as 
bases of Ho and Hi 

Step A 4- 1 Suppose that {i/i //,, p\ p, k t , A } {//1 
ip , [ aie bases respectively of Hi i and Ha and let us 
construct a basis tor Hah The elements of Hi +1 are the one 
forms ul 6 Ha such that u. € Ha Let jl = E y A , 1 ), t Ha then 
^ = E y (\,f/, 4 \jif,) It is clear that ^ £ Ha tl and only if 
E y \ ,i/ y € Hh Now note that since »/ / €• Ha i)j must be in Ha i 
so E y \jif / may be written in the following form 



Thus u, e Ha it and only if the coefficients A y satisfy the following 
system of linear equations 

• ^ 

y"V y A y — () 1 < / < in i - pk O) 


This system of equations has p a - p linearly indcp u I i 
being the lank ol the matrix \r ,] 1 hus dim H 
A basis of Hah can be computed as 

f >/ = i _ i 

where (A{ \,' A ) are the pm i independent solutions ^ i ( 

algorithm stops after a finite number A 4 ol steps wku , n in | 
it is not difficult to show that A* < n - m -f 1 

Remettk 12 In general the subspaces Ha nia\ depend m i , 
in the following sense the elements n built using ilu ^ou 
algorithm can be written as hneai combinations ol eh d 
ihe coefficients being functions of i a a Heuveui i i nelul 
inspection of the construction shows that at the / th step these 
coethuents may be chosen to depend at nu>si on a tnd its lust 
A “2 lime derivatives 

B Aeeessrhrlrts C ntaia 

Proposition 1 1 

1 ) H x is the laigest subspace of H > — span A (df da} w hich 
is invariant undet time dillerentiation It is also foi any Is s I the 
largest subspacc ol H i = span* (di da 1 ) 0 * / ^ k] which 
is invariant under lime differentiation 

2) Let |oi n, x } be a basis for H Then the Irohenius 
condition dn AmA A n ^ = 0 1 <_ i p is satisfied 

Piooj Point I is a consequence of the consti uction (it is tltai 
that starting from H / instead ol H in (6) one finds H ifter Is 
steps) The proof of point 2 is given in the Appendix ■ 

In point 2 of Proposition S 1 A indicates the extenor product of 
differential forms These conditions imply intcgrahilit} ol the Piafhan 
system (nj n, ^ } around regulai points accoidmg to the dual 
veision of Frohemus theorem Tht reader who is not familial with 
these matters is referred to II] 

Suhspace H v ma> be interpreted as a codistiihution on II x 
It 1 where A —1 is the maximum number ol input lime dciivatives 
neccssaiy to write a basis of H v Pioposition ^ 1 implies thai this 
codistribution is locally inicgrable around any point where it has 
constant rank I his implies that H is locall) spanned hy p i xact 
one lorms d« i di , where i i i , ^ aic lunctions defined 
amund such legulat points These functions do not depend on a and 
its time derivatives because H, C span A \l\i\ so H x may be 
intcrpietcd as a codistribution on 77 Since lor a I unction f (f) 
df £ H^ is equivalent to i being constant tlong all the vcctoi 
hclds adj /a( ») with / ^ 0 and rp the control vectoi fields (i e to 
be constant along Ihe strong accessibility dislnbution) H^ is around 
legulai points the annihilator of the strong accessibility dislnbution 
I his leads to the following 

Pwposihon 7 4 (At cessihihts C iituia) Ihe following statements 

arc equivalent 

1) System E satisfies the strong accessibility tank condition 

2) Any nonzero one lonn has hmte rclauvt dcgiec 
1) H^ = {()} 

Proposition 1 4 is one ke> result of this note since it allows the 
following construction which we then relate to lineaimng outputs 
if these exist 

C I he Infinitesimal Brunen sAv / orm 

Theorem! 1 S Suppose H = {()} There exists a list of integeis 
{/i i } invariant under regular static stale feedback and m 
onc-lorms -ci u , with relative degrees /1 / such that 
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1) span* 1 < i < w, 0 < j < r, - 1} ss span^ {dz*}; 

2) span* 1 < / < m, 0 < j < r,} = span*; {d.r. di#}; 

3) the forms 1 < i < w, j > 0} are linearly independent. 

In particular, r, = ». 

Proof: Let Wa* be a basis for Ha*. By definition, Wa* and 
Wa - are in Ha* - 1 . Note that Wa ■* and Wa * are linearly independent. 
For, letWA* = {r/ 1 ,- • •. }—then VVa- = {»>i, ■ * ■ o) n * }—and 

suppose that there exist some coefficients {A,, //,} such that 
4- /*,»},) = 0. The linear independence of the s implies 
that not all the //,’s vanish. Now consider the one-form jJ = 
whose time derivative is J = - LA. This implies that 

is in Ha ^ 41 , which is a contradiction. Hence, it is always possible to 
choose a set Wa - -1 (possible empty) such that Wa *, W\ •, Wa - i 
is a basis for Ha*-i. This procedure is repeated k* times, so the 
sequence {'Ha } is shown to have the following structure: 

m = spaiv {W, 0 ’. A < /' < A", « < j < i-k ), 0 < A < A*. 

Hi “ span^ {d.r} and rank ry(.r) =■ w imply that Wu = 

It can be proved by induction that, for 0 < k < A*, the set 
{Wa *, • ■ ■. W^ “ k) , ■ ■ ■, Wa } is linearly independent. Finally, set 
Ja>i. ■ " .^’rn) = {W* «,■■■, Wi }, The invariance of the list of 
relative degrees is rather obvious from the construction. It can also 
be seen from the fact that the number of r, ’s which are equal to k is 
given by «a = card Wa ~ dim Ha/(Ha + i 4 Ha^ i ). ■ 

The following (straightforward) corollary of Theorem 3.5 is the 
reason for the name infinitesimal Brunovsky form: the -c’a's provide 
a basis of one-forms in which the first-order approximation of L 
looks like a linear Brunovsky canonical form [ 111. If these forms 
were integrable, then they would yield a true Brunovsky canonical 
form for system L (for this reason, the term infinitesimal is preferred 
to “non-exact," used in [28]). In any case, the integers r, are nice 
candidates for generalizing to nonlinear systems the notion of linear 
Kronecker controllability indices. 

Corollary 3.6 (The Infinitesimal Brunovsky form): Suppose H x = 
{()}. Then the basis {u>, 1 < / < in. 1 < j < r, } of span* {d.r} 

defined by -j, , = U yields 


; (1 < / < m) 

n hi 

JL 1 , ,, = ,i\A j 4 ,d(ij 

/-i /-i 

where a, j. I>, l € K and [fi /; ] has an inverse in the ring of m x m 
matrices with entries in k. 

IV. Main Results 

A. Some Preliminaries 

An Algebra of Polynomial Operators: Let k[<l/<lt] denote the 
(noncommutative) algebra of polynomials in the operator <l/<lt 
with coefficients in k. The addition and external multiplication 
are the usual ones. The internal multiplication corresponds to 
operators composition: {(l/<11){p) = p(d/dt) 4 p, Vp € K\ The 
only invertible elements in k\<t/dt] are the nonzero elements of k 
(i.e., nonzero polynomials of degree zero). Let k ,u *"'[<! /dt] denote 
the algebra of m x m matrices with entries in k'[d/d1]. Let l'“ 
be the differential A-vector space spanned by m-tuples of one- 
forms. Each P € V" *"'[</////] defines a differential operator in 
F VI : P 12 =r. for all 12 = • .a?,,,) 1 E where 

S2 1 " = and P:= € V'" x,,, [rf/rf/]. 


Invertible elements of AJ r " x \d/dt] play an important role; they are 
elements P 6 k u '* 1n [d/dt] such that there exists Q 6 AJ’" x,,, [r//df] 
such that P ■ Q = Q * P = /„,. 

Definition 4.1 (Structure at Infinity for One-Farms): 12 = (^i, 
) 7 is said to have ok = dim (span^ {d.r, 12, 
i^'j/span^ zeros at infinity of order less 

than or equal to k. 

For exact one-forms = d/*,), Definition 4.1 coincides with 
(3) and (4), since exterior differentiation and time-differentiation 
commute, so that the notations of Section II may be adopted verbatim. 
In particular, Lemma 2.1 is also valid for systems of one-forms. 

Proposition 4.2: Consider the system of m one-forms ih = 
(jji. • • •, jj„, ) 7 , and the polynomial matrix operator P 6 
k' n,y "‘\<1/Jt]. Let P il Then 

dim (.VplTspan x A- > 0}) < dim [X lf Hspan^ {12 U) , k > 0}) 

where v is an integer large enough such that Q and 12 belong to X„. 

Proof: Suppose P has degree a. Straightforward computations 
show that 12 |A \ for k > 0, can be written as a linear combination 
of the following form !2 (A ] = /?o!2 4 P\ 12 4 ■ ■ ■ 4 f?A+r.^ ( * 1 ,} - 
Thus, span^ {!2 a) . k > 0} C span v {12 U) , k > 0} and the result 
follows. ■ 

B. The Results 

Our main result is the following. It is an easy consequence of 
Theorem 3.5 and Proposition 4.2. 

Theorem 4.3 (Problem Statement 2): Suppose H x ~ {()}. There 
exists a system of linearizing outputs if and only if there exists 
an invertible polynomial operator P £ k Xui [d/<lt] such that 
d(/ J !2) = 0, where 12 = (ycj, ■ ■ ■. ) y is a system of one-forms 
characterized by Theorem 3.5. 

Proof: 

Necessity: Suppose y - h(r, «, ■ ■ ■. u {1 ' u ) is a linearizing 
output. Problem Statement 2 implies that f = J. Theorem 3.5 
implies that F = span^ {12 (A L k > 0}. Thus, there exist polynomial 
matrix operators P, Q such that 6y = Pil and 12 = (jdu. Clearly, 
PQ ^ QP — I n , and hence P is invertible. Moreover, d(P12) = 
d(diy) = 0. 

Sufficiency. Let A” = dim(.r,; D span^ {12 (A) . k > ()}), 
y = dim(.V„ H span K {12 (A) . k > ()}), where 12 - Pil. Theorem 
3.5 implies that A r = »4 mu. Existence of the operator P implies 
X < X. Invertibility of P implies the existence of an operator Q 
such that 12 = Qil, i.e., X < X and hence .V = A T . The result 
follows because one can assume, without loss of generality, that 
12 = di *(.r. u, ■ ■. f# (,/ 1 M. i 1 is a linearizing output. ■ 

Theorem 4.3 relates linearizing outputs, if they exist, to the set of 
differential one-forms built in Theorem 3.5 for arbitrary accessible 
systems. It provides an alternative way to tackle the problem of 
deciding whether linearizing outputs exist, i.e., whether a given 
system is linearizable by endogeneous dynamic feedback, by looking 
for an invertible matrix P meeting the above conditions. This does 
not provide a practically checkable criterion because the degree (in 
the operator d/dt) of die matrix P is not known a priori , which 
prevents the condition of the theorem from being finitely checkable. 
By forcing P to have degree zero (i.e., to be an invertible matrix 
with entries in V), the problem is made finite, and one obtains the 
following sufficient condition. 

Corollary 4.4: A sufficient condition for the existence of a sys¬ 
tem of linearizing outputs is that a system of one-forms 12 = 
(jj \, ■ ■ ■, +'n,) 1 satisfying the conditions of Theorem 3.5 satisfy the 
Frobenius condition 

d-t>, A u.’i A • ■ ■ A uJni — 0, l < i < m. (8) 
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The relative degrees of a>i, ■ ■ ■. coincide with the orders of the 
zeros at infinity of the linearizing outputs. 

Proof: The Frobenius condition implies that there exists a basis 
composed of exact one-forms for the codistribution spanned by 
■ ■ *uj„,} ( and hence that there exists an invertible matrix (with 
entries in K) relating this basis to . - ■ ■, }. ■ 

The condition (8) is obviously finitely (and easily) checkable once 
some uVs have been constructed. It is of course not a necessary 
condition, and it should be noted that, in general, it depends on 
the choice of the u/,’s: some systems of one-forms statisfying the 
conditions of Theorem 3.5 may satisfy the Frobenius condition (8), 
whereas some others do not. Even for a linear system, a wrong 
choice of the one-forms -ui. ■ ■ ■, *>,„ prevents condition (K) from 
being satisfied. However, in many practical cases (see the proofs 
of the results in Section V or |28, Section 3.2J), it is not difficult 
to round this difficulty and check whether (8) is met for one of the 
possible choices of the ’s. 

A way for bounding the degree of P is to look for linearizing 
outputs depending on ./■ only, as illustrated by the following result. 

Theorem 4.5 (Problem Statement 1): Suppose = {()}. Then 
there exists a system of linearizing outputs which depend only on 
r if and only if the conditions of Theorem 4.3 are satisfied and, in 
addition, deg ( P,, (d/dt )) < r, - 1, 1 < /, < in. 

Proof: Sufficiency is obvious. Conversely, suppose that one of 
the polynomial elements of the matrix P(d/dt), say P,j((l/(It) has 
degree equal to r,. Thus, d h, contains a term which depends on 
dn and that cannot be eliminated by the remaining terms since, 
by construction, all the ^\ k) are linearly independent. This is a 
contradiction. ■ 

Note that it is very easy to write down some similar criteria for 
the existence of linearizing outputs depending on .r, u. and any 
finite number of time derivatives of u. Such types of conditions as 
the ones given in Theorem 4.5 can be restated as existence of a 
finite number of functions—the coefficients of the polynomial entries 
of P —meeting some differential conditions, namely, d (Pil) = 0 
and P invertible. A possible way to avoid writing the relations on 
the entries of P tor it to be invertible is lo write P as a finite 
product of elementary invertible matrices and taking the coefficients 
of these elementary matrices as unknowns instead of the entries 
of P itself; this is exploited in [27]. Checking whether there exLst 
some linearizing outputs depending on .i\ //, and any finite number 
of time derivatives of it therefore amounts to checking whether a 
finite set of PDF's in a finite number of unknown functions has a 
solution. 

This is not new since it is easy (although tedious) to write down 
the PDE’s which have to be satisfied by the linearizing outputs 
themselves, if they arc restricted to depend on ,r only. This is the 
underlying idea of the characterizations given in some particular 
cases (see, for instance, [25]). We, however, believe that looking 
for the invertible matrix P once the uj* \ have been constructed is 
more natural and more tractable. This is illustrated by the very short 
proofs of the theorems of next section, which arc known but usually 
not so natural to prove, and by results like the ones obtained in [28, 
Section 3.2], [271, [3. Theorem 5.4], which work out some nontrivial 
particular cases. 


V. Particular Cases 

In this section we recover some classical results using the infini¬ 
tesimal Brunovsky form. 

Theorem 5.J (Static State Feedback Linearization)- System E is 
linearizablc by static state feedback if, and only if, Ti^ = (Of and, 
for A* = Hk is completely integrable. 


This is, of course, equivalent lo the early charuciuiMtin,, 
and [22], or to the more recent one given in 117 ', , 

infinitesimal Brunovsky form provides a very short puu l 

Proof: If each Hk is completely integrable, one m o ui 
forms, say* *), = d<A(*0. i = 1. * ■ ■. w in Theorem < s <\ iS 
in functions < ,(.!■) whose relative degrees salisf> ' , , 

whose decoupling matrix [21) is nonsingular because tlu. s uno i 
their time derivatives are linearly independent, this complete \\ K > u 
part. The converse is obvious since the Hk \ are invariant hv sum 
feedback, and are integrable for a linear system ■ 

Theorem 5.2 (Single-Input Systems). Lei I he a smgle-inpui svs 
tern and suppose H * =- (0). Then there is only one diflercniiul form 
in Theorem 3.5, and the following statements me equivalent: 

1) D is linearizable by static stale feedback; 2) \ is lineaii/able 
by dynamic state feedback; 3) da’i A^i = 0. where - i is such that 
H u = span^ {^i}. 

This result (equivalence between 1 and 2) was first obtained in |6| 
and [7], The infinitesimal Brunovsky form—note that um is invariant 
up to a nonzero multiplicative function—allows us to give the simple 
characterization 3 and the following very simple proof. 

Proof: It is obvious from Theorem 5.1 and Theorem 4.3 because 
the only invertible elements of K'[d/dt] are those having degree 0. In 
turn, multiplication by a nonzero function does not change the rank 
of the differential form du;i Aa'i. 

Theorem 5.3 (Systems with in = n — 1 mpul\ 17/)■ A system D 
with in =n — l inputs is linearizable by dynamic stale feedback 
if, and only if, H -v = {0}. 

Proof: H‘i is generated by a single nonzero one-form ai 
which is orthogonal to the distribution spanned by the vector 
fields (j \, • - ■. and thus can be chosen independent of u, 

-t’j■ ■ ■ ■. -1 can be chosen arbitrarily, linearly independent of 

{u'i, lL'i [ and belonging lo span* {d.r}: they can also be chosen 
independent of u. Then, the differential forms d^\ A u.'j A ■ A 
i — 1. ■ • ■, in, are zero because they are (n + 1 )-forms in n variables. 
The converse is obvious. ■ 


VI. Conclusion 

We have built a so-called infinitesimal Brunovsky form which 
exhibits m controllable blocks whose dimensions play the role of 
Kronecker controllability indices in the liner case. This extension to 
the nonlinear case is innovative since, to our best knowledge, it is 
the only available one with the property that the sum of these indices 
equals the dimension of the strong accessibility distribution. This 
result on nonlinear accessibility was used lo derive a necessary and 
sufficient condition for existence of a linearizing output, and the early 
results in (either static or dynamic) feedback linearization have been 
shown to fit naturally in our formalism. Static feedback linearization 
was shown to be a matter of exact one-forms whereas dynamic 
feedback linearization is a matter of possibly nonexact one-forms. 


Appendix 

Proof of Proposition 3.3, Point 2 

Define T* to be the dual vector space of (topology induced by 
\\52a, di',||“ - dr/s taken among the d.r,'s or the dv[ l] ’s, both 
sums are finite), whose elements—“vector fields”—are of the form 


J-t-Er+tP.-sa 

*-1 k — \ /*>Q k 


where h, k are in and 0/0u\ l) are defined by 

{0/O'Vt . d.r,#) = <\ v (d/dit[ J) . 6u[ J , } ) = bj a'» (p/0n[ l) . 
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dx,) « {d/d,r t < dnj^) — 0. Note that the above linear combination 
need not be finite, but does define an element of €* because, for all 
£ f, {-V, w) may be written a 6 a sum with only finitely many 
nonzero terms. Define the “time-derivative” of A € S* by 

(X^)=\X<u>j-{X,l'). (9) 

The interior product (or hook) / A”) is defined, for a differential 
form of degree two // = i A dr,, by i(X)p = 

/((A', dis)drj - (A*. dt^di',). Clearly, taking the time- 
derivative of both sides in this identity yields, from (9), 

i(X)n = i(.Y)/i + (10) 

L£t n i, • * ^ be a basis of H x , and define the following subspace 

of f * 

Qoc * {A € f *| Vu> € , (A, a.-) = 0 and 

ni A ■ ■ ■ A A /(A”)dw = 0} 

whose elements are sometimes called (4j the “Cauchy characteristic 
vector fields” of Tic*. The “characteristic system” (or “retract¬ 
ing space” according to [4]) of H is defined as its annihilator: 
C('H ^) = . For a certain K > 0, the elements r>i, • • ■, of 

the chosen basis for Hr c can be written as linear combinations of 
dxi.■ -.dx,, with coefficients function of x, u, , i/ (h) only, 
hence all the da,’s are linear combinations of elements of the form 
d t'j A dx/ or dx ; A di/J^ with l < A\ so that all the elements 
0/du[ j) are in <?-* for j > I\ + 1, and finally that C(H^ ) is a 
subspace of span* {dx, di/, d<’/, ■ ■ ■. d u {h ] }. Furthermore, we have 
the following. 

Lemma: C{H^) is invariant by time-differentiation. 

From point 1 of Proposition 3.3, this implies C(H^) C Hr*., and 
hence, from standard exterior algebra, dru A A • • ■ A n, JrK = 0 
for / = 1 . * • ■ ,p-x . 

Proof of the Lemma: Consider A’ €£?*,*>€ H x . We have 
[compare (9) and (10)1 

(.V.,) = (M-(-V. J) 

oi A ■ • ■ A a J)lc A i(X) d*s 

* - N 

= n ] A ■ ■ ■ A a Pat A /( X ) du, 1 — o i A ■ ■ ■ A <\ fioc A i(X ) du; — ■ ■ ■ 
- o i A * ■ ■ A A i( X ) d-J - oi A • * ■ A A /(.Y) dJ;. 

Since (X, uc) and m A ■ • • A o,,^ A /(A')d^ are identically zero, 
and and x are in H^ (because H^ is invariant by 

time-differentiation), all the terms on the right-hand sides above are 
zero, which implies {A\ u>) = 0 and ai A ■ * ■ A n, i(XJ A /(A')d-j = 0. 
Hence, X is in Q * because this is true for all a: G H^ . Now, take 7 / 

in C(H*c). For all .Y £ Q (A’, r)) = (A\ ?/) -(A\ //) = 0 where 

(A', v) is zero because (A', if) is identically zero, and (A\ if) is zero 
because .Y is in {?*,. This proves if £ C[7i* ) 7 ) 6 ^(Hr X ), and 

therefore the lemma. ■ 
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Design of a Class of Luenberger and (2) for any given admissible initial condition .>n, , 

Observers for Descriptor Systems unique. By treating y formally as some passive mpuK i M 

proven following the analysis by. for example, Cumim.n ii i 
M. Hou and P. C. Milller Flether [14). 

In the observer design for descriptor systems, the mud \u •, 

Abstract —A new approach to the design of a class of Luenberger 
observers for descriptor systems is presented. The design is performed 
In a straightforward manner. The conditions required in the design are 

given in terms of the original system matrices. The relationship among _ . ^ . 

these and other well-known conditions is highlighted. an ^ A E C, Re(A) > () is needed at any rale This condition 

implies column regularity of 


-A E + .4 
C 


1. Introduction 

The design of Luenberger-type observers lor descriptor systems has 
obtained considerable attention in the past decade, see, e.g. 11H11J. 

Descriptor systems under consideration have the following form: 

Ex = Ax + Du (1) 

y = Cx + Dtt (2) 

where .<■ E R ", u € R\ y E R m are the descripior variables, control, 
and measurement vectors, respectively. E , .4 E D , C\ and 

D are known real matrices of appropriate dimensions, rank E - /•, 
()</■< ii. 

Mostly, the matrices F and .4 are assumed to be square, i.e., q = //, 
and the matrix pencil A E - 4 is required to be regular. However, 
as shown in [3] and [10], concerning observer design this kind of 
assumption is not needed. Therefore, no assumption on A E — .4 is 
made in this note. 

We accept piecewise continuous distributions as solutions 1121. It is 
well known that arbitrary initial conditions £\r(0_) and inputs u(1) 
are admissible for a descriptor system with a regular matrix pencil. 
However, since no assumption on A E — 1 has heen made in (I), 
we should assume that u and a certain but unknown initial condition 
£>(()_) arc admissible, i.e., they do not result in contrary equations 
in (1). More precisely, /?.#•(0 ) and //(/) are said to be admissible 
if they satisfy rank [—a E 4- 4 F.r{i) )BT r (s)] = rank( sE - .4) 
for some » 6 C, where £ T (.s) is the Laplace transformation of u{1). 
Thus, there exists at least one trajectory obeying (1) and (2). It is 
worth pointing out that the admissibility and consistency of the initial 
condition £>(()_ ) are two different concepts. The former implies 
the existence of solution(s), while the latter guarantees further the 
impulse-tree behavior of the solution(s). 

Certainly, if A E - .4 is not of full column rank, i.e., for any 
A E Crank(A£ - .4) < a. then there exist more than one 
trajectory satisfying (1). Here, we emphasize the difference between 
the description of a system, e.g,, (1). and the real system. Although 
the description (1) is of course assumed to be correct, it may be 
incomplete. The column singularity of AE-.4 reflects incompleteness 
of the description. In such a case, one cannot answer the question of 
uniqueness of trajectories only from (1). However, if 


is column regular, i.e., there exists A E C such that this extended 
matrix pencil has rank n, then the solution x(t) satisfying (1) 
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-A£ + /T 
C 

and therefore uniqueness of the trajectory. This means ih:it once an 
observer for a descriptor system is obtained, the obscrvci supplies 
the estimation of a trajectory, and this trajectory of the underlying 
system has to he unique even it uniqueness of the trajectoi) nui> not 
he recognized from (I) alone hetoie. 

Efforts have been made to design observers lor descriptor systems 
under different conditions using various techniques. A series of results 
of Paraskevopoulos ct al. |7], |8], [ 15] devote to the design of a class 
of Luenberger observers of the form 


- + F 2 y + E\u. 

. 6 B’ 

(3) 

x - F \; + Fn y 


(4) 


where .»(/) goes to .v{t) as t tends to infinity for any initial condition 
;(()) and arbitrary admissible £\r(0_). The underlying descriptor 
systems in [7] and [81 are of the form (1), (2) with the assumptions 
|A£ - A\ £ 0 and D = 0. 

The more general Luenberger observer has the same form as (3) 
and (4) except that (4) also contains the inpul term F y u. Generally, 
this kind of observer needs less restrictive conditions than that of the 
above class of Luenberger observers as shown in 116]. The research is 
progressing toward achievement of necessary and sufficient existence 
conditions as well as design procedures of Luenberger observers for 
descriptor systems. Although for the class of Luenberger observers of 
the form (3), (4) an effort has been made in [7], |8) to approach this 
goal, the resulted existence conditions can be proved to be sufficient 
only 110], [16]. On the other hand, the design in [7], |8| is rather 
involved. 

The aim of this note is to propose a straightforward design of the 
observers described by (3) and (4), and to provide the corresponding 
conditions in terms of the original system matrices. The treatment in 
this note permits the general descriptor systems like (1), (2) to be 
taken into consideration. The existence conditions in the simple form 
enable us to compare them to the existing ones. 

Finally, it is worth noting that a natural choice of observer form 
may be 

E.r = Ax + Du + L(y — Cx — Du). (5) 

Thus, the design of Luenberger observers can be addressed as finding 
L such that in the error equation Ev = (A + LC)i , where v = 
.r - x. r is free of impulsive behavior and goes to zero as 1 tends to 
infinity. This goal is nevertheless not reachable if F and .1 have fewer 
rows than columns. Therefore, the form (5) is not recommended lor 
dealing with the observer design for descriptor systems of the type 
(1), (2). 
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n. Existence Conditions and Design Procedure it is easy to verify that the condition (7) holds if and only if A r , has 

The main result of this note is as follows. Its proof provides a column rank n ~ r - 
design procedure. Solving (13) for fa, substituting the solution into (11) and (12), and 

Theorem 2.1: There exists an observer of the type (3), (4) if the pntmulliplying (13) by (I„ - A,At ), where ,i = g + m - ■>, we get 
following two conditions,are fulfilled: ^ = +,£,+£,,„ +G,j/ (16) 

_f X P + ^ 1 _ . u\ ^ n. m i v n /i-\ 


VA € C, Re (A) > 0, 


y = ?)ii+^2i< ( y = Hy) 


E 

A 

B m 

r e 

B' 

0 

C 

D 

= 71 + rank ^ 

D 

0 

E 

0 



Proof: Using the singular value decomposition technique [18], 
one can always find two nonsingular orthogonal matrices P € R qXl1 
and Q € R HXn such that PEQ = diag(£,,0), where S, is 
nonsingular. Thus, by choosing P = diag(Sr‘- ft-r )ft one has 

diag(P,J„,)[J £ diag(Q, Q, I.) 



7, 

0 

.4u 

An 

Du' 

= 

0 

0 

-+21 

An 

Du 


0 

0 

Cu 

Cu 

D 


We can further find an orthogonal matrix ft € f2 (,/ “ ,+ ' r ' )x( ' / 
such that 

p [Ail _ [A* 

1 [ D J “ [ 0 

where Bx € has full row rank with 

3 = rank [f nl- 


.421 -4 22 

D n 

_ I ^ 2 

.4, 

Bi 

C n Ci 2 

D 

a 4 

.4--. 

0 


dJ v 


All A12 
Ax Aj 

A 4 A-, 


E 

A 

B 

0 

C 

D 

0 

E 

0 


r7 

0 

.4,, 

-4,2 

Du 

0 

0 

■42 

.4:, 

Di 

0 

0 

-4i 

-4s 

0 

0 

0 

I, 

0 

0 

.0 

0 

0 

0 

0 


= —AjA 4 £i + Ag ft, 2if 

where Ai = An — AiaAfAi, G’i = AijA+ft.j, 
(A? Ar, J " 1 A<-,, 


(18) 

At = 


_ A 2 — A. 1 A 5 A 4 
1 “ (ft-A,A+)A 4 * 


71 — 

Dl ~ 0 


rr _ ft, 2 “ A 3 Af ft, 2 

(/,*- Ar„4+)ft 2 


Now it is clear if the pair {Ai, (^i} is detectable, one can design 
a full-order observer for the subsystem described by (16), (17) using 
the standard techniques. Then, combining the observer with (18) and 
considering the relation . 1 * = Q£, one finishes the observe design. 
The observer has the form (3), (4) with the order r. 

In the following, we prove that under the condition (7), the pair 
{Ai, Ci} is detectable if and only if the condition (6) holds. 

Consider only 


-A E + A 

C 


.4,2 


-4., 


.4., 

.4,2 


.4, 

4, 

0 


In , 


T 5 = diag(/,, ft)diag(P, ft,) and Q = diag(0, Q< h). In view 
of (8) and (9), we get 


This means that the System (1), (2) can be rewritten as 

£1 = Anti + A 12£2 + Bi\ u — Pi, xy (11) 

0 = A 2 G + An (j + Bxu — ft 2 y (12) 

0 = A 4 £i + Ar £2 * ft ±y (13) 

where C = [{?’ fcf £ 3 r ] T = Q T .r\ [P, r 2 P'h P, r 2 ] r is the last 
(g + "i ) columns of T. Due to the special structure of T. it is easy 
to verify that ft 2 = 0. The row partition of the last (q + m) columns 
of P is obvious according to (11M13). 

Since 


diag(P, ft) 


rp rj-j /? 0 Bn 

D «*iag(<?./,)= 0 0 Di 

L J 0 0 0 


■4. 

-A, 

-XI, + .4,i 

Aj 


-4^.4, /„ ,J 

—XI, +(.4,i —+,2.4^.4,) 0 

A, - .4,.4^4, 0 

4+).4, 0 

M A, I„- 


where the second equality is obtained by premultiplying the line block 
[A 4 A-,] in the first equality by the column nonsingular matrix 

ft-ArsA+1 n 

At J' D 

The above observer is of order r. Noting the special structure of 
Bx, one knows that the second part of the measurement Ij in (17) is 
free of inputs. Hence, it is easy to design a reduced-order observer in 
the form (3), (4) for the subsystem (16), (17) following, for instance a 
similar technique in [23]. The resulting observer will have the order p 

p = r - rank (ft - A*, At )A 4 
= r - {rank [A 4 Ar, ] - rankAr.} 

J \E A B 1 \E B] . 

= , -\ rank 0 C D " rank 0 D 


- (» - r)j 


= n + rank 


which is directly obtainable by using the form (16H18). 

In the next section we shall prove that the conditions (6), (7) and 
the order (21) are equivalent to those given in [8] provided A E - A 
is regular and D = 0. Nevertheless, the condition (7) combining with 
the condition (6) is still only sufficient for the design of observers 
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of the type (3), (4), and the order (21) is not minimal generally 
Ihese can be shown by constructing counterexamples. For instance, 
LxampJc 2 jn |10] serves as a counterexample showing nonnecessity 
)f (7) lor the design of observers ol the type (3), (4) 


HI RFl ATIONSHIP AMONG EXISTING DESIGN CONDITIONS 
In [7) and )8|, the condition coriesponding to (7) is 

rank {[r] 7 '[\]} = ' (22) 

where ^ = rank[F /?], / is the permutation matrix such that 


./(/«r+ A)-'[r n\ = 


r, 


n, 

n>, 


(24) 


where \Ei Di] has lull row rank and 3 is the unique solution 
ol the equation \Eu Bn] — 3 [Li II/], // is an arbitiary numbei 
such that pi + f is nonsingulai Obviously the matrix pencil 
\E — 4 is assumed to be regular Since the condition (7) docs 
not imply this assumption and the matrix 1) in (7) is generally 
iion/cto. the lollowing pioposilion indicates that the condition (7) 
is a geneiali/ation of the condition (22) 

Pioposition 1 I Provided the matnx pencil \F — A is regulai, 
the following conditions are equivalent 


a) lank 

b) rank 

c) rank 


7 /' 7 J - iank [7 D} 


(/i/- + -») [/ n]\r n\ 


}■ 


I i n 
o ( o 
o r o 


— a t* iank [F B ] 


(24) 

ank [ b B] 

(25) 

(2b) 


when* the notation ( ) icpiesents an arbitiary genciali/ed inverse 
of ( ) satisfying ( )( ) ( ) - ( ) 

Pwof 1 irst we prove the equivalence between a) and b) 

Since 

7 

( 


rank 


</»r+ 4) ‘[r 


n\\r d\ } 


iank 


SS 1 n,,L + 4) 


(PI" 

[r D][T B] (,<r+ 4)/' s} (27) 

for any permutation matrix 7 and any nonsingular matrix S, by 
choosing / as in (23) and 


one has 

s _l /(/»r+ it ‘[f d\ - 


0 


I, 


h t 
-4 /, 


l , B, 
r,/ Bn 


Ei 

0 


B, 

0 


(28) 


which leads to 

S- 1 /(///•'+ 4) '[r B}[1 27] (/»/ + 4 )J‘ s 

~ {S 1 /(/if + 4) '[F «]}{S 1 7(/iF+ 4) '[r D]Y 

(29) 

This, in view of (27), confirms the equivalence between a) and b) 


Ei nr 

T, 

27/ ‘ 



0' 

0 0 

» 

0 


0 

0 


Now we prove the equivalence between 1)) win t 
verity 


./(/<£+ 4) 1 

0 

0 ’ 

£ 

4 

B 

7 , ■ 

0 

0 

/, 

0 

( 

(l 

o / 

0 

In 

0 

0 

F 

0 

(i p 





7, 

/ 

0 1 




- 

0 

n 

/ n 





0 

//i 

// n , 


where 


7 ‘ 

j 1 

V 

II2 = 

F 

j' 

i) 

( 


\ 


C 


/ 


Then, due to the full row rank of [F/ Bi] the conclusion is ob\imis 

U 

In ordei to design Luenberger obscivcrs, ditkieni conditions 
arc adopted in various design techniques As shown h\ Dai |S| 
the condition (6) is ncccssaiy (but geneiall) not sulliucnl) loi 
any Luenbeigei type observei which piovides the estimation of the 
descriptor \ector » We need only to compare the conditions which 
are additionally required foi designing Luenbeigei obscivers In the 
following, we outline these existing additional conditions which are 
given in terms of ihe original s>siem matrices and foi releicnce we 
cite some coiresponding works in which these conditions aie used to 
design 1 uenberger observei s 


a) 

b) 


c) 


d) 


iank 


7 

C 


F 


iank 


0 

0 


r r 


rank 


0 


r 


iank 


0 

0 

0 


- n (see |2| |I9| |2I|) 


4 

( 

£ 

4 

( 

E 

4 

r 

0 

0 


B 

0 

0 


— n 4 rank [/ B \ 

(see this note and [71 |H]) 


-nH-iankF (see |4] [<)| |22J) 


B 

0 

0 

0 


n 

4 

( 

F 


— n 4 rank 


I 

0 


4 B‘ 

1 U 

(see 11()|) 


Remit)k 1 / For the convenience of comparison, D - 0 is 
assumed in (2) Thus, the condition (7) now has the loim b) 
Remark 12 In a iccent papei 11 11 the necessary and sufficient 
conditions aie provided lor the Luenbergci observei design in leims 
ol the Weicistrass lorm of a transformed dcscnploi system Therefore, 
the underlying descriptor systems die assumed to he regular As a 
matter of tact, in terms of the Kroncckcr form 113), the same can be 
done for arbitrary descriptor systems 

It is known that, provided the necessaiy condition (ft) holds undei 
either conditions a) or b), the class ol Luenberger observers ol the 
form (3) (4) can be designed But under neither c) noi d) can this 
be guaranteed It is also known that a general Lucnbciger observer 
l e , in (4), the input term may exist, can be designed undei each 
of the conditions a)-d) combining with the condition (ft) Since 
all these conditions a)-d) with (6) are known to be sufficient loi 
the corresponding cases, it is interesting to know what kind of 
relationship exists among these conditions The lollowing pioposilion 
gives the answer 

Proposition 12 There exists the iclation a) b) 4* /) <h 
but, in general, n) <* b) t ) </) Heie, for instance .i) =* b) 

represents that b) tan be deduced tiom a) 
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Proof: On account of (10), without loss of generality, the 
matrices E , .4, B, and C arc assumed to be in the forms 



7 

O' 


'.4 n 

A\2 


() 

0 

A - 

.4, 

A* 


0 

0 


.4, 



B = 


'B n ' 

Du 

0 


r = [ri, c, 3 ] 


(31) 


where B? has full row rank. By using these forms it is easy to prove 
:he conclusion. □ 

Proposition 3.2 shows that d) is the weakest condition among a)-d). 

Finally, we provide a conclusion about the order of the observers. 
This result is parallel to Proposition 3.1, and its proof is analogous 
;o that of Proposition 3.1. 

Proposition 3.3: Provided the matrix pencil A E - .4 is regular 
and D = 0, the reduced order (21) of the designed observer can be 
expressed in the following different ways: 


a) p = rank [E 

b) p = rank [E 


B ] - rank |c./ J [ 7 ^]} 

B] -rank {C(/<E + ,1) _1 [£ 


c) 7 * = rank [E 


B ] - rank 


E 

0 


.4 

C 


n 

o 


+ n. 


(32) 

D][E B)-} 

(33) 

(34) 


Remark 3.3: The expression (32) was given in [8|. 


IV. Conclusion 

A class of Luenberger observers in [7] and [8| can easily be 
designed under two simple conditions. The conditions in terms of 
the original system matrices are shown to be equivalent to those 
in [7] and [81. Furthermore, the present design is available for the 
more general class of linear descriptor systems. However, in contrast 
with the claim in [7] and [8], the design conditions are known to be 
sufficient in usual cases. The exact relationship among the different 
design conditions in literature which are given in terms of the original 
system matrices is shown. 
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Abstract —This correspondence supplements the remarks made In a 
paper by Dahleh <4 al. as to the Popov sector of Lur’e systems with 
interval plants. A numerical example is provided to show that strict 
inequalities hold among three sectors: those obtained by the theorem 
of the paper, 1 by applying Popov theorem to each member system, and 
by applying that to each of the systems with Kharitonov plants. This 
counterexample negates a tempting conjecture on “strong” robust Popov 
criterion. 

In the above paper, 1 Dahleh, Tesi and Vicino derived the robust 
Popov criterion for a Lur’e system including a sector-type nonlin¬ 
earity and a family of interval plants. Hereafter, we mostly employ 
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Fig. 2 Critical Popov lines and corresponding sectors. 


the notations of the paper . 1 In particular, by “sector A,” we mean 
the sector [0. A*]. In short, the criterion says that we have only to 
test absolute stability with Popov theorem for 16 Lur’e systems that 
correspond to Kharitonov plants. The caveat is, however, that we 
should choose a fixed multiplier constant 0 / for all these systems. 
This leads to a lower bound hi for the sector Ag which is obtainable 
by applying the Popov theorem to each member Lur’e system with the 
multiplier depending upon the member plant. If we need to apply the 
Popov theorem to only the 16 extreme systems with possibly different 
multipliers, then wc have the sector A „ which is larger than or equal 
to h\;. Thus, wc have, in general, A/ < A\? £ A „, as remarked in 
the paper . 1 

Here, what we arc intrigued by is the question: can the last 
inequality always be equality? II so, we would be able to obtain the 
“strong” robust Popov criterion, which we can afford most using the 
original Popov theorem. This templing conjecture was also posed in 
11). Unfortunately, a counterexample has been found by an extensive 
search. 

Setting O.s) = 1, as in the paper , 1 we consider a stable interval 
plant, 


Fig. J shows the Popov loci for i/n(s) and ) (using a 
solid curve) and that for (using a dashed curve). We can 

draw the line h touching both of the Kharitonov loci, and this 
determines the sector obtained by “weak” robust Popov criterion. For 
A K\ — 7.3G3- • • (corresponds to -1/A in Fig. 1 ), the criterion 
is no more effective, but we can still draw the lines. In and /n, in 
whose right-hand side the Kharitonov loci are located. However, it is 
impossible to draw the Popov line for </„„i at the point (— 1/A". 0). 
This indicates that the “strong” robust Popov criterion fails to hold. 

In order to obtain Av, we have to resort to partitioning the intervals 
into evenly spaced grids. For each coefficient value at the grid, the 
existence of the multiplier is examined. For this task, a symbolic 
computation approach is availahlc [2]. Fig. 2 illustrates three critical 
Popov lines for the above interval plant. These lines determine the 
Popov sectors: hi — A*/: = L82G---.A\; = A ^: = 5.771- ■, 
and A„ = AV-= 7.3G3 - -. The plant which gives A\; in 

this ligure is slightly different from ry„„ f (.s) in Fig. 1 , in that the 
coefficient 9 of the hist degree term in the denominator in (4) is 
changed to 8. We sec, from this example, that the inequalities among 
the sectors can be actually strict: A i < Av < A l( . 


_0.01 .s +J_ 

■s r ’ + 5.s * 4- a i.s 1 + 11* J -M i * + 1 


( 1 ) 


where a, G [ 8 , 11], a \ G [10, 12]. For this family, we have the 
following two Kharitonov plants: 

fji i ( - s ) = r/n(.s) =■ V] {*)/D l (.S') - N\(s)/D\(s) 


H r ‘ + 5.S* + 10* < + U.«‘ J + 11a, + 1 


012(a) = f/l »(*) = A'l(.*»)//> 2 (*) = A'|(s')/D.|(.s) 

0.01*4-1 

“ .S r - + 5.S 4 4- 12.S * 4- 41.s 2 4- 8-s + 1 ’ 


(3) 
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An Output Feedback Globally Stable 
Controller for Induction Motors 

Gerardo Espinosa-P6rez and Romeo Ortega 

Abstract — We present here a globally stable nonlinear dynamic output 
feedback controller for torque tracking and flux regulation of Induction 
motors. The control law Is globally defined (even In startup), requires 
only measurement of stator variables and rotor speed, and does not 
rely on cancellation of the systems nonlinearities. Our work extends in 
several directions the result of the paper by Ortega et a/., where the 
torque tracking problem was solved for a model where the variables 
are expressed in a frame rotating at an arbitrary angular frequency (dq 
model). First, we obviate the need to transfer the dq control signals of the 
paper by Ortega etaL, to the physical input variables in the stator frame, 
hence providing a directly JmplementabJe control law. Second, besides the 
torque tracking objective, we include the practically important rotor flux 
regulation task. Third, by choosing a more suitable representation of the 
motor model, we simplify the controller structure and provide a better 
understanding of Its derivation and behavior. 

I. Introduction 

It is widely recognized that induction motors, which have been 
extensively utilized in industrial controls, offer a very promising 
alternative for future applications like electrical vehicles and robotics, 
see, e.g., [21] and [I]. Among the advantages of the ac induction 
motor over dc types: they pack more power into less space, they have 
no need for failure-prone brushes, they are more rugged, and they 
have flatter efficiency-versus-speed curves. Also, with the continuous 
drop in the cost of electronics, the ac system is becoming more cost- 
effective. To match the higher quality standards of wideT drive ranges 
and higher reliability requires the development of new robust control 
techniques—a challenging task that has attracted the attention of 
control theoreticians. 

Induction motors can be described |22] by a fifth-order nonlinear 
differential equation with four electrical coordinates (stator and rotor 
currents or fluxes) and one mechanical coordinate (rotor speed). There 
are two physical inputs to the motor (stator voltages), and only 
three stale variables are typically available for measurement (stator 
currents and the rotor speed). Furthermore, the motor parameters vary 
considerably during operation and the generated torque is perturbed 
by an unknown torque load. In typical industrial drives, the controlled 
variables are rotor position and speed, which are indirectly regulated 
via nested PI loops around the fundamental torque control loop [Ilf 
With an eye on electric vehicles and robotic applications, we center 
here directly in torque tracking. 1 

A globally stable output feedback controller for torque regulation 
of a complete induction motor model with unknown torque load was 
recently presented in [16]. These results were further extended to 
torque tracking in [17], The control algorithms in both papers are 
derived based on the so called dq model of the motor. This model 
is obtained applying two coordinate transformations to the basic n,i 
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model. First, we refer to rotor variables to the stator coordinate frame 2 
to obtain the so-called ah model. Then, an additional projection of 
all signals to an arbitrarily rotating frame is applied. In the industrial 
electronics community, where the focus is on steady-state operation 
at constant speed, these transformations are well understood, see, 
e.g., [8], [12]. However, as pointed out in Section II-B, there seems 
to be some confusion among the control workers as to how these 
dynamical models and corresponding controllers are related. One of 
our objectives in writing this paper is to clarify these relationships. 

Since the controllers in [16] and 117] are derived for the dq model, 
an additional step to transfer the control signals to the physical input 
variables in the stator frame is needed for its implementation. In this 
note we show that, contrary to some incorrect statements made in the 
literature [13], this transformation does not require the measurement 
of rotor variables. To enhance the readability of the note, instead of 
deriving the implementable control law “reverse transforming” the 
control of [17], we directly obtain the control in the r\/i model. We 
then show how the scheme of 1171 can be obtained by transforming 
the present control law. In summary, our main contribution is then to 
provide the first directly implementable 1 globally stable nonlinear 
dynamic output feedback controller for torque tracking and flux 
regulation of an induction motor. 

The difficulty of torque control is that, in contrast with dc motors, 
torque is a nonlinear function of fluxes and currents. It has been 
shown in [3] that the full induction motor model (including the 
mechanical dynamics) is not exactly linearizable if we select torque 
as the output. It is, however, input-output linearizable (modulo 
singularity problems) with respect to velocity and rotor flux norm [9]. 
This theoretically interesting property has motivated several authors 
to study the velocity tracking problem, e.g., |10J, [9], [31, [14], [2], 
In [18] it is shown how to achieve global speed regulation from our 
torque tracking controller. It is worth pointing out that, in contrast 
with our control law which is globally defined, even in start up, in 
the aforementioned results (sec also [6]) various singularity problems 
appear. 

The remainder of the note is organized as follows. First, we derive 
in Section II the induction motor basic nrt model, establish some 
key properties, and present the torque tracking and flux regulation 
problem. Section III contains the proposed control law, derived 
directly from the model, hence implementable without further 
transformations. The proof of the main stability result is deferred to 
the Appendix. In Section IV we explain the different transformations 
of the motor model, and using these transformations rederive the 
controller of [ 17J. The performance of the proposed algorithm is 
illustrated via simulations in Section V. Finally, we present some 
concluding remarks. 

II. Model and Problem Formulation 

In this section we derive first the basic motor model (n J model) 
from the total energy function and Lagrange's equations, and establish 
its key passivity property. 4 Also, we present some fundamental 
relationships of the motor that are used in the sequel for the controller 
derivation. 

2 This is a classical approach that simplifies the analysis of the machine 
removing from the electrical equations the dependence on position. 

1 To obtain the equivalent controller, it is only necessary to apply the 
inverse Blondel’s transformation [15]. 

4 It is clear that passivity, being an input-outpui property, is independent 
of the system coordinates thus valid in all motor representations. 
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\ Basic o(1 Model 

We consider a standard three phase single pole pair induction 
motor represented with a two-phase model known as the n 1 model s 

It considers two coils for the stator and two coils for the rotor 
m pairs in two orthogonal n 1 axes The axes for the stator have a 
fixed position, while those corresponding to the rotor are rotating at 
the rotor angular velocity^ We make the usual basic assumptions 
about the mechanical structure balanced energy distribution and 
linear relationship between the flux vector \ = [\J \/] 7 = 

[Ai X 2 Ai, A*] 7 £ R 4 and the current vectoi <j = [ql q*] 1 = 
[q\ <li th CM] 7 € 


A = D (q )q 


(2 1 ) 


where the subscripts ( ) ( ) ( )/ aic used throughout the note 
to denote elcctncal stator, and rotor variables respectively the 
superscript ( ) 7 denotes transposition D (q ) — D l (q ) > 0 is 
the 4 x 4 inductance matrix ot the windings given by 


D (q ) = 


I 12 
I H e~ f ' 


I , c r ‘ 
I I m 


where 7 H I I * s 0 are the stator rotor and mutual inductances 
respectively I is the 2x2 identity matrix and r 11 is the rotation 
matrix 


( ft _ cos (q ) -sin (q ) 
~~ [sin {q ) COS {q ) 


with J the 2 x 2 antisymmetric matrix 


J = 


0 

1 


-1 

0 


II wc neglect shaft compliance the generalized coordinates are the 
electrical charges in each ot the coils q and the angular position 
ol the rotor q Since under this assumption there is no potential 
energy the l agrangian function coincides with the kinetic and the 
total encigics and is given by 


U'l *1 ) = // )'/ = ]<J J 


D (q ) 
0 


0 

D 


(2 2 ) 


where q = [q 1 q ] 7 € R and D > 0 is the rotor inertia 
Applying Euler Lagrange equations we get the state equations 
of the <\ 1 model describing both the elcctncal and mechanical 
subsystems as 

22 (q )q 4- 11 \ {q )q q 4" R q ~ U « (2 4) 


D q - )q* 11 i{q )q 4-7? <y “-77 (2 4) 

where H = diag {/? I i? 7J > 0 1/ = [I, 0] 7 IT, (q ) = 
OD (q )/dq , the control signals u = [u\ are the stator 

voltages 7? It > 0 are the stator and rotor resistances 7? > 0 

is the motor damping coeificient and 77 is the load torque The 
legulated outputs are the generated torque and the rotor flux which 
are given by 

T = ^q 1 IT , (q )q A = L e~ J1 q* 4“ 2 q (2 ^) 


^For furthei details and generalizations see [IS] and [4] 


B Torque Fracking and Hux Regulation Problem 

We consider the induction motor model (2 T) (2 \\ v i 
outputs stator currents q < and rotor speed q control [ 
disturbance' 77 and regulated signals r and \ ^ s \\ 

the torque reference 77 is a bounded ditfertninhk (u 
known first derivative Our objective is then to dts, i lt , 

control law that will ensure global asymptotic torque u i , \ 
flux regulation, thal is lim,_ „ (r-n) = () lim, || \ j| ( , 
all initial conditions and with all internal signals umlonnlv bn m I j 
Furthermore, we require that the motor in steady ^uti opuik n 
a balanced regime 1 e, sinusoidal stator voltage n \ ih j>h 
(one shitted 90 r with respect the other) with cquil unpin ult uid 
frequency 

RcmaikZ l In [18] it is shown how from the toiqm i , ting 
objective above we can solve the spced/position tracking pmhlun 
It essentially amounts to selecting the desired torque as a function 
ol the desired speed 


C Motor Pi opei tu\ 


ljPassnitx The proposition below leveals the fundamental en 
ergy dissipation property of the induction motor 
Proposition 2 1 (Passmts) The motor model (2 }) (2 4) with 
zero load torque defines an output stmth pas iu opeiator u q 
that is there exists < t > 0 ( 6 R such thal /' q l mlr > 

r i Jn ll</"II + ( * (wit ^ II II the Euclidean norm) holds foi all 

t > 0 and all locally square integiable stator voltages Furthcimore 


o r J< '' i ' 

0 0 q 

0 0 


the work less foiccs may be I adored as 

= C(<1 ij)‘i 


n i(</ )<i i] 

- V n \ (q )</ 


n 


-L J r " ,, 
I q'jc " 


where T>(n ) — (’(</ i/ , + C' ((/ </ , 

Pioof Taking the time derivative of the systems total energy 
we gel 


L q 1 


1 P{<1 )<i + 


-IT i(q )q q 


('7 )'7 


= q f { -Rq 4- Mu 4-1) 


- R q 4“ M u 4- ( 


where R = diag {/? 7? } M —[U 7 0] 7 k — [A _r /] / From 

the integration of the last equation wc gel ihe energy balance equation 

at)-L{ 0) = - / q l Rq<1r+ f q l ( Vf u 4-0 ^ 

s v / 0 /o 

Hi I I II i u;% V 1 " " r 

I “*K.| tl I | | I I 


Noting thal L{t) > 0 and setting r i = m { R } and > = 2(0) 

pioves the hrst part of the proposition The second part ol the proof 
is established via direct substitution of the terms □□□ 

The importance of the passivity property for our tracking prob 
lem is best appreciated with the following corollary that will be 
instrumental for the subsequent stability analysis Iis piool follows 
trivially from the derivative of the desired total energy lunclion 
Ci = ^c 1 D(q )< and the skew symmetry properly 


Hor the sake of brevity we restrict our attention to the cast of known dis 
turbance The extension to the unknown but linearly parameteu/td disturbance 
case follows verbatim from |l7f 
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Corollary 2.1: For all locally square integrable q$,fa and all 
£ e ft 5x \ the system 

4 [C(</n, fa) + (72 4 K)]e = 0 (2.6) 

satisfies the dissipation inequality f 7 0 dr > r :i /J \\e\\ 2 dr 4 
^|k(f)ll 2 - lk(0)|| 2 , where c 3 , = A, n ,u{ft 4 AJ}, r, 3= 

A, mn {l>{<!( 5 )} > 0, C 5 = A ni&K )} > 0. Consequently, if 
0 3 0 and ft 4 1C > 0, then lim#—ao r = 0. □□□ 

Remark 2.2: The factorization of the workless forces is not 
unique, but there always exists a suitable selection of C{q. q) to 
enforce the key skew-symmetry property [201. It should be remarked 
that in the factorization of Proposition 2.1, the third and fourth row of 
C(q, q) are independent of fa. This feature will be used in the sequel. 

2) Torque and Rotor Flux: In the remaining part of this subsec¬ 
tion, we derive a key relationship between the rotor flux A, and the 
generated torque r. To this end, we first notice from (2.3) that the 
rotor flux vector given in (2.1) satisfies the differential equation 

A, + R, fa = 0. (2.7) 

Now, from the definition of rotor flux angle p — arctan {A 4 /A 3 }, 
we have 


P = 


_1 JX, = B ' 


IIA, 


l|A.| 


A {Jq, 


where the last equation was obtained using (2.7). Noting that r = 
A T Jfa , we establish the fundamental fact that the rotor flux vector 
rotates at an angular speed 

'-iifr- ,28) 


In other words, fad* fad above define a desired behavior for the 
motor currents that delivers the required torque and insures rotor 
flux regulation. 

On the other hand, if we define the difference between the actual 
and the desired behavior as c:= [rj, ej, ?*]*:= q - fa, from 
Cprollary 2.J we see that if (for the given fad* fad) we can find 
fad and 1 / such that the following system is satisfied for all < 75 , fa: 

^(q^fat 4- C(q^. fa)q d 4 Ufa = Mu 4- £ (3.4) 

with fa \ = [q]d> q!<h fad] T * then we have convergence to the desired 
current behavior. 

It is easy to see that this is achieved with (3.2), (3.3). and 
u = L a fad 4- L ai e Jti 'q t <i 4 Ls, Jc jqt, fa fa u i 4 R«fad (3.5) 

Drufa.d 4 Ltn q1 Je ‘ Jqr, fad 4 Rmfad = -t l . (3.6) 

From Corollary 2.1, it can easily be shown that this control law 
ensures global asymptotic torque tracking and flux regulation; how¬ 
ever, it requires the measurement of the rotor current. To overcome 
this problem, we propose below a scheme where we use instead its 
estimate, add to the desired “energy function" Cj a term that depends 
on the observation errors, and use the damping term K to achieve 
a sign definite derivative. These derivations are summarized in the 
proposition below. 

Proposition 3.1 (Main Result). Consider the induction motor 
model (2.3), (2.4) with measurable outputs stator currents fa and rotor 
speed (fa. Assume the load torque 77 is a known bounded function 
and that the motor parameters are also known. The torque reference 
Td is a bounded differentiable function with known first derivative. 
Let the control law be defined as 

u = T( r,/, Td)Jc Jqb \ 1( t 4 L*,Je~ Jqr q,fa,i - K\{<fa,i)t „ (3.7) 


Our interest in presenting the relations above stems from the fact 
that we can express the torque tracking and flux regulation problem 
in terms of regulation of rotor current and flux as shown below. 

Proposition 2,2 (Rotor Flux Regulation): Given a smooth 
bounded function r,/(f) and a positive constant > 0, let 


A T( l‘ = d 


cos (p t t) 
sin (pd) 


Pit- = 



p,/(0) = 0. 


(2.9) 


Assume lim,—,*, A, = A,,* and limj—^ A, = A,,/, then 

lim*—nu ||A r || = 4, lim*—^ t = r,/. □□□ 


with 


+ I + 

+ xH^ w ) n - 

<r\ = L„ — Li, /L, and controller dynamics (3.1) and 

fad = [L H ,q, Jc- J '"fad 4 Ru.fad 4 t l 4 I\ 2 (fa)fi]\ 

XJyn 


(3.8) 


III. Controller Design 

From Proposition 2.2 and (2.7), we see that the control objective is 
attained if we ensure that the rotor flux converges to the solution of 


= A, </((» = 

n 


and fa converges to 




(3.1) 


(3.2) 


Using (2.1), we can express these two conditions in terms of currents 
by requiring fa to converge to 

^ ,,:= (zr l2+(3-3) 


7 r ,r/(0) = faydu- 


(3.9) 


The gains A'j (fad) and (fa) are given as 
K\(fa<i) = ^( lid - A'2(9,1) = “-7(0?,/ + (’ lL )- 


while the state estimator is 


0 < f < /?, 

(3.10) 


DAq'y)q, + W\fafa 4 R< fa = M.u -I(^, fa)fa 
fa : = fa — fa the observation error and 


(3.11) 


L ( f/r, . f/r, ) = 


Under these conditions, the closed-loop system achieves global 
asymptotic torque tracking and flux norm regulation with balanced 
operation. 
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Remaik 3 1 Wc should iccall that we have posed here a toique 
ontroJ problem where the gjven dcsned toique automatically deter 
nines (via the relationship (2 4)] the rotor speed In this sense q / 
is the rotoi speed that corresponds to the desired torque Notice from 
6) that actually this is the case if q converges to q / In the case 
where we are given a rotor speed or position control objective, n 
should be chosen as proposed m [18] 


where e t(0 * ‘y, - <y, ' and 


- JO 

e q i = 


" / (R H 

l /? T / -f i q ) t 


1(0 , ) 

( q i # = 


- r i 


IV Coordinate Iranskjrmaiions ah and <lq Models 

In this section, wc intioduce the coordinate translormations thal 
give us the ah and riq models Our objective in studying m detail these 
translormations is twofold hrsl, to clearly establish the relationship 
between the vaiious models studied in the control literature, second, 
lo prove that the control law of [ 171 can be obtained by transforming 
tht pieseni controllei (or vice versa) and thal this transformation 
docs not requnc measuiemenl ol the rotoi vaitables 

lo obtain the ah model wc follow the standard pioccduie of 
applying to the r> t model a cooidinate transformation that propels tht* 
lolor cuncnls to the slatoi fiame This is achieved wirli the change 
ol coordinates 


q i ~ 1 i(q )q — 


7 

0 

o 


0 

t / 

0 


<1 


1 urthti lo obtain the <fq model we piojcct the al currents into a 
liamc totaling at an arbiliaiy angulai speed j. as [8] 


'//, 


1 1 ,{q )q i = 


o 

() 


0 

71 

I) 


0 

0 q , 
1 


f 


Jo 


<1 


r 1 i 
j 

„ 'O' 

l 

i 

c 

q i - 





(4 


The last expressions were obtained considering that in this ikns 
relcience frame the desired rotoi flux will rotate at an amnilai speed 
U — q i c pt — H q therefore r t{U 1 \ ~ p l)] / 

Substitution ol the above expressions in (4 2) results in the lo] 
lowing controller 


«,'< = -/ </'<ii+ i - - j 1 ^ 

/ 'j'l'l « + -h'^q" \ ") 

wheie hi 1 — hi and 


_ li_i 
L 



- L 


)f i - 


f /_ 
r , i 


KJ 


where H ~ + H (0) - I) The iclationships between the vanous 
rcpiescnt ilions aie easily defined with tht gtneral rotation matrix 


/(/ M - 


' 0 0 

l) r 7{ I 1 0 

0 0 1 


which clearly satisfies 1 {q H ) /(() tt,)T<q 0) Hence .q^- 

I (0 H )q i 7 ((/ H )q To obtain the inverse transformations 
notice that 7 1 [q t) ) - T '(»/ ft ) 

Applying the lattci transformation to the n i model and picmulti 
plvmg the resulting equation by I (</ H ) yields aftei some simple 
calculations the tlq model studied in [16] and |17| 


P i t q i, + t>7#(v /1 )qii + R<iii— + t (41) 


wheie Vfa / 7 = / ( 1 / H ) \4n the matrices P i { and C i 4 are given 
by 


P, t — diag( P 11 D 


I - diag 



I J 
L I. 


n 


} 


t R 


<■’ II (q 11 J-,) 


l J 
/, J 


1 J ( 

I J J 1 


-q ) 


< /,( <n,) 

0 


e R y 


<!,(•!,,) = [(/ Jq‘'+T. 0]' 

and . n, = [{q! 1 ) 1 {q 1 ')' q ]' 

In order to show that the control ol [171 can be obtained by means 
of the transformation picsented above notice that 

to J to . J t[0 a , ) 

a if = t a = 1 „r 7, / + / s f q 1 

+ 'W,+ /?,<■ q,i- (42) 


= f ] -,[a n i ) n + a i -1 i-, + y i q ] 

The controllei stale is given by 

q 1 = - [7 (q 1 ') 1 Jq 1 j 4- R q i + A 1 'r ~T r/ ] 

with h h = 1 /^[{q 1 1,) +(<//',) ] I he structure of the estimator 

state is easily obtained by a pioeeduie simildi lo the one lollowed 
to obtain the model (4 1) 

We must note the tad that the above controller dtffcis slightly from 
the controllei in 1171 since the dcsned behavioi tor the curicnts (4 1) 
m this case was chosen to obtain the noun of the lotor flux equal to 
i instead of / f as in that papci 
Discussion I) 1 10 m the tlq model (4 1) wc can obtain as a 
pailicular case the classical ah model by choosing “ 0 

2) A key observation is the introduction of ~ in the model (4 I) 
This variable is used in [ 16J and [ 17] as an additional control input 
(denoted u\ in those papeis) to solve the toique control pioblcm 
From the derivations above it is clcai that given u i, and - we can 
obtain the ah controls via Vfn,/ =7 ] [q H )\4nif Ihcpioblem 
is that ti i, and ^ themselves depend on variables cxpiessed in the 
lotatmg frame One ob|cctive of this note is lo obviate the need of 
all these transformations by defining the controllei dynamics directly 
in terms of the signals q / q r/ 

4) It is important to remark that the rfq model above is different 
fiom the field coordinate model used in held oriented control I he 
latter is obtained by selecting the angulai frequency equal lo the 
rotation angle of the flu\ vecloi, see [11 p 216| See also Section 
III A of [14] and [4) It is clear that to obtain the physical signals 
from the field coordinate model, we lequire knowledge of the full 
stale vector This is, however, not the case for out tlq model as 
erroneously pointed out in 111] 
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Fig. 2. Real and observed rotor currents. 


4) The ah model is sometimes given in terms of stator currents and 
rotor fluxes, instead of stator and rotor currents. It is straightforward 
to verify that applying to (4.1) the linear transformation 

11 0 0 “ 

: = L S 9 I 2 L t I 2 0 .r 
(> n i 

and choosing = 0 , one obtains the model studied in several recent 
papers, e.g., jl4| and [ 6 ]. 

V. Simulation Results 

The performance of the control scheme of Proposition 4.3 was 
investigated by simulations. The numerical values of the four-pole 
squirrel-cage induction motor used in [7] were chosen, that is, 
/? H = 0.687 12. /?, - 0.842 St, L„ = 84 mH, L, ^ 86.2 mH, 
= 81.3 mH. J = 0.03 Kg', and h - 0.01 kg's' l . We present 
here simulations of a sinusoidal torque reference with load torque 
T [, -■= (2.76 4- 0.003f/i*) sgn ( 7 -,) 4 - 0 . 167 -,. 

The motor is initially in siand-slill with zero initial conditions 
. The controller is initialized with the values A, f /(0) = 

7 ,( 0 ) = [ 10 . 6, —5. 10 ] ; , and the parameters 1 = 1 , t - 0.5. 
Fig. 1 shows ihc response of the generated and the desired torque, 
and in Fig. 2 ihe rotor currents and their observed values are depicted. 
Notice that although the observation error converges to zero in less 
than 0.3 s, output tracking takes longer because of the slower state 
error convergences, which are shown in Fig. 3 . The required control 
effort is very smooth as shown in Figs. 4 and 5. As seen from the 
figures, and taking into account the fact that the simulation includes 
the motor start-up and zero crossing of the generated torque with a 
highly nonlinear load torque characteristic, the performance of the 
closed-loop system is quite remarkable. 

VI. Concluding Remarks 

We have solved in this note the output feedback torque tracking 
and flux regulation problem for an induction motor model given in 
physical n i frame. The controller is based on the "energy-shaping*' 
methodology of (16]. It is shown that it can be obtained by "rotating- 
back" the control law of [17] of vice versa. 

The following features of the proposed controller are worth 
emphasizing. 



• In contrast with 114), we do not require measurement of rotor 
variables, typically unavailable in applications. 

• The control law is always well defined (i.c., there are no singular 
points), even in start-up. This constitutes an unquestionable 
advantage with respect to existing results, e.g., [6|, where the set 
of singularities is determined by the design methodology, and 
consequently is unrelated to the physical structure of the system 
and very difficult to analyze. 

• Tuning and commissioning are very simple. There is essentially 
only one design parameter f in (3.10), which reflects the con¬ 
fidence in our rotor resistance estimate. Since this estimate is 
usually bad, we essentially are required to inject some high gain 
in the loop, 

• To overcome the need of knowing the torque load, we can 
use a parameter estimator as done in [16] and [17]. A more 
challenging problem is uncertainty in the motor resistances, for 
which solutions are known only with full slate measurement 
[16], [14], It is worth pointing out that in [4] some simulations 
that reveal instability of a scheme including parameter adaptation 
and (a globally stable) state estimator have been reported. 
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Fig. 4. Stator voltages. 



For speed/position control applications, a slight modification to 
the present controller must be made. This has been recently reported 
in [18). 


Appendix 

Proof of Main Result 

First, notice that the controller (3.7) is obtained from the substi¬ 
tution of (3.2), (3.3) into (3.5). Now, the error equation is given 
by 

'/r, )(' + C(qr, , <75 )r + (Tv + h'(qr, , q r , ) )t = 5 ( f/r,. q,i )</, (A. 1) 


where X’ =■ diag {A i ( <jr,,i )Tj. 0 . hi ( q,i )} and 


.9 = 


"0 L h1 .Jc Jtlr, qt, t t 
0 0 
o L.rnlJJ" 


6 7? JX \ 


On the other hand, from (3.11) and (2.3), the observation error 
satisfies the following equation; 


+ (\Y\Uja)q 5 + I((fa- tfr.Dtf, +. /?,</, = 0. (A.2) 


Consider the quadratic function V' — ^v { VlqU +- ,V 
syhose time derivative, taking into account the ami-^ n, lVi 
D — 2 C and A — 2{W\q$ 4- I), yields 

V = — r* f (Tv 4 A. )c 4 (‘ 1 Sq t — (j. It, <} t 1 O 


where we have defined [r J 




and 


C = 


n 4 *: 

hs T 




Checking that (3.10) ensures positive definiteness of Q t we conclude 
that c and q t are bounded and converge to zero. Internal stability 
is established noting that \ rf t is bounded by construction so q, is 
bounded, Boundedness of qr,,i follows from the lad that <y n/ is the 
output of an asymptotically stable filter with bounded inputs (3.q) 
□□□ 
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Eigenstructure Assignment in Linear Descriptor Systems 

Petr Zagalak and Vladimfr KuCera 


Abstract —Given a linear controllable system Ex ~ Fx + Gu, we study 
the limits of linear proportional feedback u = Kx -f v In assigning 
a desired finite and Infinite eigenstructure of the closed-loop system 
Ex = (F 4 GK)p 4 Gv. The closed-loop system is not restricted to be 
regular; Its rank makes part of the specifications. Necessary and sufficient 
conditions are established for the existence of a feedback gain K which 
assigns the desired eigenstructure, and a procedure is outlined to calculate 
one such gain. 


by applying feedback (2) that preserves its regularity, the eigenstruc¬ 
ture specified by invariant polynomials </'i(«K i/'2(»L • • ■ **/'»..(«) 
(decreasingly ordered by degree), and by infinite eigenvalue orders 
</i, • ■ * ,rf m (decreasingly ordered by magnitude) if and only if 

the set of inequalities 

m tn 

y^degV’.M + fl, < y*/-,, j = 1, (4) 

t-J I 

are satisfied with equality holding when j = 1, and 

<h = 0, i > m - q (5) 


I. Introduction 
Let 

E.i = Fa -f Gu (J) 

be a linear descriptor system where E. F are n x n matrices, and G 
is an n x w matrix of rank m over i?, the field of real numbers. 

The problem of eigenstructure assignment by linear proportional 
feedback 


a = I\x -f v (2) 

where K is an m x n matrix over J?, has had a long history. When E 
is nonsingular, the eigenstructure of (1) is specified by the invariant 
polynomials of sE - E, or equivalently by the position and structure 
of the system's finite eigenvalues. Then Rosenbrock [12] showed that 
a controllable system (1) with controllability indexes c\ , ■ ■ ■, c,„ 

(decreasingly ordered by magnitude) can be assigned the eigen* 
structure given by invariant polynomials n (.s). 0z(*L ■ ■■, </■„,(*) 
(decreasingly ordered by degree) if and only if the set of inequalities 

w tn 

y^ilegI,'r(«) < yr„ j = 1.2,■•■.it/ (3) 

>--j 

are satisfied and equality holds when j = 1. This famous result, 
commonly referred to as the fundamental theorem of state feedback, 
identifies the limits of feedback (2) in altering the eigenstructure of 
(1). Alternative proofs were given later by Dickinson [ l|, Flamm [2], 
and KuCera [5]. 

A more general situation arises when E is singular. Then the 
pencil sE - F has finite as well as infinite elementary divisors, or, 
equivalently, the system has finite as well as infinite eigenvalues. 
This means that the system gives rise to exponential as well as 
impulsive modes. Assuming that *E — F is regular, KuCera and 
Zagalak [6] generalized Rosenbrock’s result in the situation where the 
desired eigenstructure is specified solely by finite elementary divisors 
or by invariant polynomials. The other extreme, where only infinite 
elementary divisors are being assigned, was considered by Zagalak 
and KuCera [13]. 

The problem of simultaneously assigning the finite and the infinite 
eigenstructure was solved by the same authors in [14]. The result 
says that a regular controllable system (1) with controllability indexes 
c i. rj. • ■ -. r m (decreasingly ordered by magnitude) can be assigned 

Manuscript received February 24, 1992; revised Muy 2, 1994. This work 
was supported by the Czechoslovak Academy of Sciences under Gram 27 
501. The original version of this paper was presented at the 1st European 
Control Conference, Grenoble, France, July 2-5, 1991. 

The authors arc with the Institute of Information Theory and Automation, 
Academy of Sciences, 182 08 Prague, Czech Republic. 

IEEE Log Number 9407029. 


where q is the number of infinite zeros of (1). The necessity of (4) 
was proven already by Jones. Pugh, and Hayton [3] and conditions 
equivalent to (4) and (5) were given, without proof, in Oz^aldiran 
[ 10 ]. 

The requirement that the resulting feedback system 

E.r = (F 4 GK).r 4 Gv (6) 

be regular is too restrictive in many situations. That is why we relax 
this assumption here and make the rank of sE - (F 4 GK ) a part of 
the specifications. This will allow for desired eigcnstructures specified 
by a number of invariant polynomials less than n. 


II. Basic Concepts 

The system (1) is said to be controllable if iunk[sE - F G] ~ n 
lor all complex ft, finite and infinite. For details, see Lewis [8] and 
Verghese, LCvy, and Kailalh [12], and the references given therein. 

Let ,V(.s), D(s) be, respectively, n x in x m matrices over 
/?[#«], the ring of polynomials in s with coefficients in /?, such that 


[.sE - F 


-G] 


-V(.s) 

D(s) 


= 1 ). 


Then X(s),D(s) are said to form a (right) normal external descrip¬ 
tion of (1) if 

i) 

*V(.v)' 

P (M. 

is a decreasingly column-degree ordered, minimal polynomial basis 
of Kcr [.sE - F —G ]; and 

ii) ,V(s) is a minimal polynomial basis of A (.sE — F), where .4 
is a maximal anihilator of G. 


See Malabre, KuCera, and Zagalak [9] for details. 

Then the controllability indexes of (1) are the column degrees 


r, = deg, 


'.V(M 

D(s) 


i = 1, 2, • • ■, m 


of any normal external description of (1). When deg, D(ft) > 
deg, A’(ft), we call r, a proper controllability index of (I). 


HI. Problem Formulation 

Let (1) be a controllable system with controllability indexes r x > 
ci > ■ ■ ■ > c ni , and let q be the number of the proper controllability 
indexes. We are given r monic polynomials i!'\ (*), ^ 2 (a), ■ ■ •. i/’, (*) 
in R[s] such that i',+j(a) divides </»,(*), / = L 2,-1, and 
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positive integers <J\ > > > <lj These polynomials and 

utegers are not arbitrary, they are assumed to satisfy 

]T deg I ,,(«) + £// = 5Z <’ (7) 

/ i /-I _1 

ir some selection < f > < * 2 > (% of / + m — n controllability 

ndexes of (1) that includes all the proper controllability indexes 
The problem of eigenstructure assignment is then as follows I irst 
i stablish necessary and sufficient conditions under which a feedback 
un A exists in (2) such that i (s) s aie the invariant polynomials 
uid (I s are the infinite eigenvalue orders of the closed loop system 
lb) Second if the conditions are venfied give a procedure to 
calculate one such A 

IV FlJNDAMLNIAI THtORFM 

The following theorem solves the existence part of the eigenstruc 
turc assignment problem 

7 heonrrt The eigenstiucture assignment pioblem has a solution 
if and only il 

n — m + p + q ' i ^ n (8) 


and 


^deg/ (s) + <7 < y^( * y = l 2 / (9) 


where by convention <1 — 0 lor / ✓ /> and t* = 0 for / > / + ni — n 
hoof (Necessity) Wc suppose there exists a feedback gain 
A such that the eigenshuctuic of (6) is given by the lists 
i i(s) i (s) i (s)andr/| (I (I, that satisfy (7) for some 
J Wc wish to prove (8) and (9) 

To handle the limie and the infinite eigenstiucture m a uniform 
way wc shall apply the conlomial mapping 

u - ( 10 ) 

s — a 


where a 0 is a complex number that is not an eigenvalue of (6) 
Let \ (s) l)(s) be a normal external description of (1) Then 


si -r-(.A -G 


D(s )-A Ms) 


- t) 


(ID 


so that \ ( s) D( s) - A M s) is a normal external description of the 
closed loop system (6) Since controllability is invariant undci the 
iction of feedback (2) the systems (I) and (6) aie controllable with 
the same controllability indexes 
Wc define 


n('<) 

<"(«) 



'(-0 1 

diag [(u - 1) 1 


(n -1) ] 02) 


This is a polynomial matrix ovei /?[/<] which is irreducible and 
olumn proper with column degrees f i ( 2 . < Obviously, C [u ) 

Iso has column degrees < i r / We suppose without loss of 
nerality that the matux F has been brought to the form 


E = 


r, 

u 


(H) 


with A | of full row rank say f We define 


P(" ) -(){" )] = diag[fi -1 « — 1 l 1] 

-- v -- 

J 

-^—r-(F+Gh) -G 

II — 1 


This is again a polynomial matrix over /?[«] which i 
and row proper 

It follows from (II) that 


[/’ll/ > 


-Q( ")] 


ll\i <' 

c («) 


- I) 


l emmd 1 of the Appendix implies that the rank deficit nci 
and P{ ii ) are the same namely 


rank ( (u ) = i -f m - n 


since rank P{u ) = t by assumption Further the invuiaut pohno 
mials of D{u ) equal those of Q{u ) namely 

fi(« ) - 1 f f U ) = J 


* | M i ( II ) = II - 1 r + (il ) = H ~ 1 

r + - +i(« ) = 1 f (« ) - 1 

and the nonunit invariant polynomials of C (u ) are equal to those 
of P( ti ) 

<>i (w ) = i i ('<)( ii - 1)** < i (« ) = t, (« )(- D 

", n(") = t ,+ t (" ) " + (") = < + (») (14) 

where 

~ (») — <« — i > 1 ^ 1 j j os* 

Since (12) is irreducible f (ii ) is pnmc to n {ii ) i = 1 2 i + 
m - /i Hence p ^ i + m - n - <] Since i £_ n wc have (8) 

Now suppose without loss of generality that it is the first / -f m -n 
columns ol ( (n ) that arc /?(// ) lincaily independent Let ( (// ) 
be the submatrix of ( (u ) formed by these columns Then by 
assumption (7) C (u ) is column pioper with column degrees < f ^ 
t* * > (\ Then the product <>,(") <> + («) 

is the greatest common divisor of all minors of C (u ) of Older 
» + in — n - ; + 1 which implies 

+ f 

y deg o(tt)— ^ deg/ (//) + // 

i 

+ 

<s. ^ r * ) — I ; f /i# - u (16) 


where // = 0 for i > p has been dtfined foi convenience of 

summation 

Lemma I implies that i (u ) = 1 for / = ; + m - it 4* 1 1 

Adopting the convention that (* = 0 for / > / + ot - " we 

can extend the upper summation limit in (16) to / Finally since 
deg i (/<) = deg i (s) i = 1 2 i the inequalities (9) are 

verified 

(Sufficiency) We consider a list ol monic polynomials 
< I(S) I (s) 1 (S) in li[ s] a list of positive integers 

/fi <1 > <lj and a subset <* <1 of the control 

lability indexes n r > ( that satisfy (7) (8) and (9) We are 

going to construct a matrix A over 7? such that the eigenstructure 
of (6) will be given by these two lists 

To this end we apply to / (s) the inverse Iran simulation of (10) 

to gel the polynomials i (t< ) defined in (I'S) Clcaily deg/ (n ) - 

deg i (s) i = 1 2 i It follows from (9) for / = i + - u + 1 
that ( (tr ) — i (s) = 1 for / > / -f m - m 
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Lot N($), D(a) be a normal external description of (1). Applying 
the same transformation to 

-V(*)' 

D(s) 

and [*F - F - G], where E assumes the form (13), we obtain 
matrices ,4(U'0, B(w) that satisfy 


diag[ ir-l,. ^F-F -G] 
/ 


As in (12), 


'B(w)' 

A(tr) 


= 0 . 


(17) 


B(w)~ 

\A[w) 


is a polynomial matrix over B[w] that is irreducible and column 
proper with column degrees r y , r 2 , • • ■, r m . 

Now we construct the matrix 


T(ir) = diag[^i(ir)r * •, p,if)] 


where 0 ,( 1 /’) are the polynomials defined in (14) and (15). In view 
of (7) and (9), we can apply to V(w) Lemma 2 of the Appendix. 
There results a matrix C(w) having the same invariant polynomials 
<Pi ‘ • i0 r +„i- n (w)i and that is column proper with column 
degrees c* > rj > > r*+ n ,_ Tl . We further define the rn x m 

matrix 


C(w) 


C(w) 0 
0 0 


Let S be a permutation matrix such that the first r + i?i - v columns 
of B(u)S have degrees c*, rj, • • • ,rr + „ ( _ fl . Consider the matrix 


D(u')S' 
C(w) m 


(18) 


If (18) is irreducible, we put C(w) = C(u>)S ~ l . If it is not, there 
must be a zero at w = 1 common to B( w)S and C’( w). This implies 
a zero column among the first p columns of (18). As (8) holds and 
the last r + w - v - p columns of C(w) are linearly independent, 
there exists a nonsingular and constant matrix T such that 

B(w)S' 

C(ir)T 


the rows of these matrices span the same i?-linear spaces. Then, by 
Lemma 3 of the Appendix, equation (20) has a constant solution 
pair A'. Y such that A’ is nonsingular. Thus, K = -A' _1 V 
qualifies as a feedback gain that assigns the desired eigenstructure to 
aE - (F + GK). This immediately follows from Lemma I. 

* 

V. Construction 

We summarize the major steps of the sufficiency pan of the proof 
of the Theorem, which provide a construction for A’. 

a) Given F, F. G with E having the form (13), calculate poly¬ 
nomial matrices A{w), B(w) such that (17) holds where B(w) 
and 

“ B(ir)' 

A(w) m 

are irreducible and column proper, and 

B(ir) 

A( ir) 

is decrcasingly column-degree ordered. 

b) Read out <*i, r\», ■ ■ ■, r„, , the column degrees of 

Bin)' 

Mtr). 

and identify the subset of q proper controllability indexes. 

c) Choose n (*)« , *•, (.s) and r/ 1 , d>. ■ ■ ■. d v and r*, rj, 

■ ■ •, r\that satisfy (7) and check for the existence of A using 
(8) and (9). 

d) Construct an m x m polynomial matrix C(u') having the 
invariant polynomials o,(ir) defined in (14) and (15), and such that 
the composite matrix 

A(u )_ 

is irreducible and column proper with column degrees n , r 2% ■ ■ ■ , < . 

e) Find a constant solution pair X. Y of the equation 

A\4(ir)+ YD(w) = C(n«) 

such that A’ is nonsingular. 

f) Put K = -X ' 1 Y. 

Of course, there may be feedback gains other than A that solve 
the problem. 


is irreducible. Then we put C(w) = C(ir)TS 1 . In either case, the 
matrix 


B{tr) 

C(w) 


(19) 


is irreducible and column proper with column degrees n, r?, - - • 
Applying the transformation (10) to (19), we obtain the matrix 



diag[(* - a)‘ 


■■,(* -a)'" 1 ]. 




This matrix is irreducible, column proper, and its column degrees are 
equal to c \, c 2 . ■ ■ ■, c m . 

Let us now consider the equation 


XD{*) + YN{*) = H{*). (20) 


Since the matrices 




A'(*)' 

Z>(«j 

1 

ff(s) 


are both irreducible and column proper with column degrees 
ci, and A r («), is a normal external description, 


VI. Conclusions 

The limits ol proportional state feedback (2) in altering the eigen- 
structure of the linear controllable system (1) have been established 
in Theorem 1, under the assumption (7) that the total number of the 
dynamical modes desired is equal to a sum of r+m -/» controllability 
indexes. These limits can be summarized as follows. 

i) The eigenvalues can be placed at any position. 

ii) The measure of regularity r — rank (hE — F — GK) is bounded 
by (8). 

iii) At most, r + m - » cyclic chains can be associated with each 
finite eigenvalue, 

iv) At most, r + 7ii -n-q cyclic chains can be associated with the 
infinite eigenvalue, because q is the number of infinite zero chains. 

v) The sizes of the cyclic chains are limited from below by (9). 

vi) When E is nonsingular, all controllability indexes are proper, 
q = in, and no infinite eigenvalue can be assigned. 

The controllability assumption can be relaxed. A generalization of 
the theorem to partially controllable systems is under study. We also 
conjecture that (7) is a necessary restriction on the eigenstructure 
desired whenever r = n. 
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Appfndix 


Itmma I 11V Let P( s), ^(s) and B( s) ( (s) be polynomial 
antes over J?[s] of respective size n xn nx m and n x m. m x m 
h that 


[Pi s) 


"Q(S)] 


'i3(s) 

C(s) 


= 0 


lieie 

[Pi') -<2( s )] and 

ue irreducible, and rank Q[ s) — in Then, 

• the matrices ( (s) and P{ s) have the same unk deficiency 

• the matrices B(s) and Q{ s) have the same invananl polyno¬ 
mials 

• the mail ices ( (s) and P( s) have the same nonunit invariant 
polynomials 

Itmma 2 /IS] Let ( (s) be a column pioper polynomial / x 
/ matnx ovci /?[s] and let ni(s) n (s) o/l(s) be its in 
valiant polynomials arranged so that 04 i(s) divides u (s) / — 
12 / - I Let r ] > ( s > ( k be nonnegative mtegeis 

such that 

t i 

^ deg f > (s) _ ) - 1 2 / 

' > 

with equality holding when / — 1 
Then tlieie exist unmiodular matuces l i(s) and f (s) such that 
the matrix 


C (s) l ,(s)C (s)f (s) 

is column propei with column degrees (, r u 

1 tmma <171 1 et 77( s) \ ( s) and //(si be respectively m x 

// x w and m x m polynomial matrices ovei /?|s) I lien the equation 

\I)( s) + l > ( h) — 77(s) 

has a solution pair \ 1 ovei /? such that \ is nonsmgulai if and 
only il the lows ol the matrices 


'\N' 


Ms) 

D(s) 


W(s) 


span Hit same H lineai spaces 
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Robust Regulation in the Presence 
of Norm-Bounded Uncertainty 

J Abedor K Naapal P P Khaigonekar and K Pool la 


Abstract —We consider robust regulation (against steps and sinusoids) 
in the presence of unstructured uncertainty 1 he unstructured uncertainty 
is norm hounded by a constant that is given a priori This problem 
is equivalent to a certain multiob)ective problem where one objective 
is robust regulation and the other Is the standard objective of H , 
subophmal control It is shown thal a solution lo this problem exists 
if 1 and only if the standard problem admits a solution and certain 
mah ix inequalities are satisfied. Ihcsc sohability conditions arc readily 
computable t ontroller synthesis is also addressed 


I lNIKODI t TION 

Befoic discussing the problem that is the focus of this note 
wc briefly tevicw the standard ptoblent of robusl icgulalion This 
problem will be defined with icspecl lo Fig I Here P is the plant 
and A is the controllei to be designed Both are (mile dimensional 
linear time mvanant systems as are all systems throughout this note 
In the hgurc ?i is the exogenous input is the conhoiled oulpul 
u is the control input and i/ is the incasuicment The signal is 
assumed to be measured Without loss of generality wt assume in 
tact thal consists of the first l channels ol t) 

Suppose now lhal wc are interested m icgulating against sinusoids 
with fiequencies i a \ l et F denote the vetloi space of 
signals consisting of linear combinations of these sinusoids (F will 
include the class of constant signals if one of these fitquenues is 
zero ) The robust regulation synthesis problem is the problem ol 
designing an internally stabilizing controller such thal ihc closed 
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loop system asymptotically rejects T from <r to and does so locally 
robustly. The problem, in other words, is to design K so that 

1) K internally stabilizes I\ 

2) T-Aj+'k ) = 0 for k - 1, * • ■. ,V. and 

3) property 2) holds for all plants in some neighborhood of P in 
the graph topology. 

In part 2), T : „. is the closed-loop transfer function from tr to The 
requirement that this transfer function evaluate to zero at frequencies 

. ■«•,\ is equivalent to the requirement that T be asymptotically 
rejected from tr to Pan 3) is the local robustness requirement. 

Now consider Fig. 2. In this figure the plant P\ is formed from 
the interconnection of the nominal plant P (augmented with the 
additional input nu and the additional output :j) and the unstructured 
norm-bounded uncertainty block A. Again K is the controller to 
be designed, o is the control input, and y is the measurement. The 
exogenous input is »'i, and m is the controlled output, which again is 
assumed to be measured. Without loss of generality, we again assume 
that ^ consists of the first / channels of //. 

The following controller synthesis problem is addressed in this 
paper: given a real number - > 0, design a controller A such thai 
for all stable A with ||A||x < I/~, 

1) K internally stabilizes 

2) Tz j {jJr'k) = I) for k = 1. ■ ■ •, Y. and 

3) property 2) holds for all plants in some neighborhood of P± 
in the graph topology. 

This problem will be called the problem of robust regulation 
in the presence of norm-bounded uncertainty (RRNBU). It differs 
fundamentally from the standard problem of robust regulation in 
that we are requiring the controller to solve the robust regulation 
problem for an a priori given (possibly large) plant uncertainty set. 
To emphasize this point, we note that any controller that solves 
the robust regulation problem for the nominal plant P will, by 
virtue of local robustness, solve the RRNBU problem for small 
enough 1 /«, However there are no guarantees on the size of 1/v 
The amount of unstructured uncertainty that a particular robustly 
regulating controller can tolerate may be arbitrarily small. Given a 
value ^. the results of this note enable one to efficiently check whether 
the corresponding RRNBU problem is solvable, and in the event that 
it is, construct a controller that solves the problem. Results are given 
for both the stale-feedback and output-feedback cases. 

It is shown in Section 11J-A that the RRNBU problem is equivalent 
to a certain multiobjcclive problem that will be discussed in the 
context of Fig. 3. The muliiobjective problem is to design a single 
controller that solves both the robust regulation problem (from «j 
to t- 2 - with if i == 0) and the suboptitnal fontrol problem (from 
if i to :i. with /f 2 ^ 0). Precisely: given a real number * > 0. the 
problem is to design a controller I\ such that 

1) I\ internally stabilizes P. 

2) = 0 for k = l, • ■ •. A\ 

3) property 2) holds for all plants in some neighborhood of P in 
the graph topology, and 

4) ||7-„ m |U < 

Objectives 1) through 3) constitute the standard problem of robust 
regulation as defined previously. Objective 4) is the usual norm 


Fig, 2. Setup for the problem of robust regulation in the presence of 
norm-bounded uncertainty. 

requirement. This multiobjective problem will be called the problem 
of robust regulation with an constraint (RR7-f x C). 

The robust regulation objective is a special case of the robust 
regulator problem with internal stability which was investigated 
extensively in the 1970s using a variety of approaches. Asymptotic 
tracking and asymptotic disturbance rejection are special cases of this 
problem. This was one of the central problems to be solved using 
geometric control theory. See, for example, [I2|, [22], 123J, and the 
references therein. Davison and his coworkcrs used a matrix approach 
to solve this problem in |6]-[8], The robust regulator problem with 
internal stability has also been investigated in the frequency domain 
(31, [5], [11 J. One of the main results of this research is the well- 
known “internal model principle." The literature on this problem is 
extensive and the interested reader should consult these and related 
references for further details. 

The second objective of the RRmultiobjective problem is 
the standard problem of H^_ control theory initialed by Zames |25). 
This problem has also been extensively investigated using both state- 
space and frequency-domain techniques. The book by Francis |10| 
contains an account of the early frequcncy-domain/operator-theorctic 
approach to this problem as well as a large bibliography. For the 
state-space approach, see |9], [20|, and the references therein. 

Problems similar to the RR7Y* C problem have been considered 
by Vidyasagar 121], Sugie and Vidyasagar [19], Hara and Sugie 
1131, 118], and Wu and Man,sour 124]. In all of these notes, a 
frequency-domain interpolation-theoretic approach is taken. One pos¬ 
sible shortcoming of this approach is that the resulting controllers can 
be of unnecessarily high McMillan degree. In an earlier conference 
paper |2], we gave a solution to the RR H in the special case that 
only steps were to he regulated against. 

In this note, wc take a state-space approach to the RRft^C 
problem. One merit of our approach is that the solvability conditions 
are given directly in terms of plant data and the solutions to at most 
two algebraic Riccali equations. It is shown that the RRNBU or 
the RRtt x C problem is solvable if and only if the suhoptimal 
problem is solvable and X matrix inequalities hold, where X is the 
number of frequencies to be regulated against. The matrices are all 
of size /, where / is the dimension of -j. i.e., the number of (scalar) 
signals to be regulated. 

As is well known, the robust regulation objective is equivalent 
to requiring that the controller contain an appropriate “internal 
model" (see, for example, [7|, [12]). Thus an obvious approach 
to the RR tfxC problem is to incorporate this internal model into 
the generalized plant P and then solve a standard ' H ■*_ control 
problem. However, a significant difficulty arises—and this is why the 
problem is interesting—the unstable modes of the internal model are 
unobservable at -i. Since these unstable modes are on the imaginary 
axis, this difficulty cannot be easily circumvented. One approach 
is to use Riccati inequality methods as in [14]-|I6]. Our results 
are closely felated to and can be derived from the recent work of 
Scherer 116); however, the control theoretic problem contexts are 
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ig 1 Setup lor the problem of robust regulation with an H ^ constraint 


quite different The importance of the asymptotic regulation problem 
in control theory is well established, and therefore it is important to 
deiive an explicit solution lo the RRNBU problem 
The icmaindei of this note is organized as follows Notation is 
set in the next section, results (along with proofs) are presented in 
Section III and some background material on robust regulation is 
collected together in the appendix 

11 Notation 

The set of all leal numbers is R the set of all imaginary numbers 
is ) R and the set ol all complex numbers is C For a matrix \I 
M' is its tianspose M* is ns conjugate transpose spci( M) is ns 
set of eigenvalues and />{M) is Us spcctial radius Flic si/c ol a 
matrix will be indicated by a subscript if it is not clear from the 
context We abhieviate the finite dimensional linear hint invariant 
system t — 4» + Bu t/ - ( / ■+ Du by 



where I is the transfer function mapping input // lo output </ 

F mally a contiollci is said lo be admissible for a plant it the icsulting 
closed loop system is both well posed and internally stable 

FII Main Ri si us 

In the lirst part of this section we show that the RRNBIT problem 
tnd the RRH C problem arc equivalent I hen making use of this 
equivalence wc slate and piove the mam results 

A [ qimalt in c of RRNRIJ and RRH Pwhli ms 

Hum cm II l et A be any finite dimensional linear lime 
mvanani conlioller Then A solves the RRNBU problem for 1* 
f and only if A solves Ihc RRH^C pioblem loi V 

Pioof («i=) Let A solve ihc HRHxC problem Then A is 
idmissiblc lor P and icndeis \\T , , || Thus by lilt small 

nn theorem the closed loop system of Fig 2 is internally stable for 
>11 stable A withl|A||^ <1/ By Coiollary then A solves 
Ik robust regulation pioblem lot all stable A with ||A|| ^ 1/ 

1 c A solves the RRNBU pioblem 
(=>) Suppose A solves the RRNBU problem for P Then in 
ulicular, the closed-loop system ol Fig 2 is internally stable foi all 
ible A such that || A|| ^ < 1 / which implies, by the necessity part 
the small gain (heoiem that ||7" , ,iu < Moreover, because 
solves the robust icgulation problem when A — 0 A solves 
k mbust regulation problem for P Thus A solves the RRH^C 
lohlem 

Solution State Feedback Case 

in this section the RRH v C problem, hence the RRNBU problem, 
' solved m the case where the entire state is available for feedback 
We make the following standard assumptions on P 


SFI C ± — I and D 2 i = 0 (stale feedbaik) 

Sr2 (4 B 2 ) is stabili/ablc 
Shi {([, A D[) has no unconiiollahle/unobsu\ ii I 
the imaginary axis 
SF4 D\ 2 C i = 0 and D\ P, - J 
The next definition enables Theorem in K shud i 
cisely 

Definition 12 (internal model mannes) 1 and B jk mlun 
model matrices associated with the lobust legulatinn pioblem dc 
tenmned by ^ i \ if these matnccs satisfy 

1) spcc( 4) = {_t> 0,1 ±1 \ ( 

2) every eigenvalue ol 4 has multiplicity / 

4) 4 is diagonalizable, and 

4) (l II) is controllable 

A discussion of the lole that these matnces pla\ in robust regulation 
can be lound in Appendix A I he stale feedback result follows 
F/uonrn 1 1 Let the plant / > ol Fig 1 satisl\ ihc standard as 
sumptions SFI-SF4 Then the following are equivalent 

i) There exists a controller admissible foi P that solves the robust 
icgulation pioblem liom u to at the liequenues i ^ \ 
while also making ||T , 1 1| ^ (the RIIH > C problem) 

li) Theie exists a controller admissible foi P that lenders 
IK i i II ' < ‘ind 

T k D n Tl s TiBiliili (II 

Ioi all ; = 1 \ Here h = |/,0](y J - l) 1 t as 

I -f ( IIi - B II' i \ and \ is the positive semidchmtc 
stabih/ing solution to 

i \ 4 \ i3\ii]~nri) \ •+(](] ii (2) 

Moieovei il citlici (hence both) of these conditions hold then a 
conlioller A that solves the RRH ^ C problem foi P is given b> 

[ I B\hi)} 1 

A ~ ,- --- 0) 

B l'\\ 1 B \ B I n ’/ 

Here / is the unique solution to / i - \1 - ll[Iii)\ the internal 
model matnces 4 and LI satisfy 1) 4) of Definition 1 2 and U is a 
positive definite matnx that satisfies the Lyapunov inequality 

111 +114+ IIBBJ' IB II I 0 (4) 

Befoic pioving this theorem we address the problem ol cunstiuct 
mg the contiollci given by (+) lo synthesi/t this conlioller one 
must solve a Riccati equation for \ a linear matnx equation foi / 
and a lincai matrix inequality (l Ml) foi U ( omputmg \ and / 
is stiaightforward (Note that / always exists siiilc 1 and 1 have 
no eigenvalues m common ) In the simple bill important case wheie 
the only frequency to be regulated against is i — I) computing 
11 is veiy easy In this case l = l) so anyposilive definite malnx 
is an acceptable T1 The general case is also tiactablc L Ml s are 
finite-dimensional convex tcasibilily problems and thus can be readily 
computed 14) We emphasize rhai il i) oi ii) of the theorem statement 
hold then we are assured that a solution to this l MI exists 
Now we turn our attention to the proof of Iheoicm * J 
Pioof (i^> n) Suppose theie exists a controller A that solves 
the RRH^ C problem foi P Wc fust show that as a consequence 
A solves the RRH x C problem Jor the plant 



Because A solves the H > pioblem foi P there exists a solution \ to 
the algebraic Riccati equation (2) This is part ol the full information 





150 


IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 40, NO. 1. JANUARY 1995 



Fig. 4. The closed-loop system consisting ol plant /’imp and controller A\ 
Phnp consists of stable system T 5 with state-feedback D! ? X. 


result in [9, p. 836). Therefore by the Redheffer lemma ([9, Lemma 
9j), K also solves the H v problem for 



which is precisely the plant Pi m|> . Moreover, because A" solves the 
robust regulation problem lor P, K also solves the robust regulation 
problem for any plant for which it is an admissible controller (because 
it has the right internal model). This is just a restatement of Corollary 
A.3. In particular, K solves the robust regulation problem for Pi,,,,,. 
Thus, A' solves for the RRH x C problem for Pi,,,,,, as claimed. 

Before continuing with the forma! proof, we given an intuitive 
argument which will clarify the role of the coupling conditions (1). 
Let us write Pm,,, as the stable system T (stable because X is ihc 
stabilizing Riccati solution) with state-feedback £ ~ /}J.Y.r+ */, and 
close the controller loop with J\ as shown in Fig. 4. The controller 
A' solves the robust regulation problem for P,,,,,. so sinusoids ot 
frequency *t7 are rejected from any input of Pi m ,» to r-. In particular, 
sinusoids are rejected from r to rj. Now wc apply a unit amplitude 
sinusoid of frequency -a to input r. Since r 2 is asymptotically zero 
we have, in the frequency domain 

0 = Tr,,r + r/ Jt c 

In general, in order for this equation to be satisfied, £ will have to 
have some nonzero amplitude, say "a. Since iq = £. therefore, n 
will also have amplitude Thus, || T ,,, || >. > ~a- This argumenl 
goes through for every frequency that is regulated against, so the 
robust regulation constraint defines .Y lower bounds on ||7 , ,. 1 ,|| x . 
The matrix inequalities (l) are satisfied only if all -Y lower bounds 
arc less than ■). We first establish some notation before making these 
arguments precise. Open-loop transfer functions of P will be denoted 
by the symbol P with the appropriate subscripts. Thus, the open-loop 
transfer function from r to r> 2 will be denoted by P, 2 ,. Closed- 
loop transfer functions, i.e., transfer functions of the feedback system 
of Fig. 4, will be denoted in the usual way; thus T\ 1 , denotes the 
closed-loop transfer function from r to n- 
Now we investigate the feedback system of Fig. 4 in more detail. 
In the frequency domain £ = 7(,(«w)r. but since ri =- £ (see 
Fig. 4), we may also write £ = TV, ,(*)/'■ We also have v 2 — 
T, ,,(»)>■ -F Pi 2 (,{* )£. hence, substituting for £. = P* IJf (.s)r 4* 

T’rj t.(«)r,. 1 , (*)/•. Thus 

T,. JI {») = T > , i A») + T,. ji l»)T ,,.(«). 

Now let -ca be a frequency that is regulated against. Because K solves 
the robust regulation problem for Pi M(| ,. we know that T r2 , (7-t7 ) = 
0. hence 

7\,,, (./-■*) +'7V,t(M )T,. l , (ja'n) = o. (5) 

We note that P t it ( 7*7 ) ~ TkB\ and Pv^ij+'k ) - T k B 2 , wher e 7 'a 
is defined in ii) of the theorem statement. Thus (5) can be rewritten 


as 7 a D 1 4 * 7) B 2 T Vx , ( 7*7 ) = 0 . which implies 

T k B Jo V (M )7Vi, (M = l\Di B\Tl, ( 6 ) 

From Theorem A.1 we know that Pi,,,,, cannot have transmission 
zeros from u to r-i at the frequency 70 . 7 . Thus, because transmission 
zeros are invariant under state-feedback, P cannot have transmission 
zeros from £ to r a , i.e., the matrix 

7 ^ 7 1 - A B 2 
[/|«] 0 

must have independent rows. This, in turn, implies that 7 a Pi has 
independent rows. In addition, because ||r,,, || ^ < 7 . we also know 
that - T r{ , {jjJk )T V{ , ( 7^7 )* > 0* These two facts imply that 
> TkDiT,.,, (ja’t )T, l , Now bring in 

( 6 ) to conclude v7a B 2 B! 2 T^ > J\B\B\Tl. 

(ii=* i). The first step is to show that 

rV, 2 LDiD[L r ~ LUiB\V)r <r 0 (7) 

for any left eigenvector r of .1. With this in mind let r be a left 
eigenvector of .4 and suppose that the corresponding eigenvalue is 

j+'h* so 

r'A=j+'U'\ («) 

We recall from ii) of the theorem statement that L is defined by 
LA - AL = /)[//()]. Multiplying this equation through on the 
left by r 4 . using (8), and then rearranging, we obtain - r* L — 
i'*B[]t 0 ]( 7 «l 7 I - -4) -1 . Using Ihe definition of 7 a this can be 
rewritten as 

-r'L = (9) 

Now from ii) of the theorem, we have - “77 B 1 U\ 7V — 
TlBjB'/I'l < 0 . Because v* B / 0 (since, by Theorem A. 2 . 
the internal model ( .4. B) is controllable), it follows that 

Z~ J i'B7 k lUB'\Ttn'i'- 1 * BTJhJljI [ W i' < 0. 

Hence, by (9), z ’ J r m LB\ B\ Vr — r* LBjB'J/ r * 0 . thus proving 
the claim. 

Next, we bring a result of Scherer (117, p. 128, Theorem 4|) to 
conclude that, by virtue of inequality (7), there exists a W > l) such 
that 

AW + 1IV4' 4 - ~LB\ B\ \! - LBjB'iL* ■' 0 . 

Thus the controller K given in the theorem statement is well dehned. 
Next, we show that this controller solves the H > problem for ]\ II 
the controller loop of TV,,,,,, is closed with A we obtain 

'A- BjB!,L'W' L B 2 B',L'W 1 7i," 

T ^ B\iii 1 ] .4 n 

-B'.LAV- l L B!,L’W~ [ | (T 
A change of coordinates via \j (i y l results in 



~ A 

-n.n'j .'u - 1 

n, 

T ri - 

0 

A - UijlH.L'W 1 

LB, 


0 


1 ) 


which can be simplified by eliminating stable unobservable modes 
to yield 

Li- UhD!,L'\Y- 1 LB\ r.4 B 

j ^ :- = .- 

-B , 1 L t W~ [ 0 [C U 
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We note that 

i'h 1 + n '.4 + 0 'ii ~'nn '\i 1 + c'< 

= 11 ‘(111+114 ' + ■) ’LBtD'iL'-ID.D'jL'W 1 

< » 

\here the inequality is a consequence of (4) It lollows from 
ilit hounded real lemma (the algebraic Riccati inequality analysis 
itsull—see tor example [26 Lemma 2 2)) that h solves the H x 
problem tor G ilin Thus, again bringing in Redhcffer ([9, Lemma 
op A also solves the H^ problem tor P In addition since A has 
the internal model structure ol Theorem A2A also solves the robusi 
regulation problem lor P hence the RRH problem foi P 

t Solution Output Feeclbat k Case 

In this section the RRfr * C pioblem hence the RRNBU problem 
is solved in the output feedback case We make the following standard 
assumptions on P 

OH (4 77 J is stabilizablc and (( 1) is detectable 

OF2 (f i 4 D\ ) has no uncontrollable/unobseivable modes on 
the imaginary axis 

Of 1 1)\ ( , - 0 and D\ 77, - I 
ON 77,77', = 0 and 77 i O' , W 

Ihioxm N let the plant /’ of I ig ^ satisfy the standaid as 
sumptions OF I OF4 Then the following are equivalent 

i) Tlieie exists a contiollei admissible foi P that solves the robust 

regulation problem from ti to at the fiequencies * i \ 

while also making ||7 , , || <■ (theRRtf^C problem) 

ii) I here exists a controllci admissible loi P that icndcis 
||T , ,l|-, <- and 

1,11 mi ^ (/, /T, +[/,()])(/*75, + [/,()])’ (10) 

toi all A = l \ Hcic 

1, = [/,()]<" (/-*!— 4) 1 
1=4 + 1 ( , + 1 ( ( 

77, - w ' 

/ -= \(1 - 1 \ ) 1 

\ is the positive semidelimte stabilising solution to the alge 
braic Riceati equation 

4'\ 4 \ 14 \( B\B\ - P B )\ +( { ( i 0 

and 4 is the positive senndetimic stabilizing solution to the 
Riccati equation 

n 41 i'4 1 ( u ,(i -([c n +c,n; - o 

Moreover it either (hence bolh) ol these conditions hold then .1 
ontiollci that solves Ihe KRtt^C. problem loi P is given by (11) 
hiund at the bottom ol the page Mete 7 is the unique solution 
to M - \l - fl [/,()]C ( „ - C (7 4 4 7) the internal 

nodel matrices 4 and B satisfy 1) 4) of Definition 4 2 and II is a 
Positive dehmte matrix satisfying the l yapunov inequality 

4 II 4' + ■) *( 7 77, - Z7[/,()])(777, - 0[i,Oj)' 

0 (12) 



Fig S Tht closed loop system (.(insisting ol pi int f * , mil min k / 
Py i | consists ol stable system P wiih stale (eedhaek U / 


hoof (r* u) This proof parallels lhal ol the slate feedback 
case Suppose there exists a conlrollei h that solves the RR H .( 
pioblem foi P The fust step is to show that A also solves the 
RRHvC problem loi 



' 1 4 B IV 7 

O] 

B " 

r „, - 

B'7 

0 

7 


C 

I 

0 


where 4 7? i and ( are defined m the theorem statement Io prove 
this wc make use of two system transformations as in |20| 

Because thcie exists a controller A that solves the H problem loi 
P (and P satisfies assumptions OFI OF4) Iht Riccati solutions V 
and ) of the thcoiem statement exist This follows from |9 lheorem 
4] In particular 4 exists There I ore by the dual ol the Redhcllei 
Lemma (|9 Lemma 9|) A also solves the H problem lor the 
plant 


'4+ ir;r, 

B\ 

B 

< 1 

0 

ih 

( 

/ 

0 


Sirmldtly because there exists a contioller A that solves the H x 
pioblem foi (14) wc know that there exists a unique stabilizing 
solution 7 _ 0 to the X Riccati equation for (14) In lact it can 
be easily venhcd lhal 7 — \ (/ - 1 \ ) 1 Ihus again bringing 

111 Redhctier A also solves ihe H ^ pioblem lor 


14 4 ( J( i 4 D, 77,7 

B\ 

B " 

D' / 

0 

7 

c.+ d\z 

7 

0 


which is piecisely the plant Py , , 

Next we write 7T,,, as the stable system P with state feedback 
£ = B' / \ 4 u and close the contioller loop with A as shown 
in Fig S The remaindei ol the proof then consists ol an analysis 
of Tig 5 |usl as the feedback system of Fig 4 was analy/cd m the 
state feedback case The details arc omitted 

(n^ t) We show that A given by (II) solves the H * problem 
lor P by showing lhal it solves the H x pioblem lor T\ i, , and then 
appealing to the Redheffer lemmu (|9, Lemma 9]) and its dual A 
realization tor the closed loop system consisting of l\ , and 7v 
(stacking plant slates on lop ol controller stales) is shown in the 


A ~ 

i 

BiD'.l'W 1 

0 

4 + 1 ( 2 C\C,-C',( )-n R'7-DiD'J' 11 1 1 

D[I,0} 

nj 


D' Z/11 1 

-n\/-D'L'\\ '1 

0 
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r.4 + D-iB! 2 Z DiD' 2 L'W-' 

-DiD'.Z - DjD 2 L'\V~ 1 1. 

D i 

b[h ojTa -i 

0 


D 2 n' 2 L'\Y-' 

A-Ux'Oi-ThD'.L'W 

2J, 

. n'j,z m.L'iv 1 

-D'-.Z - D'iLAY- ' L 

0 


equation found at the top of the page. After changing coordinates via 

7 0 0 ' 

7, -7 0 

1 0 -7 


With every frequency */’a to be regulated against, associate system 
matrices .4* and Dk as follows. If ^ — 0, choose integrator 
dynamics 

.4/l: = 0 6 7\ ,x/ and D i: =Je/?' W . 


and eliminating stable unobservable and uncontrollable modes, we 

obtain 


.4 - LI)>n 2 L'\Y-' 

/./?, - j3[r ( o] 

-B 2 L'\Y _l 

0 


Now we can appeal to the bounded real lemma (as in the state- 
feedback case) to conclude that K solves the H * problem for 
and thus for P as well. But 7v' also has the internal model structure 
of Theorem A.2, so A solves the RRTYxC problem for P. 


Appfndix A 

The Robust Rhgul ation Problem 


This appendix includes well-known results on the robust regulation 
problem that arc used in the proofs of Section III. A precise definition 
of the robust regulation problem is given in the introduction, and all 
theorems here apply to the plant of Fig. 1. The first theorem addresses 
the question of when there exists a controller that solves this problem. 

Theorem A J /7 ]. 1121, /// There exists a controller that solves 
the robust regulation problem determined by -ei. ■ • ■ ,-e\ for P if 
and only if 

i) (A, B 2 ) is stabilizable and {C' 2 . .1) is detectable, and 

ii) the matrix 1 1 has independent rows lor every 

k = 


A controller I\ solves the robust regulation problem if and only tl 
it contains the right “internal model." I his “internal model principle" 
|l], [31, [51, [7|, [8], [I1J, [121, |221, |23| is the central result in 
robust regulation. A particularly simple statement of this result is 
given in the following. 

Theorem A.2 11J: A controller I\ solves the robust regulation 
problem determined by *j \. • • •, a*\ for P if and only if 


i) K is admissible for P. and 

ii) K admits the realization 


A ~ 

.4 0 

+ ■+ 

B[h 0] 

A 


* * 

* 


(14) 


where 


a) spec(A) = {±j *'\. • ■ ■, ±j*'\ }, 

b) every eigenvalue of .4 has multiplicity /. 

c) .4 is diagonalizablc, and 

d) (.4, D) is controllable. 

Matrices ,4 and D satisfying a)—d) of Theorem A.2 can be realized 
in many different ways. We give one realization here. Another may 
be found in, for example, |8J. 


If u'a / 0. choose harmonic oscillator dynamics 




6 R 


'll V }( 


and 


ih '• — 


0 

7 


6 


R’ lxl 


Now set 



.4, 


r 3 

ll 


and H\ — 



.4 \ _ 




It is straightforward to verify that -4 and U as constiuctcd do, in lad, 
satisfy requirements a)-d) of Theorem A.2. 

The last result on robust regulation is a straightforward consc 
qucncc of Theorem A.2. 

Corollary A.3 Suppose A solves the robust regulation problem 
for P. Then 1\ solves the robust regulation problem for any other 
finite-dimensional linear time-invariant plani lor which it is an 
admissible controller. 
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Frequency-Domain Criteria of Robust 
Stability for Slowly Time-Varying Systems 

A Megrtlski 


Ahstiait —I hi problem ol stability ol feedback systems with structured 
slowly time-varying uncertain gams is considered For the ease when 
the pair “uncertain gain/ils derivative’ belongs lo a given convex set, 
a sufficient frequency domain condition of stability Is obtained 1 his 
condition is an MIMO generalization of the S1SO results derived in the 
60s in the context of the positivity theory 


I Introduction 

Consider the system with uncert unty in the feedback loop shown 
m Fig 1 where P is the lmcai time invariant (LT1) nominal plant 
md A is the lineal block representing the uncertainty t,{t) - 
I he (unknown) function ) is such that for any t the 
pair [b{t) b{t)) belongs lo a given set P of pans ot matnees Note 
iliat in the case when = 0 foi any (A f>i) 6 P the setup describes 
m LTI system with a real parametric uncertainty 
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Fig I 



y 


The setup from Fig 1 cun be tepresenled in the cqm\ 1 n t n 
i(f)= i(f)i{t) 4(f) = 4oH-fii b{t)C { ( s (0 s(m 

tl 11 

where i (/) is the slate vector of P q = ( „ / and \ U ( aic 
given matrices A well studied ease oi the problem is whin 

4o = 0 Pn - ( - J 


P= {(*11 M INI < u R(M _ -n IIMI . t) (12) 

ic the pointwise bounds on the norms ol V(/) and 1(f) and on 
(he I yapunov index of i(/) are given (\\F || and R(7 )) denote the 
largest singular value and the laigest leal pail ol eigenvalue of / 
respectivel> / denotes a unit matrix ) ll was shown in 11) that system 
(1 I) (1 2) is stable provided that < x r u ( \/ n) is sufficiently small 
Explicit bounds t- {) { \1 a) were obtained lor example in [2| and 
[1) where a more general formulation is also considered vuth M 
t\ and f being the average values of || 4(f)|| R( 4(/)) and || 4(f)|| 
respectively See also |4| for ihe infinite dimensional ease and see 
[1j foi the application of the technique m the gam scheduling and 
adaptive control The general lesulls mentioned above establish the 
so called fiozcn time punciple in the analysis ol slowly time varying 
(STV) systems The issues ot comern however die the eonservalivity 
of the upper bound f < fn( \f n) and the complexity ol the analysis 
required lo check the condition R( 4(/)) < -o First most practical 
systems of interest do have structure which means that 4( ) is not 
expected to be an arbitrary matnx function satisfying the hounding 
conditions (12) F or example when the ordtr of the plant P on Fig 
I is grcalei than one the coefficients of 4 are not independent Since 
the stability condition t < *o( W o) does not utilize such mlorma 
lion the bound may be very conservative Although no systematic 
stud) of the eonservalivity is delivered in the note a reasonable 
example ot a second order system is given for which the enter ion 
from Theorem l hclow (which does utilize the structure) gives about 
10()()() times laiger bound on Ihe admissible rate ol variation Second 
when a genenc set ol expected values of matnees f(f) is given the 
pioblem ot finding the upper bound n in (I 2) is equivalent to a robust 
stability problem in the piesenct ol a multivariable real parametric 
uncertainty which is known to be NP hard 
An alternative appioach to studying STV systems was developed 
within the positivity theory (see for example [6| and [7| and 
references therein) In many (but not in all) examples this approach 
yields much (thousands times) less conservative upper bounds than 
the unstructured one However the classical papers like [6| and [71 
treat the case of a single time varying parameter only In this note 
relatively simple sutliueni frequency domain conditions of robust 
stability in the STV case are proved With some reservations the 
tesult can be viewed as a generalization of the criteria [6| |7| to 
the case ol any number of time varying coefficients The analysts 
required bv the new criteria is similar to that used in the real // case 
(see tor example |8]) where the untompuiable real//is replaced 
by a convenient upper bound A similai approach to the analysis of 
STV systems was developed independently in |9| 
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II. Main Result 

We consider system equations given by (1.1) where An is a Hurwitz 
w x n matrix and P is a given convex compact set of pairs of m x / 
matrices containing the pair of zero matrices. For a complex matrix 
F. let F* denote the Hermitian conjugate. We shall write F > 0 if 
the real part qf f'Ff is not negative for any complex vector /. 

Definition 1: The uncertain STV system (1.1) is called stable 
(exponentially) if there exist n. > 0 such that \,r(t\)\ < 

j' or an y sa ti s fying (i.i) and for any 

1 1 > / (). 

Definition 2: System (1.1) is called P — stable if 

a) A t = 0 for any (V A, ) 6 P [i.e., .4 = const in (1.1)]; 

b) there exist v > 0 and an / x / transfer matrix P (bounded in 
the right-half plane) such that 

[/ — Ao 0(jij)]^(jjj) — cl > 0 V(Au, Aj ) 6 P. u, 1 E R 

where G(s ) - C\){sl —* .4o)“ 1 Bo . 

It is well known and easy to show (and follows from the proof of 
Theorem I below) that P - stability implies the ordinary exponential 
stability. The advantage of using the P - stability is in the relative 
simplicity of its verification. A possible disadvantage is that a stability 
criterion based on the P - stability may be conservative. However, 
in many applications, the P - stability is “close" to the standard 
robust stability. For example, if m = 1. i.e.. Bn is a vector, then 
P - stability is equivalent to robust stability (see [10] and [I IJ). 

Let us formulate the main result of this note. For r 6 (0, -R( 4«)), 
the modified system (I.I) is delined by 

.4 1 = 4o 4- H\ = iio, C’i = Co, 


V ] 




(Ao, A j ) 6 P 


} 


where .4j, B i, G|, Pi are the modilied ,4 ( >, Bo. Co. P. 

Theorem /: System (1.1) is stable if the modified system (1.1) is 
P-stable, i.e., if there exists r > 0, r € (II, -R( 4u)), and an n\ x m 
rational function P (bounded in the right-half plane Re (a) > 0) 
such that 



P(,M - r[ > () (2.1) 


lor any -c 6 /?, (fyi. A t ) 6 P. 

Remark J: For a fixed r > 0, condition (2.1) is convex in 
P(-). The following procedure can be used lo apply Theorem 1 in 
robustness analysis. 

a) choose a finite set {o 0 . (>\. ■ ■ ■. } of scalar rational functions 

(bounded in the right-half plane); 

b) for a fixed r > 0, use standard convex optimization to check 
whether there exists P(x) = Yl' 1 , a\ ^(where P , are constant 
real matrices) satisfying (2.1) (for some r > 0, for any ^ e R , 
(A(j. A | ) £ P); 

c) use random search over r to find whether the function P in b) 
exists for some r t (0. -R( 4d)). It can be shown that, when P is a 
polytope and r is fixed, the search for P , can be reduced to solving 
a system of Linear Matrix Inequalities (LMI's), see ) 121. 

Remark 2: Consider the problem of robust stabilization of the 
uncertain STV system on Fig. 1. This is the case when the transfer 
matrix G of P is not fixed but has to he chosen from an affine 
set given by the Youla parameterization G — Go + G\HG >, 
where Go, G\. G> arc given stable transfer matrices, and H ranges 
over the set of stable transfer matrices of appropriate dimension. 
When (t> — 1 (i.e., in the state feedback stabilization problem), 
the set of H. for which the robust stability of the STV system 
can be proved by using Theorem 1 with a fixed /■, allows a 
convex parameterization, which reduces the problem to solving a 


system of LMI's. This parameterization is similar to the projective 
parameterization introduced in [10]: 

//(*)=.*(*)«(*+»■)*' 

where (P, P) is the “parameter"—arbitrary pair of stable transfer 
matrices of appropriate size such that 

P(- ^A 0 - \i(ju' - r)ty(juj) 

+G)(jj.' - r)<b(j^ - /')] > <'I 

for some r > 0 for any -c C R, (A„. A, ) £ P. 


III. The “Rank One" Cask 

In this section, we discuss application of the main result in the 
case when m = 1, i.e., £ is scalar. Note that this is not the case of 
one single uncertain coefficient: basically, this is the case when we 
investigate stability of the linear differential equation 

+ ■■• + /()(M«' = 0 (.VI) 

where n is scalar. The following result is a simple corollary to 
Theorem l, based on the fact [10], [ll| that the P - stability is 
equivalent to the ordinary exponential stability in the rank one case 
(m - 1). 

Theorem 2 Let k be a convex compact subset of R 2k such that 
/ A ,. 0. 0, ■ ■. 0 ) £ A.' for some Hurwitz polynomial 
f(s) = fo + f\ s ■+ ■ ■ ■ + )\-\s k 1 4 .s* . The differential equation 
(3.1) with the time-varying coefficients such that 

(/«(/). fiith ■■■.//. i(M. /»(/). /,(/), ■ - ■ - h i (M)CA, 


for any /, is stable if for some /■ 0 any polynomial 

mm = + i -o 1 -' + ■"+ 

<V2> 

with f ,.■■■■ f k i' Ho ■ // 1 *" *" * iJk i) t A. , is a Hurwitz 

polynomial. 

Consider, as an example, the second-ordei differential equation 

«’(/) + /i (/)«• (/) + /j(/)« (t)~ 0 (3.3) 


where w{t) is scalar, and f,(i) are uncertain coefhcienls. Assume 
that f,(t) are within 100% of their (known) minimal values J‘\ = 1 
(s 1 ) and J> = F > 2. i.e., 

!</.(/) <2. F<fA1)<'>r. (3.4) 

Our aim is to find admissible relative rate of variation a tor 
i.e., to find a > 0 (as large as possible) such that the conditions 

|/i(/)|<«l \ fr ( t )\< r<i (3.5) 

and (3.4) guarantee exponential stability of (3.3). 

First, let us apply Theorem 2 ol Ibis note in the case of the system 
setup (3.3M3.5). The sufficient condition of stability is existence of 
r > 0 such that all the polynomials 

p(s) = (s- r) J + - )■)+ (j, ~ 


/] € [V 2], |</, | < «. f, € \t\ 2 F], I;/..I < Fa 

are Hurwitz. It is easy to see (remember that F > 2) that the condition 
is satisfied for r = 1/1, a < 1/4. Therefore, a < 1/1 is a criterion 
of stability given by Theorem 2. 

Now let us apply the result [2] (Theorem 3.2, (ii), with corrections), 




I \ 


vmy 


(3.6) 
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m analysis of the system (3 3)-0 5) To do this, we should represent 
il 3) in a state-space iorm (11), One can use the canonical lorm, 01 
u\y equivalent state-space realization, 


0 

hd) -/!(/) 


01 


MD^S 


0 

-hit) 


1 

—/l(/) 


s- 1 


Indeed, we have 

R( 4(f)) > -0 r )tr( 4(f)) =: -0 j 


|| i(/)|| > y'ldei 4(f)| « /IF || 4(0|| > sup|MO| - « 


existence of r/i > 0 such that 

\I<»t - \hl< .H | ' i/| |( | j , 

foi all ) and tor all / e Ii We will piove (Ins by show in tlm 
(i 7* ( — 7< > • 11 

Suppose (with no loss ot geneialily) that / has compel uppon 
Lei ii = J*< y = T($i i,(f) - < 'ill) u i j / 

IJ (/) = f ,f v(f) It is easy to check lhai i a t/ aic squ ail 
summable, and it ~ Vi, u ~G,^i where G (si bis-,) 

Iheiqtoie, hy (2 I), using the Parseval loimula we have (i a 
(b u -b,/(2i ))v, ) > 1 1» |'foiany(«S ) C O Since the opei nois 
Ti* and Jr <t> are causal and the last inequality holds loi an arhitiarv 
i € 1 2 , we have tr \{2rJ)(t) - I)(1))]] (/)} > 0 foi any t where 


Hence, the criterion would not give a hound heller than a < 
1/2 3 l I * Even for the smallest value of 7 under consideration 
J = 2, this is about 30000 times worse than the criterion « x 1 /4 
Also, the last bound yields a —+ 0 as 7 —+ x while the bound 
uivcn by rheoiem 2 does not depend on / € [2 x ) Therefore, toi 
the example under consideration the cnteiion [2] does not give good 
piactical numbers, and also does not allow us to obscivc the important 
thcoielical phenomenon the admissible bound tor the iclativc late ot 
variation does not tend to zero as the phase ol the nominal system 
poles tends to ±n/2 

IV Concluding Rf marks 

Ihe sulliueni criterion of stability ptescnlcd in this papei is tar 
trom being ncccssaiy It appeals that the uitenon would provide 
a good balance between complexity and conservalivily ol analysis 
toi dynamical systems with low to medium' speed lime-varying 
coelhcients (u not too Iasi) II can be shown that the criterion is 
based on using a ccitam set ol Integral Quadratic Constraints (see 
11TJ) describing STV gains TTierelore il can be used in the case 
ol a mixed uncertainty combining leal paramedic uncertainty fast 
time varying gams conic nonltncarities unmodeled 1 TI dynamics, 
and so on 1 hese extensions will be discussed m a forthcoming paper 

V APPrNDlX 

Pi oof of I fuoiem I In this pi oof 1 t denotes the standaid Hilbert 
space ol squaic summable (unctions h R —» R 1 with the norm | | 
and the scalai product ( ) 11 ^ is a bounded measuicable matrix 

valued function on R f/s denotes the operator of multiplication 
by ic {Mf,h)(t) = b(t)h(t) acting from one I - space to 
(generally) another Similarly if b — 7(s) is a rational matrix 
valued function bounded on the imaginary axis the // dcnoics the 
I oran opcraioi ol multiplication in the tiequency domain Iu(jjl) — 
r ()//( fj, )V-t £ R toi h\ — T] h where denotes the Founei 
liansfonn Remember that 7/ h = Ii < and that it T is bounded 
“i the right-half plane Re(s) > 0 then the opoiator 7 j is causal, 
i e (7y h )(/) - 0 ii /;(f j ) = 0 for any t\ < t 

It is sulhcieni to show that the opeiator / - \h 1< is invertible lor 
my />( ), and that the norm of (I - \hh ) 1 is bounded Since the 
cM of b( ) is convex and contains the zero function one only needs 
to show that there exists q > 0 such that 

It - \hT<t\>q\i\ (4 1) 

tor any and tor any t £ l 1 Note that since (0 0) 6 P, condition 
- 1) implies that ty(i^) - ql > 0 for any ^ £ R therefore, 
since 'P is bounded tn the right-half plane, #(s) - ql > 0 foi 
4ny s with Re(s) > 0 Hence, the rational function 't(s) 1 where 
^(s) = ^r(s-|-i)| S well dehned and bounded in the right-half plane 
Therefore, the operator 7* is invertible and it is sufhcient to prove 


D(1) =r [J -/»(/) - (T\ 


(t) = 


«(/) 

Hit) 

id) 


U (t) 


-f 


(r)t( )\ r r/r 


Note (hat the matrix H is bounded and U ( 7 ) = 0 loi r approaching 
- x I lence 


(l 7 p I - \1* 1 < <t 1 ) - r 1 1 | 



(whcic the integrals are ovei R) 


11 *dt - u /< ' Pll r It 

Si ((2/73- D)\) )ilt ^ I) 
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Routing and Scheduling in Heterogeneous 
Systems: A Sample Path Approach 

Panayotis D. Sparaggis 

Abstract —Consider the problem of routing customers to a set or A' 
parallel servers that have different rates. Each server has a buffer with 
Infinite capacity. The arrival process Is general and the service times 
are assumed to be i.i.d. exponential random variables. Using sample 
path arguments, we show that, given any Bernoulli policy v. there exists 
another policy p which outperforms r by partially using a randomized 
version of a round-robin policy. Moreover, p is easily specified and 
implemented. We present extensions of our results to systems with finite 
capacities and service times that have an increasing hazard rate. Finally, 
a similar result is shown to hold In the context of scheduling customers 
from a set of A" parallel queues. 

1. Introduction 

Consider a queueing system with A' parallel queues, each of which 
has a buffer with infinite or finite capacity and a dedicated server. 
There is a single arrival stream and a controller that routes customers 
to one of the A queues as soon as they arrive. The situation arises 
in at least two contexts: communication networks and multiprocessor 
computer systems. In the context of computer networks, for example, 
there arc usually multiple paths to remote destinations and routing 
decisions have to be made locally, i.e., at the source node, based on 
information regarding the delay characteristics of the available paths. 

The determination of a routing policy that is optimal with respect 
to a certain performance criterion (such as throughput or delay) has 
been the subject of extensive research in the past. In the simple 
case, i.e., when service times are i.i.d. exponential random variables 
and the controller has queue length information available at arrival 
instants, the shortest queue (SQ) policy has been shown to maximize 
throughput in systems with infinite capacities (sec |3], [19], and [21 ]). 
This is often referred to as Ihe symmetric routing problem. In addition, 
it has been shown that the SO policy stochastically minimizes the 
number of customers that are lost by any time t when the buffers 
have finite queueing space, e.g.. [5], [9J, [16], A counterexample for 
nonexponential service times is provided in [20], Very recently, the 
optimality of the SQ policy was shown to hold over all service lime 
distributions that are increasing in likelihood ratio |I7|. When the 
rates at the A servers are not equal, however, the problem becomes 
significantly more difficult (see [I], [21). A routing policy which for 
an infinite number of stations performs almost optimally over the 
class of deterministic policies is described in [II]. 

In an interesting variation of the symmetric routing problem, the 
controller docs not have any queue length information. Indeed, this is 
a more plausible assumption in many applications. It has been shown 
that, when the servers are homogeneous and the customers have either 
exponential [3] or deterministic [12], [13] service times, the round- 
robin (RR) policy that alternates among the queues maximizes the 
system’s throughput. Recently, it was proven in |6| that, in fact, the 
optimality of the RR policy covers all service-time distributions that 
have an increasing hazard rate. Extending the basic symmetric routing 
problem, it was assumed in [I3| that there exists multiple arrival 
streams, each of which has a controller that selects one of the A’ 
queues each time a customer has to be routed, based only on its own 
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previous routing decisions. When the service times are deterministic 
and all the servers operate at the same rate, it was shown in [13J 
that a set of randomized RR policies (see Section II for a definition) 
becomes an equilibrium set of policies. That is, if all controllers use a 
randomized version of RR then, from the perspective of minimizing 
the expected steady-state delay of customers on its own stream, no 
individual controller has incentive to deviate from this scheme. 

When the servers operate at different rates, a simple routing policy 
is to use Bernoulli splitting, i.e., route a customer to queue i with 
probability p ,. p> = 1. regardless of past decisions. The 

idea is to allocate larger portions of the incoming traffic to faster 
servers, effectively aiming to materialize the most benefits of load 
distributing. It is the main result of this note that if the service 
times are exponential then, given any Bernoulli policy tt, there exists 
another routing policy, which partly uses a randomized version of 
RR, and which is superior to tt in the sense of separable increasing 
convex ordering (the definition given in the following) of the queue 
length process. This will be proved in Section IV. Note that 7r need 
not be the optimal Bernoulli policy. As it will be seen, given the 
routing probabilities for 7r, policy p can be easily specified and 
implemented. Thus, this result has immediate applications since, in 
practice, probabilistic splitting is a common policy for static routing. 
When the system is Markovian, a numerical example shows that p can 
significantly reduce the stationary mean system delay at moderate-lo- 
liigh traffic. 

A similar result is shown to hold in the context of scheduling 
customers from A' queues, each of which receives customers from a 
dedicated arrival stream (see Section V). Here, it is assumed that there 
is a single server which may switch to one of the queues according 
to u scheduling policy that is determined by a controller. The server 
can be thought as a communication channel shared by a number 
of stations. In this case, our result covers probabilistic collision-free 
access policies that are implemented in a distributed lushion. Another 
application arises in the context of scheduling transmissions in packet 
radio networks, where time slots are assigned to various classes of 
customers satisfying certain interference constraints (see, for example, 
141 and [15]). When scheduling is done probabilistically, the policy 
is often referred to as random polling. 

II. Preliminaries 

Let q, r. be two vectors in H h . Wc introduce the notation y* to 
denote the kih largest element in vector q and define the following 
ordering (see |8j). 

Definition I : Vector q is said to weakly majorize vector r (written 
r X tl q) if 

»-i /i 

In particular, if { r, = i ( l <- then q is said to inajori/e r. The 
ordering in Definition 1 is usually referred to as weak submajorization 
(as opposed to Ihe weak supermajorizalion ordering: sec [8J for 
a comprehensive treatment). Throughout the rest of the note, we 
will use the broad term majori/.ation to refer to the ordering 
Majorization has been a useful analysis tool in the context of routing 
or scheduling of customers (see, for example, [71 and |9|). The 
ordering makes precise the idea that the vectors of queue lengths 
in one system are “less" or “more balanced" than those in another 
system, thus, setting a convenient framework for making sample path 
comparisons among different load distributing policies. 

Following the notation in [8| let C J denote the class of functions 
/: R A — R of the form f(q) = , //(y,). where //: R —► R 
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r ig 1 An txamplc ot a system with multiple arrival streams 


is increasing and convex ( J is a subscl of the class of Schur 
convex functions defined on U h (see [8)) thus having the property 
r < q ^ fir) < i(fl) V/ f ( J A natural way to associate 
( J with a stochastic majonzation ordering between random vectois 
is the following (see |8| for a complete discussion of stochastic 
majonzution) 

Definition 2 If TV and M are random vectois we have 

M < f \ N if F[i>(M)]< r[<>(N)\ Vo 

The aforementioned ordering is sometimes referred to as separable 
increasing convex ordering ([6]) Finally we write \N(f) 1 >_ 

0} < \M{t) I ^ 0} if 

7 i 

(N(U) N(1 ))<, (M(t i) M(t )) 

lor all n 1 1 t 

III Tut Randomi/ki Round Robin Pol ic y 

In this section we considci a system of A parallel queues that 
admit customers liom \f external airival streams Fach queue has 
infinite capacity and a dedicated server Customers from each stream 
arrive at a contiollei that roules ihtm lo one of the A queues (see 
Fig I) I he contiollcr has available the past routing decisions on its 
own stream but no queue length information We assume that the 
service times at each queue form a sequence ot i i d exponential 
landom variables with a constant rate fi On the other hand the 
sequence ol arrival limes on any particular stream is assumed to be 
independent of all service limes but otherwise arbitrary For example 
it can be a deterministic sequence I he arrival processes are mutually 
independent although not necessarily identically distributed 

Wc are intetcsted in the class ol symmetric routing policies L 
defined as fed lows A policy belongs to L if foi each arrival stieam 
the probability that the /th customer is routed lo queue / is 1/A tor 
ill ) = 1 A i == 1 2 An example is a policy that uses 

Bernoulli splitting with equal routing probabilities at each controller 
Another example ts a policy that uses a randomized round robin (Rl) 
policy (introduced in [IT]) at each controller Recall lhat given the 
first A customers to amve a round robin (RR) policy routes exactly 
me customer to each ol the h queues according to some routing 
naltern and repeats this pattern therealtei R1 can be thought as a 
tochastic version of RR the controller uses a round robin policy that 
elects the routing pattern for the first A customers randomly in such 
i way that the probability ot the /th customer being routed to queue 
I is 1/A / i = 1 A An example is a policy that selects a 
uniting pattern among the A cyclic shifts of the set {1 A } with 
probability \/h Given a policy n in 3] let A *(/) denote the number 
ol customers in queue / at time t and AT* (f) - ( A , ,r (/) , \ J[ ( t)) 

finally let D*{t) denote the total number of departures by time t 
under n 

Using dynamic programming it was shown in [ 1TJ that in a 
system that consists ol A queues with deterministic servers and 


W independent but identically distributed miv ii w M 
RT policies employed by the M controllers is \ 

policies m the following sense if each ol M I i , u , 

R3 policy then the Uth controller should do the n , i 
minimize the expected delay ot its own cusionuis o \ \ 
to show that foi the system considered in this siu j \ 

policies minimizes the queue length process in the su »| cj 
increasing convex ordering, among all policies in (Hu 11 
proof follows the steps taken in [ 1T| 

More specifically as it will become cleai in the followm m >i 1 
to prove the optimality of a set ol policies it sufhees to show ih il 
RR is optimal among all policies lhat use no queue length information 
when U = 1 and in addition there exist A independent airival 
streams one to each queue, that are identical!) distributed Let 1 1 
be the class of feasible routing policies foi this system The lesult 
can be shown by constructing a modified probability space in which 
all arrivals from the A dedicated streams occur simultaneous!) at 
the A queues and all departures from the A queues also occur 
simultaneously at the jump points of a Poisson process with paiametei 
Ii Given a policy n f L| let (TV 71 (/) t ^ 0} denote the queue length 
piocess in this modified space Clearly \ n {t) = , \ n (t) for all 
/ > 0 and / = 1 A but \N*(t) t > ()| jr H[ {TVV) t > 0} 
That is the joint statistics of the queue lengths are modified 
A slight extension of an argument given in P| (sec also 118] for 
moic details) shows that N 1 1 (t) ~< TV*(/) foi all r G -i / ^ 0 

Then by an elementary property of majou/ation 

N" (1 )M (AfVi) N’(t )) 

V// 1 1 / 

Since the marginal distributions ol the queue lengths are not modified 
by the construction it follows that , Yl, i H'/O V ))] - 
YL’\ Yl, i I M ^ r ( 1 1 ))] for all r F Di <i increasing and convex 
Thus we gel the following proposition 

Pioposition / In a system with A dedicated airival streams and 
one common external stream RR minimizes the queue lengths in the 
following sense 

{TV' r '(t) t > 0} < # , lAT(f) t > 0} 

for all 7r in D| provided that A ' 1 (0) = i \/(0) lot all / y - 
1 A 

Going back lo our original problem it now becomes easy to see 
that a set ot R3 policies employed by the U controllers outpeiloims 
all policies in 1- Let p be a policy in 31 such that on ai least one 
stream say stream l a policy different from R^ is used Ihekeyisin 
noting that due to the way £ is defined the aggregate arnval process 
from streams 2 to \/ is such that each queue sees an identically 
distributed subsequence of arrivals In other words conditioned on 
arrivals that are not from the first stream the arrival piocess at queue 
/ is equal in law to that at queue y for all / and y Therefore it 
follows from Proposition 1 that a policy p which copies p on all 
streams except lrom the stream 1 where it applies R1 instead does 
outperform p in the sense of separable increasing convex oidering of 
the queue length process Repeating the argument foi all sticams on 
which p does not use R1 we get the following result 
Proposition 2 In a system with M common external arrival 
streams a set of R^ policies minimizes the queue lengths in the 
following sense 

(jv M m i > o} <,, {jv t (/) t > til 

for all n m ^ provided that \ /M (0) =,/ V/(0) for all / / - 
1 A 
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Fig, 2. A comparison of three policies. 


Remark I: As a consequence of the aforementioned result, it 
follows that a set of R3 policies maximizes the expected value of 
£)*(/)* for all t > 0, among all policies in D. Note, however, that this 
set of policies need not be optimal over all feasible routing policies 
(in particular, nonsymmetric policies). For example. A' = M and the 
arrival processes are identically distributed renewal processes with 
deterministic interarrival times, then the optimal policy is the one 
that assigns a single queue to each stream, so that all customers 
from that stream are routed to one particular queue. It follows that 
this scheme reduces to an RR policy applied to the aggregate arrival 
process (streams 1 to A/), which by Proposition 1 ensures optimality. 

A numerical example is given in Fig. 2. There arc four Poisson 
streams, each with rate A, and four queues, each with an exponential 
server of rate p = 1. The graph compares the steady-state mean 
system delay of a set of Bernoulli policies against that of a set of R3 
policies as well as a set of &emi-R3 policies. The latter is defined to 
be one in which each of streams I and 2 apply an R3 algorithm to 
queues 1 and 2. and each of streams 3 and 4 apply an R3 algorithm to 
queues 3 and 4. We assume that all Bernoulli policies use a vector of 
equal routing probabilities, which then makes the system equivalent 
to a set of four independent M/M/1 queues. The performance of the 
R3 and seini-R3 policies was determined via a simulation experiment. 
Each point in the graph represents an average of 100 replications of 
the experiment, each run lor 10 000 customers. 

As seen in the graph, the sel of R3 policies reduces the mean 
system delay considerably, especially al heavy load where it pays 
off significantly to avoid routing consecutively to the same queue. 
It is worth noting that the sel of semi-R3 policies does indeed 
perform better than the set of R3 policies at light traffic (although 
the difference in the mean delay is small, but observable within the 
confidence interval of our simulation, and is not clearly seen in the 
graph). This was first noted in [13J in the context of queues with 
deterministic servers. The intuition lies on a trade-off between the 
degree of sharing of individual servers by customers from different 
arrival streams and the variability in the arrival process that is seen 
by a particular server. 

IV. Routinc. in Heterogeneous Systems 

In this section, we consider a system consisting of A heterogeneous 
queues each of which has infinite capacity and a dedicated server. 
The service times at queue / form a sequence of i.i.d. exponential 
random variables with rate //,. There is one arrival stream (sec 
Fig. 3) and a controller that has no queue length information and 
must select a queue for each customer that arrives only based on 
information regarding its past routing decisions. We assume that the 
arrival process is independent from all the service processes bui 
is otherwise arbitrary. Let E/, denote the class of feasible routing 
policies. 



Fig. A system with heterogeneous servers. 


We use the term probabilistic policy to refer to any routing policy 
in Hi, that uses Bernoulli splitting, i.e., routes an arriving customer 
to queue i with probabilily p,. p, — 1. regardless of pasl 

decisions. Our objective is to show that, given any probabilistic 
rouling policy zr, there exists another policy p which partly employs 
an R3 algorithm, and which performs better than zr in the sense 
of separable increasing convex ordering of the queue lengths. In 
particular, policy p uses a Bernoulli algorithm to split ihe arrival 
process into a finite number of subprocesses, effectively creating a 
finite number of substreams of customers. Then, lor each substream, 
P cither routes all customers to a particular queue or applies an R3 
algorithm, possibly only lo a subset of the K queues. The exacl 
way in which the splitting of the arrival process is done depends on 
the routing probabilities for zr and is described in the following. We 
prove our result for K — 2 and later discuss how the result can he 
generalized to arbitrary 7v. 

Theorem I: For any probabilistic policy zr in Hi. employed m a 
system with two queues, ihere exists a policy p in D/, lhat partly 
employs an R3 algorithm such that Ihe following is true: 

{AT(/):/> Oil., {AT(M:/>(W 

provided that *V/’(D) =*i .Y^ri)) for all /. j = 1. ■ ■. 1(. 

Proof We condition on iniLial queue lengths, arrival limes, and 
service limes. We assume that service completions at queue / can only 
occur al the jump points of a Poisson process with rale p,. Consider 
a probabilistic policy zr lhal routes an arriving customer lo queue I 
or 2 with probabilities p\. p 2 . respectively, p i + p-j — 1. Without 
loss of generality, we can assume that p\ > pj. The prool is by 
construction. Specifically, we construct a policy p as follows. When 
a customer arrives, p draws a random number u uniformly from the 
interval |(), 11. If u 6 [0, p\ - p_*]. Ihe customer is rouled lo queue 
1. Otherwise, the customer is rouled to either queue 1 or queue 2 
according to an R3 algorithm applied to queues 1 and 2. That is, for 
the subsequence of arrivals associated with random numbers that fall 
in the interval (p \ - p ±, 11 policy p uses an R3 routing policy. 

For each of the two policies, i.e., zr and p. we create two copies of 
the system, as follows. First, we divide the interval I = [U. 1] into 
three subintervals, namely, 7 1 = [0. p\ ~ py]. A> = (pi - p-j, pi], 
and J .i = {p \. 1], In the first copy of the system, zr draws a random 
number u from [0, lj and routes lo queue I if </ £ /j U A. otherwise 
it routes to queue 2. Policy p, on the other hand, draws a random 
number u from [0. 1] and routes lo queue I if </ E /i. otherwise uses 
an R3 algorithm that starts from queue 1 if the first random number 
u that falls in 7? U I\ is such thal // E A>; otherwise, it starts the 
algorithm from queue 2. In the second copy of ihe system, n routes to 
queue I if u € 1\ U/.i, otherwise it routes to queue 2. Policy p, routes 
to queue I if u E 7j. otherwise uses an R3 algorithm that starts from 
queue 1 if the first random number a thal falls in A U/i is such that 
k E /.*; otherwise, it starts the algorithm from queue 2 (see Fig. 4). 
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I ig 4 The construction R1(/) denote** R) starts horn queue / 



To distinguish between the two copies we use the notation 
(»?(t) (tri| (t) mjl)) to denote the vectors ol the Fig 5 Comparing tt (Bernoulli) ind / 

queue lengths tor the hrst and the second copy respectively = 

(> 7 r Noting that |/j U h\ = \l } U i t | = y>i and \I_\ - |/<| - y>, 

it is seen that the marginal queue lengths in each ol the two copies - . , , . T . w 

_ 11 , * , f , to queues 2 to A tor a (A -In 

arc stochastically equal to those of the onginal system tor either ~ . 

_ . . . , , , . irathc (A - 1) uses an R1 algo 

t or n Coupling the random numbers that p and tt use foi making , , . t/ , 

and h Jor a -(JUv i - l'h ) port i 

routing decisions the key ol the proof is in showing that the foi owing . .. . n . . 

, ^ finally (A) loutes all customers froi 

relations are true on a sample path tor all t ^ 0 ...... , , 

r ■ nnrlirtn ni h» irjfh/ t r\ <iiiunp h P 


0 01 02 03 04 05 06 07 1)8 09 

A 


-< {n 7(0 i"U f )) (1) 

(«i(0 t))< («'(/) #«'(/)) U) 

Wt only discuss (I) relation (2) follows smiilail> Consider the 
two cioss systems namely (//', (/) m\ [t)) and (//[(/) mj{t)) By 
assumption service completions occur simultaneously in these two 
systems (at the jump points of a Poisson process with rate p\) and 
so do arrivals associated with random numbers diawn from 1 1 As 
to arrivals associated with random numbers from h U /» note that 
under () these alternate between the queues with lenglhs //'j(f) and 
m | (t) Thus the cross system undu /' (see the left hand side ol (I)) 
can be thought as one consisting of two queues with homogeneous 
exponential set vers two dedicated arrival streams (amvals associated 
with a s inii) and one common external stieam (amvals associated 
with i/s in fjU/i) with a controller that uses an RR policy The 
same is true for the cross system under tt except that the controller 
on the external stream does not use an RR policy Thus (1) follows 
by the aigumcnts piccedtng Proposition 1 

Let n (/) — (»](/) n]{t) in](t) m 2 {1)) - p r Prom (1) 

(2) and an elementary property of ma)ouzation it follows that 

(«'(/i) n'U/)M (n n {t\) n"(ti)) V/ t\ tt 


n {tt)) 


y^l7(» ; i ( f )) + '/(»' {t ))] + i ))1 

l _J 

1 

> )) + ))] 

i 

/ 

+ )) + i/(»>T(f ))] (1) 

/ I 

for all <} increasing and convex Taking expectations and recalling 
that the marginal queue lengths for each of the two copies are 
stochastically equal to those of the original system for both p and tt 
the result follows ■ 

As it is seen from the proof p can be easily implemented let 
us now describe how p is specified for arbitrary A For simplicity 
assume the routing probabilities for n are in decreasing order i e 
p i < p 2 < < ph Then p acts as follows 1) uses an RT 

algorithm applied to queues 1 to A tor a hp\ portion ol the 
incoming traffic, 2) uses another independent RT algorithm applied 


to queues 2 to A tor a (A - 1 )(p - pi) portion ol the incoming 
irathc (A' - 1) uses an R1 algorithm applied to queues A - 1 
and A lor a 2{pk \ - y>/ v ) portion of the incoming traffic, and 
finally (A r ) loutes all customers Irom the remaining (pf s - y>/ N i) 
portion of the irathc to queue A Note that these portions of the 
incoming traffic are specified by a probabilistic touting algorithm 

In the interest of space wc will sketch the proof lor A = 1 This 
essentially dcscnhes the inductive argument that extends the prool to 
arbitrary A First define a policy p\ as follows Policy p\ uses an 
RT algorithm applied to queues I 2 ^ for a 3y>i portion of the traffic 
routes a 2{p - p\) portion of the traffic to queue 2 and a y>» - p „ 
portion of the tiafhc to queue 1 To prove lhal /m is belter than k 
one needs to construct three copies ol the system such that the ordei 
of amvals to queues 1 2 1 under RT is 1 2 and T foi cops 1 2 ^ 
and 1 tor copy 2 and finally 1 1 and 2 lor copy Then similaily 
to Theorem 1 (see (1) and (2)) it can he shown lhal 

{n't (O »' 1 {1) "'I (f)) -s (//"i (f) n n Jt) ti \{t )) / - 1 2 1 

(4) 

where n ] denotes the queue length process at queue / ill copy / 
under policy — p\ r 

Now let p be a policy that uses an RT algorithm applied to queues 
1 2 3 for a 3yn portion of the traffic uses another R3 algorithm 
applied to queues 2 3 for a 2{p - y>i) portion of the traffic and 
routes d p \ - p portion of the traftic to queue 3 An application oi 
Theorem 1 to queues 2 and T proves that p outpciforms p\ Thus 
p is the optimal policy in 

A numerical example is given m big S There art two queues with 
service limes that are exponential the rates art //1 = 0 0 p “01 
Customers arnvt from a Poisson stream ol late \ The optimal 
muting probabilities toi a Bernoulli policy (in leims of minimizing the 
steady state mean system delay) arc known eg | IS] to be given by 

_J>| y/fc- Viy/Fi + *y/F\ ,1 
1>l A( v ^77 +JJT) J 

P^ = 1 ~ y/i p i p (^) 


The graph compares the mean delay undei a probabilistic routing 
policy 7r that uses the aforementioned muting probabilities with a 
policy p that routes a (y>i - y>j) portion ot the incoming traftic to 
queue 1 and uses an RT algorithm for the remaining portion of the 
truffic The graph also piesents a lower bound on the performance 
oi policies that use no queue length information expressed as the 
mean delay of a preemptive M/M/2 queue that is the slow server 
i e , the one with rate //^ can be preempted so that the fast seiver is 
always kept busy as long as there are customers in the system This 
bound will, of course never be achieved 
As it is seen in the graph, policy p reduces the mean system 
delay considerably at heavy load, compared to tt At light load all 



160 


l£EE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 40, NO. I. JANUARY 1995 



Fig. 6. A scheduling system with A' queues. 


three policies behave approximately ihe same, as the fast server is 
almost exclusively used. In particular, the Bernoulli policy and p have 
exactly the same performance since under the former, customers arc 
always routed to the fast server: for A < 0.1, it follows from (5) that 
pi = 1. Thus, by the way p is constructed, it follows that it also 
always routes to the fasi server. On the other hand, at heavy load, the 
performance of p approaches that of the M/M/2 system. In contrast, 
the Bernoulli policy increases the mean system delay away from its 
lower bound. 

V. Thk Scheduling Problem 

In this section, we consider a system consisting of A queues each 
of which has infinite capacity. Queue i receives customers from 
a Poisson stream with rate A,. There is a single server that may 
switch to one of the queues according to a scheduling policy that is 
determined by a controller (see Fig. 6). We assume that the controller 
has no queue length information and schedules the server based only 
on the history of its past decisions. Once the controller selects a queue 
to which the server is scheduled, we assume that a single customer 
from that queue is served (if one exists) before the server switches to 
another queue. If immediately upon the time the server joins a queue, 
that queue is empty, it is assumed that the server remains idle for a 
time period equal to the service time of a single customer; then it 
joins another queue. This is a reasonable assumption, for example, 
when time is slotted and all service times are deterministic equal 
to the duration of a time slot. In the following, we assume that 
the distribution of the service times is general and that the arrival 
processes are mutually independent us well as independent of the 
service processes. 

The interest in studying these systems primarily arises in the 
context of multiple access communication systems. One may think 
of A stations sharing a common channel and wishing to transmit 
fixed-length packets at the beginning of time slots. One of the types 
of policies that can be used for accessing the channel arc collision- 
free policies, under which it is not possible for two or more stations 
to attempt transmission of a packet at the beginning of the same 
time slot. Such policies can be deterministic. e.g„ lime division 
multiplexing, or probabilistic. The latter can be implemented in a 
distributed fashion, i.e., without any need for centralized coordination, 
provided that all stations use the same random number sequence. 

Let h be the class of scheduling policies for the system that 
we have just described. The result in this section is similar to that 
proved in Section IV. That is, given any probabilistic scheduling 
policy 7T in there exists another policy p that partly employs 
an R3 algorithm, and that outperforms n in the sense of separable 
increasing convex ordering of queue lengths. Then, p can be easily 
constructed on basis of these probabilities and be guaranteed to reduce 
the total workload at all times. The proof and the construction are 
similar to those in Section IV and are omitted. 


1 2 312 3 123 1 2 



Fig. 7. Routing customers to queues. Arrivals from the external stream (thick 
lines) are as if the RR algorithm applied only to them. 


Theorem 2: For any probabilistic policy it in Us,/, there exists a 
policy p that partly employs an R3 algorithm such that the following 
is true: 

t > o} < , {N*(tyt> 0} 

f i 

provided that A7'(0) =,t X* (l)) for all L j — 1. ■ • ■. K. 

Remark 2: It is not difficult to sec that the arrival processes 
need not, in fact, be Poisson. Note that in the routing problem, 
we were not able to relax the assumption of the service times 
being exponentially distributed, since we required that both real and 
fictitious service completions be synchronized at the jump points of 
the service-counting process. This simplifies the proof considerably. 
In the scheduling system, of course, arrivals are always real, thus, 
there is no need to use the memoryless property of the exponential 
distribution. 

VI. Extensions 

This section contains a discussion of further issues that arise in 
the context of routing. Similar remarks can he made, of course, for 
the scheduling problem. First, it is interesting to study systems with 
finite capacities. A slight extension of Theorem I in [ 121 shows that 
in a system with M external arrival streams and A homogeneous 
exponential servers, a set of R3 policies minimizes the expected 
number of departures by any time 1 and, at the same time, it minimizes 
the expected number of losses by t. As a consequence. Theorem I 
can be extended to systems with finite buffers. As one expects, due 
lo losses that may occur, the E\ ordering on the queue lengths no 
longer holds when capacities are finite. 

Theorem I does in fact hold for I HR service-time distributions. 
The key is in extending Theorem 1 in [6| (that proves the optimality 
of the RR policy when the service limes form a sequence of i.i.d. 
r.v.s with IHR distribution) to systems that may admit arrivals from 
A’ dedicated arrival streams. I he extension is rather straightforward. 
Since the arrival process in [6] is assumed to be arbitrary, one can 
simply assume that arrivals from the dedicated streams that can be 
assumed to occur simultaneously at the /v queues are part of the 
external arrival process and are routed according to the RR policy. 
This docs not modify the algorithm for the original common external 
arrival process (see, for example. Fig. 7, where K = 3). 
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Mefhods and Theory for Off-Line Mdihini It t. 

S Yakowitz and J Mu 

Abstract —Many problem* in machine learning can lu ,d> h M t»d n 
the sequential design task or finding the minimum ol an unknown mm, 
and possibly discontinuous function on the basis ol noisv measure mi m 
In the present work, it is presumed that there is no penalty to, 
choices during the experimental stage, and at some tmu m.( known 
to the decision maker, or under his eontiol, the experimentation will 
be terminated, and the decision maker will need to speuh the point 
considered best, on the basis of the experimentation In this non, we seek 
the best trade-off between i) acquiring new test points, and id relisting 
at points previously selected so as to improve the estimates ol rclutive 
performance The algorithm is shown to achieve a performance standard 
described herein I his decision setting would seem natural for I unction 
minimization in a simulation context or tor tuning up a production 
process prior to putting il into service 

I iNIKODIK DON 

I here aie mynads of situations in which one is contionicd with 
a sequence of stochastically similar decision problems In such 
situations as m real life one often has the oppoiumily to selecl better 
decisions as a data bast of actions and outcomes accumulate When 
this sifting and exploration ot decision space is done by a computer 
algorithm we tcun the process machine learning Such an activity 
can be abstractly viewed as the process of finding the minimum ol 
jn unknown function on the basis ol noisy mcasuiemails In the 
present note the locus is on this abstraction 

A chaiactenstic of off line learning is that thcie is no need to 
he concerned with performance duiing the exploration stage (By 
contrast on line learning seeks to minimize observed loss as the 
process evolves ) The criterion of goodness ol otl line algorithms is 
(hat ihc decision or design piovided at the end ol the session performs 
as well as possible II is lacilly assumed ihat this decision point will 
be selected foi implementation without lurlhci testing I or example, 
m blackjack during simulations no money is to be gamed ot lost In 
a long sequence of cxploialoiy games one could occasionally list to 
sec how good the best tens count strategy lound thus fat is and end 
the experimental session and head oft lo Nevada when perloimance 
seems promising The blackjack study |l| howevu employed on 
line learning but in view ol icsults to follow it appears that the 
learning plan could have been sharpened further 
The goal ot the present theoretical study is to pose and examine 
oil line learning algorithms We are awate of a lively lileiatuie (eg 
Devroye [2] |1] Jarvis [4| Gunn [S] and Malvas [6|) which 
includes convergence icsults but to our knowledge idles have not 
been established for off line methods Ihc conclusion substantiated 
heiein is that significantly faster conveigencc rates can be achieved 
off line undci various hypotheses than by on line algorithms 
We proceed now with the particulars A lormal statement ot the 
problem we intend to address is as follows 

Definition I / Let /(i ) denote a bounded real valued measurable 
function on an infinite Borcl action set \ C 1 The oft line learning 
pioblem is the task of sequentially 

Manusuipt received Novcmbei 17 1992 revised May 21 1991 December 
10 1991 and March 21 1994 The work ol S Vakowil/ w is supiiorted in 
pait by the National Science f oundalion under giants ICS 891T642 and JNT 
92014K) and by the National Institutes ol Health under giant A129426 01 
The authors ait with the Systems and Industrial I iigiiiLcring Department 
University of Arizonu Tucson A/ 8^721 USA 
IL \ E Log Numbci 9407011 
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]) choosing design points .v„ , on the basis of a history of noisy 
observations 

\h = J/U/) “ SUj) + f(.r ; ). j < «. (1) 

and 

2) designating a particular design point ./■* on the basis of these 
observations, so that for some specified positive number r, as n f 

P[f(K) “ /imit > '*] 0. (2) 

Convergence rate assurances are desirable. 

In (2) is the minimum (or Lebesgue-essemial infimum) of 
/(jr). Also t(.Vj ) in (1) is a 0-mean random variable and the sequence 

(■*’,/)} is hypothesized to satisfy certain assumptions to be staled 
in Section [I. 

One can intuitively justify an off-line learning criterion by imag¬ 
ining that one can perform experiments until some unknown time 
when it will be demanded that the experimentation cease and the 
best operating point, i.e., , found so far be declared and used. 

II. An Off-Ltne Learning Algorithm 
A. The Off-Line Method 

The simple plan of the algorithm is occasionally to add a new 
randomly chosen design point to our list of test points. At other 
times, one refines the estimate of the expected performance of design 
points already on the list. 

We first offer a generic off-line learning algorithm, and the subse¬ 
quent discussions will be aimed at stating properties of this algorithm 
under various postulates. In what follows, the symbol C will represent 
various positive constants, the values of which are not important 
for the associated discussions. The algorithm has as parameters 
a probability density function p( ) on the action set X and a 
nondecreasing integer sequence {XP {//)}„. The density does not 
affect the rate of convergence (provided it satisfies postulates to be 
stated). If the user has some intuition about the location of promising 
points in decision space -V, this could be reflected by relatively large 
density values. The XP(n) sequence specifies times at which new 
decision points should be acquired and added to our experimentation 
list. The selection of this sequence should depend on properties one is 
willing to assume about the noise. In Section II-C, the important cases 
of finite moment and finite moment generating function are covered. 

Algorithm Parameters: i) p(.r) is a probability density function 
having as support all of a (measurable) decision space -V C R (i . (If 
■V is discrete, take p(.r) to be a probability mass function.) 

ii) {XP[n ) | is a nondecreasing unbounded sequence of integers 
such that 

XP{\) — 1, XP(n)/n I 0. and VP(i»)- YP(i» - 1) f (0. 1} 

If supj E|f(.i )| ,) < , let XP{u) oc ln(n). 

If sup, E|expl0f(.r))| < -x, let XP{n) oc y/77. 

Set n = 1 and proceed to the iterative step. 

The Iterative Step: 

New Sample Times (Steps I and 2) If it = 1 or XP(n - 1) < 
X Pin), randomly select and test a new point from X. 

1) Choose a point, designated by t(XP(n)), from -V at random 
according to the pdf p(>r). Set = t(XP(n)) and observe y u = 

2) Start a running average ni\ t ^ n) for observations at 
t(XP(n)) by declaring 

111 

and start a counter at /(.VP(u)) by setting XS(XP(n)) = 1. 


Remark: The user-specified sequence ‘\V P{ n )" (See item ii)) tells 
the number of New [sample] Points that have been gathered by 
rime n, The random quantity “.V S[j)" which is determined by the 
realization of the learning process, gives the number of samples that 
have been taken at a given sample point t(j), j < XP(n). 

Go to 6. 

Resample Times (Steps 3, 4, and 5): Else if XP(n - 1) = 
XP(n ), resample at the point which has been tested the fewest 
number of times. 

3) Let I be any index i, 1 < i < XP(n), such that 

XS(i)< NS(j), 1< j< XP(n). (3) 

Set .r n = 1(1), and observe y„. 

4) Update the sample mean and sample counter. Set 

nt[ = [nij ■ .V Si I) + //„]/( XS(I) +1). (4) 

and 

A\9(7) = Y.SV) + 1. (5) 

5) At all times n such that Y P(u) > 1, announce as the best 
point (BP) the test point 1(1*), with I* = i where / is any index 
less than XP(n) such that 

m t < m 1 < ./ < X P(u). (6) 

(Note that HXP(n)) is not a candidate for the BP because il it has 
recently been selected, XS(XP(n)) <T XS(j),j < XP(n). Thus, 
the precision of the running mean m y/< as an estimate of J[.i \r) 
is worse than that of the precision of the other running means.) 

6) Set ii =n + l. Repeal the iterative step. (There is no official 
stopping condition, although one anticipates that at some arbitrary 
time, not known to the designer, the experimentation will be halted.) 

[End of Algorithm 1 

The properties of the algorithm will require various hypotheses, 
some of which will always he assumed in force. These are now 
presented. 

H. Convergence Analysis 

Action Space Assumption: The set X is either discrete (hut infi¬ 
nite) or has positive Lcbesguc measure. 

Noise Hypothesis I • For any 1 £ .V, define T, = {* (.r ,): .v, — /}. 
Then this random subsequence, if not empty, is independent and 
identically distributed (i.i.d.). Any collection n . /, (J) . ■ ■ ■. of such 
sequences are mutually stochastically independent. 

Noise Hypothesis 2: Let us suppose that for any error tolerance d 
and any number X of observations of V(.r), at any fixed .», there is 
a known function P(d. A ) which satisfies the following conditions: 

i) : uniformly in ./■ £ .V 

P[\>»AX)-/(.<■){> d]<P(d.X). (7) 

Here, we have let m ,(X) represent the mean of a sample of Y(.v) 
of si/e V at design point ,r. 

ii) : If A is regarded as a variable on the positive numbers, then 
for any d, P(d, A ) is continuously differentiable and strictly convex. 

The off-line algorithm assures that the number of observations, 
■V5(/) t made at each test point /(/), is about the same for each i, 

1 < / < A P. A property of this strategy which is fundamental to 
our analysis is the following proposition. 
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proposition 2 / Assume property 11 ) of the Noise Hypothesis 
2 Then among all static algorithms for allocating M observations 
among a given number A P ol test points the allocation A oc 
\f/\ P to each test point t{i) is the one which minimizes 

£ P(d \ ) (8) 

i \i 

Proof By the Lagrange multiplier procedure, the minimizer 
of (8), subject to £, w A = \1 must satisfy lor each /, 
l £ j < A P and lor some number A 

53 Pi'l 'N ) + \ ( 5Z ' - 1/ ) | - o (9) 

i __ \i y^ \i J J 

Irom which we have the following necessary condition 

0/0\,P(il V) = -A 1 < ) < A P (10) 

But this evidently implies that the derivatives of P(d A,) must all be 
equal By the strict convexity hypothesis the derivatives are strictly 
monoiomcdlly incicasing so the value at which they equal any given 
value - \ must be unique That in lum implies uniqueness of the 
optimizing value A y That is all the A s arc the same QLD 

In this study the objective of learning is to try to assure that 
at the designated best design point the objective function value is 
within some given tolerance « of optimality That is in icfeicnce to 
Algorithm Step S we seek a testing plan which guaiantces that 


Since In ( A P* ( n ))/A P* (u) -+ On suffices loi 1 11, n 
satisfy the relation 

A P*{n) - Integer Part[P(r/2 n/A P*(r/))/ln( / 

We suinmaii7C the developments on an achievable i it » 
in Proposition 2 2 

Proposition 2 2 Under the preceding noise and duiM j ^ 
hypotheses it A P*(n) satisfies (IS) then given anv loluaiiu i 
for some 0 < p < 1 

r\td(rd,))) > f„„ +,\- ) t i„i 

Usually the value / (</2) is not known to the designei bui (W>) 
remains satisfied lor any positive value*' 1 of this constant The 
effect ol omitting the divisor in (IS) is perhaps lo require a value 
0 closer to 1 

C Applnations and Implications 

The eases ol obvious interest are when the noises are known to have 
uniformly bounded moments of some order // ] or alternatively are 

known to have a moment generating function 1 bus in the nolalion 
ol (I), respectively 


sup F \f (/ )|' < ( 

(17) 

up P[exp (Of ( i ))] ^ C 

(IK) 


P[Mislake ( n )] - P\f(t(D) s Uu+i) I 0 (11) 

at some assured rale 4 Let 


J in) - mm f(t(i)) 

i \ / ( ) 


We construcl a sequence giving an assuied idle by noting that lor 
functions (J\{ \ P(n) n) and Q ( A P(n) n) dehned below 


P( Mistake (//)) P\f (/i) > / +</2] 

+ P lllfl\ | m - f (t(i))\ s i / 
v A P(n) n) + Q (A P(n) n) (12) 


It will be apparent that lor fixed n as A P increases (}\ decreases 
while () gets largei So the logical thing lo do is to balance these 
piobdbilities Specifically selecl A P*(n) to assure that for some 
constants C i > C >0 and all n 


for all 0 in some interval 7 Beiwten ihe two conditions of Imilely 
many moments (17) and infinitely many moments (18) essentially 
all of the well known landom variable families will be accounted for 
in the following stalcmeni 

Proposition 2 i Under the conditions of Ihe Proposition 2 2 toi 
the moment condition (17) the condition (11) is satished il 

A P{n) ~ ( log ( n) (19) 


and for (18) if 


A Fill) ~ t y/Ti 


( 20 ) 


In the moment bound case we have 

/’(Mistake ( n )] = ()(n " 11 log(/>)) (21) 

and in the finite moment generating function setting 

P[Mistakc(n)J = (Hp^ ) (22) 


<V,?i(AP’(i») n)M ( )(\P(i,) n) >t *<^i( A /'*(«) n) 

(H) 

lowaid achieving this design dehne P(r/) — P\I(T) </] wheie 7 
is distributed according to the algoiithm seaich density p( ) Then 
Qi( A P(//) n) = (1 - / (r /2)) N 1 ( 1 Under the independence 
assumption of Noise Hypothesis J 

p|m.i\|m - f(/(/))| > t /ij 

£ Q (A P(n) n) 

= A P*(n)P(</> n /A P*(ii)) 

(1 + (j( A P* (n )P{( /2 n/\/ Jt (;i)))) 

where r/(AP*(») P(</2 ;i/A P*{n))) — o( A P*(n )P[t /2 n/ 
A P* in))) Thus by simply equating Q i and Q we obtain the 
implicit relation 


for some U dependent) numbei f> in the open unit interval 

Proof For (19) a moment/probabilily bound eg [7] undei 
the moment condition (17) is that 

P(r/ n) = C/n l 1 

Fqualion (21) lesulls from substitution ol this bound into (1*>) 

I hen by the equation preceding (14) and the balance relation (H) 
we have 

P[Mistake(/#)] < C log (n)/n l 1 (21) 

which gives (21) 

Similarly from Cramers theorem eg |8 p 9] we have the 
assurance that the existence of the moment generating function 
implies the exponential inequality tor (7) 

P{(l n) =■ Cexp {-Qn) (24) 


A P* ( n ) 


Inf A P>))P(r/2 »»/AP*(n))/ln(l - F(</2)) 

(14) 


where Q - (}(d) is the dominating point of the laige deviation rate 
function Sec [8] tor details on its calculation In the Gaussian case 
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Q{d) « d 2 /2<r 2 where <r 2 is on upper bound for the variances of 

6 -V. 

Set p < 1 to be some number slightly larger than exp (— C^(r/}), 
say /> = J exp ( -Q(d)) and note that for any i > 1, > i/n for 

n sufficiently large. In view of (24), this gives (20), and (22) by the 
arguments for the moment case. QED 

Under the circumstance that either the noise has a symmetric 
distribution, or that the distribution function of the noise is given, 
one can estimate f(t(i)) + C\ for some constanl C not depending on 
t{i), by med,, the sample median at t(i). In the known-noise case, if 
the median is not defined, use the sample minimum of any defined 
quantile. For quantiles, probability bounds of type (20) are satisfied 
[9], That is, for fixed d > 0, and some positive constant C 

P[|med, - f(Hi)) + C | >d] = (Hexp(-C'XS(i))), 1 < i < XP. 


of each control .r, actually applied, as opposed to what is estimated to 
be the best control found so far, i.e., /(/*), there is the potential that 
the expected performance improvement is slower than for off-line 
learning. 

Toward comparison with Proposition 2.3, one can approximate the 
probability Po.\ (n ) of error in the on-line case by the first difference, 

Pov(n) = V./(n) = ./(;; + 1) -«/(;;) — P[on-line Mistake (i#)]* 

From Theorem 3 of the aforementioned work, for the moment 
bound case (17) and the exponential case (18), it turns out that the 
corresponding rates of convergence to 0 are, respectively, 

Poy(v) = 0(log(»)/>f J/v ) (27) 


The analysis for the moment generating function case then implies 
that for t(I* ) the minimizer of the sample medians, and for XP( n) cc 
y/n y the probability of mistake is 0(p v/7r ), some 0 < p < 1. These 
observations might be useful if the noise has undefined expectation, 
as in the case of the Cauchy family. 

We now argue that the rate cannot be uniformly improved by any 
other sampling plan. 

Proposition 2.4: Over the class of noise processes satisfying (18), 
the fastest achievable rate of convergence of probability of mistakes 
to 0, with increasing decision time ?i T is ()(pv 7 '), for some (problem 
dependent) number p in the open-unit interval. 

Proof: It suffices to exhibit a single learning problem for which 
no improvement over ()(p y f") is possible. The waiting time until a 
sample point T satisfies f{T) < + c/2 is geometrically dis¬ 

tributed. The only conceivable way to improve upon the conjectured 
rate is to gather new design points at a rate faster than \/n . This, of 
course, entails requiring that *Y S{i)/\fn = o(l) at all design points 
t(i): i < NP(n). 

Now suppose that unknown to the decision maker, f{x) only has 
two points, say 0 and l, in its range, and that 0 < p\: = P[f(T) = 
1] < 1. Further, it is presumed that the noise e(t ) = 0 if f(t ) = 0 and 
standard normal otherwise, and that the tolerance f = 1/2, so that 
a mistake occurs whenever /(*(/*)) = 1. It is now easy to confirm 
that for "bad” indexing the sample points t(j): /( t(j )) = 1, and w , 
being the sample mean or median at t y , that for some number C > 0 


P 


min m , < 0 
Lebad 


> exp (-C-XSM(h)) 


(25) 


where XSM ( u) = min XS{i). i < XP{ u ). Now from (25) it is 
clear that the probability that f(t{P)) = 1 cannot diminish faster 
than w ith p = exp(-C). Thus, any sampling plan for 

which XSM(n) — will have larger asymptotic probability 

of mistakes than XP*(n) in (15). 


D, Comparison to On-Line Learning Rates 

The on-line criterion is that for ,r„ the point tested at epoch n, a 
mistake occurs if 


f (*Pn ) /mm C. 

Yakowitz and Lowe 110] have analyzed the expected number of 
mistakes in the first n decision times. The expected number of 
mistakes can be written as 

n 

J(n) = ]5TV[/(.r,) > /„„„ + f]■ (26) 

L 

This criterion can be viewed as the measure of cumulative perfor¬ 
mance in the control process, up to time ». Since it measures the cost 


and 


Po\ in) = 0(log( n)/ i?). (28) 

It is known [111 that even in the case of finite decision space ,Y, and 
known parametric noise having moment generating functions, the best 
achievable rate for on-line strategies is P[Mistake (//)] = 0(1/;/). 


111. Conclusions 

The present noie contributes to the important decision-theoretic 
issue of how to undertake experimental design during an exploration 
phase so thal the terminal decision is as wise as possible. This is an 
important piece of Ihe larger picture of machine learning, a rigorous 
“black-box” approach to asymptotically optimal decision making. 

From applications, e.g., [12] [14], there is a convincing case that 
our machine learning techniques are feasible and useful for problems 
for which alternatives (such as genetic algorithms and neural nets) 
do not yet seem viable. Much of the theory is in place, and in 
particular, (here arc results about what is achievable [10], [111, 
[15], [16] and algorithms for attainment of the achievable. Some 
nontrivial toy problems have been mastered and initial investigations 
on serious problems (AIDS policy [13] and assembly line control 
[17]) have been published. We foresee a rich future for this statistical 
methodology. Ongoing investigations include algorithms for identifi¬ 
cation, control, and prediction for nonlinear syslems to complement 
methodology of linear systems. 

The present note, which is our first foray into off-line strategics, has 
contributed to the understanding of convergence behavior. We have 
noted that Gurin, Devroye, and others had already investigated off¬ 
line algorithms and established conditions for convergence. Bui to our 
knowledge, we are the first to “tune” these strategies to consider rates 
and trade-off between allocation of times to acquiring and testing new 
samples vrs refining estimates of samples already collected. Clearly, 
algorithms in this note are to be preferred to the on-line methods when 
there is an experimental or simulation phase of system tuning during 
which actual performance is of no concern. The methods offered 
in Section II are optimal, in a minimax rate sense, with respect to 
asymptotic convergence, but they arc practical, and easy to code. 
The memory requirements are small: at time //, one needs only to 
have stored the test points, the sample means, and the sample counts, 
i.c., the triplets {( t{i ). //?,, XS (/)): 1 < / < XP{i) |. Even for a 
huge number of algorithm iterations, and a fairly large decision-space 
dimension, this is a modest demand because .Y P[ ii ) grows at most as 
s/Ti. In many cases, the lime to the final mistake can be seen [18] to 
have a short (ail. In brief, the claim is that these learning algorithms 
are simple, efficient, and plainly feasible. 
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Discrete-Time Observers with 
Random Noises in Dynamic Rlotk 

h A Lyashenko and 1 B Rwsliki 

Abstract —1 he minimum variance state estimation ol Imi u si, USI „ 
discrete-time systems by an observer of reduced-ordti Is must m 
There is an additional random noise with known intensity in tin cl mi )mu 
block of the observer The local optimal reduced-order state isnmator i 
found which takes into account the presence of such noise 1 he « quulious 
of an optimal stationary observer are derived for linear stochustK tum 
invariant system. 

1 Introduction 

We considei the lollowing discrete time lineai system 


f 1 ■ l » +■ ( 1 

(l) 

V “ ( » + v 

(2) 


where i t U 1 / 6 /? are the state and the mcasuiemenl veclois 
respectively 1 and C are matrices of appropriate dimensions 
(rankK ) = m ) the vectors £ and 1 / are zero mean discrete lime 
noise signals that satisfy 

£’U'=H >0 £<///' -1 ^0 0 ) 

We assume also lhal 1 0 is Gaussian with mean and covariance given 
by 

E 10 ~ 7( E( 10 - To )(10 “ 1 c ) 1 = A( (4) 

Furthermore i () £ and// aie mutually independent 
An optimal (minimum variance) estimalt / ol / and Us eo 
vaiiance A arc usually formed by the discrete Kalman Bucy filter 
(KBF) In practice the presence of disregarded errors connected with 
inaccuracy in the model used or with loundoff errors m the computer 
during digital computations can easily lead to entirely unacceptable 
estimates and moreover to divergence of KBI |1] |1] In (101 such 
errois were modeled by random disturbances in the filter An optimal 
estimator was constiucted which takes into account the influence 
ol this additional noise In some cases it enables us lo mipiove 
essentially the quality ol the estimates obrained 

The present note that deals with similai investigation of reduced 
Older observer (ROO) with additional noise in ihe dynamic block 
goes along with vast tcscarch into robust estimation |4( [5] It turns 
out that the estimates of ROO may be much better than iht full ordei 
filter s ones 

This note is structured as follows In Scclion II wc design 
ohsei vers lor three different situations ROO 1—there is no additional 
noise ROO 2—the presence of noise in the dynamic block is ignored, 
ROO 1—the influence of such noise is taken into account by seeking 
out the parameters of an optimal observer The erior equations aic 
given foi all of these estimators In Section III we consider the design 
ol an optimal stationary observer with additional noise tor lime 
invariant systems The equations for its paiameters are picsented 
vSection IV contains an example that demonstrates the stability of 
ROO in the presence of noise in the dynamic block 
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II. Reduced-Order Observers with where i ?, 1 is (n - r »0 x and Rf is (n - mm) x (m - mi). 

Additional Noises and Their Errors So, if we know A, then the error matrix A,+i is entirely defined by 

Define a reduced-order observer for system (J) as a pair of dynamic = W+i : ^h-i] an< ^ satisfies the equation 

Mock 

1 Ai+i = 1(A,. 7?,-j-i) -I- S,-n 

= F,z, + D,y, 4- ia-h (5) 


and connection 


= P*h + Riih- 


where 

*,+i(A. m = Un - rin,(A)(;„ - no 7 + nv, +[ n' , 


o n 
o Q ,+1 


Mere i, € R' , .r, € i?" are the estimation vectors for 7>, and rj q "I [0 0 ’ 

.r M respectively: T,, P,, D,. P ( , 7?, are matrices of appropriate /p jp ■ + l = q y • 

dimensions; m, € P 7 ‘ is a zero mean discrete-time random signal J 

with covariance £V,i •*? = Q l% that is independent of other system An optimal estimator is derived from the following problem: 

wises. 

Case v, = 0 has been completely investigated in [6]. We assume 7 — tr4» l+ i(A,. 7 ?,+i ) — ♦ min. 

as usual that parameters of ROO satisfy the Huddle structure +l 


J = tr^ t .f i(A,, P,+i) —► min. 


F, = r (+ i.4 f P f , D, = T'+tA.R, (7) 

and the limitation on precision 

FT, + 7?,C = J„. (8) 

With respect to (7), (8), the error matrix A, =■ E{. r, - Jr,)( x, -.r, ) T 
3f estimate j*, is given by the recursive relation 

-X+i = p (+1 r, H n,(A 4 )r (+1 p l+1 -+- t 

+ r,+,g, + iP r + a (9) 

where 

n,(A) = .4,^ 

Let the first m i components of measurement vector (2) be disturbed 
by comparatively small noises. These measurements would be used 
for reducing the order of an observer to r = n — in i. We assume also, 
without loss of generality, that system (1), (2) has the observation 
canonic form 


Let there be no noise in the dynamic block of ROO (v, — 0). 
Then the parameters of an optimal observer are defined as follows: 

j?;; 1 , = ♦,+,( a!"). (i3) 


/ \(n\ \ (i) \ 

Wi+1 (-j, ). A {) = Ao 


fi(A) = [n? 1 !A): n? a (A)r ,, ](cn # (A)r l 4 r,+,) + . ( i5) 


1 1 

+1 _ 

U! 1 

+ >. 

.4f 

ul - .r! 

l + v.‘ 


Here, .rj, y) . . i/, 1 are in j -dimensional vectors; .rf , (; are (n — rr# i )- 

dimensional vectors and t/f. iff are (;// - ni|) vectors. The matrices 
.4,. C,. and Y are partitioned according to 


_ u" .4 ( i -n _ r/ fM1 0 ] r _ \v, u vr 

Al “ a" Af J ' Cl ~ c ’ 0 c ‘ ' “ r al r; 2a 


respectively, where A] 1 . V / 1 are ni\ x /1/1: is nil x (n - im ): 

Af 1 is (11 — in 1 ) x /n1; Af~ is ( h — in 1) x (n — ru \ r. C — j : 0 ] 

is (m - in j) x (11 — mm): Y , l ‘ 2 ~ Y ? 1 is Mi| x (mi - mm); V 2 ' 2 
is (rn - Mi]) x (ru - in 1). 

Since 1 ;/ is small, we adopt .rj ~ y! that entails (see (6)-(8)) the 
following: 

jy — ^ . t> _ ^'"1 ^ 

L 7 __ * n) Rf ■ 


l ^> 11 

*• = 1 *;- J» 


$!' = r ( ". 4>, 12 = [1 * 11 : r, l2 ]^f (A). 


4’if 1 = . 


= n'f*,(A)-[n?i 1 (A): n 2 -,(A)r 7 ]4>,'(A). 

Similar equations were derived in [6). The observer which satisfies 
(13H16)< will be denoted ROO-1. The estimate .i\'\ of ROO-1 has 
the error A[ ]) . 

Now, wc consider the estimator (5), (6) with disturbances v, ^ f). 
Let the matrix /?,+! be given as before by (13MI6); = R]+ r 

Such construction will be called ROO-2 and its estimate will be 
designated as The matrix of real error A ( ( 2) is connected with 
the expected error A ( t n according to 

S\‘ 2) = A ( , n 4- G ,. (17) 

The additive matrix G\, caused by noise />,. has the following 
structure: 

ro i) 1 


T, = [-/?,' ; - n;C), 


where (11 — m 1 ) x (11 — m 1 ) matrix G, is calculated from the recurrent 
equation 

1 R, + iCI, P/41 4 " Qi 41» Go = 0. ( 18 ) 

d h 1 = {In - n\"c)A„ 

Obviously, ROO-2 is not optimal because we ignore the information 
about the additional noise when we choose the coefficient 7?,+|. 
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ROO T that optimally uses such information for the selection /? +i 
satisfies 

« ( +, = * +,(A (i ’) (ig> 

A ( ;’, =4> n (A M, )+S H Aj,' 1 - An (20) 

The next lemma proves the optimum of ROO I and ROO T 
Lemma Let 7? be an arbitrary n x m matrix that satishes (II) 
Then 

a,,</?) = a 1 + (/?-/?';’ >[fn (ak ' T ij 

(/?-/?'{’,)' +1 n (7? ) (21) 

where 


f\i = n Vi 1 'O' 1 I. = 0 

ThL proof follows dnettly fiom the iclalions (12) (IS) (16) (19) 
and (20) This lemma is lorreci foi ROO 1 when o - 0 

Non ROO 1 T art only local optimal observers Actually the 
values t)l 7? +1 aie chosen step by step independently to minimi/e 
/ = trA i with given A It may be easily shown that the joint 
selection of matrices 7?i /? /? +i enables us to obtain a smaller 

value of tr \ + 1 We remark that both pioeeduies yield the same 
lesull foi an obsciver of full older (in this ease i/ij = 0 7 — 0) 


III S I AT ION \RY Rfl)l ( 11) ORDIK OHSIKVI RS 
We considei the following time mvanant system 
i i — 41 ft i 


Ci f // 


wheie 




/ 0 
0 ( 


[i 


= 11 - 0 Lii >/' =] > 0 

and the slalionaiy observer 

i = / 4- Dij + / + 


where 


P = 


• - r + n v 

r i 


II - 


, » 

/?’ II 


( 22 ) 


( 21 ) 


/? = [/?' n ] 


r-=Tir n-T\n 7 -[-/?' t 


1 he constant matrix P is chosen from the set R — {It/(I - IK. ) 1 — 
stable} For any II fc R the sequence ot matrices A (II) from (12) 
has the limit A(/?) — lirn >. A (/?) which satisfies the Lyapunov 
matnx tquation 

a = (/, - m >n(A)(i - nc)' +m /?' + s 


n(A) 4A4 ( +!l (24) 

In such circumstances the problem of the design of an optimal 
stationary ROO may be formulated as follows 

tr A —* mm (2S) 

f\£R 


where A satisfies (24) Lagrange function foi tin sJi , 

H = 7/( A \ 7?) = tr A 

H- tr [(A — (7 -7?f )II( A)(/ IK i + ' \ 


where 


A = 


[A 11 
A 1 


A 1 

A 


\ 

\ 1 


\ 

\ 


1 


J 


are the error matrix of stationary observer and the inatm ul 1 n m L 
multipliers respectively 


ri(A) = 


n 11 
n 1 


n 1 

ii 


Here A 11 \ M fl 11 are dm x mi A 1 — A 1 \ \ ,; 

T1 1 - IT 1 are m , x ( n - m \ ) A \ II ue (n - m,) x 
(ii - w\) The necessaiy conditions of exliemum ue presented by 
the nonlinear system 


A" \ 11 -t) 


a 1 i 11 z? 1 ' - i 1 n J -o (26» 

a - /v 1 (n 1 +i "w 1 ' n l 'n' u \ n 

-in /?'+(/ , - ii ( )ii /( 

+ /.’'n 1 (/ - n ( )’ 

- a ,-/?() n (/ - ii <)'-() = o 

1 + \" - [It 4" - (I , - II C I 4 ']' 

\ [/?' l" (/ - II C ) 1 '] - 0 

\' - [7? 1 t" - (I - Tl ( ) 1 ']' 

\ [/V 1 r - (/ , n ( ) I ] - II (27) 

1 , + V - |/f‘ 4 1 — (/ ,-!?€)* ]' 

\ [/?' 4' - (I - n ( )4 ] - 0 

\ [J/'lll" +1 ") + /? (( II 1 +1 ') -II ] 

4- V ' \ " 0 (28) 

\ [/?'(n' c 1 + \' )+ /? icn ( 1 +4 )-n <'l 

+ \ V - o 

The equations (26) (28) give us the stationary obstivtrs that arc 
analogous to ROO 1 T from Section II I ho ctroi of stationary ROO 

2 is defined from (24) by substitution H { 1 7? (l Section IV 

contains the comparison of stationary observers I 2 T 

Note Tor system (22) as in |9| we may find the limiting (/ —* x ) 
values of the observers from Section II These limiting estimator 
will differ from the corresponding stationary ones of Section 111 Wc 
remark that such difference does not take place when //' = 0 (fust 
m i measurements without noises) and in case ot full order [10| 
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IV. Example 

We consider the following differential system: 

x - A.r(t) + bm (29) 


vhere 



[0 ll 


[ol 


\x l (t)] 

A ~ 

n o 

„ h = 

1 

. .>■(/) = 

..-•-(f) 


= EmiiT) = w-r) 

vith discrete measurements in steady time moments 1,+ t = t, + h 

a, = [1:0].r,. (30) 

"he state vector x, = ,r(/ t ) of continuous-time system (29) satisfies 
;he following discrete system: 


vhere 


•»‘«+L = 


1 h 
(I 1 


1-1 


r 1 i+i 

Ul = /. 


exp[.i(f,+i - t)]IhI£(t) 


(31) 


et. = o. eu! = 


h */3 /i' J /2 

*72 /. 


We compare the estimation precision of Kalman filter [10] 


■t'l+i = 


1 h 
0 1 


x, + 


(f/n i “’ [I • ) + 


"/+1 


and the reduced-order observer (23), (24) 

u = (1 - hr) -i - !J,hr J + /', T j. 


(32) 


(33) 


.r,+i 


.<L+i 

-f + i + r ,</ f +1 



3); The parameters of KBF and ROO are selected with account 
of additional noise. So, we obtain 

, = f - = i(, + v V + \.„ + 4 

/»(7 + /i/3) - V v ■* / 


A- = 


+ li/2 


/>(*, +/,/:jh- r/ 


^ + + ^ (1,y + / ' ) + ttO; + *))■ (36) 


The error behavior of the optimal stationary filter and observer 
with noises depending on the discretization step h of system (29) is 
shown in Fig. I. 

If there is no additional noise then the estimation errors of KBF 
and ROO are the same and are improved by decrease of step 
Jr. n (Sj , 1 ] = ()[h). The presence of noise makes the eslimation 
error of KBF essentially larger when h is small: ( s( / ' M - ()(\/1r). 
The optimization of KBF only lowers the order of vanishing: ^ ( / n — 
0(1///). The errors of ROO-2, 3 keep in the natural monotony on //. 

We can correct KBF-2, 3 by the suitable selection of step // |7]. 
However, the Fig. I demonstrates that this method is less effective 
than the use of corresponding ROO. It may be also shown that the 
employment of a reduced-order observer is more preferable here than 
the known regularization [8] of the Kalman filter. 


V. Conclusion 


which contain the mutually independent noises i >), uf. and v t with 
characteristics Eu} = Euf = Eu, = 0. E{p] ) j = E{u?)* = 
E{u,f = q. We shall take the magnitude <*> = lim,_.^ E(xj - xj) 2 
as a precision test and denote the error of KBF as h /, and the error 
of ROO as K. 

I): Let u] = v 2 = v, = 0 (there is no additional noise). Then the 
optimal parameters of KBF-1 and ROO-1 are equivalent 


In this note, we consider the slate estimation of linear discrete- 
time stochastic system by a reduced-order observer with noise in 
the dynamic block. It turns out that the estimation quality of ROO 
may be much better than the quality of full-order filter. We derived 
the local optimal observer which takes into account the influence of 
additional noise. The optimal stationary observer was designed for 
time-invariant systems. An example of Section IV demonstrates the 
stability of ROO in the presence of noise in the dynamic block. 


k = 


3+\/3 

A(2 + x/3) 





(34) 


2): Let the filter and the observer be disturbed by noise but the 
parameters r and k be chosen as in (34) (the presence of noise is 
ignored). Then 




4\/3-G' 


y ci) .m , «V5 
7 =7 


f 


(35) 
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Nevanlinna-Pick Interpolation Problem 
for Two Frequency Scale Systems 

Hossein M Oloomi 


Abstract — 1 he Nevanlinna-Pick interpolation problem for a parametric 
set ol data is considered Sufficient conditions are obtained that guaran¬ 
tees a two frequency scale solution Decomposition of the problem into 
two Nevanlinna-Pick interpolation pioblcins with smaller data si/e is 
presented An algorithm is developed which computes the solution as the 
result of combining the solutions for these two smaller problems 


another nearby solution which is easiei to obt un Jn , I 
which is obtained as the lesult of solving two smallu Nhu 
problems and is close to the actual solution in sonu m mm f 
This idea will be pursued later in this note 

We refet the interested rcadet to |9| for the dUmu i , n 
tianster functions We will denote the set of all I bS ti msl h n< n 
by T When n(s 0 € T we will denote its slow nu' j isi | , 
functions by // s (s) and a 1 (j>) respectively when ; \u \ a 
also need the definition of strictly hounded leal lum turns *lmh vu 
will give here toi completeness 

Definition ! A function o(s) analytic in lh [*] 0 is s f uri io In 

a strictly bounded real (SBR) function if |n(/^ )| < 1 for all n \\ 

The following lemma will be needed in the sequel 
I tmma I I el «( s &) f 7 have no unstable lost poles ana h tu 
SBR slow and fast transfer functions </ s (s) and u l (;>) respectively 
Then tor sufficiently small </(s f) is an SBR iransler function 

a id 

In [ 10) it has been proven lhal if i/(s O £ T has no unstable 

lost poles and it \\u (s)|M £ and ||i/ / (p)|| x < then for 

sufficiently small t ||n(s Oik < + 0(0 We specialize this 

result to the present situation We have || it (0|U - 0 s < 1 and 
\\u { (p)|U ~ V 1 < 1 Let ti\i -max|ft s H 1 f Then there exists a 
constant A such that foi sufficiently small < || r# (s 0||x ^ M/ + Ar 

When A < II then ||</(s f )|| v l 0 u 4- A # (I / 1 On 

the othei hand, when A > 0 set t* — - 1 ^ s 0 Then foi all 

0 s ' < '* ||«( - 0||> s » /+A<- < Cu+h (' = 1 Heme 

||##( s 0|| x < 1 ie tor sufficienily small f u(s 0 is an SBR 
function QbD 


1 In i kodl c 1 ION 

Many intciesting pioblems of uicuits and systems as well as 
a number ol control pioblcins can be foimulated as the Nevan 
ltnna Pick (NP) interpolation problem [1| Indeed many early at 
tempts to solve the II ^ problem used this inteipolalion technique as 
its mathematical foundation Examples include [2] and |T| The NP 
interpolation problem was also used to solve the robust stabilization 
problem in |4] and |5| A good account of the theory is given in 
|6| [81 We use the veision presented in [4] 

I he NP pmblem considered in this note is different from the 
standard NP pioblcm in a numbei of ways The main difference is in 
the data set The data considcied here is moie general because it is 
paramedic This should he contrasted with the standard case where 
data is a fixed set ol numbers Howcvei only first order singularity 
and analytic pertuibation arc allowed This particular structuie was 
considered since it enables us to seek a two frequency scale solution 
Nonetheless, generalization to higher older singularities is possible 
piovided one allow* multifrequency scale solutions 
Two frequency scale (TFS) transfer functions were first introduced 
in [91 The most notable feature of a TFS transtci function is that it 
can be studied in terms of two lowei order transfer functions referred 
to as the slow transfci function and the fast transfer function It is 
evident that when complexity and high dimensionality is of ma|or 
concern TFS property is a highly useful attribute I his is precisely 
the motivation behind the present note 

Once it is guaranteed that the NP problem yields a TFS solution, 
it certainly makes sense to approximate the solution by constructing 
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The author is with the Department of riectncal Engineering Purdue 
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II Tub INMKPUI \fION PkOIHLM 
Suppose we aie given two sequences ol / complex numbers 


MO MO Mj( o wo 

where each a (e ) and MO is a function of t and analytic at * - 0 In 
addition suppose theic exists an r* 0 such that tot all 0 < * * r* 

R< [o (r)] > 0 and | MO| 1 toi / = 1 / The NP problem 

tor this set of data is one of finding an SBR function i/( s 0 which 
interpolates the data i e 

u(n (f) O - MO / = 1 / 


It is well known that the solution exists (sec |4|) iff the Pick matrix 


PU ) 


Pii(0 fPi (O 

<P I(O *P (O 


wheie 

P\\{ O 


P i (O 


Pui O 


{ 

{ 

{ 


1 - MO <,(0 

<> (O + o,(f) 


1- HOVO 

fit (f ) + (T j(i) 


l - MO *,(0 

(O + 0,(0 


} 

} 

} 


I -1 l> 

y = 1 'i 

i = i h 

;-/,+! I 

i = f, + 1 I 

1=1 1+1 I 


0018 9286/95$04 00 ® IW Ibbb 
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Moreover, the solution «(a. f): = f) can be generated recur- 2) The Slow NP Problem: Consider two sequences of l\ complex 
stvely from the algorithm (see [7]) numbers 

y (H f) . = (f* -Q ; (f))ll H |(h,f) + ^(f)(FA-f rr,(Q) n,\ 

' ' ' (f« + n 7 (f))-|-7) y (r)(f,v-n y (f))i/# +l (.s. f)' 

j = /.--•,/ j -f 1, (1) v s ... j s 


, (* ~ i\j(f))Hj+\(s* f) + f)j{f)(s + 0,(0 ) 

** ' * (s -f rr,(f)) + Ji j (f )(a - i (.s. f) ’ 

j = h.”A ( 2 ) 

where < 1 / 4.1 (*, f) is any SBR function, and the Fenyves array i,,, (r) 
is generated via 

^ i( 0 := i,( 0 , 1 = 


with the property that R[o, s ] > 0 and | Cl < 1 for / = 1. ■ ■ ■./j. We 
seek to find an SBR function u s (a) such that 

« S (n f S ) - / - 1 . 


The solution exists iff 



K(0 + n,(0 H ,(0 - ^,U^ 

K(( ) - a, (/)')(1 - T '(7)T|7f )j ‘ 


1 < J < ' - J < f- L 


and is given by 

s, (j^- n J *)« 1 (X) -h_/^ S (.s + O y S ) 

S : ~ (^+n y s )+^ s Y^-'ny)u / s M ("s)' 


■■■.!. (5) 


*, ,(0, J = 

It is well-known that PfO > 0 iff < 1, V/. j [7]. In 

particular, P(f) > 0 implies |^(0| < 1. Vj. We call this the full- 
order NP problem. Associated with this problem, we consider two 
subproblems. 

I) The Fast NP Problem: Consider two sequences of {I - l\) 
complex numbers 

i i 

n , 1 + , 


i/ s (.s):- , (,s) 

where n+ is any SBR function and 

C.:- C- / - U---./. 


(ft) 


c 


i-M ■ ■ 


(n: + n;)( c,- c,) 


(rO-o;)(l-y f V,) 


1 < ; < / - 1 - 1 , 


with the property that /?r [n I ] > 0 and | l/ | < 1 for / - /| + 1 . ■ ■ ■. /. 
We seek to find an SBR function n 1 (yy) such that 

« f (a; ) = .*,*. j = /, + 1 . 


The solution exists iff 



/ — /1 + 1 ,■■■,/, 

,/ = /t + / 


and is given by 

f, . _ )»|-n(p) + ^ (p + 7»| ) 

,J ' {p + 7 rj ) + />' 

j 


!.■■■.I i+l, O) 


« J (y j ):= «», + i(p) 


(4) 


> 0 iff | *, s J < 1. Ilencc, 7 ,s > l) implies |/y ; s | < 1. 

Wc relate the slow and fast NP problems to the full-order NP 
problem by setting 

(<0- J>- («,(()). J,(0)). j - 1. ••••/,. 

): = («,(«). <,(»)). j = 

It then follows that C,~ I, ,(•))• /O ~ /»,(<)) lor/, j — 1 Ii, 
and i , 7 ( = I, ,(()). /i, = />j(») lor i. j = 1 , 

Our objectives are as follows: 

■ to show that the solvability condition for the full-order problem 
is implied by the solvability conditions for the slow and the fast 
subproblcms; 

• to construct an SBR-TFS transfer function n s/ (.s. 0 form the 
solutions of the slow and fast subproblems which approximates 
the solution i/( .s, f) of the full-order problem arbitrarily closely, 
i.c>, an SBR transfer function m s/ (*. e) G T, such that 


where u|+](y>) is any SBR function and 

i : = ‘li • i ~ 1 1 -f 1. ■ ■ ■ /, 

jt _ i«r +«! H *!, - o 

V y+J : ---;- zzj -;—■ 

K ~ HI “ 1, j *1 ,) 

h + 1 < j < ' “ 1 < y -1- 

f, ,/ 

Pj : = 4 j j * 

> 0 iff |,i' ( | < 1 . Hence, > 0 implies | < 1 . 


II If ( AT, f) — U S7 (*, f)|| = O(f). 

II 11 ^*. 

Since the size of data arc smaller for the slow and the fast problems 
when compared to the full-order problem, it is evident that one gains a 
computational advantage by solving for // s/ (*. 0 instead of n(s, f). 

III. On The Solvability Condition 

In this section, we will show that the NP problem is solvable for the 
full-order problem if it is solvable for the slow and fast subproblems. 

Theorem J: Let P s > 0 and P F > 0. Then there exists an f* > 0 
such that for all f G (0. f*), P(f) > 0. 
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Proof Since Pn(0) = P^ > 0, for sufficiently small f 
Pn(f) > 0 Hence, for sufficiently small f, exists Thus, 

using a well-known result on the similarity translormation 111 J we 
conclude that P(f) is similar to the lowei block triangular matrix 

"Pn (0 0 

★ f[r*A*) - fPi2if)Pn l {r)Pi2( + )] m 

where ★ stands for the block whose value is unimportant in the 
tollowing development Therefore P(t) > 0 itt L(e) = P i(t) - 
f r Pu(f r )P 1 / (^)T^u(f) > 0 But since 1(0) = P[ (0) - P } s 0 
we see that for sufficiently small r T (f) > 0 Hence there exists an 
t * > 0 such that for all f £ (0 f* ), conditions P^ s 0 and P 1 > 0 
imply P(f) >0 Q F D 


IV 1 hf TFS Son n ION 


The main slep towards the construction of </ 1 (s r) is to find 
conditions which ensures a TF S solution to the full order algorithm 
(l)-(2) To this end we mtioducc the following family of transfer 
functions 


..s, , _ m + />!'»I 

I \ s I —/ —/ / S / \ 

n , -p, n , "/(I*-) 


1=1 1 1 + 1 (7) 


«! (•-) 


i'll i ip) + p, 
1 + 


> = i i i 


(«) 


and prove a lemma about them 

lemma 2 Let a , s , i(s) and uj { f |(y/) be two SBR transler func 
tions Then </ s (s) defined by (7) and {])) defined by ( 8 ) are 
SBR translu functions for y - / li 4 1 and j — I\ I 
respectively 

Pi oof Wc piovc by induction that u 1 (j>) is an SBR transfer 
function The pi oof for </, (s) is quite similar and is omitted Wt art 
given that #//, ( j (//) is an SBR transfer function So suppose tt f n (p) 
is also an SBR tiansfer function and consider u 1 (ji) obtained horn 
( 8 ) Certainly i/J (/>) is a proper tational function It is also analytic 
in the closed light half plant since otherwise 1 + p ] t/j +1 (c.) = 0 
lor some (, wuh Th [cj > 0 But then |<#J+|(0| = I > 1 
which leads to a conliadiction in view of the maximum principle To 
tomplelt the pi oof we have by a simple calculation 

, | / , ,|* (l-l/’J-Hl-l"! |(*-)| > 

1 - \u (i+ ) - ---;-~r 

1 1 |i + i>,", 


Since |/i, | < 1 and ln'f |(fa )| < i il lollows lhal a 1 , (]>) is an SBR 
function Q P D 

We now use Lemma 2 to show that the algonthm (lM2) will 
indeed produce a TFS solution provided it is initialized with a 
correct ITS iranslei function 

Them cm 2 Lcl i // +1 (/>) and f/, s i-i ( s ) be two SBR transfer func 
lions used to inmate the algorithms OM4) and (^Mb) icspectively 
where </, s l4 , (s) is generated fiom (7) using an SBR transfer lunction 
i /,+1 (s) as the initial data Let uhi( s f) E l have no unstable 
lost poles and let <//+i(*) and uj +i (p) be its slow and fast transfei 
functions respectively Then the transfer function i/(s O generated 
via the lull ordei algorithm (l)-(2) is an SBR-TFS transfer function 
Pioof By Lemma 1 mn(s 0 is an SBR transfer function 
Therefore from the general theory *) pioduccd by algonthm 
(1)—(2) is also an SBR transfer function It remains to show that 
i/ (s r) is a TFS transfer function I or that, we consider two cases 
Case I —/i 4 1 < j < / We prove by induction By assumption 
i#/ 41 (v <-) £ 7 So suppose Hyfi(s O E T and invoke [9 
Theoiem 2 2 ] to write 


/-h ( s * 1 / n ( r s *) 

dj J+ | ( s f ) 4 s k | l (f S f ) 


(9) 


where 

1) i/i /+ i(i O n. #4 ,(fs O d, /tl (s ,) / 

are polynomials with coefficients anal>tu u u 

2) deg[d, 7 + i(* f)] = deg[d, ;f ,(s 0 )J I 

^ deg(d.> Hiti 1 0 ] = deg Id Hl (^ 0 )) 

4) deg[//i O] < A, fl , 

5) deg [// , + r )] < deg [<L ,+ i(j> O) 

6 ) The constant terms of i# , n (y> O and / , (/ Ul i ltl i, 

7CIO 

Now the first and the last conditions allow us to expand the 
polynomials m (9) as 


"\ J m(s O - in(‘)s* 

J. 0 


"2 ,+j(f S f) - ^ A j + lU){r') k 

/ l 

* 41 

di (i i( s O - /-n ) s ^ 

c ( 


R i 

<h , n ^ ;4 ., (f)(/ s)' 

A I 

So using these expansions wc substitute (9) in ( 1 ) and express 

ui( s e) as 


/ v _ "1 y( s f ) + s* U (fs O 
" /(S d, (s e)+s* d (Ts o 

where h , — A ; f ( and 

o i /(s f) "= (rs — n ,(f ))//1 y+i(s f) 

4- n,{ 4 o,(f ))d I H I (s « ) 


(10) 


(ID 


" ,(<s <) — (f S - 0,(0)/'- ! + i(^ s O 4 4 o 7 (r)) 

d f ) + rs (^i (0 4- Pi ( f ) T)\ ,1 1(0) (12) 


di ;(s 0 = (fs + n,(0)dj / + |( S 0 + /' / (0(fi-d ) (f)) 

«. ,h(- (») + /*,(f)V,‘ ,v,('))'* 41 (H) 


<h ,( fs f ) = ( fS 4- o,(f ))d^ y+ i ((■ S e) 4 - Oj(f )) 

» iJ+l (rs 0 4 <-(77,*+ /?,(<■)v,‘ ; n (o) 04> 

We claim that 

1 ) a i j (s e) n 2 ; (fs f) di y(s f) and d ,(«-s f) arc poly 
normals with coefficients analytic at t = 0 This follows since 
all polynomials in (9) have coefficients analytic at <- - 0 

2) (hi/[tli f)] = dff/[di ; (s ())] —A y To piove this claim 
wc note that il 

then degree of d| (s e) in (13) does not drop to a lower value 
when ( is set to zero That is for sufficiently small«- the claim 
is Hue On the other hand, setting r = 0 in (1) and companng 
the result with (7) we sec by induction that Uj{ s 0 ) — i/ 7 (s) 
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for j —/,■■■./1 + 1. Thus, upon computing u\(n) from (9) 
and substituting the result in (7), we deduce that 

-n'j + * f , 

a-; a’‘7 + ',(0) ' 

But then the propemess of u y s (■*•) implies that u ; s ('x) / x. 
Hence, (15) holds. 

3) rfr// [r /2 ,(j>. f)] — rjrtj [r / 2 ,(/a 0)]. The method of proof for 
this part is similar to the previous pari. That is, we see trom 
(14) that if 

/v;; l (») + /vvf , ,7 l (0) d 6 ) 

then for sufficiently small r, the claim is true. On the other 
hand, computing u t (p/t, t )|,_n from ( 1 ) and comparing it 
with u\ (p) in (3), it follows hy induction that the two transfer 
functions are equal for each j = /, • - *. /i 4 1. Thus, upon 
computing from (9) and substituting the result in (3), 

we deduce that 

+ ^ v,Vi«» 

But the propemess ot u \ ip) implies that (x) ^ 0 . Hence, 
(16) holds. 

4) f/rjfy[»i y («, f)] < kj. This follows easily by examining ( 11 ). 

5) (hp[n 2 ,{p, f)] < d(fi[<h iip- 0]- This follows easily by 
examining ( 12 ). 

6) The constant terms of n 2 j{p . e) and <h j(p - 0 are both /ero. 

This follows by examining (12) and (14), noting that the con¬ 
stant terms of the polynomials » 2 , + i (p. f) and f / 2 O 

are zeros. 

Therefore, by [9, Theorem 2.2], Uj (a. f ) € T, lor j = / ( 41 - ■ ■ ■. /. 
Case 2 —1 < j < /r We prove by induction. By the previous 
step, we know that 1/^41 (s, f) £ T, . So, let us suppose 11 ,+ \ (s, <) £ 
T, and invoke [9, Theorem 2.2] to express u , \ 1 (*. t) as in (9) with 
the six properties as was staled jn Case 1. next, we substitute (9) in 
(2) and express if,( a. 0 as in (10) where A, = k,\i 4 1, and 

»1 ;(*■ «0 = ( S - H,(f ) )ll | J 41 (s. O 

4 P;(0(- s 4 O )(* ))d\ 

4* ).Vj ,+,(r) 4 ,+i(0) 

(17) 

Mj ; (#A, 0 = m y + l(KH, 0 4 P,[t)<h 0 

, Y/+j(« r - s < 0 \ 1 , A 

- 'U(0(---I 

4/^(Oo ; (0 ^ 2 ■ ■■■ ■ - 

(18) 


<h ji*' O = (* 4 f»j(r))d, , + l (.s. f) 

4/>_,(*)(* - n ; (f))ii] f) 

4 f(7T,(r)Z)] j+1 (f) - ,-m(0) 

(19) 

r/ 2 r) *= (l 2 j+\(fs* f) 4 7 >,( 0 » 2 ,+j(< r *. 0 

+ n » ^ 2 - f gj j 4 l (r)) 

- )o,(f) f) - f.vj , +l (f)j. 

( 20 ) 


We claim the following: 

1 ) "i "2 /(cs, f), r/i j(&< f), and (f ‘2 j(fi s. r) are poly¬ 

nomials with coefficients analytic at f = 0. This follows since 
all polynomials in (9) have coefficients analytic at t — 0 . 

2) I j(.s, 0] = </< 0 [di ())] = kf, This is proved by 

noting from (19) that if 

dJ%(0) +P < s A',*77,(0) *0 (21) 


then for sufficiently small f, the claim is true. On the other 
hand, selling r = 0 in (2) and comparing the result with (5), 
we sec by induction that u ; (.s, 0 ) = Uj{s) for j = h . ■ ■ ■. 1 . 
Thus, upon computing u' +l (s) from (9) and substituting ihe 
result in (5), we deduce that 

x of 1 (ii)+?», s A',‘ 

But then the propemess of i^( .s) implies that f/ y s ( x) ^ x. 
Hence, (21) holds. 

3) <l(<j[<h jii>< *■)]■=■ <h(j\di ,(p. 0)]. To prove this, note trom 
( 20 ) that if 

0l%<O) + /C Y.V' ; V,(0)#0 (22) 


then tor sufficiently small t, the claim is true. On the olhci 
hand, computing Uj(p/f , t )|, 0 from ( 2 ) and comparing it with 
</\ {p) in (3), we see by induction that the two transfer functions 
are equal for each / — /j,*■■,1. Thus, upon computing 
ti I ,+\(p) from (9) and substituting the result in (H), we deduce 
that 

/ ^ j V+1 (^) 4 pf D 2 1 i , (0) 

" r r - r 

0 /^( 0 ) 4 Wi(«)) 


But then properness of i/J (.s) implies that //J ( v) ^ x. 
Hence, (22) holds 

4) </i q [if 1 j( s. r)] < kj. This follows easily hy examining (11). 

5) ,{j>. r)] < r U<j[di ,(p. #■)]. This tollows easily by 
examining ( 12 ). 

6 ) The constant terms of n> ,(p, f) and <1 > ,(p* *) are both zero. 
This follows by examining (12) and (14) and noting lhal Ihe 
constant polynomials n 2 1 {p. *) and r/ 2 H 1 (/», r) are zero. 


Therefore, by [9, Theorem 2.2], i/ 7 (.s. f) £ 7, for j = I.- -,/i. 
Combining the two cases, we conclude that 1 / y (s, « ) £ T, tor all j. 
This completes the proof. 

Q.B.D, 

We now present an algorithm for the construction of the TFS-SBR 
transfer function n sh (.s, f). 

Algorithm. 

1 ) Check to see if P r > 0 and P s > 0 . If these conditions 
arc satisfied, proceed to the next step. Otherwise, terminate the 
algorithm. 

2) Choose an SBR transfer function <//+i00 and compute 

from (3). 

3) Choose an SBR transfer function and compute 

»'/ S 1 + l(") from (7). 

4) Use the SBR transfer function it^+ilp) obtained in Step 2 to 
compute uf (p) from ( 8 ). 

5) Use the SBR transfer function m / s 1 + j(*») obtained in Step 3 to 
compute i/i s (*) from (5). 

6 ) Set v J ( p):= u{ (p) and i/ s (s):= a^f.s). 

7) Set ?/ sf (a, r): = u s (,s) 4 - u f [fs) - // s (x). 

We have the following theorem. 
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rheotem ? Let u(s *) be Ihc solution to the full-order NP 
problem (IM2) with the initial data given by Theorem 2 and let 
i/ s/ (* 0 be the transfer Iunction constructed in the alorementioncd 
algorithm Then for sufficiently small f 

1 ) i/ s * (s ?) is a TFS-5BR transfer function, and 

2) ||i/(s O - u 1 (s f)|| x = 0(0 

PrAof By Theorem 2, it(s t) is a TFS-SBR transfer function 
Also it s/ (s f)eT because it has the same slow and fast transfer 
functions as i/(s f) Since both of these slow and last transfer 
functions are SBR functions by l emma 1 u 1 (s f ) is also an SBR 
transfer function This proves part 1 of the theorem Pan 2 follows 
from [9 Theorem 4 1J 

QED 


V CON( I IJSION 

In this note we have considered a version of the NP interpolation 
problem We have obtained conditions which guarantee a IPS solu 
lion We have formulated two smaller NP inteipolation problems and 
have shown that they can be solved in parallel We have reduced the 
solvability condition in terms of the solvability conditions foi these 
two smaller problems The immediate gam here is the reduction in 
the verification piocess and hence the coinputei time We have used 
the solutions obtained from these two smallci problems and have 
constructed a solution which can become arbitrarily close in the /7 X 
norm lo tht solution of the original NP pioblcm The icsults of this 
note should prove useful both from the computational standpoint as 
well as m the areas such as robust stabilization problem lor TFS 
systems 
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Multiscale Smoothing Error Modi! 

Mark R L uettgen and Alan S Wills! \ 


Abstract —A class of multiscale stochastic models 1 i M <) n 
recursive dynamics on trees has recently bicn introduce IP, ( j 
are interesting because they can he used to represent i ( i ( 
of physical phenomena and because they lead lo efficient < it 
(or estimation and likelihood calculation In this paper, \u \ i m,i, 
complete statistical characterisation of the error associated with sm Un d 
estimates of the multiscale stochastic processes described hv these ntu'<i I 
In particular, we show that the smoothing error is itself a niiiltisc \[\ 
stochastic priKess with parameters that can be explicitly ealuilated 

I Introduction 

A class of multiscale models describing stochusiic processes in 
dexed by the nodes of a tiee has recently been introduced in [l| 
[2) These models can be ustd to capture a surprisingly uch class 
of physical phenomena For instance experimental results in |2| 
illustrate that they can be used to model the statistical self similarity 
exhibited by stochastic processes with generalized powei spectra of 
the form 1// 1 and in [4] wc describe how they can be used to 
represent any 1 D Markov piocess or 2 1) Maikov random field 
Moreover this class of models leads to efficient algorithms foi 
estimation and likelihood calculation and as a result provides a useful 
framewoik lor a variety of signal and image processing problems [1) 
PI |4H6| 

Knowledge of the error statistics of smoothed estimates ol such 
processes is essential lor the development ol a number ol important 
new applications including tor instance so called mapping problems 
[7] the multiscale counterpart to the model validation problem in |8] 
and certain oceanogiaphic problems [9] Several such applications 
have been developed in the context ol 1 D Gauss-Markov models 
by exploiting relatively recent results that show that the smoothing 
erroi processes associated with Gauss-Markov modtls are themselves 
Gauss Markov processes 17| |8) [10] 1111 1 In this papci we derive 
a dynamic model foi the smoothing error process associated with mul 
liscale stochastic models In particular wc show that the smoothing 
uioi is itself a multiscale stochastic piocess wilh parameters that 
can be explicitly computed Ihcse results geneializc previous results 
lor Gauss-Markov pioccsses since these processes coricspond to a 
degeneiate form of the multiscale models and provide the necessary 
framework for applications such as those mentioned above 

This paper is organized as follows In Section II we briefly review 
the class of multiscale stochastic models ol intciesl here and ihc 
scale recursive estimation algorithm associated with them In Section 
III we derive a multiscale model lor the smoothing eiror process 
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'More generally Levy a al |12] have recently shown that the smooth 
mg error processes associated with the class of Gaussian reciprocal pro 
cesses which contains the class of Gauss-Markov processes art themselves 
Gaussian icciprocal See also [ 14J for similar results conespondmg to 2 D 
Gauss-Markov random fields 
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Fig. 1. Mulliscalc stochastic processes arc indexed by a f/lh-ordcr tree. The 
parent of a node .* on the tree is denoted and its q offspring are denoted 

A-n i. ■ • ■, Mk ir 


II. Multiscale Stochastic Modeling and OptimalEstimation 

The models presented in this section describe niultiscale Gaussian 
stochastic processes indexed by nodes on a tree. A r/th order tree is 
a pyramidal structure of nodes connected such that each node of the 
tree has q offspring (sec Fig. I). We denote nodes on the tree with an 
abstract index .s, and define an upward (tine-to-coarse) shift operator 
T such that is the parent of node a. We also define a corresponding 

set of downward shift operators a j.a,, such that st\ i. • • •, .so,, 

are the offspring of node .s. In addition, we denote the set of nodes 
on the tree as 7 and the set of nodes that includes node .•* and all 
of its descendants as 7,. i.e., 7 S = {<r \a = s or a is a descendant 
of .s }. Also, the complement of 7* is denoted 7*. The staiislical 
characterization of the model state .r(.s) € R" is then given by 

.r(«) - A(*).r(*T) + £(*)«•(*) (1) 

under the assumptions that .r(0) ~ A - (0. 7(0)). »’(.s) ~ 

A'(0, /). A(s) and D{*) are matrices of appropriate size, and 
« - 0 is the root node at the top of the tree. The driving noise 

ir(tf) € R n> is white, i.e., </ (*) and ir(r r) are independent if 

h ^ a , and independent of the initial condition .r(0). The class 
of models (1) has a statistical structure that can be exploited lo 
develop efficient signal processing algorithms. In particular, noie 
that any given node on the (/th-order tree can be viewed as a 
boundary between q + 1 subsets of nodes (<y corresponding to 
paths leading towards offspring and one corresponding to a path 
leading towards a parent). An important property of the model 
(I) is that, conditioned on the value of the state at any node, the 
values of the state corresponding to the q -I- 1 subsets of nodes are 
independent. This fact is the basis for the development in |1], |2) 
of an algorithm for computing smoothed estimates of ,r(s) based on 
noisy measurements ,/y( a ) € R r of the form 


for every * at this finest scale we set .r(/»|V, + ) to zero and P^V/) 
to the solution at the finest level of the tree of the Lyapunov equation: 

P(s) = A(x)P(n)A‘ (*) + D(s)B r (») (3) 

where P[.s) denotes the covariance of the process ,r(*) at node *. 
Suppose then that we have .r(.‘?|>7 1 ’) and ) at a given node 

h. This estimate is updated to incorporate the measurement </(«) 
according to the following: 

V) = .r(«|V s + ) + K {s)[i/{x) - C( s).v( «|V+ )] (4) 

P(s\y„) = [i- h-(s)C(«)]P(s\r?) ( 5 ) 

where h ( .■>) = / > (s|VJ 4 )C’ («)[C‘(*')/■’(«|1 JC’ f (*) + /?(.s)] _l . 

Suppose next that we have the updated estimates .r(*n,|V„ 0l ) at 
all of the immediate descendants of node s. The next step involves 
the use of these estimates to predict ,r(w) at the next coarser scale, 

i. e., to compute .i , (w|V.,« I ). Using the following upward model for 
the multiscale process [1|, [2J: 

F(s).r(s) + 7F(.s) (6) 


with the measurement equation again given by (2), and where 
F(s) = P(s^)A l 1 and E[tr{ n)tr 1 (.s)] = P(s^) - 

Pia^A 1 {s)P(,s)~ l A(s)P{s^)) = Q{.s). we compute the line-lo- 
coar.se pic dieted estimates: 

J‘(«|V, ll4 )= F(st\,)Sist\,\y sni ) (7) 

P(b\Y MI , t ) = r(*n, )P(*(\, |V„„ ( )F T {. sn,) + (j(b (\,). (8) 


The estimates .r(.s|V„ ni ),/ = 1. ■ ■ ■ ,q are then merged to obtain 


l 

•/ 

(1 - q)P(s) 1 + '('ll-,.,) 

I -1 


p(*\y:) = 


( 10 ) 


We assume here that P{s) and P{s^\Y s ) are invertible for all » so 
that the upward model given by (6) and the merge operation given 
by (9), (10) are well-defined. As discussed at the end of the next 
section, this restriction can be removed. 

The recursion given by the update, predict and merge equations 
proceeds up the tree until one obtains the smoothed estimate of the 
root node, .r(0|Vu). This estimate initializes a downward sweep in 
which r(.s|V 0 ) is computed according lo 


■i(s|Vo) = .r(.s|K) + ./(.s)[.r(.s 2 ; |Vo) - .r(js-)‘|l \)] (11) 

P(*|Vo) = P{s\YA + J(s)[P{.^\y 0 ) - P{s-\y,)\J' (s) ( 12 ) 
./(*)= P(x\\\)F , (s)P-'(x-- l \VA. (13) 


//(«) = C(.s).r(.s) +<■(«) (2) 

where <•(*) ~ A'(0. /?(*)), and is independent of both ir(n) 
and .i‘(0). The algorithm for computing the smoothed estimates 
of .i‘(*) is a generalization to //th-ordcr trees of the well-known 
Rauch-Tung-Striebel algorithm for smoothing 1-D Gauss-Markov 
processes. We briefly review the multiscale smoothing algorithm next, 
and then derive a model for the error associated svith the smoothed 
estimates. 

We denote the set of states defined at nodes in 7* as .Y„, i.e., 
A\ - {f (rr)|rr € 7.}. and similarly V, = {,y(rr)|<r € 7.}- I'he 
set of measurements in the subtree strictly below * is denoted V/. 
i.e., Y? - {ill *)\v is a descendant of *}. We also define j-(*|V) as 
the expected value of -r(.s) given measurements in the set V and the 
corresponding error covariance as 7(*|V). 

The upward sweep of the smoothing algorithm begins with the ini¬ 
tialization of x{*\Y ?) and Pi ) at the finest level. In particular, 


Note that (12) characterizes the smoothing error covariance at any 
given lattice site s. but docs not provide information about the 
correlation structure of the error process. The goal in the next section 
is to provide a mulliscale model for the smoothing error process, 
i.e., to show that the error satisfies a recursion of the form (1), and 
to calculate the associated model parameters. This then provides the 
complete statistical characterization of the smoothing error that we 
seek. 

III. Mijltiscale Smoothing Error Models 

Given two nodes .s and o- £ 7,‘ on the tree, wc can represent .v(<t) 
in terms of r(.*T) and an additive noise term 

= + ( ,4 ) 

with y*rr ^ independent of the set of states .r(*T) U A\ and the 
corresponding set of measurements s/(«T) by tracing a path 
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from (T to and using the upward dynamics (6) and downward 
dynamics (1) to eliminate state variables along the way The state 
transition matrix in this construction is a function of the 

upward and downward prediction matrices A and F along the path 
whereas yv ^ is a linear function of the upward and downward 
driving noises u and 7T Since T „ is independent of the set ol 
states i (*“) U \ , and the measurement </( s ~) U 1 wc have that 
»(<r|K) = n /(*~|K) which using (14) implies that 

= ) + V(T - (15) 

where we have defined the cnor in # (s|l ) as i (s|T ) = # (s) — 

i(h|} ) As a result we see that r (s|T ) has the Markov property 

B{i(s|i )|«M^ ) o e T } 

= E{i(s|K)|,(,-|) ) {y K e 7, }} 

- )|i(h-|i )} + £?(i(s|i )|{y Kern 

— E{ f ( a| 1 )| < (s \\ )} (16) 

The hrst equality in (16) iollows from (15) the second from the 
orthogonality of Y to i(s ) and ) and the last Irom the 

orthogonality ol Y to i(s) and 1 Now using the upward 

dynamics (6) the upward sweep prediction equation (7) and standaid 
lineal least squares formulas wc can write 

i (s|l )= /(n)i(s-|1 ) + m(s) (17) 


where T{ s) is given hy (M) and wheie from (16) u (s) is indtpui 
dent of {/ ((T|s)|^7 € T | and has covaiianu 

P(s|i )-P(s|i )F'{s)i> '(s |y )Ks)/»(s|i ) (IS) 

Next note that the independence of */ (s) and {i (rr |} )|rr G I } 
implies that <r(s) is also independent ol the residual information 
about i( s) that is contained in the set of all available measurements 
)( but not cont lined in 1 In particulai at each node in T a 
icsidual component / (<t) that is orthogonil to the measuiemcnts in 


the set 1 can be defined as 

i (a) = i/(a) - E{ii((t)\\ } 

= ( (<t)i(it|) ) + i (a) (19) 

Denoting / — {/ (tt)|<t g T } it is clear that span 

) — span p / } that // 1 1 and that i I u(s) Taking the 
expected value of both sides of (17) conditioned on / we obtain 

E{i(s|> )|l' } = /(s)E{l(s-|l )|l \ (20) 

Finally noting that 

j(‘>|1j)= l ( s|T )+/?{l(s|T )\o } (21) 

and then subtracting (20) from (17) results in 

r (s|l 0 ) = 7 (*)iM1 0 ) + ii(s) (22 ) 


which is a mulliscale model for the smoothing error ol precisely the 
same foim as (1) 

This model is, of course consistent with the error covariance 
computation in (12) In particular using the Lyapunov equation for 
(22) we obtain 

P(»M= /(s)P(--|i«,)/ 7 (s) + P(s|y ) 

= P(f|K)+ /(«)[P(s7|l 0 )-P(s-|l„)]J r (,) (2V 

In addition, on first-order trees, the model (1) reduces to a standard 
Gauss-Markov model and hence (22) generalizes to r/lh order trees 
the corresponding 1 D time series result The derivation here is 


related to but is in fact substantially simpler ihm t!i 
based on backwards prediction error models in |S| 

Finally we note that it is possible to derive a multi i il 
error model without assuming invertibilitv ol I s m 
We refer the readci to Appendix D of [14| loi a itlu K \ 
of a likelihood calculation algonthm for (h (2) ih u ail * „ 

deficient P{ s) and P( ) A slighi \ anation of the u fi 
in that derivation can be used to show that a muliisc tie i 
error model allowing for rank deficient P( s) and / (s |) L , n h 
wntten precisely as in (22) but with the gain l(s) jj VU , h> 

/<•>) = r(s|iar'(s)/- t (s |> , 

I{s) = P(h ) 4 7 (s)s)^ (2S) 

and the covanancc of </(s) given by 

PHi )-u»\s )P 7 (s)p f (s |i,)r<s)p(s|i ) (26) 

where the superscript f denotes the Mooie Penrose pseudo 
invcisc [15] 
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A Generalized Popov-Belevitch-Hautus 
Test of Observability 

Bijoy K. Ghosh and Joachim Rosenthal 


Abstract —In this paper, an earlier result on the problem of observabil¬ 
ity of a linear dynamical system due to Popov-Belevltch-Hautus has been 
generalized and applied to the problem of observing the Initial condition 
of a linear dynamical system described on the space of d dimensional 
affine planes in R n . 


Let us assume that the points on the plane (1.4) satisfy a dynamical 
system 

\ =A\+b (1.5) 

where .-1 is an arbitrary 3x3 matrix and hi s a 3x1 vector given by 


/"II 

"12 

"n \ 


.4 = 17 21 

"22 

"21 

(1.6) 

\"ll 

"12 

".n / 



h = (hi h') In) 1 , (1.7) 


1. Introduction and Motivation 

In this paper we generalize the well known Popov-Belevitch- 
Hautus test (see [3|) on the observability of a linear dynamical system. 
Let K denote either the field of real (K = R) or the field of complex 
(K = <C) numbers. Let .4 be an // x n matrix and lei C be a p x n 
matrix defined over K. Consider the linear time invariant system 

i = Ar. a = C\r, .r 6 K '\y C K ; \ (1.1) 

The well-known Hautus test |3| gives a necessary and sufficient 
condition, when the state vector ,r(t) can be observed from the oulpul 
measurement y(t). To be precise one has the following. 

Theorem I (Hautus 13]): System (LI) is observable over either R 
or C if, and only if 


respectively, and where \ is given by 

v = (V v Z)'. 

One can compute a dynamical system for the shape parameters p. </. i\ 
and a described as follows: 


d 

(p \ 
'1 

- (~ A ' 

(A 

(p \ 
<1 

Jt 

— S 

-U 1 


— ,S 


) 



'■ / 


Typically a point on the plane (1.4) is observed with the aid of 
a CCD camera that projects (A\ I. Z) perspectively onto an image 
plane. Let ( j/i , // 2 ) be the coordinates of the image plane and assume 
that the perspective projection is defined as 


rank 


.4 - A/ 

C 


n. for all A 6 C. 


0 . 2 ) 


'/1 


JX 

z+r 



(1.9) 


It may be remarked that the rank can only be less than n if A is an 
eigenvalue of the matrix A. 

In this paper we consider dynamical systems for which the output 
vector is not observed exactly but can be ascertained with an ambigu¬ 
ity restricted to a <7—dimensional affine subsapee. The problem that 
we propose to consider is to compute if possible the initial condition 
and hence the states of the dynamical system up to a d— dimensional 
affine subspace. Thus for the dynamical system (LI), if we assume 
that the output vector //(t) is observed up to a ^/-dimensional plane 
given by an equation of the form 

W)v(t) =■ n(t) G.3) 

where 0(f) is a (p - d) x n matrix function of time having full rank 
for all bul countably many instances of time and //(/) is a vector 
function of time. The problem is to derive conditions on .4 and C 
under which .r(0) can be observed up to a d —dimensional plane. 

The above class of problem occurs in machine vision as has already 
been introduced in 16], 11). Specifically if we consider a plane in R l 
with coordinates (A\ W Z) given by 

sZ = \>X + (jY + r. (1.4) 
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where f is the focal length of the camera. One can easily compute a 
differential equation that is satistied by the coordinates (;/i. ijj) and 
is given by 


</•)'/! “ ,</l + l/U/l + </!»/_' + (ih >h + !h*l i'/.O 

thifr = 1/2 + .Vf.'/j + + jilhiji + V 2 ) (1.10) 


where 


fm\ 


/ 0 
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f" 11 > 
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i«i 1 

III 


-h'l 
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h\ - /"j 1 

"11 — " 11 
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-/" 12 
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~/"2 1 

"21 

J/fi 
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hi - fa2 2 

"22 — " H 

</7 


~h\ 
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" 11 

Ih 
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-h\ 

“/"V2 

" 12 
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V 0 
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-/ 

1 / 


tv \ 
'/ 
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(i.u) 


and where 

b' = (ft, bt-uuf. ft, -«,,/) = (ft',. I,!,. b\). (1.12) 


The equation (1.10) is known as the “optical flow” and in the literature 
various algorithms exist as to how one can estimate (;/i. //■)) for a 
given pair 0/i. t/ 2 ) (sec Horn |4)). 

For our purposes we would like to view (1.8), (1.11) as a linear 
system for which the output vector t/j is not observed but instead one 
observes the vector (//j. i )>, i/j, >/■_>) at various points on the image 
plane. Note that for almost every point on the image plane, (1.10) 
describes a homogeneous seven-dimensional plane in R'*. Thus if 
one observes ( 1 ) 1 . // 2 , 1 / 1 . 1 ) 2 ) for 3 points on the image plane, the 
output vector in (1.11) is observed up to a homogeneous 3-plane. 


(X) 18-9286/95$04.00 © 1995 IEEE 
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On the other hand, it 4 points arc observed the output vector in 
(1 11) is observed up to a homogeneous line Various othei cases 
can be demonstiated likewise Note in particulai that by observing 
the vector (//]. //j i/i 7 j) for 1 or 2 points on the image plane, 
the output vector is observed up to respectively a seven- or hve 
dimensional plane in R" 1 Such an observation does not shed any 
new information on the vector (p //, s ;) Jn practice if the vector 
(p (/, * i ) is recovered only up to a '/-dimensional plane where 
(I > 1 one would typically use multiple cameras to determine the 
exact value of (p q s ;) 


II Problim Formulation and Main Rfsuit 

In order to introduce the mam result consideied in this paper 
let Pi) C K be a '/-dimensional affine subspace not necessarily 
passing through the ongin In this paper we shall use the expiession 
'/—dimensional atfine subspacc 1 to mean a ‘ '/-dimensional plane ’ 
We say that the dynamical system (11) obseives P t > if tor any 
0 < t\ < / it is possible to calculate Pn from the observation 
ot the moving plane ( P{i) = C ' w /h in K' /j < / < t Our 
main theorem is described as follows 

rhioum 2 (Main Ihtonm) System (1 I) observes any '/— 
dimensional affine subspace Pn m K if for any set of eigenvalues 
\ ( \ / ol 4 one has 


(4- V/) ( 1-A,/) 

( 


Moreover this condition is also necessary if d — 0 or il all eigenvalues 
ol the matnx 4 aie in K 

Rtmwk 1 Note that over the complex nuinbeis C condition (2 1) 
is necessaiy and sufficient Moieovci if '/ — 0 Theorem 2 reduces 
to Theorem 1 Finally it '/ — 1 this lesuli implies the one given in 
16] due to Wang Marlin Dayawansa and Ghosh 

I he following two examples explain the ingredienis of our lesuli 
r\umpU 4 C onsider the real sysiem 


" 1 y 

0 I )) 


Remark 5 Note that the equality ol the iwo sci 
tor all but possibly a finite numbei ol time mst 
correctly conclude from this that in principle obs 
ascertained on the basis of these finitely tnanv i 
However wc would still like to say thai the luus 
unobservable on the basis ol any arbitrary lime intu 
Example 6 Consider the fourth order svsiem 
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Because the eigenvalues of 4 are leal condition (2 1) in Theorem 2 
is necessaiy and sufficient In particulai if E R is a point it can 
be obseived from </(/) = C ' because 

‘-\ 1 

lank 0 -A = 2 (2 4) 

1 0 _ 

tor all \ G R including the case when \ is an eigenvalue of 
4 However il /o C R* is a line, it cannot be observed trom 
J(t) = ( ( u / ( because for every pan of lines / () and U in R 2 and 
for all but a finite number ot tune instants t we have 

k t b 6 /n 1 = ft e = Ct "b b e M (2 4) 

Thus the lines / 0 and /| are both mapped to the entire real axis and 
therefore they cannot be observed We also note that 


™An — \j 

A n \i ^2 
0 

for every pair of eigenvalues A 0 Ajof l (2 S) 


1 0 0 () 
0 1 0 0 
0 0 I 0 


A direct compulation shows that the pair (4 ( ) is obscivjhlc and 
the matnx 4 has eigenvalues 0 12 4 Since loi any 2 ugcnv ilues 
An A i the nullspacc of ( 4 - A t) /)( 4 - \|7) is equal to the sum 
ot the eigenspaces Ker( 4 - \i J) and Kei ( 4 - \i/) and none ol 
those sums contains the vector (0 0 0 l) 1 £ Ker(( ) it follows 
from Theorem 2 that if /« C R 1 is a line in R l it can he obseived 
fiom ( 1{t) = C ( "in which is a motion ot lines in R 1 On the oilier 
hand one immediately verifies that 


l( !-/)( 1-2/) 


It therefore lollows that certain two planes Pn C R 1 cannot l>c 
observed from (< u Pc C R 4 Specifically considei the veclors 
in = (-12 20 8 2b) 1 m = (44 -GO -24 -84)' and i - 
(-24 40 17 4s) y One immediately verities that < ) i] <2 tire 
eigenvectors loi icsponding to the eigenvalues 0 I and 2 Also note 
that in -f 11 + 1 - (0 II 0 1)' Let V be the three dimensional 

subspace in R 4 spanned by the veclors i ( i\ and 1 It can be 
verified that for all bul a limle set of values ol / ( < W P is a two 
dimensional plane in R 4 lo sec this note that ( < j ( « x '• 
tor 1 = 0 J 2 where A, is the eigenvalue conespondmg lo the 
eigenvector i y Since ( ii Ci\ C i are linearly dependent it 
follows that ( ' l 'io ( ( Xl n ( < s, i) are hneaily dependent as 
well Thus for any 0 <*. t[ < t and tor almost every pair 
of two dimensional planes Q\ and Q such that Q\ ^ Q and 
Q\ C P (h C P we have 

Ct u Qx - C ( U Q. 

toi /1 < t <t Hence the planes Q \ and Qi cannot be observed 


III An Associatfd Dynamical System 

The proof of Theorem 2 will be broken down in a sequence of 
lemmas The proof is mainly based on a careful study of a djnannial 
system defined on the K vector space /\ k K the A-fold wedge 
product of K 1 (see [2] for a reference) This system has also been 
used in [6] lo derive the results there 
First recall that /\* K' is linearly generated by the vectors 

{ 1 1 A A u |i, 6 K 1 / = 1 U 

Addition in f\ h IK" is multilinear and alternating in the components 
It {' 1 r, } is a basis of K then it follows from the multilineanty 
and the alternating property of the wedge product that 


In lact for Ao = Ai =0 rank drops to 1 


B *= {< 


A r IJL |1 < 1 1 < < n < » I 
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m 


is a basis of /\ K K”. In particular one has dim /\ k K tl = ^ ^. If 

a vectoT r 6 /\* K" has a representation r = n A ■ ■ - A ,r* for 
some particular vectors r, C K". / = I, • * ■,A\ one says that r is a 
decomposable vector. The coordinates of a decomposable vector with 
respect to the canonical basis ti are sometimes called the Pliicker 
coordinates of r. 

Next define linear maps 

• 4 : A k "-A k " 

A 

•i'\ A • ■ ■ A J‘a i A ■ ■ * A .r,~ i A A.r, A . r, ^ i ■ ■ ■ A .rrf3.1) 

(_i 

and 

c -k K "-A K '' 

J‘\ A * * • A .r a ►-» CV| A - - • A C'.i* - (3.2) 

J and C' induce the dynamical system 

A=.U\ V CY. (3.3) 

The state space of (3.3) is the vector space A* K" and the output 
space is the vector space /\ k K r . We would like to remark that if the 
trajectories ( \r i (/),-■-. C (t) are solutions of the system (1.1) then 
C(,ri (f) A ■ • • A.i‘a(M) is a solution of system (3.3). It is our goal to 
show that, provided the eigenvalues of are in K. (2.1) is equivalent 
to a particular notion of observability of the system (3.3) and that this 
condition is also necessary and sufficient for ihe observability of I\t 
under the output function ('r ' 1 f\,. The following lemmas prepare 
for this result. 

Lemma 7: The (unique) solution of the initial value problem 
~ Ui(M A-- - A.n(f)) = AUlit) A-- - A .!■*(/)) 

at 

.ri (0) A-- - A .»■*(()) = r, A ■ ■ ■ A i’a 
is given through 

.i|(/)A — A.a(f> = < "r , A---A. "a. (3.4) 

Proof: Differentiate (3.4) and recall the definition of A.Q.E.D. 
Lemma X- Let r i. f a G K m be vectors and 

cj. Oj, ■ ■ *»( a G K be scalars. Let i ^ i‘i + o + + a. 

Then u tallows that 

A 

(A ~ rl )(.# i A .rj A ■ • • A ./'a ) i A ■ • • A .r,_i 

1 

A {A - r, J ).r, A .i*h i ■ ■ ■ A .i\. 

(3.5) 


Note that 

(.1 - rj)(.r\ A.rj A ■ • ■ A .i*a ) 

- (-4./*, A .i'j A ■ ■ ■ A .i’a ) 

- r i (.rt A .rj ■ ■ • A .a) 

4- (-*’i A . 1.1*2 A ■ ■ ■ A r a ) 

- cj(j*i A .1*2 • ■ ■ A j * ) 

+ ... 


(ri A .r-j A ■ ■ • A ) - a (.»‘l A .i\> • • • A .i* ) 

k 

i A ■ • ■ A .r, - 1 A (.4 — r,f) 

* A j‘ ( • ■ • A .fA. Q.E.D. 


Lemma 9: Let {.rj, ■, .r w J C K" be a K-basis of generalized 

eigenvectors of the matrix .4 having corresponding eigenvalues 
{At. ■ * ■. A„ } (possibly repeated) then 

A • ■ ■ A ,r, t |1 < / ) < < >k < n | (3.6) 

is a K—bus is of generalized eigenvectors of the matrix .4 with 
corresponding eigenvalues A,, 4* ■ - * 4* A u . 

Proof: Clearly the set of vectors (3.6) are linearly independent 
and therefore form a basis. Assume that the vectors ■ ■ ■ .v, k have 
a nilpotency index ///,, i.e., 

(.4 - ,\ u ;'.r (4 ^0. (.4 — A,, /)'"'* -r,, - 0. (3.7) 

In particular, if m, k = 1 it follows that ,r, k is an eigenvector with 
A f/ being the corresponding eigenvalue. Let us define 

(j = m ( , 4-f m,, 4- 1 - A* (3.S) 

it is trivial to verify using Lemma H that 

{A - (A,, +--- + A , k )l) ,, .r li A---A.i (a -fl. (3.9) 

OLD. 

Lemma 10' Let .V be a nilpotenl operator acting on K 7 . For every 
vector v G K 7 there is a unique u G K 7 such that 

/ - u + X u + ■' - + X' 1 ii. 

Moreover if m is the nilpotency index of r then { ii. ■ ■ ■. .V" 1 u } 
are linearly independent. 

Proof ■ The unique vector n is given through ir. = ( / - Y)r. 
The linear independence is clear. Q.E.D. 

Bel ore we state the next result we note the following. 

Remark II. Note that not every vector m the vector space /\ A K" 
is of the form .rj A ■ ■ ■ A .r A and those that arc, would be known as 
decomposable vectors. 

The next result establishes the crucial relation between the observ¬ 
ability of the pair (.4, C) and the condition (2.1). 

Proposition 12. Assume that the eigenvalues ot the matrix 1 aie 
in K. Then the following conditions are equivalent: 

I )There arc eigenvalues A, A,, of .4 and a nonzero vector 
< G K" such that 

2)There is n A G K and a decomposable vector U A ■ ■ ■ A ^ 6 
f\ k K" such that 



3)Thc dynamical system (3.3) has a decomposable vector <\i A 
■■■Ann G /\ k K n in its unobservable subspace. 

Proof: 1 )-+2): Let Ai. • ■ •. A, be the eigenvalues of .4 and lei 

K" =r ■ (3.12) 

be the decomposition into generalized eigenspaces. This decomposi¬ 
tion induces a decomposition 

e — n 4-F i ,. 

Lei A = {/ 1 .■■■,/,,} be the eigenvalues appearing in the product 
r:=(A- A, 1 /)-..(4-A u /) 
and denote by m(/t ). - •■- m{l r ) their multiplicity, i.e., we have 
/» = (-4 — /i/)"'" 11 (.4 — 

From the .4 invariance of the generalized eigenspaces Y\ t it follows 
that i'i t = 0 if h £ A. Moreover if h € A then <>/, has nilpotency 
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index at most r >i(/i). In the following we restrict ourselves to the 
case when t*/, has nilpotency index m(h). The (easier) other cases 
are similar. By Lemma 10 we have an expansion 

r= Y (A-hI) J, u h + ••• + Y (3.13) 

;i=0 


In this summation there arc m(/i ) + ■■■ + mf/,,) = k summands 
which we like to denote by ,h. ■ ■ ■, Jk . By Lemma 10 those vectors 
are linearly independent and from Lemma 9 it follows that 1: = A 
■ ■ ■ A h is an eigenvector of A with corresponding eigenvalue A: = 

A M -f-(-A /|L . Finally, from (3.10) it follows that {C.*i, • • ■, C h } 

is a linearly dependent set. It follows 


( J 


— (A (l + ■ ■ ■ + A , K )I' 
C 


A\ A-- - A .1 k = 0. 


(3.14) 


2) -*3): The vector A ■ ■ ■ A h is necessarily an eigenvector of 
.4 and therefore in the unobservable subspace V of the system (3.3). 

3) —* I): The fact that condition 3) implies condition 1) is nontrivial. 
Our proof follows mainly ideas already developed m (6| and in 
principle it should be possible to generalize the proof given in [6]. 
This however amounts to a large case by case search. In order to 
avoid those tedious arguments we will deviate at a crucial point from 
this program. 

The proof is structured as follows. Consider the decomposable 
vector n: = o! A ■ ■ ■ A <u in the unobservable subspace V of the 
system (3.3) whose existence we assume. Using the fact that V is 
\ invariant we will construct a polynomial /(./■) E K[.r] which has 
the property that f(A) a is a decomposable eigenvector of .4./(.4)n 
is then necessarily in the unobservable subspace T and this implies 
2) and from there we will imply 1). The details are now described 
as follows. 

Consider the set of eigenvalues { Ai. • -.A,} C K of 4 and 
arrange the order such that 


Re A, < Re A, + j 


or 


where is the component of n, in \ \ t and where the 
is taken over all indexes (/ 1 . ■ ■ *. u ) having the prupom , 

■■■ + A,* = A. 

From the fact that the matrix /J is assumed to be in i\)>. 
and from the assumption that Ai, ■ ■ * A„ aic ordered, it fol, 
there is one eigenvalue // such that the component u t , of a , A- 
in TT, ( is nonzero and decomposable, i.e.. 

w fl - r t] A--Ar lt . (3 l/j 

Indeed, r, ( can be chosen in the following way hoi i - l ... /, 
consider the decomposition 

n t = Y , ’ , > n.!K) 

induced by the eigcnspace decomposition (3.12). Then choose t, 
as the first index with the property, that r, t ^ 0. By definition 
r ,! A A r, k is nonzero, decomposable, and it represents the 
component of oj A • • - A tu in IT,,. 

Let in be the order of tr t ,. It is our goal to calculate the eigenvector 
C4-p/) M, - , n „ and to show that this vector is decomposable as well. 
For this consider the initial value problem 

4(.n(O A ■ • • A n(O) = (--l - fil)lfx(t) A ■ ■ • A .n(/)) 

(If 

.v\ (0) A • • • A J’k(0) = v, | A • ■ ■ A r, f . 

Using Lemma 7 and Umma 8 one verifies that the solution is given 
through 

< ( * A — A, ( 1 ,u r, , . (3.19) 

Because i\ | E I j . j - 1. ■ ■ ■. k. we have a polynomial expansion 

»»*(y) — i 

, ( , = Y M - (3.20) 

h j = I) 

where ni(j) is the nilpotency index of i\ ( . 

Expanding each r, ; in terms of the standard basis \ .i i, • ■ ■, .i „} C 
K" we get an expansion 


Re A, = Rc A ,4 i 
and 

Im A, < Im A,-| j. 

Let us choose a set of generalized eigenvectors {.n. * ■ ■. ./■„ f of .4 
and consider the decomposition of K" into generalized eigenspaces 
given through K" = { 1 1 a, ■ Arrange the order of {,n , ■ • ■. ,r M | in 

such a way that {r i. ■ ■ ■. .r ,, | forms a basis of \ \,. {.r,, 4 1 , ■ ■ ■, . 1 *, 2 } 
lorms a basis of \\ 2 and so on. 

Let n: = n 1 A ■ ■ ■ A <u be a decomposable vector in the unobserv¬ 
able subspace T T . Expand , b,,.r,*j = 1 . ■ ■ ■, k\ in terms of 

this basis. In this way we associate to n a coefficient matrix D =- b, t 
whose entries are unique up to premultiplication by an clement of the 
special linear group .*>/*: — \T E Mat,, * „ |det [T) = 1}. Without 
loss of generality we can therefore assume that the matrix Z? is in 
echelon form. 

Consider now the decomposition of /\ k K" into generalized 
eigenspaces. 

k 

/\K" = I A_A, 1+ +V U Ua. (3.15) 

If ir\ denotes the component of nj A ■ ■ • A n* in \\\ then 

h 'a = A ' " A r ‘k • (3.16) 


< M Y Am A --A,,(3.21) 

J" »1^ m •k* » 

In this summation f {l] ia) (M are the PliJcker coordinates of the 
vector r 1 ir fl and we will abbreviate them by /,(/). By (3.20) it 
follows that f,{t) arc all polynomials of degree at most j m{j )- 
k. 

It fact we can say more. Differentiating both sides in (3.19) 
m - 1 times and substituting t - 0 results in the eigenvectoi 
(.4 - ///)"' 1 tr,, on the left side of the equality sign. On the right 
side this operation results in 

(111 1)! ^ //-,, A ■ ■ ■ A .v , k , 

1 " 1 k‘ u 

where </, is the coefficient of the monom in the polynomial 
/,(/). By definition we have (.4 - ///)'" 1 «,, # 0 and (A - 
ir,, = 0. We conclude that each polynomial /,(/) has degree at 
most in - 1 and some coefficients rj, are nonzero. In addition note 
that the vector r ( 1 ,,l)l ir fl is a decomposable vector at all time / > 0 
and the Pliicker relations (compare to |5, Section 31) 

/'+1 

(OR) ■ J{j, 0 

(3.22) 

have therefore to be satisfied for all f > 0 as well. By doing Ihe 
same argument as in |5, Theorem 3.6 and Example 3.7) we conclude 



m 
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that the Pliicker coordinates are also satisfied for the coordinates g ,. 
But this means that (A - /#7) m ~ 1 /r, l is a decomposable eigenvector 
wtrich wc denote by 

.<|A-A h. 

Consider once more the eigenspace decomposition 

ni A * ■ ■ A *u — A 
\ 

as induced by the decomposition (3.15). Let n/(A) be the nilpotency 
index of i/'a and define the openilor 

/(.4):= (-4 - ;//)"“ ' II { 4 ~ A/)'" (A) . (3.23) 

A direct calculation shows that 

/(.4)<i) A ■ • • A <t» = A---A h- (3.24) 

V/« 

We conclude thai the decomposable eigenvector >1\ A • • • A ,/a is in 
the unobservable subspace V of the system (3.3) and this implies 2). 

Actually we have shown more; i, € V, f and if the coefficient r, 
is repeated m times in the set {ri. - - -. a) then d, has nilpotency 
index at most m. But this means that 

(3 - A, a 7) ■ ■ ■ (.1 - A u /),/, =0 

for j = 1. ■ -, A . 

By linear dependence of the set {£' /|. ■ ■. CJ* } there exist scalars 
c \. • ■ ■, t'k not all zero such that 


riC /i + ■ ■ ■ + <kC h = 0. 


But then 


ir = r, t| H-1- (*a J* 


has all required properties tor 1). Q.E.D. 

A direct consequence is the following Lemma whose proof is clear. 
Lemma IJ: If for any set of eigenvalues Ai.■■■, A* of.4 one has 

[<-i~A,VV--( 1 - A* /) 

C 


rank 


(3.25) 


then there is no real decomposable vector in the unobservable 
subspace P of the system (3.3). 

Remark 14 In general the converse is not true as it is demon¬ 
strated through an example in |6J. 


IV. Proof of the Main Theorem 

Proof: We first show the sufficiency of the criterion (2.1). Let 
TV (}u C K" be two (/-dimensional planes with Pu ^ Qn. 
L-ct r/ii E (A) be a point having the property that r/ () g P 0 . Let 
{■i'll - ■■.-»./} C be a set of points chosen in such a way that the 
decomposable vector 

if = (<in - ,r tl ) A (.ft - .ret) A • ■ • A (.r,f - .i-o) 

is nonzero. If the rank condition (2.1) holds, it follows from Propo¬ 
sition 12 and Lemma 13 that there is no decomposable vector in 
the unobservable .subspace T of “the augmented system" (3.3). If 
therefore follows that 

C w(t) = ( C t go — C't ^.ro) A (CV ' f .ri -Ct 

•" A{C( U .v t( - Ct 4# -fo) 

is nonzero for all time t with the exception of a set of measure zero. 
But then we have that Ct 1 1 (jo $ Ct u Fu for almost all time /. In 
other words Ct y, Qu ^ Ct w Pq. 


In order to prove the necessity part assume that all eigenvalues of 
A are in K. Assume that there is a set of eigenvalues Ao, -.A f / 
of .4 such that the rank condition (2.1) is not satisfied. Furthermore 
assume that for any set of eigenvalues /in. ■ * •. //</-i of .4 


rank 


(A - /in/) • ■ * (A - /i,/-i7) 

C 


(4.1) 


If this (technical) condition is not satisfied wc will be able to show 
at the end of the proof that (d - 1)-dimensional subspaces cunnot 
be observed in general. 

By Proposition 12 there exists a nonzero, decomposable vector 
■fo A .i i A- ■ • A .v,i in the unobservable subspace l of “the augmented 
system” (3.3) (assuming k = <7 + 1). Define V: = span j.r 0 , - • ■, r,/} 
and let Po, Qo C V be two (/-dimensional subspaces satisfying 
P) f- Qi). By ihe assumption it follows that 

span {CV u .rn, ■ ■ -. C V w j\/} 


is a (7-dimcnsional subspace for t almost everywhere. But this means 
that the two different subspaces Pu and Q o in K" produce the same 
moving plane CV P {) =. Ct Ai Qo in K 1 ' for all time t with the 
possible exception of a set of measure zero. 

Assume now that (4.1) is not satisfied and let d be the largest 
integer having the property that 


rank 


(.4 — An J ) ■ • ■ (.4 — A (/ /) 
C 


< n 


(4.2) 


for some eigenvalues A u . ■ • •, A f/ - but 


rank 


[A - // n f) * 


C 


J) 


(4.3) 


for all eigenvalues //o. ■ ■ •, //,,_, of-4. Using the same argument 
as before one shows the existence of two subspaces Pu and Qd of 
dimension d which cannot be distinguished in the observation. This 
completes the proof. Q.E.D. 
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A Note on Robust Pole Placement 

Marek K Solak and Albert C Peng 


Abstract — Pole placement algorithm in single input-single output 
SISO) systems is discussed with respect to the corresponding, real 
siabi’itv radius of the resulting closed loop polynomial 


1 Introduction 

Various robust stability problems are discussed in terms of stability 
ladu 111, [2], [9]—[ 14] This approach is attractive because only one 
\anable (stability radius) describes stability margin in a parameter 
space Numerous methods exist to calculate stability radii (structured 
ind unstructured) [l| [2| [9]-[14] and in many cases these meth 
ods allow to determine destabilizing direction ol perturbation Real 
stability radius appears also as a practical tool in designing robust 
stabilizing controller for various control systems 11], [2] 

I his note provides a useful suggestion how pole placement of 
a closed loop system affects robust stability measured by its real 
stability radius It is also interesting to notice when root localization 
ot a given Hurwit/ polynomial is known then complicated numerical 
minimization of its distance (L from the set ot polynomials with roots 
on the imaginary axis can be avoided in many cases r/U ) niay have 
many spikes and local minima ovei [0 -x) 

Consider a nominal Hurwitz polynomial 

Pi I (S) = <i[) " S + <t\ 1 s 1 4- + a - p[ , 5 ( s) (1) 

S(s) — [s s 1 si] (2) 

peituibcd by a polynomial 

<*>(*■) = <S,s -n,s '4- 4^ ,s h =b' S( s) O) 


The real unstructuied stability radius tor p, , (s)/„ ,„(p i ) can be 
calculated via various methods All ot these methods aie based on 
the paper ot Fam and Meditch | lb] Define 


rhtoum I 
1 1 \ 


(I 


— mf 

( 


{ 


•I^ = mf 

* o 


l‘ 1 (*•)= (■>) + »/' ( 

- ) 

(4) 

|l p 91| [2] [91-1141 



= >,, (s)j = min(|n| 

1 l« "1 <u 

(S) 

1 + - 1 + + 

b> (/*>F 

1 4 . ^ Hi n 

} 


li = Ip 

(6a) 

( \j' (j- 1 -)) + [v'> O* )]’ 1 

1 1 + -'+ + -*< / 

n =2p 4 1 

(6b) 


where |u[] " | denotes the distance ot p , , (s) from a zcio at mhnilv 
|'/ 1 | denotes the distance from a zero at the ongin and denotes 
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the distance from the set of polynomials with ztn > >n i 
axis at 

In many pole placement algorithms we are able u i \ 
[6]), localization ot p u „ (s) via (stale space) tee \b\ k 
polynomial 


/ /(“) - Pi 

is generated f 1], [1] Some other feedback cuntiolleis icl Sui m 
require solution of a diophantme polynomial equation 

j»(s)o( s) -+ <y( s)Fi( s) *-11 , ( s) 

where n„ „ (s) is a polynomial with arbitrary location of loots within 
the left complex half plane At this stage an open question k mains 
ol how to select either f(s) or „ (s) to assure maximum valui ol 
the corresponding stability radius On the other hand manv piactical 
problems impose specific regions (cf [7]) in the left complex half 
plane where loots of p/ (s) or ii , „ (s) shall be located 

This note discusses a connection between leal unstiuclured stahil 
ity radius of a given Hurwitz polynomial and its root localization 


II Robust Pott PiACtMFM 

Presentation of the mam result of this note will lequirt additional 
definitions Let 5 (/») denote the set ot all Hurwitz polynomials with 
loots outside interior ot the unit circle (single or multiple roots on 
the circumference of the unit circle are allowed) Additionally let 
HP (p) denote the set of all Hurwitz polynomials with roots in ihe 
region bounded by the hyperbola (including roots on the hyperbola) 


(t-) (t-) 


r 1 


(7) 


where n denotes real part of the root and i denotes complex part 
ol the root 

The main result ol this note is formulated in Ihe following theoiem 
The proof is a little complicated and is cained out in the next section 
7 hi ore m2 If 


Pi M £ S # (p) D HP {p) 


then 


1 tn [pi ii ( s )] |'in 


( 8 ) 


R(maik l 

The condition p t , (s) (E S' (j>) can be easily verified Assuming 
that p , (s) has no roots at infinity we can easily set lhat p , (s) £ 
.S' (p) iff 

<i( s ) ” s l>i i (l/ s )£ S () ) 

where 5 (y>) denotes the set ol all Schui polynomials (polynomials 
with all roots inside the unit circle) There is a fair number of 
algebraic procedures to check if ryM € .S (y») (cl |4) |5]) Applying 
the method m [ 15] wc are also able to check if p t , (**)*: HP ip) 
It is important to notice that stabilization algorithms arc mainly 
based on an a priori localization of roots ot a nominal closed 
loop system characteristic polynomial within the left complex halt 
plane and do not require application of Ci stability polynomial tests 
let [I^D 
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Tig 1 Root lotdii/aiion ol polynomials destabilized through zero at infinity 


where pi 5 1,1 and <*o are the highest term coefficients of p u m ( s) and 

A(*) 

| a mi 

0 S |Ao| J = < 1 (17) 

Once again (17) contradicts (10) for t u = 0 Jt fin = 0, \pn m \ 2 > 0 
and (16) never happens □ 

To prove the main result of this note we shall need the following 
matrix 

H(/0 = S(>t)S'(-,t) t 6 <-x x.) (18) 

where )] denotes the set of all eigenvalues of E( ft) 

Pi opo\ition 2 

E(jt) is a Hennilian matnx (19) 

rank H( //) = 1 f € (- 'v -\ ) (20) 

(r[E(jt)] = { \(1) I) 0} (21) 


III PROOf OF IHIORFM 2 

The proof will be subdivided into a few steps In the first step 
the well known Rouche theorem will be extended to accommodate 
Hurwit/ polynomials (a nonclosed Jordan curve, imaginary axis, will 
be considered) Then a simple approximation of will be provided 
An elementary induction argument will conclude the ptoof 

Proposition / Extension of the Rouche’ theorem for nonclosed 
Jordan curves (cf [4], [8|) 

For a given Hurwilz polynomial p u „ ( s ) - p n , ,£( s) a perturbed 
polynomial 

P* M ~Pu „ (“) -h s) = (p[ , +6 )S(s) (9) 

is Hurwit7 as well if 


MO 


= yv k ^ (Mn 

^ l» + i 

in ' 


! r ( fi) 


The corresponding eigenvectors are 


vi {)O - S(//) 
v t (jt) G kcr S'(- )t) 
A = 1 n 


1?±\ 
t - ±1 


( 22 ) 


(25) 


Proof 1( is easily verified that E(jt) is a Heinutian matrix o( 
rank 1 Hence Z(/t) has i/ eigenvalues equal to 0 and one nonzero 
eigenvalue X(t) Moreover in this particular case 


fidt)fi(-it) < p, , (jt)p t , (-it) t > u (10) 

Proof The pioof lollows directly from the zeio exclusion 
principle Consider a parametrized polynomial, ct [4 Lxertise 10 
P *51 

/■(- \) = />„ . (■>)+ \ € [0 1] (II) 

In case of finite zeros of F( s \) let us assume that condition (10) 
holds, but the polynomial 

f (s 1) =P» m (s) f fi(s) (12) 

is unstable Due to continuity principle (81 (171 theic exists V> G 
[0 I] and to ^ 0 such that 

f(lt\) Ajj) =■ p u m ( jtn ) + Ad^>( itn) = 0 (15) 

0 < |\ \ 2 — lj >n »i (/ f o) I _ Pu m ( ftn )pn i n ( -"jtn ) 

1 ! |Nj*o)| 2 fi(ltn)fi(-jtn) 

and the latter condition contradicts (10) F( * X) can also be desta¬ 
bilized through a zero at infinity F{ s \) has a zero at infinity iff 
s / (1/** A) has a zero at the origin In this particular case there 
exists \ 0 € [0 1] such that for s 0 = 0 

s u F(1/sij \o) - s"[/# n m (1 /so) n- AoA( 1 /So)] - 0 (15) 

Po ,n + =0 dcgji,, „,(s) = deg MM 


XU) = trE( it) 

and the proposition lollows □ 

Pi opo sition ? If 

\\*>\\ [l/X(t)]p n (-It) t> 0 (24) 

and the polynomial p x ,, (s) == p[ n S(s) ts Huiwitz, then the 
polynomial 

jMM = iPn „ + $')*(••) (25) 

is Hurwit/ as well 

Proof First let us observe, that according to Proposition 1 

jMM = (pi, m + *') S (M 

is stable if 

Pu 1Bl H(/f)p,. >6'3(j1)fi t>() (26) 

According to the Rayleigh principle for Hermitian matrices [17 
Section 6 2] 

fi'ZljDfi < A(f)||«|| J (27) 


(16) 


and we have the thesis ol Proposition 5 


□ 



I r TRANSACTIONS ON AUTOMATIC CONTROL, VQL 40 ( NO 1 JANUARY IW.S 


PioofofTheoiem 2 From (24) we have (ct [221) 

l,m,x (Pi.< m) > ;>|f J^jP» ... (jt)]>n n, (~jt ) (28) 

id m particular 

*L — (it)p lunll { —jt) (29) 

)ur intention is to show that all Hurwitz polynomials with roots 
outside the unit circle (including circumference) and within the 
hyperbola (7) die destabilized by the zeio at inhmty, le, i mi * — 
,/J, 1,1 1 The proof will use the induction argument with respect to the 
polynomial degree n. Coefficient “noirT notion will be omitted 
( a\c m ~ 1 

pl in(s) = r 'o s 4 ai (10) 

It s = satisfies |s| > 1 then i IllM “ min(|a 0 | |m|) = \a u \ 

Cast n = 2 // 2 1)m ( s ) ,s a nominal, Hurwitz polynomial of degree 
2 and two cases can occur 

OK’ ii, (s) has two complex, conjugate roots 
2)pi m(s) has two real loots 
In case 1) 

pt mi( s) — «o[( ** 4 o ) 2 4 C] (11) 



Fig 2 A (eedback control system 

IV Robust Stabilizing Coni roller Disk \ 

A standaid feedback stabilization problem is to desi u iUk a 
proper, possibly unstable transfer function) a propci contiollu such 
that the denominator of the closed loop transfer function is m ihk 


t\ s) 

= j> i 


(U» 

p(s) = y/os ,,J 

+i>i •>" 

' '+ +i<,, 

(17a) 

7( s ) — 4 (j\ •»"“ 

■' + 

+ iy„ . in > n. 

(17b) 

r(s) 

i = d~ [ 

(s)ll(s) 

(18) 


j'i i. (iO p:, i. i-iO 


<({**) “ tfo **' 1 4 <i\ 1 J 4 4 d ini 


,t l +2t in 2 - ) 2 ) + (n-*+ j-)- 
"" H + t T + 1 

- % r +7 7 +T (,2) 

if 2(0 - > 1 (n“ + f“) > 1 

Flence in view of (28) 

t i lX > (H) 

On the other hand, / miv S |"o| and case 1) has been proved foi 
complex roots. Case 2) is an immediate consequence ot Case 1) 
Assume that Theorem 2 holds tor a polynomial „ (s) of degree 
n we have to prove its validity tor a polynomial of degree » + 1 
Fust, let us assume that 

0 = Pu ml •»)(■» + '0* O > 1 


h{s) = Flos ' - 4- fij s"‘ _l 4 4 h niJ /II, > /II » (1%) 

The transler function ot the closed loop system is 

7 (s *__ = mill ( 40) 

1 p( s)r/( s) -F (j( s)b{ s) n(s) 


n(s) = ii 0 s" " 1 4 4»» 


{ pud n 
P<lU(j 


4 r/o^u uj - n w i =• m 
otherwise 


~-J f )[P"V„ (/ f )Pn + „! (-//)] 


Under very mild assumptions concerning p( s ) and »/M 
(p rj —coprime |1), [IS]) a proper controller can always be 
determined such that roots of m(*) are located arbitrarily in the left 
complex half plane fl|, |1J, |18] II parameters of the plant are 
uncertain, for instance, cf (17) 


Pu ... (i f )Pn .„( — yf )(^~ 4 ) 

14/ J 4 4 M 4n 


ii in ^ ^ hi 

p 5> <p, 


HIIJJ ^ , y , ” 

'/. <■ <1 - <1 


_ Pu 0* )p, M l i-jt) 1 4 C + 4 f"' _ , 2 V 

1 4f J + 4/ >M 1 4 f 2 4 4f i,M+1 " 5 


2 14T4 4 t J 


4 t 2 4 4 


rJirTT) < r + n 


=- K.ni(0[(h 4 O)* 4 * 2 ], then 


A, H -(0 


(-//»] 

_ Pn (zOK* », (-/') 14 f 2 4 4 f >M 

14/.'4 4 14/^4 4 f i(,,+2) 

[f'4 2r(o 2 - i 2 )4(n’4 i 2 )] 


It is obvious, under assumptions of Theorem 2 that <7,i <. of,, and 
1 heorem 2 is proved □ 


it is quite reasonable to hnd a contiollei r(s) (18) such that loots 
of the nominal polynomial m„ „(s) (40) (41) are located within 
appropriate region Ct of the left complex half plane and 

"n inM = Pit III ( ^)m( s) 4 fji M MMs) (41) 

a stability hyperspheie of m„ ,,,(*#) contains all uncertain polynomials 
n(s) (40H42) 

Usually, as a nominal polynomial u n ,„( s) a polynomial is selected 
such that its roots are all located within a speuhed region (t ol the 
left complex half plane, possibly contained within the hyperbola (7) 
and outside the unit circle centered at the origin II pu ..M and 
q tl ln (s) are coprime [1] a compensator r( s) = a 1 (s)/>( s) (^8) can 
always be established, such that (41) holds 

k in 

...(•*) = mJJ 1,1 JJ(« 4 Si)J^J[( 1 4 n )' 4 f*] 

r I /l 

u = dcgn n In ( s) - 2m 4 A 


( 44 ) 




; IEEE 


s, > 1, € R, i =1, 2,/'*, k 



/ = 1 . 2 .• • ■, m. 

Theorem 3: Stability radius of (44) [cf. (40), (41) and 

<43)] 

r tUi xK [»»imn.(«)] = KH* (45) 

Proof: The proof is a direct consequence of Theorem 2. □ 

Remark 2: Formulas (44) and (45) can be used to design proper, 
stabilizing robust feedback controllers for uncertain, single input 
single output systems (36). Parameters hqJh) can be selected to 
provide an appropriate value of (45). 

V. Conclusion and Discussion 
This note presented a connection between pole placement in single 
input-single output systems and robust stability measure provided by 
real stability radius. 
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Pole Assignment for Uncertain Systems 
in a Specified Disk by State Feedback 

Germain Garcia and Jacques Bemussou 

Abstract —This paper presents a method for assigning the poles in 
a specified disk by state feedback for a linear discrete or continuous 
time uncertain system, the uncertainty being norm bounded. For this the 
“quadratic d stabilizability" concept which is the counterpart of quadratic 
stabilizability in the context of pole placement is defined and a necessary 
and sufficient condition for quadratic d stabilizabillty derived. This 
condition expressed as a parameter dependent discrete Riccati equation 
enables to design the control gain matrix by solving iteratively a discrete 
Riccati equation. 

1. Introduction 

Among the different ways for defining performance specifications, 
one of the most popular is to put some constraints on the poles 
location for the closed loop transfer matrix. In practice, the exact 
pole location is not required and it is often sufficient to locate them 
in a prescribed region in the left half complex plane for continuous 
time systems or in the unit disk for discrete time ones, 

In this paper, the problem of pole assignment in a specified disk 
for linear discrete or continuous time uncertain system is addressed. 
In 111, the authors use a method based on conformal mapping 
which leads lo a complex control law determination. In (2|, another 
method working with the discreie Riccati equation for control law 
design is proposed. More recently, matricial algebraic tests for the 
spectrum of the stale matrix to be clustered in a subregion of the 
complex plane were developed. In this direction, efforts to construct 
a modified Lyapunov matrix equation leading to efficient synthesis 
procedures have been made |3|-[6]. Unlike previous works, in the 
most recent works, uncertain systems are considered. A sufficient 
condition for disk location in the case of continuous or discrete 
systems is given in |5|. Robustness issues are discussed and margins 
for disk location are provided. In j4] and |3| conditions for pole 
placement in several regions (circle, vertical strip, sector...) are 
stated for uncertain systems, the parameter uncertainty being defined 
through convex bounded polytopes. The control law is derived via 
convex programming. 

Here, following the same ideas as for quadratic stabilizability, 
a similar concept for pole assignment referred as “quadratic d 
stabilizability” is defined. First, a necessary and sufficient condition 
for quadratic <7 stabilizability is given for uncertainty affecting only 
the dynamical matrix. Expressed as a parameter dependent discrete 
Riccati equation, it leads to an algorithm allowing the state feedback 
gain design. The condition generalizes the sufficient condition given 
in [5|. The necessary and sufficient condition is extended to systems 
with uncertainty in both slate and input matrices. This condition, 
although more involved, can be stated as a parameter dependent “aug¬ 
mented” Riccati equation. The paper ends with numerical examples 
illustrating the theory. 

Manuscript received December 8, 1993; revised May 5. 1994. 
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II PRFLIMINARIFS 

i et us consider a continuous or discrete time system described by 

Mi(n] = (4 + Ai)i(f) + ij H (f) (i) 

lien 4 E 93' *' B 6 9V'*' u{t) 6 ' is the input and 

t) - 9V is the state is the derivation operator in the continuous 
nine case [i e , ^[i (f)] — t (01 and lhe delay operator for the discrete 
i me one [i c , A[i (t)] = t(A 1)J A 4 is the uncertainty of norm 
bounded type written as 

A 4 = Drr (2) 

\shere D E fH' *' £ E 93* y dehne the slruclurc of the uncertainty 
and the parameter uncertainty h belongs to the set 

N = {T E 93 *' r # / < 1} (3) 

Ihioughout this paper the expression (•) 0 K 0) means 

that matrix (•) is positive definite (negative definite) and 1 denotes 
the identity matrix of appropriate dimension The problem to be 
iddressed is to determine the state feedback 

u[t) = hi(1) (4) 

such that the closed loop system poles lie in the disk D(a / ) with 
center t\ + /() and radius i Foi the sake of simplicity let us intioducc 
the lollowtng notations 

1 = 4 + UK 4 = ( t-nl)/; 

1 -(4 + Bh m1)/i 

A 1 = A \/i 

4 A = l 4- Bk -f A 1 


The above definition states that & system is (|u uir im. 
it there exists a single matrix P satisfying |7) loi ill b 
the uncertainty domain It should be interestinc to ul a i 
P to a Lyapunov matrix The existence ol a single 1 \ ipum 
for all the sysleim in the uncenainty domain ensuu s \U L v ^ 
system stability for time varying uncertainties 
7 heoinn 2 If the system (1) is quadratically / stibib 
it is quadratically stahilizable and P is a L\apunov matrix I 
systems in the uncertainty domain 

Piaof First, obviously P > 0 For continuous s>stuns \\ 
and i < |n| Developing C) we obtain 

(-1 +AI)7> + /'( 1 -f A 4) - n '(4 + A 1) i’( 1 | Ah 

4 i 1 (/ . )/ 0 

and then 

(4 + A 4)'/M P{ l +A4)sn '(4 + A !)'/’( 1 4 it) 

- <> '(/ - )/’ 0 

Thus the system is quadratically stabih/able 

Tor disciete systems the relation with quadratic subili/uhility is 
not so sliaighlloiward In fact the difficulty is due to the presence 
m the equation of the term 

-o[t l + A D'P+Pl 4 4 A 4)] 

Suppose that system (l) is quadratically »/ stabih/able Using nota 
tioris (7) is wntten 



l tt us fust consider the case m ^ 0 I he circles m the unit disk aie 
then charactcu/ed hy n 4-; < 1 Multiplying (8) on the Iclt and on 
^ the right by the symmetric matrix 


1 rum the gcneial result on matnx loot clustering presented in |7) the 
following theorem can be deduced 
I fu oi cm I Let l E 93 * be a given matrix The eigenvalues of 
1 belong to D{n ; ) if and only if there exists a symmetiic matrix 
P E 93 * such that 


( 


-l> 1 
i' 



( 6 ) 


Pioof The pioot is obtained from the standard Schur coniple 
ment result and from |7] 

From now on only the stable D{<\ / ) location cases are consul 
ered le the one where D{(\ / ) belongs to lhe left half complex 
plane for continuous systems and to die unit disk foi discrete systems 
A concept related to Theorem 1 which follows the same philosophy 
is the quadiatic stabilreability [8] can be introduced 1 his concept 
referred in the sequel as “quadratic <I slabilizability extends the 
condition of Theorem 1 to the case of uncertain system in a similar 
way as quadratic stabili/ability extends the concept of stability to 
uncertain system 

Definition / lhe system (I) is a ‘quadratically d stabih/ablc by 
a linear state feedback //(/) — A i(/) it and only if there exists a 
symmetric matrix P 6 93 * such that 


V 


( 


4', 


1 

t A 4', 


4 + A 1, 

-P 


^\ < 0 


(7) 


we obtain 



1 

r y, 


) 



< 0 


where 7, satisfies 1'7 X P = - 2P-h\ P/i + 1/ii 
0 with ; 0 t\ >0 I his implies -n( A \P 4- P I \) > 

-2m( n 4- ;)P and thus 


V a 4- (n — i )/>- >n(n + » )P 

= 4yr4 ^ + n J p 

< 4 \P )/’- <i( i'^P+P I a) 

< 0 


But as 0 < m 4- i < 1 wc conclude that 

PA A - P < 4 ' + P < 0 

The system is quadraticully stabih/able and P is a l yapunov matrix 
The case n < 0 with n - i > -1 is treated in much the same 
way as tor n > 0 beginning by multiplying (8) on the left and on 
the right by 


( 


-nr 

i 


i 

-p-'/i 


) 


for all \ G £K 2 " and P G N 


[] 



m 
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III. Main Result 

W« arc now in a position to give the main result of this paper. 
Denoting by 

E, =E/y/7. D, =D/s/T'< 

4„ = (.4 - nl)//, n, = D/r 

the following theorem can be stated. 

Theorem 3: Let Q and R two positive definite symmetric matrices 
of appropriate dimension. The system (I) is quadrutically d slabiiiz- 
able by a linear state feedback i/(t) = A’.r(t) if and only tf there 
exist < > 0 and a positive definite symmetric matrix P € 9T*" 
satisfying the following discrete Riccati equation: 

A',AP" l +D,H 'D’-fD.D') '. 4 „ 

- r + , 1 E[ E, + Q = 0 ( 9 ) 

with 


Lemma 4 [J11' Let Q and R be two positive symmetric matrices 
such that the Riccati equation (9) has positive definite symmetric 
solution for f* > 0. Then for any given positive definite symmetric 
matrices Q and R , there exists e > 0 such that (9) has a definite 
positive symmetric solution for all r €(0, c] with R replaced by R 
and Q by Q. • 

The above lemma states that the existence of a matrix P satisfying 
the conditions of Theorem 3 does not depend upon the choice of the 
7? and Q matrices. 

Lemma 5* Lei D,, .4, E, be matrices of appropriate dimension. 
Let F G N and P be a positive definite symmetric matrix satisfying 

f '\-d\pd, >o. on 

Then, 

A’PD, FE , + E\ F‘D\ PA + E\ F’D\ PD, FE , 

< A*PD,{f~ x l - D\ PD, ) 'n\PA + r l F\E>. • 


( ! 1 - D\ PD, > 0. (10) 

Then ihe control gain matrix A is given by 

K = ‘4 B,ir'D\ -*D,D\r { A„. ( 11 ) 

Remark. Let P = *P and using the matrix inversion lemma, the 
Riccati equation (9) can be written in a more “classical form”: 

4/Mm - r- X P{ D, D,) 

(<n + n:rB, b\pd, Y'fn'Ap, 

' v d'pd, D'.PD, -1 ) \d\) " 

+ E\ E, + <Q = 0. 

The Riccati equation (9) is ot a type which also arises in linear 
quadratic differential games (discrete time case) in which the cost 
functional is (sec [9|) 

■A(iu. I'JL) = !h A- +f"\Riik ~ 

k~-i) 

where u k is the minimizing control, e k is the maximizing control, 
and the system under consideration is described by state equation 

•«‘H 1 = A„.i; 4 B, m 4 D, r k 
i/a = E,.n. 

Pi oof of 7 he ore m 3 For the proof, some technical lemmas are 
necessary. In what follows, we give almost all them without proofs 
which can be found in the quoted references. 

Lemma I IK) / Let 5. V. and Z be given by k x k symmetric 
matrices such that S’ > 0, V < 0. and Z > 0. Furthermore, assume 
that 


(i/'Vi/)-’ - -l(i/'.S’v ){il , Zff) > 0 

for all nonzero // € . Then, there exists a constant A > 0 such 

that the matrix X 2 S 4 \Y 4 Z is negative definite. • 

Lemma 2 (Fmsler Lemma). Lei X be a k x k symmetric matrix 
and R, fc SH**'" such that i/.Y»/ < 0 for all // ^ 0 satisfying 
Bitj = 0. Then there exists a positive detinue symmetric matrix 
1 x * such that the matrix A’ - is negative definite. • 

Lemma 3 110 /. 11 Ip Given .r e W”. u 6 9T. 

Max {(/D, FE, //)': F' F < 1) = D, D\ .n,‘ E\ E, </. (12) 


Proof If we define 

V = (T 'l- D', FD, r l/ * D[ PA - {<- l l - D[ TD, ) X/ 'FE,. 

We have Y'Y > 0 and then 

A'PD, (*' ] 1 - D[D,) 1 D\ PA - A'PD, FE, 

- E\F'D\ PA 4 E\F'{f' [ 1 - D\ PD, )FF, > 0. 


Rearranging some terms, the lemma is proved. □ 

Necessity: Suppose the system (1) quadratically d slabilizable by 
a state feedback u(t) — Then, there exists a positive definite 

symmetric matrix P 6 91" x " such that for all A’ € 9t" >M and F € N: 

V '( ~ P ' A..+D,FE,+B.K\ 

\A'„ + E', r D\ + h"n[ -P Y " U 

which can be written 

'•■(4 Y)' 

+ x '(e: rn: D 'o r ') f <° »» 

And then for all Y ^0 such that [B\ U]A' - O, wc have 

v (4 n YY<«- 

This implies that 

-'I" >r 

< -1Mu.A ^ F’f o' 0 

for all A” ^ 0 such that \B\ 0]A’ = 0. And hence 

(' (4 - 4 )' 

for all A’ ^ 0 such that [B[ 0]A _ = 0. Writing Y = [.r'//T b y 
Lemma 3 we obtain 

( 44 ' - 4 )' 

>«(\* S)- YA "(S b°e.) X <l4 ' 
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dll A 7 * 0 such that [D[ 0] \ ==0 Now denoting by T a 
itnx whose columns form a set of basis vector lor the linear space 
£ JH* 1 ' [#' 0 ]\ = 0 } and defining 



Now denoting f =■ (P 1 +/J J? 1 It' Dp \ \ 

that 

l',(F '+/?,/» 7) - ,D D') '4 

-Tun 1 1 ) i + i i 

(17) is now given by 

£ _ [ 4' (P 1 - fD D') 1 l -Q 

-I'n n 'u'i - i ip ,i> u i 


It follows by Lemma I that there exists a constant <- s 0 such that 



for all \ ^ 0 such that [B 1 0] \ - 0 By 1 emma 2 there exists a 
positive definite symmetric matrix U - B 1 such that the matrix 

/-P J +tl) D 4 \ _ (B B ] B 0\ 

[ 4 ' -p 4 . • f r ) ^ 0 Oj 

is negative definite P being positive definite by hypothesis This 
implies 

4 (P ' + B R ' D — tU D ) 1 4 - /’ + . 7 l < 0 

And there exists Q ~ (J > 0 such that 

i 7 ’ 1 + n n 1 n - n ) 1 4 - r 4 < 1 r7 n 7 0 

It remains to ■diow the following inequality 

1 7 -nrn 0 us) 


Following the same development as above we obtain 

■'( D „ D -V)' 


1 , [u 0 \, 

+ , ' (0 / r )' 0 


lor all \ ^ 0 such that [0 P A J \ 0 Now fiorn Lemma 2 there 

exists a mati ix I = l >0 such that 


r -P 1 +(/), I) 


4 \ A) 0 \ 

-/’+' / r j VO l\ B \ B h ) 


which implies that P 1 - tD D > 0 01 equivalently r ‘l 

D PI) > 0 

Snffre if a/c \ Suppose thal condition ol Iheorcm 1 is salished and 
let 


C (4 + A \ ) P( 4 -h Jx 4 )~P 

Using Lemma *5 and (S), we have 

£<[!' P4 -P+4 PD (/ 7 -D'PD) 'D'P 4 
+ f *£'F] 

= [4' (P '-7>r>7 -l 4 -p + f 'F7 ] (16) 


A simple calculation shows that with A given bv ill) 

4' (P '—*■/)//) 1 i - r'(P 1 -</> /> )/ o 

and then 

L < -[Q 4- A'AA] < -y < 0 

Ihcn the system is quadialically r/ stabih/ahle and tlu proof is 
completed U 

A c/ Stabilization Algorithm 

Theorem 4 gives a condition which allows lo dense the d 
stabilization algorithm presented below 

Step 1 Choose any positive definite symmelnc weighting mall ices 
Q G 9T ' and li t K and initialize » to some 
starling value 

Step 2 Determine whether the Riccati equation ( l >) has positive 
definite solution satisfying (10) If such a solution exists 
the algorithm succeeds unci a contiol law is determined by 
(11) If not go to step 4 

Step 4 lake r - <-/l If ^ is less than some computational 
dccuidcy * stop the dlgoiithm fails Otherwise go to 
step 2 

Remark In order to solve the Riccati equation in step 2 some 
standaid methods can be used set loi example |12| (141 

H C ompanson to a Pnxrous Rc salt 

In this paragiaph wt examine the relation between the icsults given 
in this paper and the icsults in (S| First suppose thal system (1) is 
well known (i e I) — 0 and 1=0) that is 

« s [»(0] “ 1»(/) + Bait) (IK) 

Following the same steps as foi the proof ol Thtorcm 4 the result 
below can be proved 

( oi oil an I Let Q and B be two positive definite symmetric 
matrices of appropriate dimension The system (18) is <1 stabili/ablc 
by a linear state lecdback u(t) = A i (/) (i e all poles of the closed 
loop system lie in D(a i )) if and only if there exists a positive 
definite symmetric matrix I* £ * satisfying the following 

disciete Riccati equation 

4 </’’ + />/? 7P) 1 4 - P + 7-0 (19) 


By the conditions of Iheorcm 4 

P =. V (P ' + DJ? ‘D'-fDD) 1 4 + * 7 ' L +7 

Replacing in (16) leads to 

C < [4'. (P~ [ - tD D 1 ) 1 4 

- (P 1 + n n~'n' - ff) d 1 ) 1 4 - 7 ] <i7> 


Then the control gam matrix A is given by 

a = -ii 'n'{P ' +n,n 'n') '4 

- -(,‘77+ D'PD) 'il'Pl 4-dl) (20) 

# 

Thus the sufficient condition stated in |5] is also necessary 



tar 
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IV UNCFRlAINrY IN BOTH THE STATE AND INPUT MATRICES 

In this section, the problem of uncertainty on both state and input 
matrices is investigated Lei the system 

*['('>] = ( 4 + 4 4)! (f) + (77 4 SD)u(f) (21) 


[A 4 AJ5) = D/[r, r,] 

Wc begin by defining an "extended system’ 114] 

/>[\ (#)] = (/“ 4 AJ)\</)4 D,<(t) 


- * (?) 


= (o °.) 


M^DrTkjr Dr 


Proof The system (21) is quadratically d stabilizable by a linear 
state teedback u(t) = A i(f) if and only if there exists a positive 
dehnite symmetnc matrix P such that 

[( i + A4/i) + (JB, + AB/i)A]P[(4 + A A/i) 

+ (fl +AZ?/i)A]' -P <0 

this is equivalent to 

<•' 0l [^ +AF| (/, r r 

~ {hr /£■)]«<" 

and then there exists (> -> 0 (sufficiently small) such that 




(.' o) <f+ 


pk' 

kPh 1 + ,,i 


(J + A F) 


SB — SB/) tr=(E[/y/i hi/\f\) p = n + rn 

Iheotm 4 let Qr £ 9V * y und Hr £ 93 x be positive 
dehnite symmetnc matrices The system (21) is quadratically d 
stabilizable by a linear state feedback //(f) = A /(f) it and only 
tt there exist ij > 0 and a positive dehnite symmetric matrix 
\\ £ 9V A/ satisfying the following discrete Riccati equation 

-F'd! _l +DsP r [ n F - iiDfD'j:) ‘F-ll 

+ ii 'r',r> + Qf ^ o (2D 


I) ' 1 - D 1 , SDr > 0 


The control gain matrix is then given by 
I# = S )>, 1 1 

where S = II 1 is partitioned as follows 


-0’ !:) 


S, G9t 


s £ * 


Foi the proof wc need the lollowing lemmas 
Limma 6 Let a d stable system (i e all Us modes are in D(t\ i)) 
be given by 

*[»<*)] = 4* (T) (26) 

Thus theic exists a definite positive symmetnc matrix \ such that 
M \ -1 <0 

Then 

3 M 1' -H <M) 

is salished with 11 - 1 ~ l • 

lemma 7 The system (21) is quadratically d stabilizable if and 
only if there exists W = VV' > 0 such that 

(» 0)|[f + ADW(f + Sf)’ - <0 • 


P Ph 
KP KPh + ,>1 


For the converse see [IS] 

Proof of Huorcm 4 Fiom lemma 7 the necessity is derived 
m a similai way as lor Theorem 3 We provt only the sufficiency 
Suppose that the condition of theorem is salished and let 

H = [4 4- A 1 + (B + A/i )S S, J ] 

S,[4 4 A l 4(JJ 4 AB )S S, ’] - S, 

Dcvtloping this expression leads to 

H = ( 4 4 A 4 ) S, ( 4 4 A \ ) - S, 

4(0 4 AD )S,( 1 4 A 4 )' 

4(4 4 A 4 )Sj(B 4 AD ) 

4 (D 4 AD )S'Sr'SjD 4 AD ) 

13ut by definite posmviiy of S we have 

S. - s A, 1 S, > « 

and then 

Will o,[(* B+ / B ) 

(;■ :.)(* v “v n )'-(? ;,)]c.) 

01 

tt<(l 0){[/-4 A?]V[/4 A*]'- S}(1 0)' 

By Lemma S we have 

H <(1 0)[F(S 1 - », ’/-)/>) '/■' - A4»)/3>DV](1 0)' 

with by (23) 

//i - r% sf; > 0 

Because of the structure of Bj and by (23) it is now easy to see 
that the condition above can be written as follows 

H<( 1 0)[F(A 1 -«; '/>£>) 'J'-S 

■" D^D_f*Df 4 0)* < 0 
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and it follows that the system 

f| (T)] _ [( 4 4 A4) 4 (II + AU)S S, 'I (/) 

is quadraticdlly d stabilizable lor all uncertainty in the uncertainty 
domain By Lemma 6 the system 

!>[.(/)] - [( 14 A4)4(04 AD)S S, '].(/) 

is quadratically tl stabilizable ovci the uncertainty domain □ 

V Nlmiricai Lxamplls 

A C orUinuous System / If)/ Let the continuous time uncertain 
system 

i{t) - ( 1 + D1 1 )i(t) + Du(t) (27) 


/ 0S2 17 70 00 21 \ /—01 21 

4 = In 17 -0 7") 11 10 ] 73-1 0 

\ i) o -250 ) \>>o 

The uncertainty slructuie is described by 


/) — 0 2 r > J / [0 10 0 10 1 00) / = / €= SH 

We wanl to place the poles in a disk centered at n — -3 and with a 
ladius i - 4 5 This problem was tieated in [4J lor convex bounded 
unceitamtits The Q and U matrices are chosen to be 

p " °\ 

q - o i 0 n - i 

\" o 1/ 

The algorithm is initialized with f = 10 After 4 iterations for 
* = 1 25 

( 3 210 1 1 402 —11 SS7\ 

P = 13 4G2 80 288 -05 432 

y—11 887 - 03 452 1 11 423/ 

. 1 1 - 72' PD = 0 205 
The control gam matrix is then given by 

h = [0 0016_> 0 0173 0 0178] 


Tig I shows the closed loop modes tor 200 values of t such that 

-i < r < 1 

A Disinit S>w tm Let the discrete time uncertain system 

!(/ + ])=■ ( 1 4- DTE ) i (/) 4 Bu(t) (2H) 


/-II 50 

1 00 

1 00 \ 


( ] \ 

1 -3 

-0 25 

- \ 00 

n 


V-2 25 

—0 2 > 

0 >0 1 


V 0 / 


I he uncertainty structure is described by 

/o im\ 

D — [ 0 1607 ] I =[0 1041 008 \\ 0 \W)] 
VO 1607 / 


t = 1 6 <n 

I his nominal system has an uncontrollable stable mode al 0 7S Wc* 
tiy to place the poles in the circle centered at n - 0 > with a ladius 
i =.0 3 T or the Q and H matrices we take 

q = i n = i 

The algonthm is initialized with r — 10 Attei 7 iterations < — 
0 0781 and 


/ 50151 2 138 -0 1)1 

r - 2 138 2 ojo i m 

V—0 331 1 300 2 491 


r 1-/7 PD = 0 SI 4 

The control gain matrix is then given by 

A =*[10421 - 1 0133 - 0 9906] 

Fig 2 shows the closed loop modes for 200 values of / such that 
-1 < / s 1 
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VI. Conclusion 

In this paper, pole assignment in a specified disk lor discrete 
or continuous time uncertain systems has been investigated. The 
condition derived is expressed us u parameter dependent discrete 
Riccati equation leading to a simple procedure for the design of the 
state feedback gain matrix. Numerical examples for continuous and 
discrete time systems have been presented showing the efficiency 
of the proposed method. The results of this paper are based on the 
strong assumption of state availability. In practice, for most systems, 
only partial state being available the problem of pole assignment by 
output feedback is of great interest. This problem will be addressed 
in the near future. 
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Modified Output Error 

Identification—Elimination of the SPR Condition 

Amir Betser and Ezra Zeheb 


Abstract —In this note, we present a modified adaptive output error 
algorithm for identification. We refer to both the continuous time and the 
discrete time cases. In the standard algorithm, stability requires that a 
certain transfer function should be Strictly Positive Real (SPR); however, 
in the present algorithm, the SPR condition, which is difficult to satisfy, is 
eliminated. This is achieved by adding a fixed feedback gain. When there 
exists an a priori partial knowledge about the parameters of the plant, we 
provide a simple procedure to design that gain. An illustrative example 
comparing the modified algorithm to the standard one is provided. 

I. Introduction 

A sufficient condition for asymptotic stability of the adaptive 
output error algorithm for identification is that a certain transfer 
function should be a Strictly Positive Real (SPR) rational function 
[IM3]. A drawback of the algorithm is the need to determine the 
SPR function. T his is a very difficult task which has not yet been 
satisfactorily solved. Although the numerator of that function is an 
arbitrary polynomial which is chosen by the designer of the system, 
the denominator of the function must be the denominator of the 
transfer function of the plant, which is either unknown or only 
partially known. 

Several papers give necessary and sufficient conditions to check 
whether a transfer function, with uncertain parameters, is a SPR 
transfer function [4]-[8]. Some papers treated the design problem, 
where there exists an a priori knowledge about the plant, i.e.. it is 
assumed that the denominator a ( s) of the transfer function of the plant 
belongs to a certain set of stable polynomials. In the continuous lime 
case, only sufficient conditions were found for the existence of a fixed 
polynomial (/(*) such that */(.s)/o(.s) is an SPR transfer function, 
where <t(s) is the above-mentioned denominator [4|, |9]-[I1J. As 
mentioned above, the design of such a polynomial //(*) is (for 
degrees higher than 3). difficult and not yet satisfactorily solved even 
when there is an a priori knowledge about the plant. In the discrete 
time case, necessary and sufficient conditions were found for the 
existence of a fixed polynomial r(:“ J ) such that r(*~ 1 )/n( :“ 1 ) is 
a Discrete SPR transfer function where n( ; _l ) is the denominator of 
the transfer function of the plant, which belongs to a certain stable 
set of polynomials |9]. 

In this note, we present a modified output error algorithm, per¬ 
taining to both the continuous time and the discrete time cases. 
The modification includes a fixed gain feedback which, if designed 
properly, eliminates the SPR requirement. 

The modified algorithms are presented in Sections II and III for the 
continuous lime and the discrete time cases, respectively. In Section 
IV, we use the modified algorithms to solve the design problem, 
where the denominator of the plant belongs to the same set of stable 
polynomials mentioned above. In Section V. we give an illustrative 
example where the advantages of the modified algorithm are evident. 
Section VI is the conclusion. 

Manuscript received January. 1993; revised March, 1994. This work was 
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The authors are with the Department of Electrical Engineering, 
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11 MouiFibD Algorithm--Continuous Timt Casf 
We stait by following the derivation described in [21 of the output 
ior algorithm Consider the identification problem of a single input 
igle output, linear time invariant system described by the transfer 
mction 


_ m-) _ e: u b 

«(•>) •>" + E,_i 


m < it 


(1) 


iiuiote the input and rhe output of the system by nit) and i/(/), 
n spectively We assume that n{ s) is a stable polynomial (i e all the 
a ros ol r/( s) are in the open left halt plane) Letr,(s) = s /ryb), 
- 1 1 it where <j{ s) = s' + J] , <j s' is a stable 

irburary polynomial The input-output lelationship tan be expressed 
as tollows 

</(/( = c,! U)Hu (2) 


where 

On {1) ~ [f u 1 t\ -l i i/ - / tf\ O) 


= \b n h a i - <j\ a - 0 ] (4) 

and 1 u T ti are the outputs ol the stable filters f (s) with the inputs 
i/(h #/(/) lespectively At this point we depait from the standard 
output eiror algorithm We dchne oui output error as usually done by 

• it) - (lit) tfit) (^) 

However our //(/) dilleis Ironi the standard dchmlion We add a 
feedback gain as tollows 

,(/) = , ' (/)»(/) + /. (O / 0 (6) 

when. 

(M [/ u 1 </ i i V - / v] (7) 


0' (M = [/|) b ill - Ij\ (I - <1 ] (M 

I I - 0 1 m and if ~f/ r 12 n are the estimated 
values Notice that in the siandard algorithm i/(/) - t>‘ (t)H(t) le 
I n 

I he adaplative algonthm is 




I ig | (a) Fsiunaltd pirainclus vusus sampling instants miHlihul algo 

nihm discrtk iinu cast (b) Output snoi vusus sampling insianis— mndiliul 
algoiiihni disc rtic tunc cast 


H{t) - rt(t)n(t) ( > 0 ( 9 ) 

We will now state and prove a theorem, on which our modified output 
eiror identification algorithm is based and justified 

Ihioum 1 Iheie exists a scalar I* * 0 such that foi all 

/ > /* the modified output error algorithm described by (2)-(9) 
is asymptotically stable when the input «(/) is bounded le all 
the signals m the system are bounded and him x'(0 = h 
It in addition the input u{t) is persistently excited (PF) then 
lim* ^ H{t) — H {) exponentially last 
To prove Theoiein I, we need the following lemma which appears 

m 1121 

Lemma l l ct (){ w) be a rational lunclion in the complex variable 
s with real coefficients 

= s u <i /o 

E _ 0 •' - 

having a positive leading coefficient (, /d , and satisfying 

Q( s)^0 in R Re[n] > 0 (10) 

Also, if Q(s) vanishes at mhmty the multiplicity ol the zero is not 
greater than 1, i c, m > n - 1 


Let I be a positive finite numbei which is suiflucnil) laige Then 
the rational function F(s) defined by 

i /. \^ mi 


1 (s) - 


l + /y(-) 


is strictly positive ical (SPR) 

Ptooj of I/uniem / For the case I - f) it was shown in |2) that 

.(f) = -//( s)(o 7 0) (12) 


lli') = 

rt(s) 


H = H u -0 (14) 

A similar derivation which will be omitted foi the sake of bievity 
reveals that for the case of //( t ) as in (6) with \ 0 (H) should 

be replaced by 

__ nsi 


Hi s) = 


ri( s) + /r/( s) 




m 
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It is well known [IJ [2] that if //(*) is an SPR function, then the 
system is asymptotically stable If, in addition, the input u(t) is Pb, 
then the parameters converge exponentially to their correct values 
i e , hnif — 0 exponentially fast 
Define now ()(*) = </{*)/<*(«) which satisfies the conditions 
of Lemma 1 then, (15) becomes //(*) = Q(*)/l + IQ{*>) and 
according to Lemma J we can find / > 0 sufficiently large such that 
IT(a) is SPR □ 


Proof of fheotem 2 A similar derivation to the one earned out 
in 121, for the case l = 0, yields tor the proposed case of y{ A), as in 
(21) with / > 0 the following expression 


')(<>'(*-1 )«(*)) 


( 28 ) 


H( l ) = 


J 

H / 


i + j+/ !£ i <lf 


(29) 


III MoPIFILU AlCiORHIIM—- Discreif Timl CASr 
Wc consider a plant desenbed by 

</(/« ) ~ ^ "■ 0 + - j ) (16) 

1 y-0 

=r<>l{k 1)0. (17) 

where 

r>!) (/ — 1 ) = [«/(A ) tt(k - in) t/(k - l) 4 /( A — II)] (IK) 


where 

0 (A) = 0(A) -0o (50) 

It is well known | i]-|51 that if II ( 1 ) is an SPR function then the 
system is asymptotically stable If in addition the input u(k) is PE 
then the parameters conveige exponentially to their correct values 
le I mu , 0(A) “ 0 exponentially fast 
Obviously there exists a sufficiently large / such that 1 /I + 
/ JT , \u | <. 1 and according to Lemma 2 H{ J ) in (29) is 
SPR foi such an / □ 


0/ = \l>\ h -ff, - 1 / ] (19) 

The plant is assumed to be stable 1 c all the zeros of n{ 1 ) = I 4- 
53 , n 1 arc in the open unit disk The output error is 

< (A ) = tj(t ) - i/(A ) (20) 

As 111 the continuous time case the standard t/(l j is modihed by 
using feedback in the system s realization 
i/(A) is defined as 

" (A) - - iwm+moi > > (i (2 d 

where 

o r (/ - 1) = [a(k ) */(/ — /!/) i/(A — 1) i/(A - 11 )] (22) 

«(/)'=[/)„ I, -,/ ] (21) 

1> 1 — 0 1 in and a i - 12 n are the estimated values 
Notict that although (21) is algebraically equivalent to 

< /(A) .'(A -I )«{k)-l((k) (24) 

which is the foim we actually desire lor our stability analysis (24) 
is not directly implemcntablc whertas (21) is readily implementablc 
The adaptation algonthm is 

= 77 >U)'#(A) v(A + D] (2S| 

1 -f tn(k )' n(k ) 

We state and prove Ihcoicm 2 which is the discrete time analog of 
Them cm 1 

lhtonm2 There exists a scalar /* > 0 such that for all / > 
I* the modified output erroi algorithm described by (16) (25) 
is asymptotically stable when the input «(/ ) is bounded le all 
the signals in tltt sysiem uie bounded and lirm X <(A) = 0 
If in addition ihe input u{l ) is persislently excited (PE) then 
firm x 0(A) “ 0o exponentially fast 
To prove Theorem 2 we need the following lemma which appears 
m mi 

Lemma 2 Tht discrete transfer function 

Q{ ’’-rfr 1 ;,,— <“• 

is SPR if 


IV Thi Dfsic.n Procfdijrl 

We assume now that we have some a puon knowledge about the 
denominaloi of the plant We will point oul how this knowledge 
allows us to design the feedback gain I which eliminates the SPR 
requuemcnl 

In the continuous time case we assume that *( s) the denominaloi 
of the plant belongs to the sU of inteival stable polynomials A 1 e 

(/( s) = S 4 - y^ fi s ( 51 ) 


(i E [t± a ] / 1 2 n (52) 


and a a 1 = \ 2 11 arc given leal numbers satisfying <1 N 

n i— \ 2 n 1 he hist step of the design procedure is to choose 
an arbitrary stable polynomial (/(s) of ckgrcL n As discussed in [ 121 
il 


/ ^ 



(55) 


then the function 77(s) 111 (15) is SPR and the adaptive system 
is asymptotically stable Now let the four Kharitonov polynomials 
which are associated with the set A 114] be denoted by a (s) 

* = 12 \ 1 It has been proved in 115) that 



Iheieforc any feedback gain / which satisfies 

'=•- ,{ Re [^ tJ} 

guarantees the asymptotic stability of the adaptive system 

The right hand side of (5*5) can leadily be computed since it 
consists of four distinct fixed coefficients functions of a single 
variable « 

I 11 a more general case where the denominator of the plant n[s) 
belongs to a convex polylope of stable polynomials it is evident that 
the coefficients can be bounded by a box and the procedure carried 
out as in the interval case 

The design procedure in the discrete time case is carried out in a 
similar way and will be omitted for the sake of brevity 


Ei" 1 < 1 


(27) 
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Htmatk In [IT], 1161, and [I7|, a cnicnon was given tor the 
smplotic stability of the output cnor algorithm ([11] was geneial 
d in H81) This criterion srates that foi a large enough adaptation 
un or input signal magnitude the system is asymptotically stable 
the present note, there ate no restrictions on the adaptation gain 
i on the input signal magnitude 

V Simui AriON Rrst lts 

In the discrete^time case, we investigate the example in (19|, 
note that it is only the example which is being used and not the 
technique pioposcd in [19] The boundedness con|eclute in [19] was 
disproved in |20] The plant is a second-older one with two unknown 
paiamcters, a i n described by 

</(/) = -fli i/(k - J) - a i/(l - 2) 4 ti(l ) 

= 1 0iy(A - 1 )-()br/(A -2) + r/(A) 06) 

[lie tiue parameter vector is - [—ni -a ] [l 0 —0 h] The 
tianslei tunction of the cnor system in the original algorithm is 
H 1 1 ) 1/1 - 1 U 1 + (J S which is not an SPR tiansfei 

function It can be shown that fot I ^ I 4 the modified H ( 1 ) 
is SPR I oi convenience wc chose / i Now // ( 1 ) 

0 2 r )/l - I) [ '+02 is an SPR transler function 

In |19| it was lound (by tnal and erioi) that it the adaptation gam 
f 0 (M) 1773 then the output erioi algonthm is not stable Wc* 
simulate this example with < 0 001 I ig 1(a) contains the plots 

ol the estimated paiametcrs versus lime P(l ) I ig 1(b) contains the 
plot ol the output trior versus time • ( / ) This hguie shows that 
contiadiclory to the original algorithm in the modified algorithm 
the paiametcrs converge to their correct values and the output error 
approaches aio 

VI Conch sion 

The SPR condition is a md)or drawback ol the output eiror 
algonthm because the denominator of the SPK transfer function 
has to be the denominator of the unknown plant In the disci etc 
lime case it is possible to overcome this condition while m the 
continuous time case only sufficient conditions wuc found to sahsly 
the SPR condition In the piesenl nott it was shown that by a slight 
modification of the output eiroi algorithm we eliminate the SPR 
condition both in the continuous and discrete time cases An a pi ion 
knowledge about the denominaloi ol Ihc plant allows us to design 
a feedback gain which guaiantecs the asymptotic stability oi the 
algorithm The design algonthm is very simple and by simulation 
wc demonstrate the advantages of the modified algonthm over the 
original one 
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Comments on “Explicit Asymmetric Bounds for Robust and 
Stability of Continuous and Discrete-Time Systems” p _ (£■/ p4 + 4' pe ,)/2 > = 1 

Jian Xiao , 

/„ = r! pe,12 11 = 1 w 


Abstract —Hits note comments the results of a recent paper, 1 we point 
out that Theorem 2 h incorrectly stated A correct version of the theorem 
is provided- 

In the recent note, 1 (he authors gave sufficient conditions for the 
robust stability of systems with parameter uncertainly Since both 
the signs and the ranges of the uncertain parameters are taken into 
consideration the proposed critenons are less conservative than the 
previous results 

In this note, we identity a minor error in lheir Theorem 2 In this 
theorem the authors pointed out, for discrete time system 

i(A + 1) = ( 4 + F)i(A) (1) 

with r ~ i A,f is asymptotically stable if 

A 

J 


P is the solution of equation 

i' i'A-p + n = o 

Since F n is generally not <1 Hermilian matrix Lemma 2 1 is nol 
valid here and the theorem is generally not true To give a correct 
statement wc first define two kind ot matrices 

r, =(F‘PI , + L‘,rE )/2 I J = 1 WI<J 

t. = r‘ pe /2 / = i iii 

note that both / , and 6 are Hermilian matrices and 

i i 


where 


\, r , v (/’ ) lor A > 0 
A,mi(F) for A <0 


I 


\.hin(F J for A Kj £ l) 

\,». (f ) for A A ; < 0 


( 2 ) 


We can give the conect version of this Iheoicm as lollows 
7 htotem I I he system in (1) is asymptotically stable if 

\ + £**„ +£** f 'i 


wheie \ is defined by (2) and 


Manuscript lettived September 6 1994 

The author is with the Dcpjilmcnt of l kctrical fneineumg Soulhwcsi 
Jiaotong University Chengdu 610011 Sichuan P R China 
II fch t og Number 940700S 

1 / (vao and P I Anlsakhs /hH Ttan\ Automat Court vol pp 
112 IIS 199^ 


9 “A,, x (6 ) 


in 


(\ „(/ j) lot A A ^ 0 
\\ (/ ,) tor / A, 0 


w i ) 


0018 92K6/9SV)4<X1 & I99S JfcH 



M rRANS ACTIONS ON AUTOMATIC CONTROL VOL 41) NO 2 FEBRUARY 1991 


Scanning the Issue* 


adaptive Control of Plants with Unknown Hystereses, 

/ to and Kokohn it 

I his paper studies the problem of adaptively controlling systems 
uith unknown hystcrcses Hysteresis phenomena caused by suction, 
nucnctisin 01 gears with backlash often severely limit system 
performance and give rise to inaccuiacy or oscillations even leading 
in instability Since hysteiesis characleiislics aie generally nondiffer 
I ntiablc nonlmeanlies the existing results of adaptive contiol theory 
which deal primarily with linear or differentiable nonlineai systems 
ue not applicable to systems with hysteresis The authors develop a 
mw method to parameterize the unknown variables of a hysteresis 
model and develop a hysteiesis inverse I hey present an adaptive 
method lor a known linear plant following the h\steresis as well as 
foi an unknown plant An adaptive hysteresis invcise cascaded with 
the plant is employed to cancel the effects ol hysteresis so that the 
remaining part ol the controller can retain us linear structure I he 
analytical results piove the global boundedness of the closed loop 
signals for wide class ol hysteresis models 

Robust Stability under A Class of Nonlinear Parametric 
Perturbations, F u Dasgupta and BlondeI 

In verifying robust stability in lineai systems dependent on real 
valued uncertain parameters there is a fundamental trade oil between 
llic generality nl functional dependence on the unknown parameters 
and the complexity of the tests lot verifying stability Ihcrefore a 
kc) engineering challenge in these problems is to identify structures 
ol uncertainly that lacilitate tractable analysis while capluiing pracli 
cally important cases This papei s contribution tails in this category 
Tor a transfer function description of finite dimensional linear time 
invariant systems the authors derive tiactablc necessary and sufliuenl 
conditions for robust stability when given a particular structure 
of nonlineai paiamelei dependence in the associated characteristic 
function Ihe authors motivate their structure as one that might 
commonly arise horn a cascade of parameter dependent single 
input single output plants with a hxed contioller If each ol these 
uncertain plants has a numerator and denominatoi that aie powers 
ol polynomials having affine dependence on mdepcndenl parameters 
and the paiamelers themselves are restricted to a hypei rectangle in 
U the stiucture considered in this paper results 

I he necessary and sufficient conditions for robust Hurwitz stability 
on this structure are derived using the concept of the frequency 
dependent value set foi the characteristic function of interest In 
particular the authors identify simple one dimensional subsets ol the 
parameter set that must he tested to ensure robust stability ovei the 
entire parameter set These subsets are the edges of the parameter 
set hyper rectangle and certain internal line segments that may be 
frequency dependent Moreover the paper shows special cases of this 
uncertainty foi which these internal line segments remain indcpendenl 
of frequency To facilitate construction of algomhms to test these 
geometric conditions the authors go on to identify a complex-valued 
function of frequency with the pioperty that its image avoids the 
negative real axis if and only if the original uncertain system is 
robustly Hurwitz stable 


*Thi\ section is written hy the Tiansaclions Lditorial Board 


Discrete-Time Observers for Singularly Perturb* d 
Continuous-Time Systems, Shouse and ta\lot 

This paper deals with constructing observers toi minimi u 
continuous time singularly perturbed systems The ohservei design 
concept adopted here is based on inversion of state to measurement 
maps The two lime scale nature of the systems is exploited b> 
decomposing the problem into one of designing separate observe 1 !s 
lor the approximate slow and fast subsystem models The resulting 
observers are of reduced order and aie also implemented in a muhnau 
fashion As a result significant savings in computational and/or 
memory requirements are obtained Also the stiffness inherent in a 
full ordei observer design is eliminated 

lhe development and analysis of stabilizing obseiver based ton 
trol algorithms is not attempted in this paper Nevertheless the use 
of observer b ised feedback is utilized in a case study ol velocity 
control of a two phase permanent magnet synthionous moloi using 
only phase current measurements It is demonsttaled via simulations, 
that the reduced order observer designs proposed m this paper result 
in good perlormance 

Adaptive Rack-Pressure Congestion Control Rased on 
Local Information, Tawmlas 

A new distributed control policy foi a multiclass multistage queue 
ing systems that models virtual circuit and datagram communication 
m tworks and a class of manulactunng systems is presented in this 
paper 1 he policy aims at controlling the congestion in lhe network hy 
dynamically allocating the servers to the butlers icgulatinp the flows 
in the network and iouting the served traffic Ihe author proposes a 
policy called Adaptive Back Picssuie (ABP) congestion polic) that 
performs all these control functions in a disliihuicd fashion based on 
local information For arrival pioccsscs with bounded bursliness 
the authoi shows that the ABP policy stabilizes the network in the 
sense that it ensuies bounded backlog in the network as long as the 
utilization of each server is less than its service capacity An adaptive 
version ol the ABP polic) is also discussed 

Stability of Queueing Networks and Scheduling Policies, 

Kumai and Me\n 

l he objective ol this paper is to establish stability results loi 
queueing networks that involve scheduling servers among several 
competing queues Given that solutions to the stability problem 
(namely by determining a steady state probability distribution) aie 
rare outside the special class of queueing networks possessing a 
product form solution the authors propose and develop a program 
matte procedure for obtaining sufficient conditions for the stability 
of queueing networks operating under scheduling policies 1 his 
procedure uses lineai progiammmg to programmatically construct 
an appropriate quadiatn Lyapunov funclion from which the stability 
results are deduced This method can provide stability results that 
could be very difficult to obtain analytically and it is applied very 
successfully to sevetal example problems in the paper 

Second-order Properties of Families of Discrete-Event 
Systems, Rajan and Aqianal 

This paper considers a class of timed discrete-event systems 
termed synchronous discrete event systems (SDFS) Any such system 



198 


IEEE TRANSACTIONS ON AUTOMATIC CONTROL VOL 40 NO 2 PEBRUARY 1995 


can be described directly in terms of a language/score space, as 
opposed to a state evolution mechanism SDES provide a unified 
framework for modeling a large class of Petri nets and queueing 
systems including tandem, cycle, fork-join, and spht-merge networks 
with general blocking and starvation The objective of the paper is 
to study the concavity of the event counting process of SDES In 
particular, the paper presents a sufficient condition on score spaces 
that ensures that the event counting process of one system dominates 


the event counting processes of a collection of systems This result 
may be applied to conflict-free Petn nets to deduce the concavity of 
the firing process as function of the initial marking or to the above- 
mentioned families of queueing networks to deduce concavity of 
the throughpul as a function of various parameters such as buffer 
configuration, initial job configuration, and blocking parameters 
Overall, the authors present a new and interesting perspective that 
covers and extends earlier results 
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Editorial 

The 1994 George S. Axelby 
Outstanding Paper Award 

(Covering the Period January 1992-December 1993) 


T HE George S. Axelby Outstanding Paper Award is given 
each year by the Control Systems Society to the authors 
of outstanding papers published in the IEEE Transactions 
on Automatic Control during the two calendar years pre¬ 
ceding the year of the Award. At most three papers can be 
selected each year. Papers are judged on their originality, their 
potential impact on the foundations of control, their importance 
and practical significance in applications, and on their clarity. 

Nominations for the Award were solicited this year through 
announcements in the IEEE Control Systems Magazine 
and the Transactions on Automatic Control The Axelby 
Award Committee makes use of selected reviews and panel 
evaluations to evaluate and rank those papers which are 
nominated. As a result of this process, the following paper 
was selected this ycai to leceive this prestigious Award: 

Sheng-Lucn Chung, Sl6phanc La fortune, and Feng Lin, 
"Limiled lookahead policies m supervisory control of 
discrete-event systems," vol. 37, no. 12, pp. 1921-1935, 
Dec. 1992. 

Prof. Chung is with the Electrical Engineering Technology 
Department of the National Taiwan Institute of Technology, 
Taipei, Republic of China. Prof. Lafortune is with the De¬ 
partment of Electrical Engineering and Computer Science, 
University of Michigan, Ann Arbor, Michigan, 48109 USA. 
Prof. Lin is with the Department of Electrical and Com¬ 
puter Engineering, Wayne Stale University, Detroit, Michigan, 
48202 USA. 

The authors received iheir Award during the Luncheon 
'Ward Ceremonies of the 33rd IEEE Conference on Decision 
and Control, at Lake Buena Vista, Florida, USA on December 
15, 1994. The Award consists of a certificate and plaque and 
is named for George S. Axelby, founding editor of the IEEE 
Transactions on Automatic Control. 

The following "Scanning the Issue" paragraph, writlen by 
Associate Editor C. G. Cassandras, is reprinted here again as 
a service to the reader: 

One approach toward controlling discrete-event systems 
involves the design of “supervisors” whose purpose 
it is, for example, to ensure that the system reaches 
certain desirable stale sets or avoids undesirable ones. 


The design of supervisors is based on finite automata 
models both for the system to be controlled and for 
the process that describes its desired behavior. For 
complex systems, however, this task becomes extremely 
difficult for a number of reasons; for instance, one cannot 
overcome the exponential growth of the state space that 
accompanies complexity. The authors introduce an on¬ 
line scheme based on the idea of projecting the behavior 
of the system for N steps into the future after every 
event occurrence and then selecting a control action 
based on this limited information. This gives rise to 
various "limited lookahead policies’’ (LLP) resulting 
in an LLP supervisor. The authors also introduce the 
notion of an "attitude" regarding the control action 
selected, and they analyze a conservative and an op¬ 
timistic attitude, for which they derive a number of 
properties. Lower bounds on the number of steps N 
defining the lookahead horizon are provided such that 
the LLP scheme’s performance is at lcasi as good as 
an off-line scheme with complete information. Finally, 
a simplified subway system is used as an example to 
illustrate the applicability of LLP supervisory control 
and related implementation issues. 

Nominations for the 1995 George S. Axelby Outstanding 
Paper Award, covering the period January 1993-Deccmber 

1994, are already open. Any reader who wishes to nominate 
a paper is urged to do so by the closing dale of May 15, 

1995. No nomination form is required. A letter identifying the 
paper by title, author, and issue of publication and including a 
description of the contribution of the paper should be submit¬ 
ted. Alternatively, an electronic message carrying the above 
information would be sufficient. Please send nominations, by 
May 15, 1995, to 

Michael K. Sain 

Freimann Professor of Electrical Engineering 

University of Notre Dame 

Notre Dame, IN 46556 USA 

FAX: (219) 631-4393 

email: jordan@mcdugorje.ec.nd.edu 

Michael K. Sain 

1994 Axelby Award Chairman 
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Abstract— For a system with hysteresis, we present a pa¬ 
rameterized hysteresis model and develop a hysteresis inverse. 
We then design adaptive controllers with an adaptive hysteresis 
inverse Tor plants with unknown hystereses. A new adaptive 
controller structure is introduced which is capable of achieving a 
linear parameterization and a linear error model in the presence 
of a hysteresis nonlinearity. A robust adaptive law is used to 
update the controller parameters and hysteresis inverse param¬ 
eters, which ensures the global boundedness of the closed-loop 
signals for a wide class of hysteresis models. Simulations show 
that the use of the adaptive hysteresis inverse leads to major 
improvements of system performance. 

I. Introduction 

H YSTERESIS phenomena caused by sliction, magnetism, 
or gears with blacklash commonly exist in physical sys¬ 
tems [ 1 ]-|6), and often severely limit system performance such 
as giving rise to undesirable inaccuracy or oscillations, even 
leading to instability. Hysteresis characteristics are generally 
nondifferentiable nonlinearities and usually unknown. Most 
results of adaptive control theory are for linear or differentiable 
nonlinear systems, and are not applicable to systems with 
nondifferentiable nonlinearilies. 

It is a challenging task of major practical interest to develop 
adaptive control schemes for systems with unknown hysterc- 
ses. In this paper, we will pursue this task. The controlled plant 
consists of a linear part preceded by a hysteresis characteristic, 
that is, the hysteresis is present at the actuator of a linear 
part. The linear part carl be cither known or unknown, and 
the hysteresis is assumed to be unknown. The objective is 
to achieve stabilization and tracking in the presence of the 
unknown hysteresis. To solve this problem, we will use the 
"‘adaptive inverse” approach which was proposed in |7|-[9| for 
adaptive control of plants with nondifferentiable nonlinearilies 
such as dead-zone and backlash. In [7], an adaptive dead-zone 
inverse control scheme was proposed for systems having an 
unknown dead zone at the input, using full state measurement. 
In [8], adaptive dead-zone inverse control designs for systems 
with unknown dead zones were developed, using only the 
output measurements. In |9|, the adaptive inverse approach 
is used for adaptive control of systems were designed: one 
for systems with an unknown backlash and a known linear 
part, and the other for both the backlash and the linear 

Manuscript received March 2, 1992; revised December 2, 1993. Recom¬ 
mended by Pasl Associate Editor, A. Arnposuuhis this work was supported 
by the National Science Foundation under Grunts bCS-9203491 and hCS- 
9307545, by the Air Force Office ol Scientific Research under (Irani F-49620- 
92-J-0495, and by a Ford Motor Company grant. 

0. Tao is wilh the Department of Electrical Engineering, University of 
Virginia. Charlottesville, VA 22903 USA. 

P. V. Kokotov k is with the Department ol Electrical and Computer 
Engineering. University ol California, Santa Barbara, CA 93106 USA. 

IEEE Log Numbei 9407562 


part unknown. In this paper, we will employ an adaptive 
hysteresis inverse cascaded with the plant to cancel the effects 
of hysteresis so that the remaining part of the controller can 
retain its linear structure. 

The paper is organized as follows. In Section II, we present 
a hysteresis model characterized by a set of parameters, and 
formulate the control problem. In Section III, we present a 
mathematical model of a hysteresis inverse and its implemen¬ 
tation. In Section IV, we develop a parameterization for an 
estimate of the hysteresis inverse. In Section V, we design an 
adaptive controller for plants with a known linear part and an 
unknown hysteresis. In Section VI, we present a solution to the 
more general adaptive control problem in which both the linear 
part and hysteresis are unknown. In Section VII, we present 
simulation results for the proposed adaptive control schemes. 

Models of backlash, electromagnetic, and other types of hys¬ 
teresis can be found in [1|-[6J. However, a general hysteresis 
model would not be convenient because of its complexity. We 
will use a simplified hysteresis model which captures most 
of the hysteresis characteristics and is useful for parameter 
adaptive control. We will show that our hysteresis model has 
a parametcrizablc right inverse which cancels Ihe effect of 
the hysteresis when cascaded with the hysteresis. An adaptive 
hysteresis inverse is implemented with parameters updated 
on-line by adaptive laws. 

When the adaptive hysteresis inverse is used for control, 
the effect of the hysteresis may not be completely cancelled, 
and there is a control error we express in two parts: first, a 
parameterizable pail, and second, an unknown but bounded 
part. 

For plants with an unknown linear part, the usual linear 
controller structure is modified in order to achieve a linear 
parameterization and a linear error model, which are both 
crucial for the development of robust adaptive laws. As a result 
of the linear parameterization of the adaptive hysteresis inverse 
and the new controller structure, robust adaptive laws can now 
be used to guarantee the closed-loop signal boundedness for a 
wide class of hysteresis characteristics. Our simulations show 
that major improvements of system performance have been 
achieved by the adaptive hysteresis inverse. 

II. Plant and Control Objective 

The plant to be controlled has a linear part and a hysteresis 
characteristic //(■) as its input: 

y{l) = G{H)[u]{t), u(t) = H(v(l)) (2.1) 

where (?(.s) = k p (Z(#)/P(s)), Z(s), P(s) are monic poly¬ 
nomials, k p is a constant scalar, y(t) is the measured output, 

v{t) is the control input, and //(■) = H(m t < r*,7?4, tw^r- 
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Fig. I. Hysteresis model. 


r- n 7 / 4 . f/; ■) is parameterized by constants m t , c t , 7 / 4 ,, m r . 
c r , 774 , r;j and described by two half-lines (see Fig. 1): 



u(t) - m t v(t.) + c t 

^ ct + mici 

, V(t) > Vi = 

774 - 714 

(2.2) 


n(t) - rni,v(t) + n,. 

Cb F m r (’r 

, 7/(4) < 7/9 =- 

774 . — rni 

(2.3) 

and 

two line segments: 



u(l) 

— - c r ), v -2 

F 

li 

V 

VI 

(2.4) 


u(t)-riu(v(t) o/), 


<-b + mici 


v 4 < v(t) < Vi 


where v\ , i/ 2 , fa, 7/4 are the values of 7 /( 4 ) at the four opposite 
“comers” of the quadrilateral. 

Along the segments, the lime derivatives of u(t), v(t) are 
of constant sign, namely, u(f) > 0, 7/(4) > 0 for u(l) = 
m,,. (u(4) —< r ), and u(I ) < 0, v (4) < 0 for u(t) = — 

The hysteresis phenomena occur inside the loop formed by 
the half-lines (2-2)—(2.3) and the segments (2.4)-(2.5). Inside 
the hysteresis loop, the relationship between 7 /( 4 ) and ?;(4) is 


u(t) ^ 


f m t v(l) F r,/(4 
\m h v(t) + c.„(f 


(/.) for />(/) < 0 
(4) for 7/(4) > 0 


where c f /(4) E (q, cj ), c„(4) E ( co - (-b) are piecewise constant 
functions which depend on the point where v(t) changes its 
sign and on the past trajectories of (v/(4). 7/(4)), with 
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The relationship ( 2 . 6 ) holds also for a part of one of 
the half-lines: when m t > 774 ,, on the hall-line ( 2 . 2 ) with 


Vi < 7 /( 4 ) < 7 / 3 , 71 ( 4 ) = m t v(i) F c t for v(t) 0: . 
rn t < m fc , on the half-line (2.3) with i.\, < ,cj) 
u(t) = mi, 1 /( 4 ) F Cb for F(0 > 0. 

The signs of (/) and 7 /( 4 ) are not restricted on other parts 
of these two half-lines: <i(4) = m,v(/.) f r ( . r(/) , 

7/(4) = 7747/(0 F ('b ■ 7/(4) < 7 / 4 ; and 7/(4) r ni(V[ \ r n , 

Vi < 1/(4) < 1/3 when m t < 7/4 or 11 ( 4 ) = m ft r{i) 

7/4 < //(4) < 1/2 when m t > 7 / 4 ,. For example, the half-line 
7 i( 4 ) = m(,v(4.) F 74 is bidirectional when 774 > m, ( . 

The motion of u(t) and i/(4) on the half-lines (2,2) ( 2 . 3 ) 
and the segments (2.4H2.5) and inside the hysteresis loop 
can be mathematically described as 

f 7747 /( 4 ) if 7 /( 4 ) > 7 / 3 , 

or if v ,4 < 7 /( 4 ) < v .j. v(t) < 0, 

7/(4) ^ 7 / 1 /(7/(4) - ci) and 

m(4) 4- 7/4i/(/) F /:/>, 

or if 7/4 < //(4) < 7 / 3 , 7 /( 4 ) < 0, 

7 /( 4 ) = 7747 /( 4 ) F <‘b and 714 < m/, 
or if 1/4 < i/(4) < 7/3. 7 /( 4 ) > 0, 

7/(4) =■- 7747 /( 4 ) F ct and 7/4 < mi, 
miji'(t) if /j( 4 ) < 7/4, 

or if 7/4 < i/(4) < 7 / 3 , 7 /( 4 ) > 0, 

7/(4) - { 7 /( 4 ) ^ 77 i r ('/*(/) - r,.) and (2.9) 

m(4) / 7747 /( 4 ) F r,>. 

or if 1/4 < r( 4 ) < 7/3, 7/(4) > 0 , 

71(4) = 7747/(4) F /•/ and 774 > 774, 
or if 7/4 <-;//(/) < 7/3, 7/(4) < 0 . 
i/.( 4 ) -■ 774/1/(4) F ci, and rn, > 7/4, 
m r v(t.) if 74 < //( 4 ) < 7/3, 7 /( 4 ) > 0 and 
7/(4) = 7/4.(7/( 4 ) - r: f .) 

7747 /( 4 ) if 7/4 < i/(4) < 1/3, i/(4) < 0 and 
u(4) = 774 ( 7 /( 4 ) - c/) 

0 if 1/(4) = 0. 

The model of the hysteresis and its two typical minor loops 
are shown in Fig. 1. This model captures most of the hysteresis 
phenomena described by more elaborate models in [5]. e.g., it 
is an extended approximation of the ferromagnetic hysteresis 
model with 7/4 — m r = 00, m t — 774 , - 0 [ 6 |. When 
774 ~ n/./,, the major hysteresis is the out-loop hysteresis, and 
the inner loops degenerate into line segments. When ra t - my,, 
774 - 774 , Cf - -c b > 0 , c r = -ci > 0 . the hysteresis model 
( 2 . 2 )—( 2 . 8 ) becomes the approximation model of | 6 | for the 
ferromagnetic hysteresis. 

The assumptions about the hysteresis characteristic //(•) are 

1) the hysteresis output 7 /( 4 ) is not available for measure¬ 
ment; 

2 ) nit > 0 , nib > 0 , rn y > max{ 774 , 774 ,), nn > 
max{ 774 , m./,}, c t > c b , c r > r/. and the values of these 
hysteresis parameters are unknown. 

The assumptions for the linear part C(/) of the plant are 

3 ) G(,s) is minimum phase; 

4) the relative degree n* of G(,s) is known; 

5) the degree n of P(s) is known; 

6 ) the sign of k p is known. 

The control objective is to design a feedback control i/(4) so 
that all closed-loop signals arc bounded and the plant output 
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y(t) tracks the output y m (t) of the reference model: 

y m (0 -■ W r m (#)[rJ(<) (2.JO) 

where W ni (s) is a stable rational transfer function of relative 
degree n*, and r(t) is a bounded piecewise continuous signal. 
Without loss of generality, we take W nt {$) — P~ [ (s). where 
P m (s ) is a monic Hurwitz polynomial of degree n*. 

A linear model reference controller structure could be used 
to achieve the stated control objective of the hysteresis were 
absent, that is, u(t) = v(t). In the presence of hysteresis, a 
linear controller alone cannot achieve the control objective. 
We propose to use an adaptive hysteresis inverse developed in 
the next sections to cancel the effect of hysteresis. 

HI. Hysteresis Inverse 

Let u d (1) be a control signal to be designed. As the 
inverse of the hysteresis (2.2M2.6), we use the hysteresis-like 
characteristic IJI(-): 



Fig 2 Hysteresis inverse 


v(t) = ///(MO) 

= t‘f, iri),, c b . in,, c>, mi , q; MO) (3.1) 

which is defined by the half-lines (see Fig. 2): 

,a 1 / /A t /A ^ + Cl) 

v{t) = -(MO - Cl), u,l(l) > H| - 

nit mi — nil 

(3.2) 

i,\ 1 i i,\ \ / m,lnit,i, "f f/j) 

"(0 = —(M0 - ci,), u,i(i) < «2 = — 

nib »i, - mi, 


(3.3) 

4 <>) 

(3.4) 

. /.\ 1 /a . niilnibci + (%) 

<>(0 = -«</(0 + O,-- ha < null) < iii- 

nii irii - mi, 

(3.5) 


and the line segments: 

«(0 = — MO + C,, 112 < II,ill) < I'M = 

Ulf 


Along the segments, the time derivatives of v(t), u<i(t) arc 
of constant sign, namely, i/</(/) > 0, #»(/) > 0 for u(0 - 
(l/rn t )ii<i(/) + <v, and !>,#(/) < 0, u(0 < 0 for W0 = 
0/nn)u d (t) + c f . 

Inside the loop lormed by the half-lines (3.2H3.3) and the 
segments (3.4H3.5), the output of the hysteresis inverse v(t) 
is defined as 


m _ j ^(MO 0/(0) for M0 < 0 

l^('id(0 Cull)) for M0 > 0 


(3.6) 


where M0 e (<*. a), r r u (0 E (<‘ 2 , a,) are piecewise constant 
functions which depend on the point where M0 changes its 
sign and on the past trajectories of (W0< w«f(0)i with r i and 
eg defined in (2.7) and (2.8). 

The relationship (3.6) holds also for a part of one of the 
half-lines of the hysteresis inverse HI( )\ when m t > m h , on 
the common part of the half-line (3.2) and the hysteresis loop, 
W0 = (l/ra,)(M0 - ct) for ii d (t) < 0; when rn t < m b , 
on the common part of the half-line (3.3) and the hysteresis 
loop, W0 = (l/wf,)(u (i (0-Cf > ) for Ud(t) > 0. M0 and W0 


are allowed to have both positive and negative signs on other 
parts ot these two half-lines. 

The model of the hysteresis inverse and its iwo typical minor 
loops are shown in Fig. 2. 

The motion of u d [i) and W0 on the hall-lines (3.2)—(3.3) 
and the segments (3.4)—(3 5) and inside the loop is mathemat¬ 
ically described as 


/’(0 


L 

h 


M0 


^M0 

l 


IT,7*i(0 

dbMO 

o 


ii a,iii) > u s , 

or if rrj < ii, ilt) < ii ii, ill) < 0 . 

"(0 ^ ,,77 M0 + < i and 
"(/) / v!„ (M0 -<b)- 

oril i/j < u,ill) <. iii. M0 < *)■ 
»’(0 = -*—(«,/(/) - t;,) and nii < m h 
or if »i < M0 < n !• a(lU) > 1*. 
"(0 - ^(MO - f/)andm, < m ( , 

il M0 < fit, 

or it < M0 < " i- 11 dll) **■ 

"(0 ^ 7,77 M0 + 'V and 
"(0 1 ,t^(M 0 - ff). 
or if 1 < M0 < fij. M 0 > 0 , 

"(0 - ^(MO - If)andm, > rt/ ( , 

orif 1/4 < M0 < "i- M0 < 0. 
i’ll) - ^(MO ~ il,) and in, > m,, 
if f/i < ujlt) < it t, n,ill) > (land 
"(0 = 777 MO + O 
if U 4 < M0 < u\, < Oand 


W0 - 7 ^ M0 + O 
if M0 = °- 


(3.7) 


The proposed hysteresis inverse has the following proper¬ 
ties: 

Proposition 3 / (Hysteresis Inverse): The characteristic 
HI(-) defined by (3.2)-(3.6) is the right inverse of the 
characteristic //(■) defined by (2.2M2.6) in the sense that 


H(HI(u d (t „))) = u fi (to) * i/(///(M0)) = ii«i(0i 

V/ > fa (3.8) 


for any piecewise continuous M0 and any > 0. 



TAO AND KOKOTOVlC PLANTS WITH UNKNOWN HYSTf-RfSES 



fig 'll Adaptive hysknxis invcixe 

The mapping (3 2)~(3 6) may fail lo define a hysteresis 
inverse only if //,/(/) is such that i (/) and //(/) ncvci leave 
the hysteresis loop, that is, t {) in (3 8) cannot be reached Such 
a situation happens when i/ r y(/) is inside the hysictesis inverse 
loop and u(t) ^ </,/(/) initially and the motion of //</(/) is 
such that //(/) never leaves the hysteresis loop to correct the 
error between u t i{() and u(l) Howevei, as u,i(f) is the design 
signal at oui uisposal, an lnitiah/alion of the hysteresis inverse 
by an appropnatc choice of i/,y(/ () ) can always make /(/) 
and u(t) leave the inside of the hysteresis loop at / t) so that 
</(/„) =. u y(/ 0 ) and then Irom Proposition 3 1 u(t) = 11 , 1 ( 1 ) 
for an> / > /0 

In the adaptive contiol problem the hysteresis paiameiers 
aic unknown so the exact hystcicsis inverse (3 2)-(3 6) is 
not implementable We propose to use an adaptive hysteresis 
inverse whose parameters arc updated accoiding to adaptive 
laws 

Let rtif C t m\, i~i, in', 0 m) 0 be the estimates of the 
unknown hysteresis paiameteis m t n mi <i m, ( > m 1 
(i An estimated hysteresis inveise III( ) is defined as 

n(n=-ffl(«A 0 ) 

- HI(rn t r t m\ 0, iTT, mi f/ n i(l)) (3 9) 

Fig 3 shows its characteristic with two typical minor loops, 
as well as two segments indicating two possible motion of 

v(t) and at point S Now, 0/(f) e (0(0 0(0) 
0(0 F (0(0 0(0) depend on the point (u,j(1) 1 (/)) 
where u,i(t) changes the sign, and on the past trajectones 
of the adaptive hysteresis inverse, where 0(0 ^nd 0(0 dre 
calculated from (2 5) and (2 6), but with the estimates of the 
hysteicsis parameters 

F 01 a fixed hysteresis inverse, the estimates m t f, rTT?, 
<\ frT, C, mi u are constants based on certain a pnon 
knowledge of mt 0 , wi, <b n; r , r r , mi <1 For an adaptive 
hysteresis inverse, m t = rrTy(f), O = 0(0* Mb — wi(0 
U, = 0(0, w T = m,(/) f 7 = 0(0 fni = nT,(f) 0 = 0(0 
are time-varying signals generated by an adaptive update law 


To ensure that (3 2)-0 b) are implementable \u , , 

7) positive constants m, u r 

constants cm (bt, 1 1 2 , <iu (i, f 12 arc known such thu / 

< rn f2 , wim < m* < m l2 max {rri r2 m ( , j 1 , 
rub*} < f«/o < nn andfn ^ f/ < 1, 

(11 < <1 S <h S <1 < ' i2 
Even when not stated explicitly, we will use pioj^ti 
lo ensure that the estimated hysteresis paiameiers saiisly itu 
above inequalities 

A numerical scheme lor implementing the estimated !ns 
tciesis inverse (3 8) is developed next 
At time / = if, we have the knowledge of m t (l) t t (f) 
m{t) 0(0 wT, (/) 0(0 w/l0 0(f' «/(/) />(/) lot t - 
/* 1 and the knowledge oi f/,/(f) foi / = With the help 
of Fig 3 where S' - (uj(f k 2 ) v(t k 0) \sc introduce the 
quantities u n < u t u //,/, < ?/,// 

WjfrTT/iO+O I (|) 

'Mr "a/ =■•*-.- -c (3 10) 

mi nn 

0/ - UdUk-i) ~ mt"(U,-'i) 0. ~ O-my/KO 1 ) 

(3 11) 

.. . It «,/(/* 

it l) < U,l(lk) 


^ o-a ii « rf (/* 1! 

a,, 1 = < . "ii. »* 

r Ilf in 

Then o(t k ) is delenmned as 


' t (U-\ ) *1 Wr/(0 ) = "tl(U 1 ) 

<~i) 1 ln,i(h) ± "it 

"i 1 

~ <l) it llil(fk) *• H II 

HI/ 

k) H- f T it II II > ",l(lk ) 

III 

- itj( 1 k) + <i if u<11 <- >‘i(U) < »11 

"I, 

-C ( 11 , 1(1 k) - ',/) it < 1,11 < "iVk) ' «, 1 , *»nd 

111 

<1,1(1 k 1 ) I'lAU ) 

-^(u,t(tk) - < „) If 11,11 < Ii,i(h ) < III, and 

in 

m(U 1 ) < ui(h) 

(3 13) 

The implementation is dynamic because it uses not only 
u ( i(U) but also iid(U-]) and n(l k 1) ll is important that 
this implementation does not need the knowledge of uj(t) 

In our adaptive control schemes, the adaptive hystcicsis 
inverse will be use j in cascade with the plant with a hysteresis, 
and the signal w f /(/) will be generated from a linear controller 
structure 


1V PARAMETFRI7AT10N 

In the absence of the hysteresis, a linear controllei gener¬ 
ating il(I) can be used to achieve the asymptotic tracking of 
<An( 0 by i/(0 When the hysteresis is present, o(f) is the acces¬ 
sible control which, by our design, is the output of the adaptive 
hysteresis inverse with input tid(l) A parameterization ol the 
control erroi u(f) - i/,y(0 will help us to develop suitable 
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controller structures for generating Ud(t) and adaptive laws 
for updating the estimates #Hi(0, MO, m b (t ), cj,(t ), m r (t), 
c r (t), m)(0> ci(t) to implement an adaptive hysteresis inverse 
To proceed, we define the following indicator functions 


fi it MO, u(t)) e {MO, «(<)) | u(t) 

Xt(t) = < = rn t u(t) + c t } (4 l) 

(o otherwise 

f 1 it(v(<), «(0) € {MO, u(0) I 

X*(0 = < = m b v(t) + Cfc} (4 2) 

[ 0 otherwise 

'1 if(i/(f), u(0) e {(u(f), u(t)) I u(t) 

Xr(0={ - m r (v(t) - c r )} (4 3) 

L 0 otherwise 

'i if MO, «(#)) e {MO, «(/)) | u(f) 

Yi(0 = ( = mi MO - (/)} (44) 

; 0 otherwise 

r l if(v(f), u(f)) € {MO, u(0) | Ii(t) 

MO = { = m t v(t) + c d (t)} (4 5) 

L 0 otherwise 

'i if(u(0 ./(0) e {MO, «(0)M0 

MO = { = rn b i'(t) + r„(0} (4 6) 

L 0 otherwise 


r i it (MO v(0) e {(M0M0) I "(0 
=^(M0-^(0)} (47) 

^0 otherwise 

'i if(M0. »»(0) e {(MO "(0) I «(0 
= ^(M0-M0)} (48) 

k 0 otherwise 

l if(M0> v(t)) e {(MO, n(t)) | v(t) 

^)={ = ^(M0 + ™,(00(0)} (49) 

k0 otherwise 

l if (MO. u (0) g {(M0> w(0) I f(0 

= ( = rMtj( Ud (0 + ™H00(0)} (4 l0) 

J) otherwise 

l if (MO, v(0) € {(MO, "(0) I «(0 

M0 = { = ~ 

no < mo < c,( o} 

0 otherwise 

(4 11) 


Xu{0 — 


«t(M0> «(0) € {(M0, u (0) I r(0 
= -Jrr(MO-MO). 

m fc(0 

0,(0 < MO < 0(f)} 


[0 otherwise 

(4 12) 

In defining these indicator functions, we do not repeatedly 
count any intersection of the half-lines and the segments, e g , 
at the top right comer of HI( ) we define x t (t) = 1 and 
5f7(/) = 0, and when it goes down passing the fi(t) point, we 
define Xb(l) = 1 and Xd(t) = 0 With this constraint, only 
one of these functions is nonzero at any given time t 


M0 ■+ M0 + M0 + Xl( 0 + M0 + M0 = 1 (4 13) 


a* 2 (0 = M0 MOMO - o 

/ # I i ) k £ {t I) i l d /v} (4 14) 

All the equalities in (4 l)-(4 12) arc understood in the sense 
of (2 2)-(2 5) and the adaptive veision ol (3 2)-(3 *>), that is, 
inequalities apply whenever they are feasible We note that 
the condition 7) is necessary to ensure a well defined adaptive 
hysteresis inverse with the properties (4 13) and (4 14) 

Using (2 2)-(2 6) (4 l)-(4 6), (4 13), we express n(t) as 

“(0 =Xi{l)(rn t v(l) + (t) + \i(t)(m,v(1) + ti) 

+ MOM.MO -',)) + MOMiMO -i/)) 

+ M0MM0 + M0) + M0MM0 + M0) 

=M0 + Xt( 1 )(ni,t (!) + (,- x,(t)u,i(l)) 

+ M0MM0 + <i - xt (0M0) 

+ MOM. M0 - (,) - M0M0) 
f MOMiMO - <0 - */(0M0) 

+ M0MM0 + M0 - M0M0) 

+ MOMMO + MO - M0M0) + M0 

(4 15) 

where 

<*i(0 = (^f(0 ” Xr(0)(^r J K0 + f /) 

+ (\fc(0 - vT(0)( w 6^(0 + 

+ (\r(f) - 3fT(0)M.("(f) -'.)) 

+ (\i(f) - r/(f))('«i( , "(f) - o)) 

+ (A'.i(f) - Xd(t))(m t v(t) + £rf(/)) 

+ (xu(f) - M0)MM0 + f«(f)) (416) 

Since a projection based on the condition 7) ensures that 
all estimated hysteresis parameters arc bounded, the adaptive 
hysteresis inverse loop is bounded If v(t) is large such 
that (u(f) Urf(f)) is outside the hysteresis inverse loop and 
(u(/), ^(0) I s outside the hysteresis loop, then all \(t)'s and 
*(f)\ are zero except for *r(0 (or x/(0) and ^(0 (or £i (/)), 
so that rfi(/) = 0 from (4 16) This implies that di(t) is 
bounded whenever the estimated hysteresis parameters satisfy 
7), which is crucial for our designs 
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Using (3.2)—(3.6), (4.7H4.12), we express v(t) as 



4(0)) 


+ xt.(0( 


- 0.(0)^ 
i 

! 

+ M') 

v(i) (,w(() 

4- 

4(0) 

4-£)((•)| 

— ,.A u d(l) 4- 7/1.)(/.)f| 
,7/l.)(() 

i(0) 

+ £i(0( 


- MO) 


4- Xu(() 


-M0) 



In view of (4.14), from (4.17), we obtain 

MO'MO = MOOMOOO + 0(0) (4.18) 

XiM)'Ud(t) = Xi,V)('m(t)v(t) + ci(t)) (4.19) 


Proposition 4.1: The unparamcterizablc pari 
control error u{t) - MO is bounded lor any / > u .! 
reduces to zero when the hysteresis parameter emu 
zero. 

The signals *,(<), Xr(0. Xi(0, \,/('). and \ 

which describe the motion of the hysteresis outpui v.i ) 
H(v(t)) [see the first expression for «,(/) in (4.15)1, an not 
available for measurement. We choose to parameterize the 
control error u(f.) - n,i(i) in terms of the measured signals 
MO- MO, Y>(0- xiV), MO, and MO [see the definition 
(4.26) for u/, (/)] which describejthe motion of the adaptive 
hysteresis inverse output i »(/.) = HI(u,i(1)) [see (4.17)J. 

The parameterization (4.27) can be made simpler for some 
special hysteresis characteristics. 

Hysteresis with Equal Slopes m, = m h : The knowledge 
that in, = m.\, can be used to choose m'i(l) m t (l .), 
MO — MO- In this case, we define 

01 - (m», <■,. r-b-. »/',-, rn r c,., mi, mic-i) 7 (4.29) 

MO = (»M0- 0(0, <5(0, "MO- 

7r^r,.((), mi(l). inji'i(t)) 1 (4.30) 


Xr(t)u,l(t) = Xr(0( 7 «r(0'"(0 “ 7/l r (/)M0) (4.20) 

Yf (O'MO = Xi(t-)(fni(t- >(/) - mi(l)e,(l)) (4.21) 


M0M0 = Y.i(0(™i(0<'(0 4 MO) l 4 - 22 ) 

MO'MO = MOOMOOO + MO). (4-23) 

Introducing nTA'AJ) = m, (t)c r (t), injr,(t) — mi(t)'ci(t), and 

MO = (MO- 0(0. 'MO- 0,(0. <n)-(0- 

7/mv( 0- mj(0> intn(t)) 1 (4-24) 

0/* = (m t , ft, 7/1),, /■/,, /Or, 7/lrCr, Hlj, TO)/:/)”, 

Mt) = Oh(t) ~ "I ( 4 - 25 ) 

MO = (~(,v/(0 + M0M0- -y#(0. 

- (MO + M0M0. -3(0. -SUMO- 
MO. -3UM0. yi(0) t 

(4.26) 

from (4.15), (4.18)—(4.26), we have 

7/,(/) = a f /(f) 4- </>/| (0^(0 + r//i(/) (4.27) 


where 

' 4 (^) = rfi (0 4 - Xt /( 0( r: </(0 “ rj </(^)) 

(4.28) 

Thus, we have expressed the control error u,(0 - as 
a sum of a parameterizable part and an unknown disturbance. 
The disturbance ri/, (/) has the following properties. 


w),(0 = (-(3(0 + MO + MOM*'•), -3(0. -3(0. 

-30M0- 3(0. -3UM0. 0(0) r (4.31) 


where 



, 0 otherwise. 

(4.32) 


Symmetric Hysteresis: When the hysteresis is symmetric, 
mt = m/j, m r ~ m/, r# = —cj, > 0, r r = -r-j > 0, we define 


= (//i*, (>. m r , m r r r ) 7 (4.33) 

M0 = (™t(0, 0(0- m;(0> riP-A*-)) 1 (4.34) 

<M0 = (-(3(0 + 3(0 + 3(0M0- -(;?/(/) - 3(0). 
- (3(0 + Xi(4))w(0, 3(0 - 3(0/ (4.35) 


where \>(0 is also defined by (4.32), but with o,(/) — -o(/). 
In both cases, (4.27) and Proposition 4.1 hold. 


V. Adaptive Hysteresis Inverse for G(s) Known 

In this section, we design an adaptive control scheme tor 
the plant (2.1) with a known linear part. 

We use the following linear controller to generate u ( i[i ): 

u„(t) = 0,* T wi(t) 4- OfvAt) + P m v(i) + M(0- (5.1) 

There are two designs for 0*, 0J, O-j,,. and 0* and MO 
and u> 2 (/). 

I) For a(s) = (1, #. ■■ ■, # t, “ 2 ) t , A(«) being any monic 
Hurwitz polynomial of degree n - 1 

WiW = xS W(a w * (0 = (5,2) 
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#5 = k~ l , and 0*, fljjj 6 R' l ~ L , 4o G FL satisfying the 
following Diophantinc equation 

*I r a(*)J’(") + (0* 2 T a(s) + 6T i0 \(*))k p Z{») 

- A(a)(P(a) - k p 0;Z(,)P m (s)). (5.3) 

II) For u(.*) = (1, . 4 , ■■ • ,* T,_1 ) r . A(«) being any monic 
Hurwitz polynomial of degree h, u^(/) and u > 2 (/) still have the 
form (5.2), and 0j = fc” 1 . 0*, 0J e f? rt , flJo — 0 satisfying 
(5.3). 

Then the signal is applied to the adaptive hysteresis 
inverse (3.8) to generate o(t) as the control input to the plant 
(2.1) which has a hysteresis at input. 

To implement an adaptive hysteresis inverse, we can use 
the adaptive version of rhe continuous-time hysteresis inverse 
(3,7) or the discrete-time version (3.13). For the controller 
II, both the continuous-time and discrete-time adaptive hys¬ 
teresis inverses can be implemented because, from (5.1) with 
#20 " (K the derivative of iid(t), which is needed for 

implementing (3.7), is also available (assuming that i(l) is 
bounded). For the controller 11, one can only use Lhc discrete- 
time inverse (3.13) since, in this case, u (l (t) depends on 
which is not measured. For the discrete-time inverse, there is 
no need to assume the boundedness of ?■(/). 

The parameter matching equation (5.3) can be used to derive 

«>> 

+«;„,'/(') + <w;'(.')[//](o. (5.4i 

Using (2.9), (4.27), (5.1), (5.4), introducing <(/) = //(/) - 
Um{ 0i *X- S ) = 0* i,(a)(I - 0j 7 («(' s )/A(* s )))^ we obtain 

the tracking error equation 


Lemma 5.1: The adaptive law (5.8) guarantees 

1 ) 4(0. 4(0- (4(0/1 + OT(00.(0 + 4(0) 6 


2) for some constants A*i > 0, k 2 > 0, and all t 2 > 1 1 > 0, 

k 2 


^||4(/)|| 2 ^<4+| ,J y 


+ OT(00,(0 + 4(0 


di (5.10) 


4(0 


-di < ki 


/, i + gT(0 G,(0 + 4(0 

0 -' b 

+ .lu I + C,T (00.' 


4 (00,(0+o,(o 


Jt. (5.11) 


Proof With (he substitution of (5.5) and (5.6) in (5.7), 
the estimation error r /, (/) becomes 


= <I>I (00,(0 + d(l), d(t) = F(s)[d h ](t). (5.12) 


Using (5.8) and (5.12), wc express the time derivative of the 
positive-detinite function 1/, (</>/,) - J </;/,. along the 

trajectories of (5.8), as 


V; < — . 

*4i + Of (00.(0+ 0 2 (0) 

, _ «/ 2 (0_ 

2(1+ o: (oo ( (o+000) 

rr(4. Affc, <r o )«4(04(0- (5.13) 


Since r/(/) is hounded, it follo ws from ( 5 . 13 ) that 0 /,(/) 6 , 

and MO/^I + G 2 (00,(0 + 4(0). 4(0 f 

The definition of the signal rr(/) — rr(0/ M A/),, rr„) implies 


00 = 7'(.x)[^%,](/) + F(m)[(I,,](I). (5.5) 

We note that F(s) is a stable, strictly proper, and known 
transfer function. 

Introducing 


^(OII4(OII 2 lir„|| 2 £ ^(0114(011(114(011 - 114II) 

<k' 0 n{1)d>l(t)6 h (/) (5.14) 

for some constant > 0. Then, using (5.13) and 


o,(o = 4 (oo,(o-^(0(4"/,](0, 

0,(0 - F(s)[ui,]{t) (5.6) 


114(Oil 2 <2 


lin,iiio,(/)i|Q^(/) 

(i + 4(oo,(0 + 4(0) 2 


o,(0 = <-(0 + 4(0 (5.7) 


+||r,,||V(4. Af h ,ir„)||4(0|| 2 


(5.15) 


we use the following adaptive law to update 0/,(/) 


4(0 -- 


_ I\0,(0o,(0 

i+0f(00, (0 + 4(0 

-r^(4, Mi ,, <t 0 )4(0 


(5.8) 


where I'/, - T' > 0. and a is a “switching-sigma” signal 
[ 10] using a prion knowledge of an upper bound A//, on the 
Euclidean norm ||0jj|| of and a design parameter a 0 > 0: 


"(4. A4, fro) = 


if 114(Oil < A/fc 
if Mi, < ||<M0|| < 2 M h 

114(011 > 2Af fc . 


This adaptive law has the following properties 


(5.9) 


we have part (2) of the lemma. V 

Lemma 5.1 only shows the boundedness of 0),(t) and the 
L 2 properties (5.10)—(5.II) of the adaptive law (5.8). While 
the signal boundedness of the adaptive control system with a 
general hysteresis is still under investigation, we present the 
stability results for some special cases. 

Theorem 5.1: When the hysteresis fi(-) in (2.1) has two 
equal slopes, m t = mh and (4.30), (4.31) are used for adap¬ 
tation and implementation of the adaptive hysteresis inverse, 
all closed-loop signals are bounded. 

The proof for Theorem 5.1 is given in Appendix A. 

This boundedness result holds for the symmetric hysteresis 
case where (4.34), (4.35) are used for adaptation and imple¬ 
mentation of the adaptive hysteresis inverse with a reduced 
system order. 
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VI. Adaptive Hysteresis Inverse for G(*) Unknown 

The problem of adaptively controlling the plant (2.1) with 
an unknown hysteresis //(■) as well as an unknown linear 
part G(s) challenges us with the task of estimating two sets of 
parameters: one from the hysteresis inverse, and the other from 
a linear controller structure. In this case, the linear controller 
structure (5.1) results in a nonlinear parameterization which is 
not suitable for parameter estimation. 

Facing this difficulty, we need to modify the linear structure 
(5.1). Using (4.13), (4.18)—(4-23), (4.25), we express the 
hysteresis inverse (3.2)—(3.6) as 

MO = -ft*? MO + y a(0M0 + v7i(0M0- (6.1) 

Hence, the term 0* 7 ui(t) in (5.1) becomes 

+0* 7 + \ u<-u]{i)- (6-2) 

A(s-) 

Introducing new regressors and parameters as 

= o; =-h; o; (f>.3) 

*',(!) = +• -0-0,](0- ■- 0* (6.4) 

where <y denotes the Kronccker product, .4(.s) - (/,,, sI p , * ■ ■. 
, s"“‘ , /| l ) T , is the p x p identity matrix with p being the 
dimension of 0J r from (6.2), we obtain 

= + (6.5) 

We should note that the dimension of 0^ and its estimate MO 
is p(i\ - 1) (pn) for the controller I (controller II) ot Section 
V. It is this enlarged parameterization that enables us to obtain 
a linear parameterization of the error system, which is suitable 
for controller adaptation. Physically, the hysteresis has eight 
different regions, so that the feedforward part 0\ T (l)uj\(t) has 
eight different forms. A natural parameterization turns out to 
be the one given in (6.5), with the parameter vector (/*. 

This new parameterization brings us to a new controller 
structure for adaptive control 

MO = ^2 (0M0 + 02o(O//(O + M0 r (0 

+ t)J(t)u> 4 (1) + 07 l (t)w :> (1). (6.6) 

02o(O » 0 in (6.6) if u(,s), A(.s) are defined by the controller 
II of Section V. 

In view of (4.27), (5.4), (6.1)—(6.6), the tracking error 
equation now has the form 

c(1) = P*W m {*)[<l> r tj](l) + d(t) (6.7) 

where p* = = 8 {t) + 8*. <i(t) = r*W m («) 

(J - Of(a(s)/H»)))[d h ]{t), and 

0(t) = 0 2 u(0- *.(*)• (0- 0[U), Bl(i)) T (6.8) 


“(0 = (<*#(()< })(t). »■((). U).f(0, (I), a/ ■/ I - n 

The expression of the tracking emu allows us i< 
adaptive linear contiol theory 1111 ,| 12 | to develop ,u ad ( p U 
law for updating parameters of the adaptive hysieievo 
and adaptive linear controller structure which iruki up , lll} 
adaptive controller for the nonlinear plant ( 2 . 1 ). 

Wc note that 0* o , 0J } . 0 2 o(O< MM, «/#,(/) are ditimd m 
different ways when the different designs ot Sections 1 \ and 
V are used; see (4.24H4.35), and (5.1). 

Starting with tracking error equation (6.7). we can design 
adaptive schemes to update the parameter vectoi 0(f) Next, 
we present two such schemes. 

Adaptive Scheme /. Letting p(t) he the estimate ol p* and 
L(. s) he any Hurwitz polynomial of degree n* - 1 such that 
W T „,(.s)L(.s) is strictly positive real, and introducing 

an = ir\")W]v), 

t,(1) =■ 0 1 - L l (s)\8'*i}{l) (6.11) 

wc define the estimation error <(/) from 

<(0 = 'UH £ 2 )](0 (6.12) 

where n ^ 0 , and update 0(1) and p(t) from 

0(f) - -sign[A*|,]TC(0'i0 - (To)O(t) (6.13) 


(>V) = -Ti(0'(0 “ MMO < 6 ' 14 ) 


where L ~ r T > l). "v > 0 . and rr is the switching signal 
defined by (5.9). 

Wc then have the following properties of this adaptive law. 
Lemma 6.1. The adaptive law (6.13)-(6.14) guarantees 

1) 0(1). p(t), <(t) e /»; 

2 ) for some constants k\ > 0 , k\ 0 . and all t 2 > f j > 0 , 




a 


(nan + C(n 


-ill (6.15) 


f 2 (/)(i + c 7 (0ao + { 2 (0)<// 

f l 2 [ j 

< ki + J tl \ + c 1 (nan+CHn : " 

Proof. We rewrite (6.7) as 

an = yv,aai'(»)[f>*(L- - r 7 l ] (.s)H 

+/>*( i-»:' A ir i am\(n. 


(6.16) 


(6.17) 


Substituting (6.12) in (6.17), we have 
t (l) = W m (-H)L(s)[pU> 1 <+H-<^(C\ ¥C 2 ) ^l](l) (6.18) 
where (/(/) = p *(I - d * 7 (u(.s)/A (h)))L '* (*•)['//,](()• 0(0 = 

an - a■ 

Lxt H(.s) L (.s) have a controllable realization (.!,„, B 1U . 
C m ) and r f (/) be its state variable. With this notation, from 
(6.18), wc obtain 


(0 = A„,<- r (t) + B,„v(t). ((1) = ( ; m<\(n (6.19) 


0* — /fl*'J fl* fl* fl*F q+T \1 

“ — v"2 » ^20’ “li ”4 ’ 1 ) 


(6.9) 



20ft 
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whert 

ho = ffvw) + mm 

-m(o(c T (oc(o+e 2 (o)+«*(0- (6.20) 


Strict positive realness of W m (s)L(ft) implies that there 
exist constant matrices Q = Q r > 0 , F = T 1 > 0, vector 
q , and scalar b > 0 such that Aj n P + PA m = 

PB m =5 Hence, the time derivative of V(^ r , 0. VO = 
cJPCf. + |^*|0 7 T“ V + 7~~ 1 0 2 is 

v r <-«K 2 ( 0 (i+C r (OC (0 + C 2 ( 0 ) 

, An _ 

+ " 2 j+C r (0<(0 + ^(0 

- 2|/i*|ff # (/)^ r (0fl(0 - 2a„(i)mm (6.21) 


for some constants aj > 0, rv -2 > 0. This, together with a 
similar argument to (5.14) and (5.15), proves (1) and (2). V 
Adaptive Scheme II: Instead of a choice of L(s) of degree 
71 * - l, we choose L(s) “ P m (u) = of degree //*, 

to obtain the estimation error as 


m± p( nm 

i+awmo+m) 


( 6 - 22 ) 


where <,'(() and (,( I ) are defined in (6.11) with this new L(s). 
We then use (6.13) and (6.14) with the new f(t) defined in 
(6.22) to update the estimates 0(t) and />(!). 

For this scheme, we have the following result. 

Lemma 6.2• The adaptive (6.13). (6.14), (6.23) guarantees 

1) o(t). 0(1), m, ,)(t), < 2 (/)(i + <y(c T mn+em e 

Lck. i 

2) for some constant k$ > 0. k b > 0, and all I 2 > t\ > 0, 


rU 

/ f*(#)(l+ «((,'(/)<(/)+ €* 

■li 1 


mn+m) 


,n 

(6.23) 


(t)))dl 


Sh+ J„ TT^MmTeW) dl - ,6 ' 241 


Proof: With (6.7), (6.11), and (6.22), we write the 
lime derivative of the positive-definite function l 7 (0, i/;) = 
|(l^*|0 7 V + 7~ V 2 ) where tjj(t) = p{t) -/A along 
(6.13) and (6.14), as 

V(/)<-^ 2 (0(i+o(C 7 (/)C(/) + ^(/))) 

_ <jm 

2(i+n(c T mn+em 

- Mt), <td)<J> t (1)0(1) 

- <j(p, M r , oo)ij’{t)p(t). (6.25) 


From this inequality, (6.13) and (6.14) with a defined in (5.9), 
(1), and (2) follow. v 

Similar to Lemma 5.1, Lemmas 6.1 and 6.2 only show 
the boundedness of 9(i) and the L 2 properties (6.23H6.24) 
of the adaptive law (6.I3M6.14). The signal boundedness 
of the adaptive control system with a general hysteresis is 


still a research topic. However, for a wide class of hysteresis 
characteristics, we have the following stability results. 

Theorem 6.1: If the hysteresis //(•) in (2.1) has two equal 
slopes, rn t — nib , and (4.30), (4.31), (6.9), (6.10) are used 
for adaptation and implementation of the adaptive hysteresis 
inverse and the adaptive controller, then all closed-loop signals 
are bounded. 

The proof for Theorem 6.1 is given in Appendix B. 

This boundedness result applies to the symmetric hysteresis 
case where (4.34), (4.35), (6.9), and (6.10) are used for adap¬ 
tation and implementation ot the adaptive hysteresis inverse 
and the adaptive controller ( 6 . 6 ), with a reduced system order. 

While analytically characterizing the tracking performance 
of the closed-loop system with an adaptive hysteresis inverse 
is an important future research topic, we will examine it by 
simulations and compare it with that of a control system 
without hysteresis inverse. 

VII. Simulation Results 

In this section, we present an example with simulation 
results to illustrate the structure and effectiveness of the 
developed adaptive hysteresis control schemes. 

We consider the plant with an unstable linear part G’(.s) — 
2.5/(.s 2 + 2.s - 4) and u symmetric hysteresis characteristic 
H(m f. r t , rrifc, o,, hi,, c f , nif, <’/:•) where rn t — w\ } — 1 . 8 , 
rn, — on = ,'l, r t - -( ft = 1.9. c, ~ —r/ = 2.5, and choose 
a model transfer function - 1 /(.s * 2 -f .Is + 2 ). 

We study the system performance in one ol the five cases: 
a) a fixed linear controller without hysteresis inverse, for ( 7 (.s) 
known; b) an adaptive linear controller without hysteresis 
inverse, for G'(.s) unknown; c) a fixed linear controller with a 
fixed inaccurate hysteresis inverse, for G(s) known; cl) a fixed 
linear controller with an adaptive hysteresis inverse, for G r (,s) 
known; and e) an adaptive linear controller with an adaptive 
hysteresis inverse, for C!(s) unknown. 

The controller a) has the structure 

+ #;—[:</](() 

+®3otf(0 + ®.V(0 (7.D 

where = -1, tfj = -0.4, 0* () =. - 2 . 8 , 0* - 0.4, which arc 
calculated from (5.3) with (!(s) known. 

The controller b) is the adaptive version of a) for G(. s) 
unknown: 

m = 

+0w(1)y(l) + 0*(t)r(t) (7.2) 

where 0 j(/), 0 2 (t), 02 o(O> arc updated from an 

adaptive law similar to (6.11)—(6.14) with a different 
Q(t) = (0,(f), fl 2 (/). 0 2{) (t)< 0 d (t)) T \ and with (9(0) = 
(-1.5, -0.8, -2, 0.8) 7 , ^(0) = 1.25, and the choice of 
L(s) = h + 2, n = I, (t 0 = 0.15, F = 107, 7 = 10, M# = 
5.53, M p = 3. 

These two controllers, well known in the literature, ignore 
the existence of the hysteresis characteristic in the plant and 
have no compensation for the hysteresis //( ). 
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The controller c) is 

v(t) = HI(u,i(t)), 

HI(-) = Hl(m t , c t , m h , c t , m,., c r , m ( , 7r, ) (7.3) 

+0* 2t Mt) + 0*r(t) (7.4) 

where ///(■) is an accurate estimate of ///(■) defined in 
(3.2M3.6), with m* = m b = 2.3, c t = -«■* - 0.9, m r = 
m/ = 2.5, c T — —ct = 0.5, and 0*, 0J, ftJo* are the same 
as that in the controller a). This controller, with ///(•), has 
a fixed but inaccurate compensation for the hysteresis H(-). 
The control law (7.4) was developed in (5.1). 

The controller d) is with an adaptive hysteresis inverse for 
G(.s) known 

v(t) = iri(U',(t)), /?/(■) = HI(m](t), 5.(0, fi(0, 

Wr(/), <v(0- q(0; ■) (7.5) 

where «,</(£) is generated from the same structure as that 
in the controller c) [see (7.4)|, and HI( ) is an adaptive 
estimate of ///(-), with initial parameter estimates rn*(0) = 
m*(0) = 2.3, r?*(0) = -r? 6 (0) = 0.9, m r (0) = mj(0) = 2.5, 
c r ( 0) = -£}(()) = 0.5. The adaptive law for (J h (t.) — 
(/;?*(/.). Ct(t)< m r (t, ), m7M0) 7 * s (5.8) with <r 0 - 0.15, 
r /f = 107, Af/, = 11. 

The controller (e) is with an adaptive hysteresis inverse for 
G(s) unknown 

v(t) = /7/M/.)). 7/7(0 - Hl(m t (ll c t (1 .), m£(/.), S(t), 

m r (f), cv(f), m}(*), rTf(A); ■) (7.6) 

•M') = + MOwW + MM'-) 

+ (M0, M0, M0, M0)^M(0 

4- MVs(*) (7.7) 

where ///(■) is an adaptive hysteresis inverse with the same 
initial parameter estimates as that in the controller d), and the 
adaptive law for 0 (/.) = ( 0 2 (/), M 0 > M)- #f( 0 , M), K 
(l)) T is ( 6 . 11 H& 14) with the choice of L(») — s + 2 , r* = 1 , 
a 0 = 0.15, r = 107, 7 -- 10, M» = 17.38, M p = 3, 
/>(()) = 1.25, and 0 2 (O) = - 0 . 8 , M0) - - 2 , 6 » 3 ( 0 ) = 0 . 8 , 
0 4 (O) = (3.45, 3.75. 1.35, 0.75) T , 0 5 (O) = -1.5. The control 
law (7.7) was developed in (6.6). 

Our simulations indicate 

1) if either a fixed or adaptive linear controller without a 
hysteresis inverse is applied to this plant, then the tracking 
error remains significant for large /; 

2) an inaccurate fixed hysteresis inverse can reduce the 
tracking error; 

3) significant improvements of system tracking performance 
are achieved with the use of an adaptive hysteresis inverse 
either with a fixed linear controller, for G(s) known, or with 
an adaptive linear controller, for G{s) unknown. 



Fig. 4. System responses: fixed controller without hysteresis inverse lor 
G'(.s) known. 




Fig. .5. System responses: adaptive controller without hysteresis inverse for 
G(.s) unknown. 

Some typical system responses for r(t) — !2.7sin(2.3/) are 
shown in Figs. 4—8 for the above five controllers, respectively. 
It is clear that with an adaptive hysteresis inverse, the tracking 
error is significantly reduced with less control effort (the 
control signal is smoother and smaller). 

VIII. Conclusions 

Our construction of an adaptive hysteresis inverse, repa- 
rametcrization of an adaptive linear controller, and choice 
of suitable error models have led to several adaptive control 
schemes applicable to plants with unknown hyslereses. Sim¬ 
ulation results indicate that these adaptive inverse schemes 
promise to significantly improve system performance. 

Appendix A 
Proof of Theorem 5.1 

From Lemma 5.1 and the condition 7), we have that 0h (t ) € 
7>nc and m r (/) > rn n > 0. Introducing m = rn t = inh, 
rh(t) = m t (t) — raj,(/), and using u(t) = v(t) = 
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ao 



(b) 


ng 6 System responses fixed controller with inaccurate hysteresis inverst 
:>r G{ s) known 




Fig 8 System responses adaptive controller with adaptive hysteresis in 
verse for fr(s) unknown 
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(a) 



i/(ii rf (0) we express 


«(0 =■ «</(0 + rf 2 (/) 

w(0 = /n«(l) + (7 2 (0 


sKi(«) = i-A'(s), A» = 


Using (A 1) and (A 3) in (5 I), we have 

7/1 yiJhT r/(s) . vr , 

ttt(' +«d)[ ](/) 


J A(s) 

+ -^*2o(' + ««)[=](')+ :5^»(/) 


+ 


m(l) 

0 - 


m(0 


»< «(■*) »>( ) 
m(/) 1 A(s) /« 


) k >](0 


(A 6) 


Substituting this expression tor «(/) in (A 5), we see that (/) 
and 2 O (0 are related by 


^1 + ft|(s)^«„ - 


m „. T ti(s)m() \\ r , 

-(s I- ri(j)J J [-o](0 


m( ) A(s) m 


"tg 7 System responses fixed controller with adaptive hysteresis in vet sc 
or Ct(s) known 


- (A(s)G J (s) + A,(s 


w 


r. 


*7 


"(■>) 


+ A'i(M 


»«() 


”'( ) ' A(*) 

'+««.)) w to 


(* f «o) 


(A I) 

(A 2) 


+ A 1 (')f-«>1(0 

L 77/ J 




(A 7) 


oi some bounded dzO) ^(0 We then define fictitious 
ignals -i(0 filters A'](s), A'(s) as 

*o(0 - 73— M(0. -(/) = ~— bm "0 > 11 

s * + «u 

(A3) 


a > 0 (A 4) 


(s + a)" 

\nd use (A.3), (A 4), and (2 1) to obtain 

rn(/) + a ( ,A 1 (s)( 2o ](/)-A' 1 (,)[ w ](/) 

= K(*)(r'(n)[ S ](t) (A 5) 


Consider a linear operator 7 (s /) with input 1 (I) We define 
T(<t f) as a stable and proper operator it \\T(s, )[r](/)|| < 
fh / 0 f t- o(# - T) ||r(T)||rfT + /J 2 ||i(/)|| for some constants /f, > 
0, > 0, and a > 0 all t > 0 and any /(/) The operator 

T(s /) is stable and strictly proper if ||7(s, )[r](/)|| < 
fl\ /J f ~ ft(f " r) ||r(r)||r/T tor some constants fi 1 > 0 and 
n: > 0, all / > 0 and any j(/) With this dehmtion, the 
facts that rri(/), w(t) 6 L A'i(s) is strictly stable, and 


m T q(s)m() 
rn(/) 1 A(s) rn 


(s + tl(|)[2 0 ](0 


rn 




*T 


a(«) 


■ 

m 



m 

s 

— Zq 

— s 

— 

Z{) + f/o — 

. 

rn 


rn 

rn 


(/) (A 8) 
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= At(*)[’[^o 2 ] - s [~^2o]’ i + "o—^o 2 ](0 (A 9) 


imply that (m/m(t))6* T (a(s)/A(H))(m(i)/m)(s + a 0 ) and 
Ai(s)(m/m( ))0J o (s + « 0 ) are stable and proper 
By definition, the impulse response function A \ (/) of A ] (s) 
satisfies / () fx |Ai(/)|r// = n*/a Hence there exists a° > 0 
such that for any finite a > n l] the operator 


To(s 




( 


1 + Ai(i 



rn 

Tll(l) 


Pi 


*1 


•i(f) '»(0 
A(s) m 


(s + a<i) 


)) 


-1 


is stable and proper For a fixed a > a° (A 7) implies that 


s )[c](0 -h r/,(0 (A 10) 


where /i(s /) is a stable and proper operator, and r/j(M is a 
bounded signal due to r(t) d 2 (t) 

As in the swapping lemma | i21 we let (Aj Hf Cf) be a 
minimal realization of i (s) define M (s) = ( j( hJ - A /) 1 
H|(s) = (si - Aj) 1 ttf and express £/,(/) as 


wo-iv (s) 


Wi(h)(h -l «(,) 


1 


S + flu 


]0/l 


(0 (All) 


Using (4 13) and (4 31) we rewrite u>/,(0 ds 


w/(0-(-'O) f (\r(/)+ \|(/))|(0 -XflO -\i(0 

C. (/)«(/) \7(0 -\i(/)<(0 CKO) 7 (A 12) 


whose components except for — < (/) aic all bounded Using 
(A 2) and (A3) wc note 

~[uv,](/) = -( o(0 <> <»' +rf4(0 (AH) 

s -f (i{) m 


where r/j(/) is a bounded vector signal 

Filtering both sides of (*> 7) by l/(s + u ( ,) and using (A 3) 
again, wc obtain 


-(/) - 


i 


+ "0 


lVm](0 + 


1 


*> + «0 


[o,-6,](0 (A 14) 


The inequality |f/,(0| (|</t(0|/^ + (JUKhU) + (f,(0) 

(1 + ||U(0|| + 10/(01) and (A 9)-(A 14) imply 


|-(/)|£io(0W 2 (s )[m/s(s )H0]](0 (A IS) 


are bounded On the other hand, if (/U l 
unbounded, then the smallness of i } (t) m the sl { 
results in the boundedness of z(1) given by (A 1 m h 
turn implies that - o (0 in (A 10) is bounded so u 
(A 6), v d (t) in (A 1), uj h (i) in (4 13) ^(f) m hi 
in (5 7) are bounded Hence we conclude that all tl< s i 
signals are bounded 


Applndix B 
PROor or 7 Hr ori m 6 l 

A Proof fm Adaptixe S< heme I 

Using (A 1), (A 3), (A 11), (A 12), and (6 3) wc express 

)( s + (Hi) 

lor some bounded 0 a (1) E 7?" 1 for the controllei I (0 r ,(/) t 
/f" toi the controller 11) of Section V and some bounded 
r 1^(1) Using -u(0 A](s) A(s) dchned in (A 3) (A4) we 
obtain 


z 0 (t) + doAi(s)[~ (J ](/) - A i(s)[«](/) 

- A(s)f; ‘(si- — [vl(0 (B2) 

•> -I- do 

Using (A 1) (6 6) (B I), we obtain 

+ 777^ f, ' < o.(n+ + ,BV 


From (B 2) and (B 3) it follows that 


’ s H «0 


= (a(s)g V 


i 


+ Ai(s)- 


))[»K0 

S 4 flu /n( ) A( s ) 

0M ))[>/](/) 4 Xi(s)[^®,f](/) 


">() 

+ Ai(s)[rf 2 ](/) (- Ai(s)T —(I 

ITU 


(I) 


(B 4) 


Similar to (A 9), we sec liom (B 4) with a sufficiently large 
a > 0 in A (s) Aj(s) that 


where e Lr* and i\(f) 6 L*. is such that 

[/‘ wl< - T ‘ + [TTT[I^n^W) dl ,A,6) 


for some constants k\ ^>0 k 2 > 0 and any t 2 > l\ > 0 
The operator /) is stable and strictly proper, while the 
operator Tj(s /) is stable and proper and has a nonnegative 
impulse response 

If (,/,(f) (/,(/) bounded* then from Lemma 5 1 and (5 7), 
we see that n,(0 ytf) arc bounded, from (A 3), (A 10) that 
z(i ), ^ 0 (/) aic bounded, and from (A 6), (A 8), and (A 1) 
that u(t) and i v r /(f) are bounded Thus all closed-loop signals 


~o(t) = J,(' )[v](/)4f/|(0 < B<i ) 

for some T\ (s I) stable and proper and some </((/) bounded 
Similar to (A 10) with (At B, ()) being a minimal re 
ah/ation of 1/A(s) and W (w) =- C’Ks7 - If) 1 Wr(s) — 
(s/ - we express ((/) as 

£(/) = (s)[W,(i)[w 7 ]^](/) (B 6) 

Using (All), (6 1), (6 4), (5 2), (B *5), in (6 10), we see that 

w(/)=7 1 (s )[(,](/) + <7K0 (B7) 

for some Ti(s I ) stable and proper, and some rfi(() bounded 
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By rewriting (6 12), we get 

y(t) = l/m (/) + <(')- UrrMH*) 

K^C + ^J + HfO (B 8) 

Similar to (A.15), (6 11), (B 6 )-(B 8 ), and Lemma 6 1 imply 
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Rfffri ncfs 


ltf(Ol<'o(0+f 2 (« )[riTj(s )[||/|]](0 (B9) 

where io(1) e L and / j (/) is such that 

C ,B,0) 


lor some constants k j > 0 £4 > 0 and any t 2 > i 1 > 0 
The operator 7 2 (s /) is stable and strictly proper while the 
operator Ti(s /) is stable and proper and has a nonnegativc 
impulse response From here on the closed-loop signal bound 
edness follows from a contradiction argument which is similar 
to that in the proof of Theorem 5 1 


B Proof fo) Adaptixc Scheme II 
Using (A 1), (6 6 ), (B 1 ), we obtain 

"(9 = + "°)N(0 

+ ^7)^ i(/) XR (,, + " o)Irl(/) 

m 
F 


j ^^ 2 o ( 0 ( s + «»)[-](0 
+ + + (B " 

From (A and (BID it fbllows that 

tij, ( )-i-i(s + «l))^[2(,](/) 

- ^A(k)G'~ 1 (s) -F A 1 (s)^(»3()^ ) ) (s + «o) 
+A 1 (s)~ 0 M ( )(s4«o))[ ](/) 


+ A](s) 

+ Ajb) 


[-#jf](0 f A,(s)[i/,](/) 
§*]<'> 


(B 


where / m{ ))0jo( )(* -F n 0 ) * s stable and prof 

because 

= Al(s) [ s [^ 7 ^°“] ~ '[ vM ** "°^ io '] (0 

(B I 


and rri(/) r//(/) 0 io(/), l? 2 o (0 are all bounded 
The remaining part of the proof is analogous 10 
(A9)~(A 14), followed by a contradiction argument similar 
to that used in the proof of Theorem 5 1 
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Robust Stability Under a Class of 
Nonlinear Parametric Perturbations 

Minyue Fu, Member, IEEE, Soura Dasgupta, Senior Membei , IEEE, and Vincent Blondel 


Abstract —This paper considers the robust stability verification 
of linear time-invariant systems admitting a class of nonlin¬ 
ear parametric perturbations. The general setting is one of 
determining the closed-loop stability of systems whose open- 
loop transfer functions consist of powers, products, and ratios 
of polytopes of polynomials. Apart from this general setting, 
two special cases of independent interest are also considered. 
The first special case concerns uncertainties in the open-loop 
gain and real poles and zeros, while the second special case 
treats uncertainties in the open-loop gain and complex poles and 
zeros. In light of the zero exclusion principle, robust stability 
is equivalent to zero exclusion of the value sets of the system 
characteristic function (a value set consists of the values of 
the characteristic functions at a fixed frequency). The main 
results of this paper are as follows. 1) The value set of the 
characteristic function at each fixed frequency is determined by 
the edges and some frequency-dependent internal line segments. 
2) Consequently, Hurwitz invariance verification simplifies to that 
of checking certain continuous scalar functions for avoidance 
of the negative real axis. 3) For the case of real zero-pole-gain 
variations, the critical lines are all frequency independent, and 
therefore, the determination of the robust stability is even simpler. 
4) For the case of complex zero-pole-gain variations, the critical 
internal lines are shown to be either frequency independent or 
to be confined in certain (two-dimensional) planes or (three- 
dimensional) boxes. 


1 Introduction 


T HE lollowing problem is of interest in the robust stabil¬ 
ity vcnhcation ol lineal time-invanant control systems 
depicted in Fig 1 Suppose we arc given a stability region P 
and a family ol open-loop tiansfer 1 unctions parameterized by 
a leal vectoi 7 


T(V) -{/(s 7) 7 eP} (1 I) 

where t (s 7 ) is the transfer function oi the plant and con¬ 
troller, and T is a connected set in R A Determine as simply 
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Fig 1 Closed-loop uncertain s>slem 


as possible it all members of the family of the coiresponding 
characteristic functions 


H(F) = {/t(s, ->) = 1 + /(s, 7) fc F} (12) 


have all zeros contained entirely in V (1 e , the family is D- 
stablc invariant) Generally, the transfer function coefficients 
depend nonlincarly on 7 

One approach to this problem is to tical it in its bioudest 
generality, as is done in | 1 |, | 2 | where a very broad class 
of //(F) is considered Alternatively, one can considci par- 
ticulai parameteri/ations leflecting specific forms of structural 
information supplied by the modeling process. This allows 
formulation of stability vcnhcation schemes which are compu¬ 
tationally less demanding Examples of this approach include 
[ 31 , which considers a family of polynomials admitting inde¬ 
pendent variations in the coefficients, |4|-[61, which account 
for affinely dependent variations, and [71-19|, which consider 
multilinear dependence (see [10], |111 for surveys) Each of 
[3]-[9], exploits the undci lying structural information and 
demonstrates consequent simplifications 

This paper adopts the second approach by locusing on a 
special class ol nonlineai parametric dependence To keep the 
presentation simple, only Hurwitz invariance is investigated 
(1 e„ V is the open left-half plane), although the results do, in 
fact, generalize to more gcneial stability regions Specifically, 
the family of characteristic functions to be considered in this 
paper admits the following form 


/?(s, 7) = 1 4 r/o(s) 




(i v 


where ryo( s ) and the p, o(s) are real scalar rational functions 
and polynomials in s, respectively, 7 , E 1\ C R N represent 
a partition of 7 . 1 c , 

7 = (7i r . 7J, . 7n f O 4 ) 

and 

r = r, xr 2 x ■ x r„ a.5) 
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jP*(s) arc real vector polynomials with dimension N n arid 
fi, and are fixed positive exponents. The quantities <yo(s), 
PioW, P,(*), ft, Nh /*,, < 7 , and F are assumed known. The 
jth element of 7 , [respectively, F t (s) J is denoted by 7 *, 
(respectively, P tJ {x )]. Since V is an axis parallel box, each 
7 ij varies independently of the others within an interval 

7.3 <7,., <7,J- 0 . 6 ) 

Thus, each factor (p,o(*) + 7 f P,(s)) forms a polytope of 
polynomials as 7 , varies in F t . Notice that a polytope can 
simplify to model an uncertain gain in the open-loop transfer 
function. Furthermore, in the case where the exponents p x 
and /', are restricted to be + 1 , the robust stability verification 
of (1.3) is equivalent to that of a subclass of the multilinear 
problem, i.e., the characteristic polynomial associated with 
(1.3) depends on 7 ,^ in a multilinear fashion. To simplify our 
notation, we rewrite (1.3) as follows: 

11 

/i(.N, 7 ) = 1 -f )A)(' s )fJ(/ , io(' s ') + 7, T 7 '(.s))*‘ (1.7) 

Where h, are allowed to he non/ero integers (either positive 
or negative). 

There are several situations in which the setting of (1.7) 
becomes important. By way of background, we cite the results 
of (4), where the setting considered translates to one involv¬ 
ing an uncertain plant having numerator and denominator 
polynomials lying in poly topic sets. Essentially. |4] asks the 
following question: Given a fixed controller r/o(.s), how can 
one determine if the uncertain closed loop is stable? Thus, 
the problem considered in [4| can be viewed as a subclass of 

(1.7) , with n = 2. k\ = 1 , and k> 2 -1. In many applications 

involving process control, th^e overall plant is itself a cascade 
of several subplants, such that physical uncertain parameters 
of a given subplant enjoy physical independence from those 
of the other subplants. Now, if one models each uncertain 
subplanl individually as lying in sets analogous to those in [41, 
one readily obtains a special case of the structure exhibited in 

(1.7) . 

To further illustrate the scope of (1.7), two more examples 
are considered. The first example is of a plant with independent 
real zero, pole and gain variations and is as follows 

n -1 

//(.s. 7) = I + ^/o(*)f[(* + A,) 4, (1.8) 

1=. 1 

where d and A, are uncertain real parameters lying in given 
bounds. In this case, //o(s) can be a given controller, d the 
gain, and A, the zeros and poles of the plant whenever the 
h\ are, respectively, positive or negative. The uncertainty 
assumes that the A, vary independently within given bounds. 
The objective is to verify if ry 0 (,s) stabilizes the plant for all 
possible parameter variations. Therefore, under the assumption 
of no zero-pole cancellation (which is trivial to check in 
this example), the closed-loop system associated with ( 1 . 8 ) 
is robustly stable if and only if the corresponding H(T) is 
Hurwitz invariant. Note that the terms involving cases where 
A, / ±1 rellcct structural preservation of multiplicities. 


To allow for complex zero and pole variations, one may 
include factors of the forms (.s 2 4 -u,*+&,)*', with a, and h t also 
varying independently in intervals. The version of (1.7) given 
in (1.9) below will be referred to as the complex zero-pole-gain 
variation problem: 

r ri — 1 

fc(«. 7 )=i+</flb(")n^+ A ')*’ n 2 +<ij"+i >,) k '’ (i- 9 ) 

f=l j—r+l 

with 

7 = (^1 ■ ^2 «■ ■ ’ 1 A t , flr+i, »■ ■ ■, 1 < bfj _ 1 , d) 

( 1 . 10 ) 

Throughout this paper, we adopt the following assumptions 
on the function //(.s, 7 ) in (1.7). 

Assumption / .1: The function //(.s, 7 ) has no unstable zero- 
pole cancellation for any 7 £ T. 

Assumption 1.2: Continuous variations ol 7 result in con¬ 
tinuous changes in the zeros of h(s , 7 ). 

Assumption 13: The function //(*, 7 ) has no purely imag¬ 
inary poles for any 7 € 1 \ 

We note that Assumption 1.1 is essential to assure the 
internal stability of the closed-loop system. However, violation 
of this assumption will not cause any difficulty for constructing 
the value set H ( jlj , T). Further, in light of the recenl results of 
[ 201 , the vcrificalion of this assumption is a relatively simple 
matter. 

Assumption 1.2, on the other hand, is a standard one 
thal requires the leading coefficient of the overall numerator 
polynomial in (1.7) not to equal zero within ihe specified 
uncertainty bounds. 

Finally, Assumption 1.3 can be simply tested through a 
series of straightforward algebraic [ 181 techniques. 

A. Approach and Mam Results 

As in [ 8 ], J9|, 112], [I3|, we follow the so-called value set 
approach to robust stability analysis. For the family of rational 
functions ( 1 . 2 ), the value set at a frequency lj is defined as 

H { u >. T ) 7): 7 € T}, (1.11) 

and its boundary will be denoted by 0H(lu\ I 1 ). 

Our approach exploits a slight variation of the zero exclu¬ 
sion principle (see, e.g., [9]). Under assumptions (1.1), (1.2), 
and the fact that the value set changes continuously with a;, 
this principle reduces to the following conditions as being 
necessary and sufficient for the Hurwitz invariance of H(Y): 

1 ) at least one member of //(T) is Hurwitz, and 

2) 0 ^ 0(11 (w, T)). Vo; € ft, where 0 denotes “the 
boundary of.” 

There are several appealing qualities of the value set ap¬ 
proach, First, it provides a unifying framework within which 
most of the currently known robust stability results can be 
understood. Second, it can be used to obtain simple and 
transparent proofs of these results (sec, for example, [131,114), 
J19] for proofs of [31, [4|, 115]). Furthermore, while for certain 
special classes of uncertainties the robust stability of a family 
of polynomials reduces to that of some easily characterizablc 
members of this family, the same does not hold for general 
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parameterizations. For example, when the uncertainty set is 
one of multiaffinely parameterized polynomials, in general 
there are no easily characterizable internal subsets of the 
parameter box T, whose Hurwitzness implies the Hurwitz 
invariance of the entire set [23]. In the same vein, it is shown 
in [16| that in considering the Hurwitz invariance of the 
characteristic polynomials of a polytope of (n x n) matrices, 
one has to check all (2 n - 4)-dimensional boundaries of the 
parameter space, a task which is computationally prohibitive 
even for matrices of reasonably small sizes. In such a case, 
the zero exclusion condition 2 ) above provides a relatively 
simple frequency sweeping procedure for verifying Hurwitz 
invariance. Such a graphical approach has been advocated 
through successful application in |9|, |12|, [13], and provides 
the conditions of [ 8 ], [9], Recently, the so-called finite zero ex¬ 
clusion principle has been developed by Rantzer [21] to avoid 
frequency sweeping, and to thus permit fast compulation. This 
approach too relies on the ready calculation of pertinent value 
sets. Besides their utility for robust stability analysis, value sets 
play an important role in the determination of the irequcncy 
response of a family of transfer functions; see, for example, 
1171- Consequently, they can be used in designing robust 
controllers which meet performance considerations that go 
beyond mere closed-loop stabilization; see | 22 ], for example. 

Accordingly, in this paper, we consider both the determina¬ 
tion of value sets as well as the Hurwitz invariance of (1.3). 
The principle contribution is to show that for the general fam¬ 
ily ( 1 .7), at each fixed frequency u. each member of HIT (a;, I") 
has prcimages in certain line segments in the parameter set 
I\ These critical line segments are simply characterized, 
vary with frequency, are independent of the exponents k t 
[see (l.7)|, and consequently the Hurwitz invariance of the 
family of functions in (1.7) becomes equivalent to checking a 
finite number of continuous and piecewise differentiable scalar 
functions in u for avoidance of the negative real axis. 

For the case of (1,8), we show that the critical segments 
are, in fact, the edges of F plus certain simply construclable, 
frequency-independent, 45-degree line segments in the pa¬ 
rameter space. Further, the Hurwitz invariance of //(F) is 
guaranteed by that of these frequency-independent line seg¬ 
ments (including the edges). 

For the case of (1.9), we show that the critical lines 
determining the value set boundaries are either frequency 
independent or, as frequency varies, vary on certain (two- 
dimensional) planes and certain (three-dimensional) boxes in 
T. To check for robust Hurwitzness, it then suffices to check 
these frequency-independent lines, planes, and boxes. 

It is instructive to compare this paper with the work of [241, 
which considers the set of uncertain polynomials 

Ewfipus) (i.i2) 

where the F t (i s) are fixed polynomials while the P n {») vary 
in independent interval sets. Several points of difference are 
noteworthy. First, the set in (1.12) is broader than ours in 
the sense that it allows the sum of more than two factors. It 
is narrower than (1,7) in that the P tJ (i h) vary in independent 


intervals and have unity power, whereas in (| / „, 

variations with arbitrary powers are allowed n 0H < 
greatest difference lies in the approaches employ d , , 
results derived in this paper and [24|. The hue? n, 
a parameter space as opposed to the value set .\ppi ^ 
Hurwitz invariance verification. Its result stales that i r 
to check the Hurwitz invariance of internal manifolds i« i 
of the dimension 

max {ti,}. (I.Hi 

Thus, even for the cases of (1.8) and ( 1 .9). [24| requires check 
ing Hurwitz invariance over manifolds that have dimensions 
that increase with the number of lactors in ( 1 . 8 ) and ( 1 . 9 ). 

Section II considers the general case. The special cases 
of (1.8) and (1.9) are addressed in Sections 111 and IV, 
respectively. Section V is the conclusion. 

II. Value Set Boundaries 

A major objective of this paper is to achieve the following. 
For a given frequency identify a critical subset F, (u) of F 
having the property that for all 

r € F)) (2.14) 

there exists 7 e T f (u;) such that 

7 ) = r (2.15) 

i.c., every point on the boundary of the value set al this 
frequency has at least one preirnage in F ( (u/). In characterizing 
such a F, (u), we will not attempt to extract the smallest 
possible such set, but will be content with one particular choice 
that enjoys the above properties, and at the same time has a 
relatively simple analytical description. 

To this end, we adopt a somewhat indirect approach. Specif¬ 
ically, given we give necessary conditions on a parameter 
vector 7 such that 

7 ) e F)). (2.16) 

Clearly, parameter vectors satisfying these necessary condi¬ 

tions together suffice to define a l\ (u;) meeting the require¬ 
ments specified above. 

In the sequel, a A-side of the parameter box F will rcler to 
a subset of V in which only A* parameters vary and all others 
are fixed at their extreme values. By the same token, a point 
7 in the interior of a A-side has exactly A-eleinents that do 
not take extreme values. A point on the boundary of a A--side 
lies on this A-side, but not in its interior. If 7 is in the interior 
of a A -side of T. then the elements of 7 not fixed at extreme 
values will be called the variables on this A -sidc. We note, that 
every A-side of F is also a hyperreclanglc, that all its /-sides, 
I < A' are also /-sides of I\ and that corners and edges of F 
are its respective 0 - and 1 -sides. Further, the JV 4 -side of 
F (recall that Lhis sum is the dimension of T) is F itself, 

In characterizing 1 \ (u;). we will assume that all edges of V 
are automatically included in r,.(w). Thereafter, for all 

1 < k < 


(2.17) 
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wc will provide for each ide of T necessary conditions for 
7 in its interior to obey (2.16), 

The main results of this section can be summarized in the 

following way. 

Result I: Not all sides of T need contribute interior points 
to F r (cu). In fact, certain sides are such that, irrespective 
of tu, their interior points need never be included in F r (u>). 
In Section ILA, we give a result which characterizes what 
the sides that potentially contribute interior points to r c (u>) 
are. A feature of this characterization is that, for each side 
which in Section II-A is identified as a potential interior 
point contributor, all the sides of this side are also similar 
contributors. 

Result 2: Having eliminated a vast majority of sides by 
virtue of Result 1 , we restrict attention to an arbitrary side 
Q of F whose interior points have been ascertained by Result 
1 as being potential members of T, (uj). For every such Q, at a 
given frequency u>. we associate a unique, possibly frequency- 
dependent affine line L n (Q) such that a 7 in the interior of 
Q obeys (2.16) only if 7 belongs to this line. Consequently, 
together with the edges of F, the union of the intersections of 
all L n {Q) with the interior of the corresponding contributing 
sides Q comprise T ( (u). The affine line L a {Q) associated with 
a given contributing boundary Q is characterized in Section 
1I-B. 

The results to be presented will be illustrated through the 
following example. 

h( w ) = 1 + [(^ 2 - 2 bi! + (juJ + 1)712] + 7n + 732] 

' JU> ' 1 [(1 -w 2 +jw)72i + jw7 2 2][0.5yw + 7 41 ] 

( 2 . 18 ) 

To proceed with the development, observe first that as far 
as the determination of F c (u;) is concerned, one need only 
consider the transfer function 

n 

«/(«. 7) = n<*0(.) + if /’,(-•'))*■ • (2.19) 

7=1 

For unless yo(juj) in (1.3) is zero, at any frequency uj. 

7 ) € d(G(u), T)) (with G(w, F), obviously defined), 
iff h(ju>, 7 ) €? d(H(uj. T)). Of course, if yn(juj) = 0, then at 
this u>, II (u>, F) collapses to a single point, and any 7 E T, 
including any comer, describes ()(H(u, F)). Thus, in this case, 
F r (w) can be trivially constituted by a solitary comer of F. 

Thus, here onwards, attention will be restricted to the 
sets 7 ), <)(G(lj, T)) instead of h(ju>, 7 ). d(H(v, F)), 
respectively. 

A. Result I 

We have the following theorem. 

Theorem 2.1: For a given uj and every v e 0 (H(lj, T)). 
there exists a 7 obeying (2.15), such that [see (1.4)], for each 
/, at most one 7 ,^ is a variable. Consequently, a boundary Q 
of F need contribute internal points to T r (u;) only if, for each 

at most one 7 ,, is a variable in the interior of Q. 

Thus, for example, in (2.18), any side on which both 711 , 
7 i 2 are variables is not included in The proof of this 

theorem relics on the following lemma, proved in Appendix 


A, which involves boundary determination of the power and 
the product of sets of complex numbers. 

Lemma 2.1: Let D\, Dtr'iDa be bounded and closed 
sets of complex numbers. Define, for integers fc n 

D\ k ' ] := {rif' : <1, 6 A : i = 1, 2, ■ ■ •, a), (2.20) 

and 

a f it 

n£>, (M := l f[,if : d, e D,: i = 1, 2, • ■ ■, < (2.21) 

1 = 1 l 1 = 1 

Then 

H(bD,)^ (2.22) 

.*x \.-i / 

We can now prove the theorem. 

Proof of Theorem 2.1: In Lemma 2.1, identify 

t), := + ifP,(») ■ 7. e r,} (2.23) 

and o — n. Then the result follows by noting that every 
element in i)(D t ) has at least one preimage in an edge of 

r, [14]. 


B. Result 2 


From here onwards, in determining contributions to F, (u) 
from the interior of a given side of I\ attention need only be 
restricted to sides Q of F on which at most one element of 
each T, is a variable. 

Call such a prototype side Q. Lump the variables defining 
this side into the vector q - [fli. ■ ■ ■. (/a], k < o. Suppose 
the coefficient polynomial of each q, in (2.19) |i.e., the 
corresponding i^(.s)J, evaluated at ,s -- juj. is nonzero. (If 
the coefficient polynomial is evaluated to be zero, then the 
corresponding q, need not be included in q; see discussion 
later.) Then, through a suitable extraction of the frequency- 
dependent coefficients of q, in (2.19), at every uj, the image of 
the side Q under the mapping defined in (2.19), i.e., G(uj, Q), 
can be described by a set of complex numbers 

f(Q) : = |/(<7) = /ofJ(<Ji + n ‘ + '■ '/ £ Q J (2.24) 

where / 0 is a complex constant, rv, and ft,, / - J, 2, ■ ■ ■. k are 
real constants, and k, ± 0 for all ? = 1, 2, ■, A\ 

Fact 1: Notice that, as u varies, / 0 , rv,, and ft, vary in a 
rational fashion with uj. Further, f(Q) is bounded because of 
Assumption 1.3 and the fact that the coefficient polynomial of 
each q, in (2.19) evaluated at .s = ;juj is nonzero. 

Thus, for example, in (2.18), consider the interior of the 2- 
side defined by 712 = 7 / 2 . 732 = 7 ; 72 - 721 = 7^. 74 1 = 741 . 
722 = 722- In this case - <7i = 711- 92 = 731: 

Q := {Q = [ 9 i- «] r : </," S<h < </, + , ' = 1 , 2 ). (2.25) 


Further 

/(</) = 


(u> 2 - 2 ) 


[(1 - w 2 + juj) 7^ + yw7 22 ][0 J>juj + 7^]] 


<7i + 


Qv+ 07l + 2 ~ 
(uj 2 - 2) _ 


[<72 + uj 2 + 7^]. (2.26) 
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observe that, at J 1 = 2 , the coefficient of 7 n (= <yj) is 
, 10 . Consequently, the representation in (2.26) is infeasible, 
lowever, we argue now that at this u, no interior point of 
t c Q under consideration need be included in F r (w). This is 
iccause, at this u, H{uj, T) is independent of 7 n = q \. Thus, 

we select 7 ji at an extreme value, without changing the 
variable 731 = r/ 2 one does not alter the value of 7 ). 
thus, corresponding to each point q in the interior of Q, there 
lies a point q* on a boundary of Q having precisely the same 
image in the value set space as does q. Then, in determining 
contributors to one need not consider any point in the 

interior of Q, as these points are covered by points on the 
boundary of Q. This observation leads to the following formal 
I act. 

Fait 2: Suppose Q is a side of T that conforms to the 
icquirements of Theorem 2.1. Further suppose, with q t the 
variables on Q , at some frequency u>, and some y. the 
coefficient polynomial of q, in (1.3) is zero. Then, F f (u>) 
will not contain any points in the interior of Q. Also, if the 
coefficient polynomial of each q, is nonzero, then /o in (2.24) 
is nonzero. 

Finally, observe, from the foregoing that the basis for limit¬ 
ing the sides that contribute to \\ (u/) is that the edges of each 
individual factor by themselves cover the value sel boundary of 
dial factor. Although, in general, an /V-dimensional polytope 
has edges, at a given frequency, at most 2 N of 

these edges need be considered for constructing the value set 
boundary of the polytopc. These special 2 N edges are easily 
characterized (see |IK|). Thus, the number of contributing 
sides is even smaller than that specified in Theorem 2.1. 
However, to prevent notational encumbrances, henceforth we 
will adhere to the somewhat conservative characterization 
given in Theorem 2.1. 

Provided in (2.24), ft, / 0 . V/ f {I, • ■ ■. k\, we will call 
die affine line in (2.27), below, the line associated with Q 

I'tAQ) := {(71 <iz '<ik ) 1 - fti 

* (r^i a -2 • ■ ■ rv* ) J : — x. < /> < x }. (2.27) 

The situation where one or more of the ft, equal zero will be 
dealt with later. Observe that the intersection of L n (Q) with 
the interior of Q is given by the segment 

I'(Q) - {</ = (71 • ■ ■ 7* ) T : 7i = I’li - <h, 

(Q) < l> < P + (Q). 7 G Q) (2.28) 

where 

p~ (Q) — infix min 
■ ML -M 

and 

P +[Q)= .ain (2.30) 

■*} I A A J 

where qf and q~ arc the extreme values of the variable q ,. 
Note the following important facts. 

Fact 3: If p+{Q) < p~{Q)* then this set is empty, in which 
case, as will be shown soon, the interior of Q contributes 
nothing to F f (u>). 


Fact4: As the nr„ ft, depend on w (Fact I,, m 
line L(Q). Equally, while at some frequencies. / u/ ,, 
be empty, at others, it may well be. 

Fact 5: As noted earlier, whenevei Q conloim 1 .. x \\ 
requirements of Theorem 2.1, so do all its boundancs ii - , , . 
to see from the foregoing definitions that the line tts us 1 
with a boundary of Q is, in fact, a projection ol ih< 
line onto this boundary. 

We now present the main result of this section, proved m 
Appendix B. 

Theorem 2.2: Consider Q a boundary of V obeying the 
conditions of Theorem 2.1, with the various quantities as 
defined in the foregoing and / 0 / 0 (sec Fact 2). 

i) Suppose in (2.24) that, for some / e {1. ■ ■ 3, - (). 
Then for every q in the interior of Q such that f(q) e 0(f(Q )), 
there exists a q* on an edge of Q such that f(q) =. f(q*). u) 
Suppose ft L / 0, V/ e {1. ■■■,£}, and there exists q in the 
interior of Q such that J'(q) e i)(f(Q)). Then q £ L(Q). lii) 
Suppose, in addition to the conditions set out in ii) 

r Fhen there exists q* on a boundary of Q such that for all 

(J € L(Q) 

/(</) - /('A (2-32) 

Before illustrating this theorem with the example of (2.18), 
we highlight some of its features. First of all, if any ft, = 0, 
then no point in the interior of (} is included in the lormation 
of r r (u/). For, given any 7 in the interior of Q obeying (2.16), 
there is a parameter vector on an edge of F that has the same 
image in the value set space as does 7. Since F,(u/) already 
includes the edges of F, the image of this 7 stands covered 
from the outset. 

Likewise, if (2.31) holds, then all interior points of Q 
mapping to the value set boundary have the same image on 
this boundary, an image also shared by another parameter 
vector lying in the interior of a boundary of Q. Thus, these 
points need not be included as they will be covered when one 
considers the boundaries of Q. 

Further, in the event that i), iii) of the theorem do not hold, 
then the critical subset contributed by interior points of Q 
lies exclusively on the associated line segment. Thus, indeed, 
at every u/, r r (u) comprised exclusively of line segments, at 
most one for each side conforming to the requirements of 
Theorem 2.1. Of course, these segments in general vary with 
u), and should their intersection with an associated Q be empty, 
then the interior of that Q fails to contribute elements to F, (u>). 

Finally, we note that the equations describing the internal 
line segments do not depend on the powers//, and o,. 

We now illustrate these results by invoking the example of 
(2.18). In the example, assume u; = 2, 7 ' = 0, and 7 * ~ 1. 

Consider first any side on which 731 or 732 are variables 
[recall that if both were variables, then such a side will not 
contribute an interior point to F r (u/)). Then i) of Theorem 2.2 
holds, and no point in the interior of this boundary is included 
in r r (u/). 


hr ' 1 ? 1 
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Next, consider the 2-side defined by 731 = 741 = 0 
and 732 = 712 = 722 = 1. Then, on this boundary, with 
(/1 = 7 j 1 and 92 = 721 , (2.31) holds. Consequently, this side 
too contributes no interior points to r, (w). 

Finally, consider the 3-side 712 = 721-1 and 731 = 732 = 
0. Choose r/i = 711 , 92 = 722 , and 93 = 741 . Then, we have 


ft s_ 1 •_+ + j 

m ~*W+i+iMkT]r 


(2.33) 


and Q 

h(ju>, 7 ) = 1 + F(Q, »)U + p) UlQ h 

lj) < /> < ( > + (Q, u>) (2.36) 


where 


HQ) 

F(Q, u>) = go{jw)fn(Q- w) [J w)‘ ,w) 

i=i 


(2.37) 


Then the interior points of such a 3-sidc to be included in 
T,{u) arc given by the segment 

L(Q) = {Ifli. Q' 2 . 9 . 1 ] = f[h 1.5, I] - [0.5, 1 . 0 ]: 

2/:\ <(><!). (2.34) 


C. Humitz Invariance 

Having shown that 0(H(ju>, I')) is mapped from the critical 
line segments in F, we now tum to verifying the Hurwitz 
invariance of (1.7). Essentially, 0(11 (jw, T)) must be checked 
for zero exclusion. 

In view of the definition of r r (w), the Hurwitz invariance 
of (1.7) is equivalent to the requirement that: i) at least one 
member of (1.7) is Hurwitz, and ii) that at every w and all 

7 € r,(w) 


h(ju>. 7 ) / 0 . (2.35) 

Since at every u, F r {u) comprises the edges of T and certain 
cj-specific line segments, to check for Hurwitz invariance, it 
suffices to check that all transfer functions corresponding to 
the edges of (1.7) are Hurwitz, and that the image, in the value 
set space, of each aforementioned w-specific line segment is 
zero exclusive. The principal contribution of this section is 
to demonstrate that the zero exclusivity of these segments 
can be verified by checking certain piecewise continuous and 
piecewise differentiable functions of u for avoidance of the 
negative real axis. 

To avoid notational complexities, we show this fact in a 
somewhat informal fashion. Consider a side Q of T, which 
meets the requirements implicit in Theorem 2.1. Recall that for 
each such Q , potentially there exists an internal line segment 
that contributes to r. M- 

For a prototype Q , the internal line segment is as given in 
(2.2XM2.30), with the value set G(u, V) as in (2.24). Observe 
that the various quantities in these equations depend on u. 
and that at certain frequencies, this segment could be empty 
(Facts 3 and 4), or may not be a member of F r (u;) (e.g., when 
i) and/or iii) of Theorem 2.2 holds, or when /o (sec (2.24) 
and Fact 2) or go(ju) is zero. Call the set of frequencies 
at which L(Q) is a nonempty subset of F r (u;), Then 
we need to check that the image in the value set space of 
L(Q) is zero exclusive at all uj 6 Si(C^). Direct substitution 
of (2.28M2.30) into (2.24), together with the relation among 
G(w, F), H{u. T). and (2.24), shows that at all u> e the 
image of L(Q) is given by (2.36H2.38). In these equations, 
to avoid confusion, unlike (2.28H2.30) and (2.24) the various 
quantities have been expressed explicitly as functions of u> 


and 

HQ) 

M(Q)=^k,(Q). (2.38) 

» = ] 

Recall that the choice of Q (see (2.31) and the discussion sub¬ 
sequent to Theorem 2.2) ensures that M(Q) ± 0 . Moreover, 
from the definition of Q(Q), 

F(Q , u) 1 0 , Vw 6 il(Q) (2.39) 

and 

l>'(Q, u)<p+(Q,w). Vweli(^). (2.40) 

We claim that the bounds p + (Q, w) and p~(Q. u) are contin- 
uous and piecewise differentiable in these frequency ranges. 
To see this, we note that at t (Q, aj) and /i(Q, ut) |scc (2.24)] 
arc continuous and differentiable. The minmax functions in 
(2.29)-(2.30) preserve continuity, and the lack of differentia¬ 
bility occurs at the isolated frequencies where the minmax 
selections ‘’change." 

Thus, from (2.36), one can see that (2.35) holds tor all 7 in 
the interior ot this prototype Q and belonging to IV (wj) iff 

l>(Q.*):= ( -F(Q. 

tf(p {Q,u),p + (Q.u)). \/u> 6 il(Q). (2.41) 


In the above, if M(Q) # ±1, then all the roots of 1 
should be considered. Define the functions 


w) = 


(Q,*) 

(g, -j 

i 


Vu> 6 
otherwise. 


1 /MiQ) 


(2.42) 


The functions ((Q, u j) defined above arc piecewise continuous 
and differentiable. Moreover, \/u E oc) - H((2)}. 

(( Q , u) = J. Recall that, at these frequencies, there are no 
contributions from the interior of Q to IV (u>). and (2.41) need 
not be checked. Furthermore, the required zero exclusion of 
7) for all 7 in the interior of this prototype Q and 
belonging to T, [u) is equivalent to 


*)*(-'xU>). (2.43) 


Wc therefore have the following theorem. 

Theorem 2 J: The family of transfer functions H(T) de¬ 
scribed in (1.7) is Hurwitz invariant if and only if the following 
conditions hold: 

a) h(i s. 7) is Hurwitz for all 7 in the edges of F; b) For each 
u> £ R. and all Q satisfying the requirements of Theorem 2.1, 
the piecewise continuous and differentiable functions u) 
defined in (2.42) avoid the negative real axis. 
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III. Real Zero-Pole-Gain Variations 


d e {d\ (1 }. 


We now consider the special case of (1.2) where the 
, icertain parameters arc real poles, zeros, and gains. 

Consider the family of transfer function //(F) described by 

8) md (1.2). The parameters d and A, vary independently 
• ithin given bounds, i.e., 

<r < d < rf + ; A“ < A, < A+, i = 1. 2, •-■.n - 1 

(3.44) 

.md 

, e r = [A]-, Af] x ■■■ x [A-_ Xl A+_,] x [dr, </+]. (3.45) 

Notice that Assumption 1.3 implies that if a given k, is 
negative, the corresponding interval of A, cannot include zero. 

Recall, from Section II, that apart from the edges of 1\ 
wc need to consider an internal segment associated with each 
side of I\ which conforms to the requirements of Theorem 
2.1. Observe that every A-side Q of T\ A- > 1 conforms to 
this requirement. Consider, now, two possible cases of such 
Apsides Q. 

Case I: The interior of Q has c/as a variable. 

Observe that, with (ft = d in (2.24), fi[ - 0. Thus, because 
of i) of Theorem 2.2, no parameter vector in the interior of such 
a Q is included in T f (u). Consequently, for each 7 G T ( (u;), 
(I is at an extreme value. 

Case IT The variables of (J exclude d 

Assume the variables in Q are for some S C {J, ■ ■ ■, n - J}. 
A M / G S. Observe, in (2.24), that each fi = lj and = 0. 
From i) of Theorem 2.2, one concludes that at u; — 0, F f (u;) 
comprises onl> the edges of L\ Also lrom Theorem 2.2, when 
uj ± 0, a parameter vector in the interior of Q is in 1, (cu) 
only if it obeys 


A, - (iv, V> (Q) < (>< (> + {Q). 

where 

/ e s 

(3.46) 

p~ (Q) ~ max A“ 
,cs 1 

and 


(3.47) 

l> + (Q) = min A/ . 


(3.48) 

In addition [ii) of Theorem 2.2], we must have 


5 >* n - 


(3.49) 


Furthermore, the family of functions H(V) is Hmv u t 
ant if and only if h(; «, 7) is Hurwitz invariant on tn , 
segments and the edges of T. 

A. A Special Case 

Notice that if some of the A, vary in nonova lapping 
intervals, then the number of critical segments reduces, as can 
be easily seen from (3.51). A case of special interest is when 
all the poles and zeros vary in intervals that do not ova lap. In 
such an event, the projections mentioned above are empty, and 
edges suffice for value set boundary and Hurwit/ invariance. 
One thus obtains the corollary below, itself a variation of the 
results (see Remark 3.3 for comparison) of [251, [26|. 

Corollary 3.1 (An Edge Theorem): Consider the paiamcter 
box T in (3.45) and the family of function H(Y) described 
by (1.8). Suppose (A“, A^ ) n (A“. A+) arc empty for all 
1 < t < j < 71. Then the boundary of the value set H( u, T) at 
any frequency u is mapped from the edges of 1\ Furthermore, 
H(V) is Hurwitz invariant if and only if all the edges of //(T) 
are Hurwitz invariant. 

Remark 3.1: For overlapping intervals of A,, however, the 
45-degree line segments are indeed necessary. To show this, 
we provide the following simple example. Consider 

M*, A,. A 2 ) 

0.1(0.8,s 2 + U,8.s 4 4.f>)(.s + A!)(.s + A 2 ) 

,s 4 4- 10,s 3 + 1 i + 1 1.8*4- 0.2 
Ai, A 2 € [-30, 0], (3.54) 

In this example, there are four edges with the associated 
transfer functions given by 

/#(«, U. A 2 ), A 2 6 [-30, 0], 
h(i s, —30, A 2 ), A 2 G [- 
h(s. A). 0), A! G [-30, ()]; 
h(s. A x , -30), A T G [ 

There is only one 45-degree line segment given by 
/i(n, A, A), A G [-30. 0]. 


Thus, we have the following theorem. 

Theorem 3.1: Consider the parameter box 1' in R 1 ' given in 
(3.45). Then T t (u;) is frequency invariant, and comprises the 
edges of T, and all line segments of the form in (3.51M3.53), 
for every Sc {1, ■ ■ *, 7# - 1}, obeying (3.49) and 

max A” < A, < uimA + , Vy G S. (3.50) 

i£S ' “ J ” tfcS 

A, = Aj, V/', j E S (3.51) 

Aj e {a;, a;}, Vjfts (3.52) 


It is straightforward to verify that the transfer functions on all 
the edges are Hurwitz, but some on the 45-degrce line segment 
are not. For example, at A = -15, /*(*, A, A) has the unstable 
zeros 0.2424 ± 1.8914^. 

Remark 3.2: In actual fact, the result in [26] is stronger. It 
requires that only 2 n edges be considered. This fact is also 
recoverable from [27], which also employs the Jacobian rank 
deficiency approach underlying our development. 

Remark 3.3: Neither [26] nor [251 deals with the overlap¬ 
ping root situation considered in Theorem 3.1 above; nor do 
(26] and [25] permit the structural preservation of pole-zero 
multiplicities. 
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IV. Complex Zero-Pole-Gain Variations 

In this section, the complex zero-pole-gain variations case 
of (1.9) is treated. As before, I\ (w) comprise line segments. 

We note that the case u; = 0 is trivial. At this frequency, for 
every Q obeying Theorem 2.1, all the (l t in (2.24) are zero. 
Thus, from Theorem 2.2, T r (0) comprises the edges of V only, 
and ^ 0 will be assumed in the sequel. 

The characterization of the line segments follows as in 
Theorems 2.1 and 2.2, In this section, we will focus mainly 
on showing that these frequency-dependent line segments obey 
the confinement rules stated in the Introduction. To this end, 
we consider the various possible Q obeying Theorem 2.1, 
and consider how the associated line segments change with 
frequency. Observe, as in iii) of Theorem 2.2, that certain sides 
Q can be eliminated according to the combinations of powers 
of their defining factors. In the sequel, we will only consider 
Q on which (2.31) does not hold. Then the only restriction 
that Theorem 2.2 places on Q is that for no i can both a, and 
b t simultaneously be variables. Consider, now, the following 
possible cases of Q. 

Case I: d is a variable on Q As in Section HI, such a Q 
does not contribute interior points to Y r {w). 

Here onwards, wc assume d is at an extreme value. In the 
sequel, without loss of generality, all nonvariables will be 
denoted with a superscript “±," and the A, will always be 
considered as potential variables. 

Case 2: No n, nor any b, is a variable. As in Section III, 
an interior point of Q is in T r (u;) only if it is on a frequency- 
invariant 45-dcgrec line segment similar to that in Section III. 
The set of all such segments will be called L^. 

Case 3: No <i, is a variable, but some b, are. Suppose, 
as before, that q, are the variables. Then if q t = A,, in the 
corresponding factor in (2.24), o 4 — 0 and 

li = tu. (4.55) 

Further, if q t = b ,, then in the corresponding factor in (2.24) 
(\, = -~u; 2 (4.56) 

and 

(4.57) 

Thus, on the line segment associated with Q, the variable A, 
and b, obey 

A, =■ 0-F pw (4.58) 

hi =r -u; 2 + fwaf. (4.59) 

Defining pi = pu> and p> = w 2 . one finds that all interior 
points of Q to be included in T\ (u;) lie on the two-dimensional 
plane, 

7 = p\(\ + p 2 ('2 (4.60) 

where C< 2 £ are frequency-independent constant 
vectors. 


Cased: No b t is a variable. If q, = a,, then in the 
corresponding factor in (2.24) 

<x t = 0 (4.61) 

and 

(4.62) 

Then, as long as all the fi ^ 0, on the line segment associated 
with Q, the variable A, obey (4.58) and a, obey 

a, = () + p(uj - u(4.63) 

Thus, again, this segment is confined to a plane of the form 
of (4.60) with different C\ and C 2 . The corresponding p } and 
p >2 in this case should be defined by p\ =■ pu; and p 2 ~ pw"" 1 . 

Case 5: For some v, a, is a variable, for some others, b, is 
a variable. As before, on the line segment associated with Q, 
the variable A,, b f , and a,, respectively, obey (4.58), (4.59), 
and (4.63). Then with pi = peu, p 2 = uj 2 . pj — pu _1 , and 
C,. i =1, 2, 3 frequency-independent constant vectors {C \, C 2 
different from those in Case 3), this segment can be seen to 
lie on a three-dimensional plane of following form 

7 - p\C\ + p 2 Ci + pjf 3 (4.64) 

'Hie set of two-dimensional planes covered by Cases 3 and 
4 will he denoted L 2 . while the set of boxes in Case 5 will 
be called L j. The zero exclusion principle then immediately 
yields the following theorem. 

Theorem 4.1: With L 2 , and Lj defined as above, 
the family of transfer functions 11 (V) described by (1.9) is 
Hurwitz invariant if and only if all the edges, the internal 
segments in L u the rectangles in /, 2 , and the boxes in Lj are 
Hurwitz invariant. 

We note that the sets in L \, L 2 , and L k \ are easily char¬ 
acterized from the critical frequency-dependent segments they 
contain. As in Section 111, many of these segments, rectangles, 
and boxes will be empty. 

V. Conclusions 

In this paper, we have considered robust stability verification 
of linear time-invariant systems characterized by the class 
of nonlinear parametric perturbations given in (1.7). In light 
of the zero exclusion principle, our focus has been on both 
the verification of Hurwitz invariance and the construction of 
value sets for the system characteristic function. The main 
result on construction of value sets shows that for the class of 
nonlinear parametric perturbations given in (1.7), the value 
set boundary of the characteristic function at each fixed 
frequency is determined by the edges and some frequency- 
dependent internal line segments in the parameter box. This 
result greatly simplifies the construction of the value sets, 
and considerably cases the task of robust stability verification. 
Indeed, a piecewise continuous and differentiable frequency 
sweeping function is found such that Hurwitz invariance of 
the set in question is equivalent to this function’s avoid¬ 
ance of the negative real axis. For the special case of real 
zero-pole gain variations, the critical line segments are all 
frequency independent; hence, the determination of robust 



,, d /,< N0NLINEAH PARAMETRIC PERTURBATIONS 


,lulity is even simpler. For the case of complex zero-pole 
, m variations, the critical internal lines are either frequency 
dependent or vary in certain (two-dimensional) planes or 
l ice-dimensional) boxes. 

Hie key device used in our development concerns a Ja- 
% )»>ian function which helps isolate certain critical subsets 
,tf the parameter box whose elements collectively determine 
Hie value set boundary. This Jacobian-based technique may 
provide an effective tool for the robust stability analysis of 
„ets which are even more general than the ones considered 
hete. Indeed, a similar device is featured in [7], [281, 127] in 
ielation to the multilinear problem. 

Appendix A 
Proof of Lemma 2.1 

We prove this result in two parts. First, we show that for D 
a bounded and dosed set of complex numbers, 

(dD)^ D 0(D (k) ). (A.65) 


Then, dy } converge lo d\ as j —* because , w>ii . t . 

Yet dij # Di for all j because z, - r/j f d 2 d, % ([ 
Therefore, d\ € i)Di. Similarly, it can be shown ih.u v, 
for all other t; hence, the result holds. 

Appendix B 
Proof of Theorem 2.2 

To prove the theorem, we need three lemmas The lust 01 
these is well known (see, e.g., 128], |27]), and hence Us prool 
is omitted. 

Lemma B.l: Consider the hyperrectangle 

Q ~{<1 = ((/i fl 2 - ‘In, ) 7 : 

^ < 7*, ' = 1- 2, ■ ■ •, m) c H' n (B.08) 

and a differentiable complex valued function /( ) ( t ) * C 

with all its first derivatives continuous. In the sequel, we will 
denote 

I(Q) :={/(</): <1 € (B.f> 1 )) 


We then show that for D\ . 75 2 , ■, 75* bounded and closed 

sets of complex numbers, with 


I]P, := | J"Jr/,: dt E D ,: / = 1, 2, • ■ (A.66) 

a n 

i=\ i=l 

Together, these two parts prove Ihe result. 

Proof of Part 7 ■ Given any complex number ^ 6 

()(lf k) ), we need to show that € (r//5)^L By the 
boundedness of D ^ (from that o( D), there exists a sequence 
of complex numbers {zj} outside such that z } —*■ z as 
Since /5^ A) is closed (as D is closed), there exists 
some d e I 5 such that d k — z. Define d\ r r/^, ■ ■ -, r/^ lo 
be the Ath roots of z t . Then ior all t, j, d,, & D because 
z ) - (<//;)* £ D^ k K On the other hand, the sequence \d n } 
has a subsequence converging to d(E D). It follows that 
d E <575, or equivalently, c = d k € ( cJl))^. 

Proof of Part 2; Given any complex number z e 

ixn:-x 75 t ), we need to show z 6 HT-i( ( 577 t )• This clearly 
holds if one of the 75, is just {()}. So, assume every D, 
contains at least one nonzero element. This implies that every 
0D l has at least one nonzero element. By the boundedness 
of (nr =J 75,) (from that of 75,). there exists a sequence of 
complex numbers {z,} outside (1^-1 75,) such that Zj —» z 
as j —> oo. Since (fllLi 75,) is closed (as 75, is closed), there 
exist some d, e 75,, / = 1, 2, ■ ■ .a such that z — duh ■ ■ d a . 

If 2 is zero, it follows that all d n except for one which is 

set to zero, can be chosen to be a nonzero boundary point of 
01),, In the sequel, we assume that at most one d, is zero. 
We claim that d l € 0D X , i = 1, 2, - - -,a. Without loss of 
generality, consider <li. From the foregoing, one can choose 
di € 0D\ if one of the other d, is zero. On the other hand, 
if the remaining d, are nonzero, we define 


d U 


(If" do 


(A.67) 


Then a point q in the interior of Q obeys f(q) e df(Q) only 
if the following Jacobian matrix, evaluated at <], has rank less 
than two 

H^) Rc (“) **(%? 

' ' ' (B.70) 

It should be noled that the lemma above is of little value when 
f(Q) degenerates to a real segment. For when /(■) is real, ihe 
second row of the matrix in (B.70) is identically zero, and 
hence rank deficiency occurs for all q. For such a case, we 
have the following result. 

Lemma B.2. Consider the hyperrectangle Q in (B.68) and 
a real continuous function /(■): Q R. Then f(Q) can be 
mapped from the edges of Q if and only if the two extreme 
points (minimum and maximum) of J\Q) can be mapped from 
the edges of Q. 

Proof: Necessity is obvious. To prove sufficiency, sup¬ 
pose q l and q 2 are the edge points corresponding lo the 
extreme points of f{Q)> Observe that r/ 1 and q 1 can be 
connected by a path entirely in the edges of Q. By continuity 
of /( ■), the image of this path, which is a subset of the edges, 
covers the whole of f(Q). 

The final lemma needed is given below. 

Lemma B.3: Consider the hyperrectangle Q and the 
bounded set f(Q) in (2.24) with (i, ~ (), V5. Then each 
point in f(Q) has at least one preimage in the edges of Q, 
Proof: Observe that the result will not be affected by the 
value of /o. So choose fu = \ . Using Lemma B.2, we simply 
need to show that both the minimum and the maximum of 
f(Q) have preimages in the edges of Q. Take the minimum, 
for example, as the maximum can be dealt with in the same 
manner. Denote the minimum by f m and consider the two 
cases: 1) f m = 0; and 2) / m ^ 0. Case 1) implies that some 
q t -f (x x is zero with k t > 0. In this case, it is obvious that 
setting the other q t at extreme values does not change f m . In 
Case 2), we claim that all the q, must take their extreme value. 
Indeed, if some q % were not at its extreme, f m would not be 



j = 1, 2, ■ 



Wi 
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minimum because we can decrease the value of the function 
by increasing or decreasing this q,. Therefore, in both cases, 
fm can be achieved at an edge point. 

Proof of Theorem 2.2: Consider a point q in the interior 
of Q such that 


/(</) e 0f(Q)- (B.71) 

Proof of i): Suppose first that #• — 0, Vi. Then, Lemma 
B.3 proves the conclusions of 1 ). Next, suppose that at least 
one #, without loss of generality fi\, is nonzero. Suppose also, 
that for some q in the interior of Q, f{q) e df(Q). Then from 
Lemma B.l, Vi, j € { 1 , ■ • • , k), there exist real scalars * and 
y, not both zero, such that 


J ". T " ~ V~Z -■ 

Oqi oqj 

Thus, from (2.24), the above gives 

kf(q) = kjf(q) 

Qi + fli, + iff " qj + otj + jff 

which simplifies to 

i'H += (</j + 


(B.72) 


(B.73) 


(B.74) 


Now, suppose that at least one ft,, i ^ 1 , without loss of 
generality ft, equals zero. Then with i = 1, j = 2, one has 
from (B.74) that 


q 2 + (X '2 = 0. (B.75) 

Thus, f(q) = 0. Moreover, this holds ,no matter what value 
the 0 j, » ^ 2 take. Setting these q, , i ^ 2 to their respective 
extreme values, one provek i). 

Proof of iif: Follows from (B.74). 

Proof of iii): Direct substitution of (B.74) into (2.24) 

yields 


/(</)■ = + (B.76) 

where c is a suitable complex constant and the integer M is 
given by 


M = (B.77) 

i=i 

Thus, when M = 0 . f(q) has tlie same value on the whole 
segment L(Q). Thus, by continuity, the image of this entire 
segment is covered by any one of its endpoints which is on 
a boundary of Q. 
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Discrete-Time Observers for Singularly 
Perturbed Continuous-Time Systems 

Kenneth R. Shouse, Member, IEEE , and David G. Taylor, Member, IEEE 


Abstract-— The use of discrete-time observers for nonlinear 
singularly perturbed continuous-time systems is explored. The 
observer design approach is based on inversion of statc-to- 
measurement maps. The two-time-scale nature of the system is 
exploited by constructing separate reduced-order observers for 
the approximate slow and fast subsystems, using multirate mea¬ 
surements and computation. The reduced-order observers are 
compared to a full-order observer designed for the same system. 
The comparison shows that although the reduced-order observers 
exhibit some approximation error, they also result in reduced 
computational requirements if the inversion is performed 011 -line, 
reduced memory requirements if the inversion is implemented 
by look-up table and reduced stifftiess. The trade-off between 
accuracy and implementation requirements always favors the 
reduced-order approach for systems with significant separation of 
time scales. Numerical examples are provided, including a case- 
study of an observer-based control system for permanent-magnet 
synchronous motors. 


I. Introduction 

I N this paper, the problem of constructing state observers 
for nonlinear continuous-time singularly perturbed systems 
is investigated. One observer design concept that may be 
applied to such systems is based on inversion of the so-called 
state-to-measurement map (a set of algebraic equations) [4|. 
15], Since this design concept is widely applicable, requiring 
only that standard observability assumptions be satisfied by 
the diseretc-time representation of the system, this paper will 
adopt a map-inversion approach and will adhere to the basic 
principles outlined in [4], |5J. 

Unfortunately, when this full-order observer is applied to 
singularly perturbed systems, several difficulties are encoun¬ 
tered. First, the discretization of the continuous-time system 
must be carried out at a sufficiently small sampling interval 
to accurately model the fast states, even though the stales of 
primary interest may be the slow ones. This necessitates the 
use of faster sampling devices and faster processors than would 
otherwise be needed. Second, because of this separation in 
time scales, the equations which must be inverted to construct 
the observer are stiff. This means that accurate numerical 
solution is more difficult than if the system evolved in a single 
time scale. 
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The two-time-scalc nature of the system can be exploited 
by decomposing the problem into one of designing separate 
reduced-order observers for the approximate slow and fast 
subsystems. This decomposition is quite straightforward for 
systems without inputs, hut can also be applied when inputs are 
present with some technical modifications. The outputs of the 
two reduced-order observers can be used to estimate the stale 
of the original system. Since the slow and fast observers are 
separate algorithms, they may be implemented using mullirate 
measurements and computation. Consequently, the computa¬ 
tional and/or memory requirements for state-to-measurement 
map inversion are significantly reduced from those of the 
full-order observer, and the numerical stiffness is relieved. 

All the benefits of the reduced-order observers are achieved 
through a trade-off with accuracy. The reduced-order obset vers 
are only approximate, but each source of observation error 
vanishes as the singular perturbation parameter tends to zero. 
To demonstrate the practical value of reduced-order observers 
tor feedback control, the paper concludes with a detailed case- 
study on velocity regulation of permanent-magnet synchronous 
motors with only stator current measurements. 

II. Fuli -Order Anaiysis 

In this section, the full-order observer tor singularly per¬ 
turbed autonomous systems is analyzed. This analysts is 
presented to provide a reference for comparison with the 
reduced-order observers that follow. 


A The bull-Ordci Observer 

Wc follow precisely the procedure recommended in 14], [51. 
Consider the continuous-time nonlinear system 


■' ~ /)('■• a) 

(2.1) 

i=h(r,z) 

(2.2) 

s) 

(2.3) 


where j E 7? iV , z G 7? A/ , y e 75^, ( > 0 is a small parameter 
and the dot denotes differentiation with respect to time /. It is 
assumed that /i, fi and h arc analytic, that unique solutions 
j(t) and z(() exist, and that // is the only measured quantity. 
For notational convenience the representation 


»’ = /(«>) 


.7/2(l"l, <"2) 


y = h(w) := h(vii. 11'-j) 


(2.4) 

(2.5) 


is used, where w = [tuj w' 2 ]' with wi := .r and m >2 := z. 
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Discretizing this system at the instants t = fiT where T is 
tably chosen on the basis of the .r dynamics, one obtains 

bi[/+ 1] = F( w[/]) (2.6) 

ll[f] ■■= h(u>[f]) (2.7) 

hen := n,(t<7), # [/J := y(f<T) 

» 2 -») 

1-0 ,4 

and where the notation Ljtu denotes the Lie derivative oper- 
alor L'j applied to each component of w [10|. 

Associated with the system (2.6)—(2.7) is the lull-order 
siate-lo-measurement map 


have full column rank. In this case, the full oidei oh ■» s 

w[t] := $ Ar+w - l (C* , 0) -1 C)') |(1 

which has the deadbeat property "'Kl -■ •'■[<] ioi .«') - 
N + M - 1. 

B Full-Order Observer Stiffness 

The full-order observer tor a singularly perturbed 
is inherently stiff. To see this, consider the lincai system 
(2.I3M2.14). To this system, apply the diugonali/tnp tiai.s- 
formation |8] 

\h ~ i -<L2]\r) 

n\'~[ /»i JL-J 


r 1>H 1 

ll o !• (m) 

II («») . (2.9) 

Jiof""" 1 !(„,). 

where F' denotes I composed with itself t times. This statc- 
to measurement map satisfies 

) [/] = H(w\t- N - M + J]) (2.10) 


wheie 1 \f\ is the block ol measurements 

-,i /|t - N - A1 + 1J- 

V[/j- ; (2 11) 

•m 

Assumption 2 I For some set P tl C_ 7v N + there exists a 
function Z/ 1 U) <7 -* R v + u such that w f P u and 

) — H{w) imply w — 11 1 (> ). 

Undci Assumption 2.1, the full-ordei observer lor the sys¬ 
tem (2.6) (2.7) land hence for the system (2.4)—(2.5)1 is given 
by 

u,[f] F** hl - { oir l (Y[(]) (2.12) 

'Hus nonlinear observer is deadbeat on P„, t.e., i/>[Z] - w[f] 
lor all 1 > N + M — 1 if i/'[Z' - JV-A/ + 1] 6 P„. 

To gain insight into the observer design given above and to 
aid in the analysis to follow, the specialization of this observer 
to the linear autonomous case is useful. Consider the system 
(2.4)—(2.5) defined by f(w) =■ Aw and h(w) - ( f w, where 


A 

r 


A\\ A\2 

Mil 

Cv c 2 ]. 


(2.13) 

(2.14) 


Discretizing at the instants t — tfT yields the model 
(2.6H2.7) with F(w[l]) — <l>u>[Z] and h(w\f]) = Cw[f] 
where <I> := oxp(AijT). The specialization of Assumption 2.1 
simply requires that the observability matrix 

C 

r<t> 

0:= . (2.13) 

c »$/v+a/-1 


where /,i and L 2 solve 

A1 1 — A‘22^1 + ( Li A n - (L\ A\ 2 L\ — 0 (2.18) 
f (An ~ A\2.L\)1j2 - l'i(Ari + <Id\A\j) + A\z =- (2.19) 


Note that L\ and are guaranteed to exist provided that A& 
is nonsingular (i.e., that the system is m “standard" form), 
and that t is sufficiently small |8|. Tlic result is 


= 

(i 


.*/ = [(\ 


where A^ A\\ — A\ 2 F\s A ri 4- fL\A\ 2 , (\ 

C]-C and C\ := C 2 ~ In ~ (\L { L 2 ). Discretizing 
at the instants t —hi gives 


M'+ '] 
»/[' + i] 


_ r*t 


#] = K( ] 


where <I>£ ■- oxp(A^<7') = 7,\ -F(J(f) and <b n exp(A T? 7 ). 

In these new coordinates, the observability matrix (2.13) is 
given by 

r c '» 1 

(’£+0(() C,,*,, 


Lc { + o(0 

Since the first N columns ol are nearly the same for small 
r, it is very close to losing rank. The observability matrix 
(2.13) m original coordinates is 


O^Oj 


In — t L 2 Li h'l 
L\ hi 


Since the transformation matrix in (2.23) does not lose rank 
as t 0, O is also ill-conditioned for small (. 

In the nonlinear case, ill-conditioning will be evident when¬ 
ever OIl/Ov) is close to losing rank. This Jacobian clearly 
is the observability matrix of the classical linearization of 
the nonlinear discrete-time system (2.6M2.7). Since classical 
linearization and discretization are commutative |3], however. 
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this Jacobian is also the observability matrix of the discretiza¬ 
tion of some linearization of the system (2.1K2.3). Since 
any such observability matrix is inherently ill-conditioned, it 
follows that the nonlinear observer (2.12) is inherently stiff. 

The stiffness of the full-order observer is a result of the 
inordinately fast sampling required. This fast sampling causes 
an f dependence in H(w ), which leads to loss of rank as 
f —► 0, Since reduced-order observers may be operated in 
their natural time scales (in a multirate fashion), such stiffness 
is not an inherent feature of the approach considered in the 
following sections. 

III. Reduced-Order Design 

The problem of observer design for autonomous nonlin¬ 
ear singularly perturbed systems is now re-examined. The 
proposed reduced-order techniques can be applied to most 
singularly perturbed systems for which a full-order observer 
exists. 

A. Time-Scale Decomposition 

Appropriate design models are needed for the construction 
of reduced-order observers. These models will be determined 
by first obtaining continuous-time reduced-order models and 
by then computing the corresponding discrete-time reduced- 
order models. This is in contrast to [1], where full-order 
discretization is followed by discrete-time order-reduction. 
By performing the order reduction first, no tools from the 
(less developed) discrete-time singular perturbation theory are 
needed. By discretizing only models of reduced order, the 
resulting state-to-measurement maps (and their inverses) will 
be simpler. 

Consider systems of the form (2.1M2.3), namely 

j = /i(j-,z),.r(0) = .r 0 (3.1) 

f j = / 2 (.l',2),2(0) = 2(1 (3.2) 

y = h(x,z) (3.3) 

where all notation and assumptions match those of (2.1 M2.3). 
It is implicitly assumed that J\ and / 2 have 0(1) magnitudes, 
so that the rate-of-change of 2 is about 1/c times larger than 
the rate-of-change of .r. It will be convenient in the sequel lo 
make use of the set notation B,(h) := {\ £ V. 3 : ||a||^ < M- 

Assumption 3.1: For a fixed / > 0, there exist fixed positive 
scalars r x and r z such that (x(t),z(t)) £ B^(r 2 ) x 
for all t £ [(). / ] and such that 0 = h(x< z) has the unique real 
solution 2 - </>(x) on (x,z) £ B^(r x ) x B^(r z ). 

Following [8| t the slow subsystem is defined by 

“jjT = /i (•/**., 0(x*i,)) =: 0) = .r 0 (3.4) 

V* = = : h s {x H ) (3.5) 

and the fast subsystem is defined by 

dzr 

jf- = /2CE0, </>(«*'()) + Zf) =: f/{zf),Zf(Q) ~ - <j>(x 0) 

(3.6) 

tJf = h{x th (l>(x 0 ) + z f ) - h(vo,<f>{:v 0 )) =: h f {z f ) (3.7) 


where r := t/f . It is assumed that a unique solution x g (t) 
of the system (3.4H3.5) exists for all t £ [0, f], and a 
unique solution zj(t) of the system (3.6M3.7) exists for all 

T £ (0, f/f]. 

It is significant that fj(zf) and hj(zf) depend implicitly on 
the initial condition ,ro. This feature is inherent in the time- 
scale decomposition. It should also be emphasized that y H and 
Uf are not signals available for observer design purposes. 

Assumption 3.2: lire fast states are asymptotically stable, 
i.c. j <—*,<() evaluated along x = x s and 

2 - for some fixed constant k > 0. Furthermore, 

j"(0) = xo and z(0) = zo are such that z/(0) belongs to 
the domain of attraction of the equilibrium Zf = 0 of the fast 
subsystem. 

The following well-known theorem provides a method of 
approximating r(/), z(t) and /;(/) by solving the reduced-order 
subsystems (3.4M3.7) [7], |8|. 

Theorem 3.1 /Tikhonov/: If the system (3.1 M3.3) satisfies 
Assumptions 3.1 and 3.2, then there exists an t* >0 such that 


lor any c £ (0,0 

x(t) = s tt {1)+ <)(*) (3.8) 

2(0 = 00 ( 0 )+ */(r) + 0 (c) ( 3 . 9 ) 

,v(0 = /7<(0 + ^//(t) + O(0 (3.10) 

for all t £ [(),/]. Furthermore, for any t* £ ((),/] there exists 
an (** £ (0,0 such that for any < £ (0,c**) 

z(t) = 0(./\(O) + 0(<] (3.11) 


for all t £ [/*./] 

B. Synthesis of Reduced-Order Observers 

Since the reduced-order models (3.4)—(3.5) and (3.6M3.7) 
are decoupled, they may be sampled at their natural rates. The 
slow subsystem is sampled at the instants / = 11 T and the fast 
subsystem at the instants r = mT/(h ), where T < i/N is 
chosen on the basis of the x dynamics and where k > M is an 
integer such that T/k - ()(r). Note that the fast sampling rale 
is precisely k times the slow sampling rate, and also that the 
separation in time scales is maintained in the discrete n and m 
indexes. Throughout this section, time-varying quantities will 
be written as functions of t (or 71 ). or r (or m), in a manner 
consistent with the notation of Theorem 3. 1 . 

The discrete-time model of the slow subsystem is 

s„[n + 1] = F.G'r.M) (3.12) 

.(/*[»] = ft* (.»'<.[«]) (3.13) 

where ,r s [7i] := j-»(h7'), y„[n] ■= ?/.(»7’), and 

FsU's[n]) \ f=,,!„]• < 3,4 > 

I ~0 

The discrete-time model of the fast subsystem is 

Zj[rn + 1] = F f (zf[rn)) (3.15) 

tf/M - hf{zf[ni\) (3.16) 
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iiere 2 /[wi] := Zf{mT/(kc)), := yf{mT/(ke)), and 

».,7, 

]i may seem that constructing observers for the reduced- 
order ;tnodels (3.12H3.13) and (3.15M3.16) is simply a 
matter of repeating the full-order procedure for each of the 
subsystems. This is not possible, however, because only y(t) 
is measured— y H [n] and y/[m] are not known. Intuitively, fast 
variations in the measured output may be attributed to the fast 
states. Consequently, any difference between y{mT/k) and 
y((m - 1 - 1 )? 1 /*) will be approximately equal to the difference 
between yf[m] and y/[m -f 1]. For this reason, the construction 
of the fast observer begins with the fast output function in 
incremental form 


Aftf(zj) := hj o Ff(zf) - hf(zj). 


(3.18) 


Using (3.18), the incremental fast state-to-measurement map 
is defined as 

A hj(zf) 

Ah, o Ff(zj) 


AHf(zj) := 


^A/-l 


( 2 /)J 


(3.19) 


la h f o f; j 

which is implicitly parameterized by xq. Given Ihe block of 
incremental fasl subsystem outputs 


A Y)H := 


Df [in — M + 1] — y / [in — A7] 


:'//["'] - - 1] 
this map satisfies the identity 

AYj[m\ ---■ AH- M\). 


(3.20) 


(3.21) 


2/H = Ff O A//;’(AV>H) 



’y„[n - N + 1 ]' 


'yf[k{n - N + 1 )] 

.*•/[«] := 


+. 



?/«M 


S//IH 


H„(*..Z f \n )): = 


This quantity motivates the slow-plus Gm 
measurement map 1 definition 

hs(Xa) 

>'s 0 F,(x,) 

Ur.oF^’-'UjJ 
\h f (z f [k(n - N + l)]) 1 
hf[zf [k{n - N + 2 )j) ' 


"F 


(3.24) 


which is parameterized implicitly by x,i and explicitly by 
[zf[k(n — N + 1 )]' 


Z f [n] := 


Zf[kn] 


(3.25) 


This map satisfies the identity 

Y,f[n] - /F,/(:r;,[n - N + [],Z f [n]). (3.26) 

Assumption 3.4: For the fixed positive scalar 77 and some 

k; ■■ 


fixed positive scalars r* and there exists a function //~ J * 


Assumption 3.3: For sonic fixed positive scalars 77 and fij, 
there exists a function A HJ 1 : ► 7v M , parameterized 

by .to, such that: 

1) Zf E B\j(r f ) and A Yj = A Hf(zj) imply zj = 

2 ) Mi f 1 (■) depends continuously on the parameter To and 
is continuous on the set Bqm(t + 6j) where r > 0 is 
such that A///(0 a/( 77 )) C Bqm(t). 

Assumption 3.3-2 is required because of the approximation 
error introduced by order reduction. This requirement is not 
restrictive, since a similar assumption would be required in [4], 
1 5] to handle measurement noise and/or parameter uncertainty. 
Invoking Assumption 3.3, whenever zj[m - M] E Bm(vj) 
then 


n QN X n MN - n N such that: 

1) x M E B N (r s ) and Y)/ = imply :/:* - 

H;/(Y„ f ,Zf) for all Zj 6 B MK (r f ). 

2) H~[ (■, •) is continuous on the set Bqn+m+ f>„) 
where r > 0 is such that H ll f(B\r(r ll ),B mn^j)) x 
Bmn('i'j) Q Bqn+ms(i')- 

Again, Assumption 3.4-2 is required to account for or¬ 
der reduction approximation error. Invoking Assumption 3.4, 
whenever x s \n — TV + 1] E Bs{r s ) and Zj[n] E 
then 

■i-M = o H;/ (V./H, Z f In]) (3.27) 

which is computable for n > N - 1. 

Of course, neither (3.22) nor (3.27) is computable using 
known signals. The fictitious signals AY) and Y)/ may be 
approximated by 

[y((rn -M + l)T/k) - y((m - M)T/k)~ 

AY,[rn] := 


Y,j[n] := 


y(ruT/k) - y((m - 1)7’, 
y((n-N + l)T) 

y(nT) 


(3.28) 

(3.29) 


(3.22) 


which is computable for m > M. 

Turning to the slow observer problem, consider the sum of 
blocks of slow and fasl subsystem outputs 


, (3.23) 


and the unknown Z / [n] will be obtained from the fast observer 
defined below, according to 

tz f [k[n-N+ l)Y 


Zr\n\ := 


zj[kn] 


(3.30) 


1 Although in this section it is possible to formulaic an implcmcniable slow 
observer using a “slow" staic-io-measurcment map, this dcliniiion has been 
chosen to be consistent with a similar definition required in the next section. 
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TABLE I 

Implementation Costs of Full- amp Reduced-Order Observers 


Observer 

Numerical (flops) 

Closed-Form (flops) 

Look-Up Tabic 
(bytes) 

Full *Oder 
Theorem 
3.2 

Corollary 

3.1 

tT 

“Ltfl + uifl 

i afi 

0<(N+U) J ) 

* T 

+ (ju^h 

2 6W+2 + 

2 6A + 2 


Note also that F/, A HJ l and H“j all depend implicitly on 
To* Hence, the following assumption is required. 

Assumption 3.5: At least one of the following conditions 

is satisfied: 

1) xq = to + O(t) is known. 

2) /2 (t, z) = f 2 r(r) 4* A 2 z and h(.v , *) - /^(t) + 

If Assumption 3.5-2 is true, then Ff, AHJ 1 , and H~j are 
independent of .r 0 , since f/(zj) = A 2 zj and hf(zj) = Czf. 
If Assumption 3.5-1 is true, then the unknown functions will 
be estimated by 


t'fM ■= Ffct) Urt, 

(3.31) 


(3.32) 

H~f(x,.Z f ) :=■ H;/(x.,Z f )\ 

=1*1, 

(3.33) 


The implementablc reduced-order observers are defined in 
the following theorem, and they are interconnected in the sense 
that Zf[rn] drives x s [v]. 

Theorem 3.2: Suppose (t*[h], Zf[m\) E Bjv(/\) x Sj\/(r/) 
for n sz 0, ■ , ft and m = (),■■■, kn, where h is the greatest 
integer such that f\T < t. If the system (3.1 M3.3) satisfies 
Assumptions 3.1-3.5, then there exists an r* > 0 such that 


for any ( € (0, c* 11 ) 

if [in] := Ff 1 o AJ/^Af/H) (3.34) 

= z f (mT/(k')) + 0( 0 (3.35) 

•M"l ••= ^ 1 o Hytrin),Zf[n]) (3.36) 
= x„(»T) + 0( r) (3.37) 


for n = iV, • • ■, n and m = M, ■ ■, A n. 

Pnw/ 1 Assumptions 3.1 and 3.2 imply that Theorem 3.1 
holds for 7i ~ 0, «. n (and rrt -(),•■, A h). In light of (3.4) 

and the fact T/k = 0(r), it is evident that r N ((rn + 1)!T/A’) = 
x $ (mT/k) + O(f). Since h g is analytic, ii is continuous and 
therefore // s ((rn + 1)77A) - y*(tnT/k) + O(r), which, along 
with (3.10), implies that 

AV)[m] = A Yf[m\ + 0(f). (3.38) 

This, together with Assumptions 3.3 and 3.5, the continuity of 
Fj and the hypothesis that zj\m] E Bm{i */), implies (3.35). 
From (3.10), it follows that 

ftfM = Uj[n] + 0(f). (3.39) 

Using (3.35), it follows that* 

Zf\n] — Zj[n] + 0(f). (3.40) 


Along with Assumption 3.4, die hypothesis that x, [n] e 
B N [r,) and the continuity of F a , (3.39M3.40) imply (3.37). □ 

Since the observers (3.34) and (3.36) are interconnected, it is 
helpful to describe their implementation further. The required 
sequence of calculations is as follows: 

1) Using To if necessary, construct (3.31), (3.32) and (3.33). 

2) Set rn = n = 0. 

3) While n < ft 

a) Measure y(mT/k). 

b) If m > M, compute £/[m] from (3.34) using 
y{(m — M)T/k). ■ ■ ■ ,y(mT/k). 

c) If rn = k(n + 1) then 

• Set ii = n + 1. 

• If n > TV, compute i ,[n] from (3.36) using 
y((ri - N + 1)T), • ■ ■ ,y{nT) and Zf[k(n - 
N + 1)], ■ • ■, zj[kv], 

d) Set m = m f 1. 

4) Stop. 

It the accuracy of the observer during the boundary layer is 
of no concern, then an estimate of zj [rn] is not needed, and it 
is possible to obtain a simplified slow observer. 

Corollary 3.1 . Suppose that j\[v] E B N {r\) for n - 
0, If the system (3.1)—(3.3) satisfies Assumptions 3.1, 

3.2, and 3.4, then for any integer //* E [N,ft] there exists an 
r** >0 such that for any r E (0, (**) 

•t.M := F?- } o//->()'„,[„],()) (3.41) 

= .r„(vT) + 0 {() (3.42) 

for all n = tt *, ■ ■ ■ ,7/. 

Proof: Equation (3.11) implies that, under the hypothesis 
of the corollary, Zf[m] = O(r) for in E [kt*/T, kn], Taking 
v* as the smallest integer such that n* > N - l f 1*/T leads 
to Zf[n\ = ()(e) for v ■= it*, •-,//. This, together with the 
fact that 7/ s /(-,0) = H^f( ,0) (which is evident from (3.24)) 
and (3.36), proves the result. □ 

Corollary 3.1 reinforces the intuitive notion that if the 
parasitic states arc fast enough, then their effects can be 
completely neglected. Note also that the observer (3.41) docs 
not require either the invertibility of the incremental fast state- 
to-measurcmcnl map, knowledge of .ro, or special structure of 
f 2 and h. 

C. Reduced-Order Observer Advantages 

Table I provides a comparison of the implementation re¬ 
quirements for both full-order and reduced-order observers, 
assuming k — i/r. It is further assumed that numerical 
solution of j nonlinear equations requires 0(j s ) floating point 
operations, and that evaluation ot a closed form solution to j 
nonlinear equations requires Q(j 2 ) floating point operations. 
If the inverse of a j-dimensional map is computed off-line and 
stored in a single precision floating point look-up table with h 
bit resolution, then 2 b - 7+2 bytes of memory are assumed to be 
required. As a consequence of the conservative assumptions 
used, the values shown in Table I underestimate the advantages 
of the reduced-order approach. 
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To better illustrate these requirements, consider a fourth 
(.ider system with two slow and two fast states (N - M = 2). 
The limiting ratio of on-line numerical inversion computa- 
i onal requirements is 

| jm full-order flops _ f8, Theorem 3.2 
reduced-order flops \oo, Corollary 3.1 ' 

The 8-bit look-up table memory requirements for the full- and 
reduced-order implementations for this system are given by 

{ 16 Gbyte, full-order 
512 Kbyte, Theorem 3.2 
256 Kbyte, Corollary 3. J. 

Clearly, a look-up table full-order observer is not practical 
for this system, whereas both of the reduced-order schemes 
require only moderately sized look-up tables. 

As for the stiffness, there is no assurance that the reduced- 
order inversions will be well-conditioned, since poor ob¬ 
servability will not be improved by singular perturbation 
approximations, On the other hand, as previously shown, 
the full-order inversions are always ill-conditioned for small 
enough e. 

D. Autonomous Example 
Consider the linear system 


30.9265 

—24.9047 

.7; 4- 

-13.9306 

23.3253' 


35.3095 

-27.4703 

-16.8008 

25.4823 

z 





(3.43) 

24.5739 

-18.8923' 

+ 

-J 2.2427 

19.0368" 


9.4471 

-6.1202 

-5.9734 

6.2865 

z 


(3.44) 

y = [1.5630 1.6125]:/; + [1.7635 2.7318]z (3.45) 



(c) (d) 

Fig. 1. Actual and estimated slate responses for the case f — 0.1)4. (a) n (/) 
and :p.h| [»], (b) ^(f) and (c) :i (0 - <!>\ (.r(/)) and i/, [mi] and (d> 

viU) ~ (M*(0) and Zf‘>\m]. 

Fig. 2(b) compares the matrix 2-norm condition numbers of 
the full-order observability matrix, the slow observability 
matrix, and the incremental fast observability matrix. The full- 
order observer for this system is ill-conditioned—particularly 
for small t, whereas the reduced-order observers are not. 
Fig. 2(c) shows the ratio of the full-order to reduced-order flop 
count. The reduced-order observers require only about 40% of 
the calculations required by the full-order observer, assuming 
that the matrix inversions are computed off-line. Based on the 
earlier analysis, it is expected that this flops ratio will approach 
four as t —* 0. 

IV. Reduced-Order Design with Inputs 


where :ti(0) = 1, .r 2 (0) = 2, zi(0) = -2, z 2 ( 0) = 1, 
T = 0.05, and k is the greatest integer less than or equal 
to 1/e. This system satisfies the requirements of Assumptions 
3.1—3.5. According to Theorem 3.2, when 1/e is an integer, 
the fast observer is given by 


z f [m] = 


y[rn - 1] - y[m — 2] 
[ y[m) - y[m - 1 ] 


2.5922 -3.4629 
1.7732 -2.0408 

(3.46) 

and, otherwise,The numerical matrix in this expression will be 
a perturbation of that shown. The slow observer is given by 


x s [*r/] 


-1.7104 1.9170 
0.1755 0.0236 

y[n - 1] - [1.7635 2.7318 ]z s [k{n - 1)] 
y[n] - [1.7635 2.7318]£/[fen] 


(3.47) 


Fig. 1 shows the actual and reduced-order estimated re¬ 
sponses for the case e = 0.04. The reduced-order observers 
do an excellent job of reconstructing the slow and fast states 
for this value of e. The behavior of the observers as t varies 
from 0.01 to 0.1 is shown in Fig. 2. In Fig. 2(a), the averages 
(over the time indexes) of the absolute values of the observer 
errors are shown. For small values of e the estimates are good, 
but for large enough c, the estimates degrade as expected. 


In this section, the design of reduced-order observers is 
extended to a class of nonlinear singularly perturbed sys¬ 
tems with piecewise-constant inputs. The section concludes 
with a practical application of the theory: velocity regulation 
of permanent-magnet synchronous motors, using only stator 
current measurements. 

A. Decomposition with Piecewise-Constant Inputs 

Nonlinear singularly perturbed systems exhibiting linearity 
in the fast states and the control inputs admit reduced-order 
models that are easy to discretize. Moreover, this class ot 
systems also models the dynamics of some important electro¬ 
mechanical devices. Hence, we consider systems of the form 

x — a\(x) + A i (x)z + B\ (x)Ui ;/;(()) = x i} (4.1) 
ez = a 2 (x) -F A 2 (x)z + B 2 (x)u. z(0) = z {) (4.2) 

y = c(x) *F C(x)z (4.3) 

where x £ , z € u G ‘R r , y 6 7?^, t > 0 is 

a small parameter, and the dot denotes differentiation with 
respect to l. It is assumed that a x , a 2 , A \, A 2 , B \, B 2 , e and C 
are analytic functions, that unique solutions x(t) and z(t) exist, 
and that y is the only measurement available. Furthermore, 
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t 

(b) 



(c) 

Fig. 2 Observer performance as a function ol f (a) Aveiagc absolute 
observer error, (b) Observability matrix condition numbers, and (c) Full-order 
to reduced-order flops ratio 


lt(0 is assumed to be piecewise-constant 

u(i) - v(nT) =: </[//], € [nT, (rt + 1)7") (4.4) 

where T is consistent with the speed of the .r dynamics. The 
need to accommodate piccewise-constant inputs rules out any 
straightforward use of Theorem 3.1. Hence, a systematic ap¬ 
proach to time-scale decomposition for the system (4.1 >-(4.4) 
must be developed. 

As pointed out in [2], each step in the input will excite the 
fast dynamics. For this reason, a family of slow manifolds will 
be considered, each of interest over one control interval. 

Assumption 4.1: For a fixed n > N , there exist fixed 
positive scalars r r , r z and r u such that u(t) € Bnf(r u ) and 
(jr(t)iz(t)) € Bait's) x Bm(v z ) for all i E [0, nT], and such 
that A 2 (.r) is nonsingular for all .r 6 B/v(r*). 


The family of slow manifolds is determined by setting t = 0 
in (4.2)* leading to 

0n(-Tn(O, u[rt]):=-Ao l (jr,(t)){a 2 (x*(t)) + B 2 (j‘m(t))u[n]). 

(4.5) 

The slow subsystem, which models the system assuming 
that z(t) = (j>, f (.r s (t).u[n]), is defined by 

=a,(/ g (/)) + B,(r il (/))4/],j,(ll) - j‘o (4.6) 
= c s (j\(t)) + D«(j\(/))u[n] (4.7) 

where 

u s (.r,) := «i(j\) - A } {j\)A~ 2 1 (.r H )a 2 (.t\) (4.8) 

B,(.r s ) := - A\ (.r s )/l 2 " x (j\)B 2 (j\) (4.9) 

c(j\) - C(.r Ji )A^ 1 (j,)fJ2(j\) (4.10) 

D.(r m ) := -C(r s )Ai 1 (.v s )B 2 (.i<). (4.11) 

A unique solution jr^(t) of the system (4.6)-(4.7) is assumed 
to exist for all t E [0, nT\. 

The family of fast subsystems, which models the deviation 
of z from its quasi-steady-state (j) n (.i s (//7’). u[n]), is defined 
by 

= A 2 (.r,(nT))z nn) (T,,) (4.12) 

Hfb ,)(r„) = r(r,(»7'))2 /(tl) (r„) (4.13) 

in the fast time-scale 

t - nT 

r n -■ (4.14) 

t 

Each member could be initiah/cd with i(n 7 ) — 
</>„(j(nT),i/[n]), but this approach will be taken only 
for n = 0. For n > 1, we substitute ,r s (nT) foi j{nT) and 
use the continuity of 2 at t — n r I ’ to justify substitution of 
-y(n-i)(7’/0 + ,(h7’).h[h - 1]) for z(vT). Ilcncc, 

the family of initial conditions is defined by 

{ 2o - <M- r o. «[0]),n = 0 
zj(n- i)(?) + <f>n- i(j.(»7’), u[n - 1]) (4.15) 

-(/),,(./,(«7’), 1 /[»]).» > 1. 

Note that (4.12)—(4.15) are evaluated in a sequential nature, 
with the nih member of the family depending on the terminal 
value of the (n- l)st member and on r s (nT). This formulation 
was motivated by (and encompasses) the resetting condition 
given in |9], It is significant that (4.12)-(4.15), in general, 
depend on j M (nT). 

The validity of the reduced-order subsystems as asymptotic 
approximations requires a stability assumption on (4.1M4.4). 

Assumption 42: The fast stales are asymptotically stable, 
i,e. Re{A(A 2 (.r))} <-*,<() evaluated on .r = ,/\, for some 
fixed constant n > 0. 

The following theorem provides a method of approximating 
r(f), z{l) and y(t) by solving the reduced-order subsystems 
(4.6H4.15). 
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Theorem 4.1; If the system (4.1M4.4) satisfies Assump¬ 
tions 4.1 and 4.2, then there exists an e* > 0 such that for 
my e E (0,c*) 

x(t) = x 9 {t) + 0(t) (4.16) 

z(t) = w[n]) + 2/(n)(T n ) + 0(0 (4.17) 

1/(0 = Va(t) + Vf(n){Tn) + 0(c) (4.18) 

lor all t E [nT, (n 4- 1)T), and n = (),■•■, h. Moreover, given 
any T > 0, there exists an t** E (0, (*] such that for any 
cE( 0,r**) 

z f(n){T/c) = 0(f) (4.19) 


for all n = n. 

Proof: Over each interval I E [nT, (n 4- 1)T), the 
system (4.1)-(4.4) can be viewed as a special case of system 
(3.1 )-*(3.3) parameterized by i/.[n]. From this point of view, 
Assumptions 4.1 and 4.2 imply Assumptions 3.1 and 3.2. 
Furthermore, note that (4.6)—(4.7) are equivalent to (3.4H3.5) 
except for the initial conditions taken at t = nl\ and note 
that (4.12H4.13) arc equivalent to (3.6)—(3-7) except for the 
parameterizing values of x and the initial conditions taken at 
t — nT. For n -- 0, Theorem 3.1 may be directly applied 
(there are no exceptions for this index), proving the validity 
of (4.16)—(4.18) over t E [0,7 ). For n > 1, the exceptions 
can be summarized by noting that initial conditions (and 
parameterizing values of x) are assigned according to 


x*(nT) 


z f(») 




f ar(nT), 53.1 
l^(nT),54.1 


(4.20) 


z(nT) - 4> n (x(nT), u[n]), 53.1 
Z f(n-1){ j) 4- 4>n~i(xAnT),u[n - 1]) (4.21) 
—<fi n ( x s (nT) , n[n]), |j4.1. 


Now suppose that (4.16)—(4.18) hold for some v! > 0. Since 
and 2 are continuous, x((n'+l)T) = +1)T)+G(e) and 

z((n' + 1)T) = <j> n '(x 8 ((n' + l)T) 9 n{n']) + Zfi n ')(T/t)+0(f). 
Hence, the discrepancies between the assignments indicated by 
(4.20)-(4.21) at n = n f 4-1 arc O(t). Since all the differential 
equations involved admit solutions which depend continuously 
on their initial conditions (and parameterizing values of :r), 
we see that (4.16)—(4.18) hold for n' 4- 1. By induction, 
(4.16)-(4.I8) are proven for all n = (),••■ ,n. The last claim, 
(4.19), is an easy consequence of Assumption 4.2. □ 


The function F* is given by [101 


i4.2Fi 

where b 8j is the .yth column of and u 3 is the jth element 
of u. The family of fast subsystems is sampled at the instants 
r n = ?nT/(kf), where k is an integer (k > M) such that 
T/k = ()(f ), to obtain 


[»),«[»]) 


Eti 

i“0 


.7 = 1 


2 /(n)[ m + 1] = 4>/( n )Z/( n )[m] (4.26) 

W/(»)M = C f(n)Zf{n)H (4.27) 

where 2 /(rl) [m] •= z f(n) (mT/{k()). y f („)[rn\ := !)n n) 
(mT/(k.t)) and 

•»/(„) :=c:xp^ 2 (x„[n])'T\ (4.28) 

Cf (n) ■= <?(*.[»]). (4.29) 

Note that the fast index m is implicitly associated with a slow 
index n. For simplicity, this dependence is suppressed in the 
notation. Note also that throughout this section, time-varying 
quantities are written in a manner consistent with the notation 
of Theorem 4.1. 

The reduced-order observers arc constructed by formulating 
the appropriate state-to-measurement maps. For each u, the 
incremental fast statc-to-measurcment map 


A///(n)( 2 /(n)) • 


£ 7 (")(*/(n) ~ hl)Zf(n) 


(4.30) 


is implicitly parameterized by :r*[n], and satisfies the identity 
Al/( W )[m] = A H fin) {z fitl) [m - A#]) (4.31) 


where AT/( n )[m] is the block of incremental fast subsystem 
outputs 


B Synthesis of Reduced-Order Observers with Inputs 

In order to design reduced-order observers, discrete-time 
representations of the subsystem models are needed. The slow 
subsystem is sampled at the instants t — nT to obtain 


«.[w+ 1] = *»(*.[«].«[»]) 

l/.[n] = /»,(j',[n],w[n]) 

where x 8 [n] := :r,(nT), y„[n] := y„(nT) and 
/i 8 (;n,[n], w[n]) := c 8 (*,[«]■) + D x (x„[n\)u[n}. 


(4.22) 

(4.23) 


AV>(„)H 


’?//(,.)[m - M + 1] - ?// (n) [m - M] 


L »/(n)H-W/(n)[m- '] J 4 32) 

Assumption 4.3: For some positive scalars r 8 and />/, there 
exist functions AHJ^ : —> 1Z M for n = 0, 

parameterized by x A [n], such that 

1) Ay /(n) = A///(„)(*/(„)) implies z fM = A Hj{ n) 
(A!/(„)) for all x 8 [n] € B N (r,). 

2) Aare continuous in the parameterizing .x.,[n] over 
the set tf/v(r 8 + 6j). 


(4.24) 
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From (4.30), AHj^ n) is a matrix pseudo-inverse. However, 
since depends on t*, the inversion may not in general 

be done off-line. Using Assumption 4.3, if .r«[nj 6 B/v(r„), 
then 


conditions (4.15) (along with the definition (4.24)) into (4.34) 
proves that this map satisfies 

H,/(x „[rt - N+l],Zf[n- l],t/[»i - 1]) = };,["]• (4.39) 


*/(H) W = *"n) A tf7(n)( AV /l»)M) (4.33) 

which is computable for m = M , * •, k and n = 0 f —, n. 

In constructing the slow observer, it will be convenient 
to use the notation h*(x H ) := and F“(.t„) := 

F 0 (x»,u). Summing slow and fast subsystem output blocks 
yields the identity 

Y.f[n) = 

*;l«-*+ 1 l(, j [ fI _ N + 1]) ] 

h^ n ~ N+2] ohf n - N+ %-,[u-N+l)) 

h" [n] o o ■ ■ ■ o - N + 1]). 

- N + l])*/(n-JV + |)[0j 

+ CoF^- N+l \, a ['<-N + l])z f(n .^ 2) M 

P ° i r " l "~ 11 o • ■ ■ O F^-^ix.ln -N + 1]) 8/(1|) [01 

(4.34) 


where 



'y»[n - N + ]]■ 


///(»-A+ 1)[0| 

Y, f [n] := 


+ 



y*l n \ 


. .'//„)[()] . 


Assumption 4.4. For the fixed positive scalar r a and some 
positive scalars rj and there exists a function l]~ f [ : 
x R \IN x 7 JfPN ftN such lhat 

1) .r* <5 fljvfr*) and Y sf = II Hf (jr H% Zf,U) imply 

= }J~ f l (Y s f,Zf,U) for all (Z/,/7) € £a/n('/) x 

tfrv(r u ). 

2) is continuous on the set 

Bqn+mn{i “1* M where r > 0 is such that 

Jf tt j(B^(i\),BMrsj(Tf).Bpx{r li )) x B^^(rj) C 

Bqn+ Mb, (r). 

Using Assumption 4.4, if .r*[n - A r 4- 1] E B^(r\) and 
Zf[n] e Ba/a(t/) then 

.r*[n] F;I» A ‘ flJ o//" 1 

{\\f[n] % Z f \n - l],f/[u - 1]) (4.40) 

which is computable for n = AT,.. , 

Neither (4.33) nor (4.40) is implcmentable. The unknown 
signals are estimated by 

Ay y ,„)H := 

■r i(uT + (m - M 4 l)T/k) - U („T 4 (in - M)T/k)' 
ij(iiT 4- viT/k) - u(nT + («/ - 1)7’/A) 

(4.41) 


The slow-plus-fasi staie-lo-measuremcnt map 3 is defined as 


H r f(.t,,Zf[n - 1], f/[/i - 1]) := 

[ //, ul,, - A| (.r.) 1 

^u(n-JV+l] o fJ' fn-A ' +11 (j. < ) 

(^( T *W(n-A)[fc] 

+ CoF: [n ~ N ^(.r f )z H , l ^ +[) [k] 

C'o o ■ ■ - o F? [n *“'(', )*/(.. ,)[*■]. 

(4.36) 


which is parameterized by the vectors 


U[n - 1] : 


“/(«- a’)M 
>[*]. 

“u[n - /V] - 
. u[« - 1] . 


(4.37) 


(4.38) 


Note that the right-hand side of (4.34) is not the same as the 
right-hand side of (4.36). However, substitution of the initial 


2 Since the fast output block in (4,34) depends on an implcmeniablc 
slow observer cannot be formulated using a “slow” state-to-ineasurcmenl 
map. m control to the previous section. 




r//((" w + j) 7) i 


u(»T) 


Zf[» 


Zt(n-l\)[k} 


L z j{" i)[*]J 


(4.42) 


(4.43) 


where i/ (l )[/n] will be supplied by the fasi observer. Also, 
$f{ n ) and require knowledge of ./*[/>], motivating 

the following assumption. 

Assumption 4 5’ At least one of the following conditions 
is satisfied: 

1) To = ./■(, 4 0(0 is known. 

2) The matrices Aaf-r) — A 2 and 0(0 — C are consrant. 
If Assumption 4.5-2 is true, then both $ /( „) and A//,“j 0 

are independent of If Assumption 4.5-1 is true, then 
<!>/(„) and A//y^ are estimated according to 


*/<»> := */(")L.h»,.h 


* r,[n] 


(4.44) 

(4.45) 


where 



■hun = 

pu[n- 1] 

* * 


0 

0 


* ■ 0 0 ), v = 1, • ■ , JV ~ 1 

(4.46) 
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and where will be supplied by the slow observer for 

n = N, • • - i 

The implementable reduced-order observers are defined in 
the following theorem, and they are interconnected in the sense 
that x-d[n] * s used 1° compute zj( n )[k], which in turn is used 
to compute i 9 [w + 1], and so on. 

Theorem 42: Suppose (^[n], */( n )M) 6 Bx(r s ) x 
&m (r/) for n = 0, ■,n and m = (),■■■,A;. If the system 

(4.1H4.4) satisfies Assumptions 4.1-4.5, then there exists an 
t* > 0 such that for any t € (0, a*) 

-7(0H := ■i>J[ l) A//-i ) (Ay> (() H) (4.47) 

= 3j {l) (mT/(k <)) + O(r) (4.48) 

i,[n] := F“ [n_l1 o • • • o F“l"- A ' +, 1 o //~ j 

(t/[«],Z/[H-ll,t/In-l]) (4.49) 

• =;r,(nT) + 0(f) (4.50) 

for all 7 — (),■•*.n, n = Af, • - •,//, and m = M, - ■ ■ .k. 

Proof: By (4.6) and the fact that T/k = O(c), it is 
evident that +• mT/k) = x s (iT + (m -f 1 )T/A;) + 0(c). 
The continuity of //*, along with (4.18) implies that 

AK /(f) [m] = AJ7 (f) H + 0(t). (4-51) 

This fact, along with Assumptions 4.3 and 4.5 and (4.46), 
implies (4.48) for i = 0, • -,N - 1. We now proceed 
inductively in n. The above analysis proves that 

Z f [ri-}] = Zj[n'-i] + 0(f) (4.52) 

for n' = N. Also, (4.18) implies that 

tj[n] - Y H/ [n f ) + O(c) (4.53) 

for vf = N. These two facts, along with Assumption 4.4, the 
continuity of F s and the hypothesis that {:r s \n'iZ f \v'- l])e 
Bj\ {r s ) x Bj\ix(vf) imply (4.50) for n = n f — TV. But 
:r. s [r</] = :/:„[«'] + ()( f) in turn implies (as argued above) 
that Sf(n')[ m j “ z f(n')[ rn ] + 0(f) lor ra ~ .A/. ■ ■, k. Thus, 
repetition of this argument at an arbitrary r\! inductively proves 
the result. Note finally that since this induction stops when 
n* = /I, any cumulative errors are guaranteed to be (){<). □ 

Since the observers (4.47) and (4.49) are quite complex, 
it is helpful to enumerate the sequence of operations that 
implements the algorithm. The observers are evaluated by: 

1) Using = Xq if necessary, construct (4.44) and 

(4.45) for n = 0. 

2) Set m = n = 0. 

3) While n < h 

a) Measure y(nT + mT/k). 

b) If m > M, compute zj( n )[in] from (4.47) using 
y{nT 4- (m - M)T/k ), ■ ■ ■, y(nT + mT/k). 

c) If m - fc, then 

• Set n = n + 1 and m — 0. 

• If n > N, compute x M [tl] from (4.49) 
using tj((n - N + 1)T), ■ ■, y(wTj, 
Zf(n-N) ■■■,«/(„-])[A:] and u[n - 
.JV], ■■•,u[n - 1], Else, set x,[n] according 
to (4.46). 


• Using i»[«] if necessary, construct i-t . 14 , 
(4.45). 

d) Set m =* vi + 1. 

4) Stop. 

Under certain conditions, it is possible to construct a Uw 
observer which is independent of the fast observer, this 
possible because if zj^ is fast enough relative to T. then us 
contribution to the slow observer may be neglected. 

Corollary 4.1: Suppose a;*[n] £ Bx(r s ) for n ~ tL 
If the system (4.1)-(4.4) satisfies Assumptions 4 . 1 , 4 . 2 , and 
4.4, then there exists an t** > 0 such that for any < e. ((),.«■ *'*) 

x„[n] := o • • ■ o F," 0 II;/ 

(V;/[«],0.f/[n- ]]) (4.54) 

= .r,(n7’)4 0(f) (4.55) 

for n =: N, • • ■, h. 

Proof: In light of (4.19), there exists an c** so that (for 
the given T) zj {1l )[k] - O(r) for all v ~ Utilizing 

this fact in (4.49) yields (4.55). D 

Corollary 4.1 is different from Corollary 3.1 in that (4.54) 
produces estimates over the same range of n ns (4.49) does. 
This is because r is assumed to be small enough so that 
the boundary layers are over in one slow sampling interval. 
Note that it is not possible to simply “wait long enough” for 
the boundary layer to be over, because it is excited by each 
step in the slow input. Finally., the observer (4.54) does not 
require either fast subsystem observability, knowledge of xa. 
or special structure of A -2 and C. 

C. Electromechanical System Example 

An important practical application considered now is ve¬ 
locity control of two-phase permanent-magnet synchronous 
motors using only phase current measurements. In [ 6 ], a 
recursive nonlinear observer similar to an Extended Kalman 
Filter is constructed to estimate the rotor motion from stator 
current measurements. The disadvantages of the scheme in 
[ 6 ], in comparison with the methods introduced in this paper, 
include: sampling and computation necessarily occur in the 
electrical time-scale; the innovation gains must be selected for 
a specific and limited range of rolor velocities to avoid poor 
estimation error convergence. 

The two-phase permanent-magnet synchronous motor can 
indeed be modeled as a singularly perturbed nonlinear system 
in the form (4.1H4 4) under standard assumptions. Assuming 
that magnetic saturation effects are negligible, that the air 
gap is smooth, that the distribution of windings eliminates 
harmonic effects, and that the load is a constant inertia, the 
continuous-time dynamic model is 

— = tu (4.56) 

at 

~ sin (N P B) + i 2 can(N p 9)) - (4.57) 

= 1 »! - Rn - K m u>sin(NpO) ( 4 . 58 ) 

L~ = v 2 - Ri 2 - K m u)cw('N p 0) 


(4.59) 
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Fig. 3. Motor response under output feedback control using only the slow 
observer of Corollary 4.1 for L =r L„ ol „ and L = (a) Actual 

rotor velocity u >(t) and (b) Phase 1 current n(f) and slow manifolds 
</>,,(* *(f)< «[»)) for it ■= 0. ■ ■ .4 at L = L, 


where 0 and u> are the angular position and velocity of the 
rotor, i\ and i 2 are the currents in the stator phases, vi and v<> 
are the voltages applied to the stator phases, K in is the magnet 
constant, N ft is the number of magnet pole pairs on the rotor, 
J is the total rotor inertia, B is the total rotor viscous damping 
coefficient, L is the inductance of the stator phases, and B is 
the resistance of the stator phases. Typically, L can be taken 
as a small parameter so that, in the notation of (4.1M4.3), 
f. = s = [0u;]\ 2 = [/j 12 ]', a = [v\ v 2 ]\ and y = [»j ; 2 ]'. 

The reduced-order observers are formulated according to 
Theorem 4.2. The fast observer is decoupled from the slow 
observer and is defined for all 1 < in < k and each fixed n by 

i/(n)lH = ^j (),lW - in\[m - J]) (4.60) 

‘/(. 02 H = 0 ^ ) (»n 2 w - 2[m - 1 ]) (4.61) 

where i n [m] := i(nT + rnT/k) and 0/ := rxp(—i?7’/ Lk). 
The slow observer can be constructed using only the slow 
subsystem output equations. Thus, discretization of the slow 
subsystem is not necessary, and the slow observer is implicitly 
defined for n > 1 as a solution (0*[n], £*[«]) of the system 
of nonlinear algebraic equations 

V’l := jjjr-(i'i[« ~ “ ^(»i[n] - */(n-i)iW)) 

= li.ln] siii(JVp0„[n]) (4.62) 

V'2 : = - 1] - - V(n-l)2W)) 

= w s [n]ooR(7Vp^[n]) (4.63) 


where i[7t] si(?iT), and where i>\ and 02 are computed from 
known quantities. The observer of Corollary 4.1 is recovered 
by setting V( M -i)[fc] = 0 in (4.62H4.63). 

A few comments are in order concerning the inversion of * 
the slow observer equations. Whenever 0i ^ 0 or 02 ^ 0, 
(4.62M4.63) can be solved for 8*[n] and a; a [w]. There are, 
however, multiple solutions. If (0°,u> 0 ) is a solution, then 
so are (0° ± j^j, u°) and (0° ± -$-(2j - l),-6J l) ), for any 
integer j. In the previous section, tfiis type of difficulty was 
avoided by insisting that .r*[ti] € In this example, 

such an approach would require 0 rt [w] E B\(ir/2N V - bo) 
for some bo E (0, n/2N p ). Since limiting the rotor motion 
is not practical, heuristic rules will be used to decide which 
solution of (4.62M4.63) is the correct one. Assuming that T is 
sufficiently small so that 0 does not vary more than ±n/(2N v ) 
from one slow sampling instant to ihe next, it is reasonable 
to choose the solution closest to the previous estimate. If the 
motor starts from rest and 0(0) is known to ±7r/(2N p ), then 
this scheme will provide the correct solution. If 0 1 = 0 2 = 0. 
then d>s[n\ = 0 and thus (4.62)-(4.63) cannol be inverted, so 
the estimate 8# [n] = 8 H [n - 1] will be used in this case. 

The observer-based feedback will be designed from the slow 
subsystem. The simplest such feedback, one that emulates a 
continuous-lime linearizing slow controller, is given by 


1’iM" 

/'2M. 


si.i((V,AMf 

roh(JV|,tf,[/»])_ 




(4.64) 


where u)' 1 is the desired velocity and > 0 is a design 
parameter. The conlinuous-time version of (4.64) acting on the 
slow subsystem (under the assumptions that L -- 0, 0, — 0„ 
and uf s - u s ) results in closed-loop dynamics 0, = u; s , 
uj* = K^(uj d - uj H ). Since (4.64) emulates a continuous-time 
control, T needs to be relatively small. 

Simulations were run with parameters from a commercially 
available motor: N p = 2 pole pairs, B — 0.41812, L = 0.1.31 
mH, K fn = 29.4 mN-m/A, J = 0.0205 gm^/rad, B = 0.013 
gm 2 /radsec, T — 0.001 sec and k ~ 10. Note that the value 
given above for L (i.e. r) will be used as a “nominal” value. 
In all cases, the motor, starting from rest with 0 S [O] = 0 and 
0(0) = 0.2, was commanded to regulate to u) d = 40 rad/sec 
for / E [0,2.5) sec, and lo u d = -40 rad/scc for t E [2.5,5.0] 
sec, with K u -- 13. 

The first case considered corresponds to output feedback 
control using the slow observer of Corollary 4.1. Fig. 3(a) 
shows u(t) for L - L nom and L ~ 10L TMmi . At L = A m)m , 
the controller does an excellent job of providing the desired 
linear response. At L = 1 however, the controller 

has trouble at startup, and the heuristics for choosing the 
correct sign of the velocity fail. In Fig. 3(b), ii(1) and the 
corresponding first five slow manifolds are shown for L = 
Lnom . In this case, T is just large enough for i\(t) to settle 
to its quasi-steady-state value. For L = 10L„ o „,, this property 
no longer holds and, hence, the observer fails severely. 
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(b) 

I ig 4 Motor response under output feedback control using both the slow 
and last observers of Thcoiem 4 2 for T — 10/ and T — dill/ (a) 
Actual roloi velocity u *(t) ind (b) Phise I current /](/) and slow niamlolds 

(i (f) »/[»/]) lor ii — 0 1 at L = 10/ 

The second case considered corresponds lo oulpul feedback 
control using the slow and fast observers of Theorem 4 2 
Fig 4(a) shows uj{t) foi L - 10 L lum <*nd L = r )00 L n 
At L = 10/ u , r , the controllci pcrlorms very well although 
a perceptible steady-slate error occurs Although not evident 
from the plot the steady state error is noi a consequence 
of observation enor but instead is due to the lailure of the 
emulating controller At J - r >00/ „ wl the controller yields 
substantially degraded peiformance In Fig 4(b), y L (/) and 
the corresponding first five slow mandolds are shown lor 
L - 10 L n)w Even though /](() doesn’t settle to its quasi 
steady-state value, use of the fast observer has accounted foi 
this feature, resulting in good performance 

V Conclusions 

In this paper, the observer design problem foi nonlinear 
singularly pettuibed systems has been formulated as one of 
designing separate observers (or the approximate slow and 
fast subsystem models Not only are the resulting observers 
of reduced-order, they aie also implemented in a multirate 
fashion Both of these features result in significant savings in 
computational and/or memory requirements Furthermore, the 
stiffness inherent in a full-order implementation is eliminated 
by this approach 

It has been proven that this reduced-order observer design 
methodology guarantees ()(t) observation errors Hence for 
sufficiently small e, these errors will be acceptable for all 
practical purposes Although the development of stabilizing 
observer-based control algonthms has not been attempted, the 
use of obseiver-based feedback has been considered in a case- 


study For the application to so-called sensork s 
control of permanent-magnet synchionous motoi ,, 
performance has been demonstrated using tmi lultiu i u 
observer designs 
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Adaptive Back-Pressure Congestion 
Control Based on Local Information 

Leandros Tassiulas, Member, IEEE 


Abstract-*-Th* problem of distributed congestion control as it 
arises in communication networks as well as in manufacturing 
systems is studied in this paper. In particular, a multistage 
queueing system that models virtual circuit and datagram com- 
tnunication networks and a class of manufacturing systems are 
considered. The topology may be arbitrary, there are multiple 
traffic classes, and the routing can be class dependent, with 
routes that may form direct or indirect loops. The model incor¬ 
porates the functions of transmission scheduling, flow control, 
and routing, through which congestion control is performed in 
the network. A policy is given that performs these functions 
jointly. According to the policy, heavily loaded queues are given 
higher priority in service. A congested node may reduce the 
flow from upstream nodes through a flow control mechanism. 
Whenever routing is required, it Is performed in such a manner 
that the lightly loaded queues receive most of the traffic. For 
arrival processes with bounded burstiness, the policy guarantees 
bounded backlogs in the network, as long as the load of each 
server is less than one. The actions of each server are based on 
the state of its own queues and of the queues one hop away. 
Therefore, they are implementable in a distributed fashion. An 
adaptive version of the policy is also provided which makes it 
independent of the arrival rates. 

I. Introduction 

D YNAMIC control achieves a better utilization of the 
transmission and switching resources of a communica¬ 
tion network over static schemes. Dynamic schemes have 
been adopted for packet routing, for sharing the transmis¬ 
sion capacity of a link among several competing information 
streams, or for controlling the flow of traffic in a packet 
switching network. In large networks though, where the control 
functions are distributed, dynamic control policies are prone to 
misbehavior, and the network may enter intriguing instability 
modes. The same phenomenon has been observed in manu¬ 
facturing systems with distributed scheduling [10], [6], In this 
work, we present a new distributed dynamic control policy 
in a multistage queueing system that models virtual circuit 
and datagram communication networks, as well as a class of 
manufacturing systems. The policy, which we call the adaptive 
back-pressure (ABP) congestion control policy, schedules the 
servers, routes the served traffic, and controls the flow based 
on local information, and therefore is amenable to distributed 
implementation. Each server is allocated dynamically based 
on the state of the queues that it serves, as well as on the state 
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of downstream queues, which are one hop away. When there 
are routing options, the decision is taken again in a similar 
distributed fashion. U is shown that under the ABP policy, 
the backlog in the network remains bounded as long as the 
utilization of each server is less than one. 

A central problem in the design of virtual circuit (VC) 
communication networks is the sharing of the transmission 
capacity of a link among the virtual circuits that go through it. 
Several schemes have been proposed, including round robin, 
weighted round robin, golden ratio scheduling, virtual dock, 
and processor sharing [4], |9J, f 14], [8], [51. In some of the 
above schemes [41, [5], the link transmission scheduling is 
combined with window flow control. One of the main prob¬ 
lems arising in this context is how to evaluate the throughput 
of a virtual circuit network, that is, the session rates thal 
can be sustained by the network for certain transmission 
control policies, without experiencing instabilities. Cruz |2], 
[3] has considered this problem m virtual circuit networks 
by employing several different link transmission disciplines 
including FIFO, LIFO, and strict priority. He obtained bounds 
for the backlog in the nodes of the network. These bounds 
depend on the session arrival rates and burstiness character¬ 
istics, and guarantee stable operation of the system for the 
traffic parameters for which they remain finite. In certain cases 
though, in networks with virtual circuits that may form cycles, 
these bounds explode for arrival rales which give utilization 
strictly less than one at all network links. In this case, no 
conclusion can he drawn for the stability of the network. 
Chang fl| extended these results, obtaining backlog bounds 
in multiclass networks with routing. The stability problem yet 
remained open for certain cases in networks with cycles. Yaron 
and Sidi [13] addressed the same question with processes 
satisfying constraints on the tails of the backlog distribution. 
Hahne [4] has shown that with round robin scheduling in each 
link and hop-by-hop window flow control, there exist window 
sizes to stabilize the network as long as the link utilization 
is less than one. The selection of the window sizes depends 
on the arrival rates. By the ABP policy, stabilization of the 
network is achieved as long as the utilization of every link is 
less than one, and no knowledge of the arrival rates is required 
for its implementation. When the routes of the packets are 
not prespecified but only their final destinations are given, as 
is the case with datagram communication networks, then the 
ABP policy combines routing with the scheduling mechanisms 
described in the VC case to preserve bounded queue lengths. 

A problem of stability, similar to the one discussed above, 
arises in manufacturing systems operated under distributed 
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scheduling policies, Perkins and Kumar [10] and Kumar 
and Seidman [6] have studied the problem of stability in 
a flexible manufacturing system with distributed scheduling. 
While simple distributed policies are shown to stabilize acyclic 
manufacturing networks in [10], a simple two-stage queueing 
network is presented in [6]. In this model instabilities occur 
in situations in which all servers of the system are strictly un¬ 
derloaded and the system is operated under a work-conserving 
policy. This example demonstrates how instabilities may occur 
in multistage manufacturing systems due to the phenomena 
of starvation and overloading. The queueing network that 
corresponds to the manufacturing system considered falls 
within the scope of the networks presented in our study. The 
ABP policy stabilizes the manufacturing system as long as the 
utilization of each machine is less than one. The scheduling of 
each machine i is determined by the size of the backlog of the 
different part types in i ) as well as the size of the backlogs of 
those part types in downstream machines one hop away from i. 
Occasionally, the machines are forced to idle again, depending 
on the local state. The queueing system presented here extends 
the above manufacturing systems model to include the case 
where a part type may have the option of several alternative 
manufacturing scenarios and routing decisions are made. In 
Section JI-C, this is discussed in more detail. The ABP policy 
in that case combines scheduling and routing decisions to 
achieve the same goal, In [11], a single-class network was 
considered, and a routing policy of the same nature was studied 
for Poisson arrivals. 

For the arrival process, we assume that the burstiness 
is bounded by a deterministic bound [2J, [3|. This traffic 
assumption has been used widely lately since the outputs of the 
traffic regulators that shape the traffic before it enters a high¬ 
speed network satisfy this type of constraint. Lu and Kumar 
[7J have used a similar type of traffic in the context of a 
manufacturing system. 

The rest of the paper is organized as follows. In Section 
II, the queueing network is presented, and the correspondence 
with virtual circuit and datagrams networks as well as man¬ 
ufacturing systems is demonstrated, in Section III, the issue 
of stability is discussed, and the sufficient stability condition 
is shown. In Section IV, a parametric class of policies is 
specified, and their stability is studied. In Section V, the ABP 
policy is specified and investigated. Finally, in Section VI, the 
results are discussed and some open problems are presented. 

II. The Network Model 

We consider a network consisting of N servers and B 
buffers (Fig. 1). Each server i can serve any buffer from the 
set of buffers B t , i = 1, ■ ■ ■, N — it is allocated to the buffers 
according to some scheduling discipline. The sets B x may be 
overlapping, that is, a buffer may be served by more than 
one server simultaneously. The served traffic from buffer j 
can be directed to any buffer of the set 7 Zj. A routing policy 
determines to which buffer in 7Z } the traffic from j is routed. 
From certain buffers, the traffic can be directed out of the 
system; this is indicated by including a 0 in the set 71 r We 
make the natural assumption that, from every buffer, the work 


can be forwarded out of the system if it is itnur.i tlx j\ 
appropriate sequence of buffers. We considei a IlihU m , 
for the work coming into the system. The time is tl 

and the instantanepus late with which work come- m L*.| ^ 
from outside is a,(t ). For simplicity, we assume that tin m i\ ii 
stream can contain no impulses, that is, the arrival ut , l / , 
for all buffers i is bounded uniformly over time b\ a bound 
A. In the time interval (/ Zl t 2 ), an amount of work equal to 



enters buffer t from outside. We assume that the arrival 
streams satisfy some burstiness constraints; that is, there are 
nonnegative numbers a M fc M / = 1, •-,/? such that, tor all 
0 < ti < / 2 , we have 

/ a,(t)dt < - M + k,. (2,1) 

Jti 

We will assume that the long-run average rate with which work 
enters the system in buffer i exists 



and (j i will be referred to as the arrival rate lo buffer /. The 
latter assumption is not needed for the validity of the results. 
It is introduced only because it is conceptually simpler to 
think of the n t as being the arrival rate to buffer f. When 
server i provides service to buffer j 6 B, which is nonempty, 
work leaves the buffer with a constant rate /v, r the service 
rate of server / at buffer j. If server \ is assigned lo buffer 
j continuously for an amount of time T and the traffic is 
directed to buffer / in 7?,. then an amount of work Tit,, is 
transferred from j to /. The selection of buffer j 6 B t served 
by ? and of buffer / £ 72 ; to where the traffic is directed is done 
by the control policy. This work is routed to some buffer in 
71 r We allow more than one server to serve the same buffer; 
in that case, the service rate is assumed lo be equal to the 
sum of the rates of the servers that serve the buffer. When 
server i that serves buffer j and directs the traffic to buffer k 
switches to buffers I and m, respectively, a switchover lime 
is involved during which the server idles. The results in 
the paper hold for arbitrary and distinct switchover times. For 
notational simplification but no loss of generality, however, wc 
will assume that the switchover time denoted hy and be the 
same for all servers and buffers. In the following, we discuss 
how certain networks fall within the scope of ihe above model. 

A. Virtual Circuit Networks 

A VC network is characterized by its topology graph 
G — (V, E ), the transmission rates of the links and the 
established VCs. The topology graph contains one node for 
every network node and one directed link ( =■ (il w) for every 
communication link from node v to node w. The transmission 
rate of link e is denoted by /i,. A virtual circuit / is specified 
by the sequence of links c\ , • ■ *, r* N that traverses as it goes 
through the network. The traffic of VC t enters the network 
at the origin node of e\ and leaves it at the destination node 
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Fig. 1. An example of ihe network specified in Section II is depicted There 
are three servers 1,2, 3 (not depicted) which serve the queues in the sets B i, 
By * and B j, respectively. The traffic from each queue / may be routed to 
any queue j to which there is an arrow from i 

of link e 5v- A scheduling policy schedules for each link the 
transmissions of the virtual circuits going through it. Also, 
at certain times, the link may be forced to idle due to flow 
control actions. 

A virtual circuit network is modeled by the above queueing 
system as follows (Fig. 2). Each link corresponds to a server 
with transmission capacity equal to its service rate. Clearly, 
in this case, the service rate of server (link) j is the same 
for all buffers in B 3 . There is one buffer for every VC i and 
every link e\ is traversed by ?. The buffers of virtual circuit 
i contain traffic of that VC waiting to be transmitted through 
the corresponding link. When a link is allocated to any of the 
buffers of the VC’s going through it, that buffer empties with 
rate equal to the link transmission rate. The traffic from the 
buffer of virtual circuit i at link r* is routed to the buffer of 
the same virtual circuit at link except if r' is the last 
link traversed by VC /; in the latter case, the traffic leaves the 
system. Hence, the set contains a single element for each 
j and no routing is needed. The traffic in the VC's consists of 
Streams of packets, and clearly, the link cannot switch from 
VC to VC in a time period smaller than a packet transmission 
tune. In the assumptions we made, though, in the previous 
section, the traffic is considered as a continuous flow. For the 
results that we obtain here, it is not important whether the 
traffic is continuous or in terms of packets since the policies 
we propose can be easily modified to work with packetized 
traffic. 

B. Datagram Networks 

In datagram networks, the traffic of a communication ses¬ 
sion does not have to follow a specific route, but can be routed 
arbitrarily, and different packets may follow different paths to 
the destination. The packets are differentiated only by their 
destination nodes. When a packet arrives at a node which is not 
its final destination, a decision is made as to which outgoing 
link the packet will follow next; then the packet is placed at 
the outgoing buffer of that link. There are no constraints on 


Fig 2 A virtual circuil network 

which outgoing link the packets of each specific destination 
will follow; this is determined by the routing policy. Packets 
of several different destinations are wailing in the buffer of 
each outgoing link to be transmitted. The link scheduling 
discipline determines which is the packet to be transmitted 
next. In the corresponding queueing model, the buffers are in 
one-to-one correspondence with the links. For each server /, 
there is one buffer in B, for each destination node. The traffic 
with destination node fc, which is transmitted through link t . is 
stored in the corresponding buffer of B,. These buffers do not 
necessarily correspond to physical buffers in the origin node 
of link i. Their introduction enables the differential ion of the 
traffic, based on the destination and the link the traffic will go 
through. If, from the topology of the network or fiom other 
constraints, it turns out that the traffic of a specific destination 
never transverses link /, then the corresponding buffer in B, 
will be permanently empty. 

C. Manufacturing Systems 

In manufacturing systems, several different types of parts 
arc fabricated. A part type, in order to be manufactured, needs 
to be processed by a number of machines in some prespeci- 
fied order. In flexible manufacturing systems, a machine can 
process more than one type of part. In this case, the machine 
has to switch from one part type to another at certain times 
since the different parts are not processed simultaneously. The 
switching of the machine involves a period during which the 
machine idles. The switching should be done rarely enough 
such that the fraction of time that the machine is utilized is 
higher than the loading. 

Perkins and Kumar 110| have obtained a distributed schedul¬ 
ing policy that stabilizes any acyclic network of manufacturing 
machines. When cycles are formed by the routes of the parts 
in the manufacturing system, then the behavior of the system 
is more difficult to characterize. The intrinsic complexity of 
the problem was demonstrated by a counterintuitive example 
of instability in a simple manufacturing system in [6], The 
queueing network considered here is readily interpreted into 
a manufacturing system model. The servers correspond to the 
machines, the arrival streams to the different part types, and 
the buffers of each machine store the part types at intermediate 
processing stages. If a part type needs to be processed by a 
machine more than once in its manufacturing cycle, then it 
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waits in a different buffer each lime, When the part types 
have unique routes through the nianufacturing system, then the 
sets H 3 have a unique element, the next buffer that the parts 
; will join when they leave from j. Kumar and Seidman in [61 
propose a scheduling policy with a supervisor mechanism that 
stabilises the manufacturing system as long as the load of each 
machine is less than one. The supervisory mechanism, though, 
relies on knowledge of the arrival rates. The ABP policy 
stabilizes the system without the arrival rate information. The 
model considered here extends the one considered in [10|, [6] 
to include cases where a part type has the option to follow 
different routes at certain points of its processing, in which 
case routing is performed. 

HI. Necessary and Sufficient 
Stabiuzability Conditions 

Wc arc interested in the average rate with which work can 
he served. Wc consider that certain throughput rules can be 
achieved if there exists a policy under which the network is 
stable when the long-run average arrival rates are equal to the 
desired throughputs. Denote by X 3 (1) the amount of work in 
buffer j at the time /. 

Definition. The system is stable under a policy 7r if 
Inn sup, _, x i(t) < D 

where D may depend only on the arrival rates a — 
(a i, - • •, n tv ), the burstiness coefficients b - (b i, - - ■ < w), 
and the service rates // ~ (fi n : / = 1, • ■, N, j £ & t ), but 
not on the initial condition. 

The following condition on the arrival and service rates is 
necessary and sufficient for the existence of a policy under 
which the network is stable. 

Cl There exist nonnegative numbers / y *, j = 
k £ H, that satisfy the flow conservation equations 

J k, ~ (3,l) 

k j £ II (, k £ n j 

Also, there exist nonnegativc numbers w', / = 1, ■ ■ •,/V, 
j - l, ■■ ■, U such that 

£V,<1, / = 1, - , TV, (3.2a) 

/ 6 », 

£/,( < “>"■ J = (3.2b) 

More specifically, it is shown that Cl is sufficient for 
stability if the burstiness condition (2.1) holds, while Cl 
is necessary for stability when the arrival process satisfies 
a “bounded idleness” condition, to be specified later. No 
existence of the arrival rate is needed. An intuitive justification 
of Cl follows. The number /,*. in the condition Cl represents 
the long-run average rate with which work is transferred 
from buffer j to buffer k. Hence, (3.1) is indeed a flow 
conservation equation. A collection of nonnegative numbers 
fjk, j = l,.. -, B, k £ R, which satisfy (3.1) will be referred 
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Fig. 3. A network wilh routing control. 

to as flow in the following. The number «* rcpicscnts the 
fraction of time that server / spends serving bufler /. Inequality 
(3.2b) expresses the fact that the rate with which work Leaves 
buffer j |the sum on the left side of (3.2b)] cannot be greater 
than the server capacity [the sum on the right of (3.2b)| that 
is allocated to that buffer. 

In the case of a virtual circuit network, condition Cl is 
equivalent to the condition that, for every link, the sum of 
the average rates of all virtual circuits that go through the 
link is strictly less than its transmission capacity. This can 
be easily seen by observing the following. Since there is no 
routing involved, 7? y contains a single element for all j’s and 
is equal to the arrival rale of the virtual circuit that buffer 
j corresponds to (recall that in the VC network case, each 
buffer corresponds to one virtual circuit). Each buffer can be 
served by one server (link) only; therefore, the sums at the 
second inequality in (3.2) are reduced to single terms; these 
inequalities are equivalent to the fact that the traffic load of 
every link is less than one. The proot of the necessity of Cl 
for stabilizability follows, vhile the sufficiency will be proved 
m Section IV where the parametric back-pressure policy, the 
predecessor of the ABP, will be specified. 

A. Necessity 

When the long-run average rate wilh which work goes 
from buffer j to k as well as the long-run average fraction 
of time spent by server / serving buffer j exist, the intuitive 
interpretation of Cl given earlier turns readily into a proof 
of the necessity of Cl. These long-run averages, through, 
do not exist for every policy, even if the system is stable, 
as illustrated in the following counterexample; therefore, a 
different approach should be taken to show the necessity of Cl. 

Counterexample: Consider the system in Fig. 3 that con¬ 
sists of three buffers. Buffer 1 receives work with constant rale 
0.8, and is served by a server of higher service rate; therefore, 
it leaves the buffer instantaneously. The served work is routed 
either to buffer 2 or to 3, from where it leaves the system 
instantaneously. The switchover lime is equal to 0. Clearly, as 
long as the server does not idle, the system will be stable. The 
routing is specified by the variable r(t) which represents the 
buffer that is fed from buffer 1 at lime /. Let 

, __ /2, 3 2k <t < 3 2/rfJ , k ■= 0, 1, - 

“ [3. l\ 2k+i < / < 3 2 * +2 . k = 0, 1. ■ ■ . 

If // 2 is the average flow from buffer 1 to buffer 2 at the ?th 

routing transition, then we can see that f[ 2 fluctuates between 
values which are greater than and less than therefore, 
it cannot converge, and the long-run average flow docs not 
exist. 0 
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Next, we show the necessity of condition Cl. The neces¬ 
sity follows when the arrival streams satisfy the following 
condition. 

SI: There exist nonnegative numbers i = 1, • • -, ti such 
that 



<h(h -ti)-b 9 . 


Condition SI can be viewed as a constraint on the idling 
of the arrival streams. 

Theorem 3.1: If the system is stable and condition SI holds, 
then condition Cl holds as well. 

The proof of the theorem will follow after a lemma. The 
following definition is needed. A collection of nonnegative 
nurrtbcrs Ijki j — 11 1 " ,B,k£ 7?j is called a superflow if 

a j “1“ fkj £ //A" j ~ h -' • ,B (3.3) 

k k£Tt , 


and a strict superflow if the inequality in (3.3) is strict 
whenever ^ keJH f,k > 0* The notions of superflow and strict 
superflow do not correspond to any physical quantities in the 
system, and they arc introduced only to be used in the proofs. 

Lemma 3.J: If /' = (f' jK . j - 1, ■ •, B, k e Rj) is a su¬ 
perflow, then there exists a flow / s= (f /k , j = 1, ■ ■ ■, [}. k € 
7 lj) such that 

/ < /' (3.4) 


where rhe inequality (3.4) holds componentwise. If /' is a strict 
superflow, then there exists a flow / such that (3.4) holds with 
strict inequality, for the nonzero elements of J and /'. 

Proof, Consider the transformation T defined by / i+] = 
T(f* )whcrc 


rH 1 


{ 


-■*E. 


j ft R/ 






/: 


A 


Hit/?, J',l > 

otherwise. 


It is claimed that if /' is a superflow, then /' f 1 is a superflow 
as well and 


/' f ‘ </' (3.5) 


while if /' is a strict superflow, then / H 1 is a strict superllow 
as well, and (3.5) holds with strict inequality. Notice that (3.5) 
follows readily since the fact that /' is a superflow (strict 
superllow) implies readily that 

+ Si fj, 

SleKj //; 

is less (strictly less) than one. From the definition of T(), we 
have 




From (3.5) and (3.5a), it follows that 




(3.5a) 


if /* is a superflow, while (3.5b) holds with strict inequality 
if /* is a strict superflow. Relation (3.5b) shows that / ,+1 is 
a superflow and (3.5b), with strict inequality, that it is a strict 
superflow. 

Obviously, if /' is a flow, then f t+i = /VConsider the 
sequence of superflows J\ i = 0, 1, ■ ■ • defined as /° = /', 
Z' 41 = T’(/*), * = 0, 1, ■ ■ ■. This sequence is nonincreasing, 
and as a consequence, it converges to a fixed point of T() 
denoted by Note, though, that all fixed points of T are 
flows; hence, /°° is a flow which is less than or equal to /' in 
general, and strictly less than /' if J‘ is a strict superflow. 0 

Remark: If /' in the above lemma contains no cycles, 
that is, there is no sequence of buffers such 

that /i = h, lj -fi € Tlj. j = 1, • ■ - 1, // J / M1 > 0, 

j - J, ■■■.A 1 - 1, then the sequence of the superflows /', 
t — 0, 1, ■ ■ • in the proof of Lemma 3.1 will converge after 
a finite number of steps. If there is a cycle in /', then the 
convergence takes an infinite number of steps. 

Now, we can proceed to the proof of the theorem. 

Proof of Theorem 3 . /. Assume that the system is stable. 
From the definition of stability, there exists D such that for 
every X(0), there exists T, which may depend on X(0), for 
which 


E-V,(0 <11 t> T. 


(3.6) 


Assume that initially each buffer has a backlog equal to 21), 
and T is such that (3.6) holds for this initial condition. Let 
be the total amount of work that has been transferred 
from buffer j to buffer k € in the time interval (0, 7'), 
where V > T is a time to be selected appropriately, as we will 
see in the following. Clearly, A' J (T / ) < D, j - 1, ■ ■, B and 

X J (T') = 2D+ £ Qn~ E^>* 

K If-R l kfKj 

+ / a ,(i) (P < P. j— L* ■, B. (3.6a) 
Jo 

Consider the nonnegative vector J - (: j = 1, -.., /J, k t 
7?,) where fjk -• Q,k/T f . If we divide (3.6a) by T\ and 
using SI, we get 

a j + ^2 hi 

* iev K 



If we select D > max^i, ub, and condition SI holds, 
then (3.7) implies that / is a strict superflow. Let TJ k be the 
amount of time that server i serves buffer j and directs the 
traffic to buffer k during the time period from 0 to 7". Define 

A| Tjk 

u j =- J, -■ 


(3.5b) 
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itc each server may serve ai most one buffer at a time, we 

e 

E^ 1 os) 

/etf. 

vc also have 

Qik = E T',kltij A, fc Rj, 

i ietf 

which yields 

E^= E E 7 >- k ^R, 

keRj ktR * jf - 

and therefore 

E /j* - E «>■, kf ' R j oo) 

k € R i ) t 

From Lemma 1 1, since / is a strict superfiow there exists a 
flow /' such that f' jk < J ^ if ^0 Lei 

( — max| / — i It h l Rj fjh > o} 

Clearly ( ^ J Define 

u f — (/ — 1 a i ( B t 

l! holds that 

E";<> 

By multiplying each part of O 9) by r and given the fact that 
l f f k i t / ; a H follows that 

E/;»< E">i / 12 u oi<» 

/ /efr 

o 

The necessity ol strict inequality in relation 0 2a) of con 
dilion Cl is due to the dehmtion of stability that we have 
and more specifically to the requirement that the asymptotic 
bound of the backlog is independent of the initial conditions 
It is possible that bounded backlog is guaranteed without 
strict inequality in 0 2a) This bound, though, cannot be 
independent of the initial condition This is illustrated in the 
following counteicxample Consider the case of a single-server 
queue with constant instantaneous arrival late equal to u and 
seivice rate ft Condition Cl, with strict inequality in 0 2a), 
boils down to the condition a < /x If equality is allowed in 
O 2a), we may have a — // In the latter case, and if the 
instantaneous arrival rate is constant and equal to the service 
rate /x, then the backlog at all times will be equal to the backlog 
at the lime instant 0 Therefore, the network will be unstable 
according to the definition of stability considered in the paper, 
since there is no asymptotic bound in the backlog independent 
of the initial condition 


IV The Parametric Back Prissuri p » 

In this section we will present and study rfu \ 
back-pressure policy PBP(<*) This policy determine 
a server idles or not, and if it does not idle whu h i 
served and where the traffic is directed The freqi i 
which a server is switched from queue lo queue dtp i 
parameter rv > 1 We include a as an argument in tlk 
of the policy to emphasize that dependent e 

PttP(a) While the server / serves a buffer ) { and tint cl 
the traffic to a buffer k {] it is constantly rmmnoim tin 
quantity 

A,(t) - lUrucL.iniaxI Vj(/) V ,,{!))] (il) 

Id) [ ne/?i J 

If 

"/I.* (\ K (0 - A* (/))>() 

and 

1|(0 >«/'., ^ (0) 

then the seivei is rescheduled lo serve the queue / < B and 
the traffic is dnected to queue it E R\ which togethci teali/c 
the maximum in (4 l), with lies biokcn arbitrarily Service is 
reassumed after the switchover time b 

If 

4,(0 <o 

then the server idles tor a time penod b If at die end of the 
idling period it is still 1,(0 < 0 then the server lesiaits 
an idling period of the same duiation Otherwise stiver i 
is rescheduled to serve the queue l E B, and the tiaftic is 
directed to queue n E R-i which together realize the maximum 
in (4 1) 

Some clarifications on the operation of PBP(o) follow 
According to PBP(n) the served traffic of buffer / \> touted 
to buffer it which achieves the maximum m 

nirix{ \/(0 MO) (4 la) 

ni R 

whenevci buffet / is served Server / selects which buffer 
l E B , to serve based on the terms 

//,(max{ \,(/) - V„(/)> I e B 
n€R i 

It selects the one whiLh achieves the maximum m (4 1) Iht 
policy is illustrated in Fig 4 In order to avoid servu / switch 
ing from queue to queue too often the policy reschedules the 
servet only if the quantity (4 la) for the queue / undu seivice 
becomes considerably smaller than 4 T (/) How much smaller 
it should become is determined by n which regulates how 
often the server switches This feature is reminiscent of the 
clear a fraction policy considered in [10] II the backlog in 
all the downstieam queues is larger than ihe backlog in the 
queues of J3 t that is the quantity 4,(0 is negative, then the 
server / idles 

Note that the scheduling of server / relies on information 
aboul the queue lengths of the buffers in B, and in Hj j f B, 
This is local information for server / in seveial practical 




Fig. 4. Server t is allocated to one of the queues 1, 2, 3 When the backlogs 
areas illustrated in the picture, then queue 2 will be selected for service and 
the traffic will be directed to buffer 4. 

systems. For example, in the case of the VC network, the 
buffers in B t and in Tlj, j £ B , reside in the origin and 
destination node of the server/link j ; therefore, policy PBP(a) 
can be implemented in a distributed fashion. 

The stabilizability properties of PBP(r*) are stated in the 
following. 

Theorem 4.1: If the arrival and service rates satisfy condi¬ 
tion Cl, then there exists an a > L such that the system is 
stable under PBP(o). 

Note that the policy PBP(m) needs only the service rates for 
its application. Furthermore, the stability holds for any value of 
the parameter A in the definition of the policy. The proof of the 
theorem relies on the following lemma, which is a drift-type 
condition on the sum of the squared backlogs in the system. 

Lemma 4.1: If the arrival and service rates satisfy condition 
Cl, then there exists an a which depends only on the arrival 
and service rates and the burstiness coefficients such that when 
policy PBP(ci) is employed, the following holds. There exist 
numbers Do, T, and i such that 

+ T) - XX 2 (t) < -c If ^ 00- 

J=1 J-l J-1 

Proof: The proof of the lemma is lengthy, and relies on 
some intermediate results. It is given in Appendix A. Q 
Proof of Theorem 4.1: We show that there exists a 7\ 
which may depend on Jf(0), a, /(, and <\ and a D\ which 
may depend on a, and 6, such that 

mipDW < D'. (4.1b) 

Let T and < be such that 

XX 2 (' + T) - X>, 2 (0 < -< if JX a (t) > Do. 

/si J=[ 

(4.1c) 

From I-emma 4.1, T and r as above exist. Lei k be the smallest 
integer A* such that 

XX 2 (W) < Do. 

J = 1 


TRANSACTIONS ON AUTOMATIC CONTROL, VOL 40. NO. 2, FEBRUARY 1995 



Fig. 5. The link from node 1 to node 2 that cames 4 VC's and the buffers in 
the origin and destination nodes of the link are illustrated. When the backlogs 
are as they appear in the picture, the link will transmit VC2 

From (4.1c), clearly such a k exists. Define T - kT and 

D= ( /D~o + '£(', l T + b ) ) 

1 = 1 

We show by induction that we have 

B 

< 1)D. l>k. (4.Id) 

T-~ 1 

The induction step is as follows. If A ^((7") < Do. 

then, clearly, Ef=i A ’f((l + 1)7") < HD. If Do < 

E?=, XftlT) < BD. then E"i X j((> f ] ) T ) < 

Ej=i A'2(ZT)-r, and therefore. £?_, A’, 2 ((/+l)7) < D. 0 
The actions taken by policy PBP(rr) in the general network 
correspond to scheduling the server, routing of the traffic, and 
idling of the server. If routing is not involved, then the policy 
is simplified considerably. This is the case in a virtual circuit 
network where PBP(a) acts as follows. Every link is allocated 
to the virtual circuits that go through it based on the backlogs 
of the buffers of these VC’s in its origin node and in its 
destination node. For each VC, the differences of the backlog 
of the buffer in the origin node of the link minus that of the 
buffer in the destination node of the link are formed (Fig. 5). 
If all differences are negative, then the link idles. Otherwise, 
the quantity 

A,(1) = max{^(/) - .Y ; (0} (4.1c) 

is computed where l and / are the buffers that correspond to 
the same virtual circuit in the origin and destination node, 
respectively, of link t. If 

A l (t)>,y, ill (X J (t)-X l (n) 

where y, j are the buffers that correspond to the VC currently 
transmitted, then the server switches to the VC that achieves 
the maximum in (4.1e). 

Note that in the model, it is possible that work goes 
through the same buffer more than once. It appears that this 
possibility may lead to instabilities under a distributed policy. 
The following is an intuitive explanation of why this is not 
happening with PBP(r*). The goal of the policy is to push all 
the traffic out of the network. It is possible that work may 
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-it a buffer more than once, following a cycle of buffers 

th small backlogs, instead of following a route out of the 
M work, when the buffers that lead out of the network are 
u digested. Eventually, though, the latter buffers will empty, 
P id the work will find its way out of the network by selecting 
m- route with the low-backlog buffers. 

The PBP(r*) policy achieves a bounded backlog in the 
network under the necessary and sufficient stabilizability con¬ 
dition Cl, and is indeed distributed since the decisions at each 
node rely on the one hop away state information. Nevertheless, 
the arrival and service rates and the topology of the network 
need to be known in order to select the appropriate r». In the 
next section, the ABP policy (an adaptive version of the PBP) 
is obtained, which does not rely on knowledge of the arrival 
and service rates. 

V. The Adaptive Back-Pressure Policy 

The adaptive back-pressure policy is identical to PBP(rO, 
except that <v is not preselected, but is computed at each 
node based on the local (one hop away) state. Each server i 
computes a parameter n,(/) based on the lengths of the queues 
that it may serve and the queues that it may direct the traffic 
to, as follows. Consider a function y : /if 4 —* T? 4 " which is 
nomnereasing, strictly greater than 1, and such that 

lim (/(.r) -- l 

lim .r(//(.r) — 1) = x. (5.1) 

The function y(.r) = 1 4 ■/' u satisfies (5.1) for all l ) < a < 1. 
Define <*,(/) as 

<‘,(0 = y ^y^i;uia*{-Vy(/) - A',(0}|y 

The policy ABP is as follows 

ABP: Each server acts as in the PBP(a) policy with the 
dilference that server t uses the locally computcdnr,(/) instead 
ot r*. 

The following holds for ABP. 

Theorem 5.1: The network is stable under ABP if condition 
Cl holds. 

The proof of Theorem 5.1 is the same as that of Theorem 
4.1, given the drift condition slated in the following lemma. 

Lemma 5 1: If the arrival and service rates satisfy condition 
Cl and the network is operated under ABP, then there exist 
numbers T, and i such that 

B D B 

£*“(/ + T) - £*’(/) < -* if£*J(0 > Ih- 

J = J J = 1 

Proof: The proof is given in Appendix B. 0 

Note that there is no parameter estimation taking place in the 
policy. In fact, the average arrival rates might not even exist. 
The logic of the adaptive policy is that it adjusts a such that it 
goes closer to 1 with a certain rate as the backlog increases. In 


this manner, it is achieved that the servers switch to the , , 

that have the heavier backlog difference frequently m ,, 

VI. Conclusions and Disc ussion 

In the ABP policy, each server is scheduled based *» i- 
lengths of the queues that it serves and of the quern s om tu , 
away, which were assumed to be instantaneously uvaihibk m 
the server in our study. This is not always the case, though 1 m 
example, in the virtual circuit network, the link scheduling was 
based on the queue lengths at the origin and the destination 
nodes of the link. Clearly, the queue lengths in ihe desimaiion 
node could not be made available at the origin node without a 
delay greater than or equal to the propagation delay ol the link. 
In high-speed networks, the link propagation nine is enough 
for the state of the queues to change considerably. TheuTorc, 
there will be a discrepancy between the actual queue lengths 
and those available to the server. Nevertheless, as long as 
the difference between the actual queue lengths and those 
available to the server is bounded independently of the queue 
length values, then the results obtained in the paper remain 
unaffected. That is, the ABP policy stabilizes the network with 
delayed information about the queue lengths as well. 

According to the ABP policy, server i idles if the quantity 
A,(t) becomes less than or equal to 0 or, in other words, if for 
every queue / E B, the backlog of all queues in 7 It is grealer 
than or equal to that of queue /. The results remain unaffected 
when A,{t) is compared with an arbitrary negative number 
instead of 0 in the definition of ABP. Thai is, the server idles 
only if the backlog in the downstream queues becomes greater 
than thai of the upstream queues by a certain amount. 

The delay through the network is an issue left open for 
further research. Bounded backlogs imply bounded delay; 
therefore, under the ABP policy, the delay will be finite when 
the stability condition Cl holds. There are several questions 
which are left open, though. How docs the delay vary when 
the function y that estimates the parameter nr in each node 
changes? Which is a good choice of y as far as the delay 
is concerned? As we said earlier, the backlog in the network 
remains bounded even if the server t idles whenever A } (t) < h 
for an arbitrary h , and not necessarily for h - 0. What is a 
good choice of h for small delays? Also, what will be the 
effect of the delayed information about the slate of the one 
hop away queues on the delays through the network, and how 
will the latter be affected from the propagation delay? The 
investigation of these questions might lead to refinements of 
ABP with improved performance with respect to delay. 

Appendix A 

Lemma 4.1 is proved here. Two lemmas precede its proof. 
The first quantifies the property of policy PBP(o) that the 
switching of the server becomes less frequent as the queue 
lengths increase and more frequent as a approaches 1. Note 
that the maximum rate with which the difference of two queues 
may vary (increase or decrease) is less than or equal to 

N 

ba+ EE^ = a/ - 
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Lemma A.l: If for a server i we have 

~ - ^\ MT (AI) 

then i will switch at most once in the time interval (I, 1 + T) 
and 

EEw (0 - *i(0) 

“ i {7 ' - - - Y,( ' )} } 

-4M(T + t)) 2 (A.2) 

where Q is the amount of work served by server i and 
transferred from buffer j to buffer l during the time interval 

(*, t + n 

Proof: At time (, server t serves queue j\ and directs the 
traffic to queue l[ We distinguish two cases. Assume first that 

/\>i(Aji( 0 - ^ti(O) - ‘»ax|// IJ max{A',(/) - A' ( (/)} j 

(A3) 

Since the maximum rale with which any difference 
/i til (-V /t (o - X\ y (0) may vary is Af, we have 


Assume now that (A.3) does not hold. From the definition 
of the policy at time t , we have 

(*IUn(X .».(0 ~ X tl (t)) > - A,(f)}}. 

)tb x I ICKj I > 

(A.7) 

If server i will not switch queues during the interval (f, / + 7 1 ), 
then from (A.7), relation (A.2) follows easily. If the server will 
switch, then let /i be the first time after / at which the server 
switches. Lei jz be the queue that is served after the switching, 
and lz the queue lo which the traffic is routed. At time 1 1 4* 
we have 

i + A) - Xi 2 (l\ + (>)) 

= uiajcj i^jinnxlX f (t\ 4 A) - 4 A)} 1 

l€b, ^ , J 

> max j //,, ni ax {X, (/) - A,(/)} j - 2M (/, - I + f>) 

(A.8) 

and arguing similarly as above 
I'.'AXjAO-XiAi')) 

> iiiax|//, ) nirix{A l (/) - A’/(/)} ^ —2 AIT (A.8u) 


> (* |mw|/i,jmax{ A ^ (/) - A|(f)} j - 2iW7’j, 
f P (/, f + T). (A.4) 


Also, for any pair of queues ,/ € B,, l £ lij 


> max j//, ; max{A r ,(/) - A'/(f)} j + 2 M 7 \ 

/' £((./ + T). (A.5) 


From (A.4), (A.5) we can sec that if (A.l) holds, then 


Equation (A.8a) together with (A.5) imply that it (A.l) holds, 
then the server will not switch again in the time interval 
(/1 + f>,i + T). Let Q' jt be the amount of work served by 
server / and transferred from butler j to buffer / during the 
time interval (I. / j), lei Q‘ jt be the same quantity for the time 
interval [t \ + f>, / + T) Then, clearly, Q‘ tl — Q‘ jt + and 

£E^'(L(o-a-,(0) 

J<rB,lk Rj 

-=('t - I)/!,,, (A,,(0- A,.(0) 

> -{Ii - f)iri.uc{/i l/ jii.uc{A’ < (f) - A/(/)}l (A.9) 

fv jfctf, [ itRj ) 

while 


«*n (*,.((') - A,,(/')) > ;i, ; (A # (0 - X,(!')), 

j e£»,.^ji,/e7? ( /' €(/,/ + T) (A.5#) 


and the server will not switch in the time interval (/,/+■ 7’). 
Then, clearly 

EEw>- A ' (/ )) 

” ~ A '( 4 )} J (A.6) 

and (A.2) follows. Note that (A.6) assumes that the server is 
always busy during (1, t 4- T) % which is a valid assumption 
based on (A.l). 


EE^-f'i-AiW 

jee,ie-R, 

= ^(*,,(0-^(0) 

= (7’+/-/i- ,.(/)-A',. (/)). (A.10) 

Notice that 

PnAXJI) - XM > pMA/i + 6 ) - X,A<i 4 A)) 

—4Af(#i — / -f A) (A.l 1) 

From (A.8), (A. 10), and (A. 11), wc get 

££0;,(A',(o-*,(/)) 

> (T + f -/i - i^maxj/i.jiuaxfA^I) - A|(/)}} 

^€“1 ( l&IK i J 

-47’Af(/i -/ + <)). (A. 12) 



\sSlULAS ADAPTIVE BACK PRESSURF CONGESTION CONTROL 


om (A 9) and (A 12), we have 

EEw-' ■At)- v,(/)) 

> M'j ~ - MO}} 

v{T + l -/i -/>)inAxj//, J niAx{\ j (/) - A,(/)}| 

- 4TM(ti -t + b) 

- ^)iM<ix|/t,jjnax{ \j(/) MO}} 

-4M{'l+b) 2 (A 12,i) 


0 

In the following we show that one of the differences 
\ f (t) \i(t) / = 1 U / e 'R, is on the order 

of v /£? = , \^(/) A result similar to the lollowing lemma 
has been shown in |ll| The lemma is included here for 
completeness Notice that it holds independently of the policy 
and is a characteristic of the netwoik 
l<mma A 2 There is a constant ( > 0 that depends only 
on the topology of the netwoik so that 


i 


max^ {\,(0- \/(/)} - 


n 






Pi oof Consider the queue y 0 with the maximum length 
(leaily 


in :■ 


V 




Phcie exists a sequence of queues thiough which the work can 
be louted out of the network that is thcie exist a sequence 
/ j t n such that n i B / 1 = /o 0 £ R ^ i £ 7v, 

/ - I ?/ —* 1 Flien we have 


;i 1 


5Zc\. (0-v, ,('))= V„(0 MO 


A — 1 


M0_fe!L. 


inax (\„ - \„ + 1 )> — B ~ > \/ — 


A 1 i 1 


/?» 


Pwof o] Ijemma 4 1 With simple calculations we get 
for every / 

£>;(' +J)-£\ 2 (') 

B 

= 2j^(A,(/+/)- m»m) 

i i 
B 

+ Y,(f -4-7) - M)) 2 (AH) 

J = J 

Notice that 

(A',(f + T) - X^t)) 2 < M 2 1 1 ) = l B 


where M is the maximum rate with which am i 
vary, and it has been defined before lenmn \ 1 
B 

Y,(XA* + T)-X,{i)) i <BAI 1 l' 

1 

fn the following wc proceed to bound the fust t 
right hand side of (A IT) Let Q' }i be the amoum t 
served by server / and transferred from buffer / to i 
during the Ume interval (/ / 4 T) Let Q, be iht i, m it < i 
work transferred to bufler / from outside, and (J f{) Hk tm ^ i t 
of work served by server t and transleircd from hull i i 
outside during (/ t 4 T) Clearly! o j 

A \ 

+ M0 + V,+ EDft-EE'x. 

!ir 

Hence for the fust term on the right side of (AM) wi hau 
B 

£(M + i)- \,(0)M0 

/ 1 

= E<wo+£ £ 5>,M) 

l l / 1/ ,fK,I 1 

EEE^mo 

1-UfK i-l 

11 \ 

,(/)) 

/ 1 ;-l/^7? i-J 

(A 15) 

From the buistmcss constraints on the arrival stieams wi hav<. 

Qi S <*11 + ft/ I “ I W 

which togelhei with condition Cl gives 
zy tt zz 

£q,MK y£«,\.(0 + £\,(0'', 

y 1 /-I /l 

t— 1 \I^K 
B 

+ £voa, 

n n 

= J EE mm) - mom £mv'< 

7 lfrR f 1 

moj 

it 

+ £M0&, < A16 > 

J=1 

In the following, we upper bound the second term on the right 
side of (A 15) Note that 

<y;,(MO-A,(o)>-M^ > = i n 

/ / = 1, /I (A 17) 


t J 
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This is so because, first 

0 <Q),<MT. i = ],••■, N, j, l — ,B 

and, second, if X } (f)~Xt(t) < -MT, then Xj(f')-X,(t') < 
0 for all f € (t, t + T), and Q‘ jt will be equal to 0 since no 
server will serve queue j m the time interval ((, i + T). By 
interchanging the order of the summation, we have 

N B 

= -EEE c W( o-a'ko) (a. is) 

t = i1/e R, 

Define D = From Lemma A.2, we get that for 

some i, say / = to, we have 

<Aojj2*x{Xj(0 ~ *i(0) | 

[ l£Rj J 

= max max<- A r /(/)} > 

«=i, \ J v J 

> mill {/j,, }maxmaxlA\(f) - AV/)} > r \//). 

7 €0m1=i, 7 VtVeRj J v /J “ 

Hence, for £> large enough, inequality (A.l) is satisfied for at 
least one server. Let T be the set of servers for which (A.l) 
holds. Replacing in (A. 18) from (A.2) and (A. 17), we gel 
B N 

?= i 

< A)mdx|/r i; max{A ; (t) - A',(/)}J 

+ 4M{T + b) 2 + NI3 2 M 2 T 2 


< -V ( 1 - V a*. I —Tiuhx 

■erl k " 


Then, we can easily check that 


E ( 1 “ XJ ,c # J- *,(/)}} 


>ti—Trn max {A,(f) - A/(/)}. (A.20) 

a j= 1, ,0,16/?/ M v /j v y 


Clearly 


> max {Xj(t)-Xi(t)\ 
7 = 1, ,£1 1 JK ' V 


(A 21) 


Let /i = max, =1 ^ /fe s ( , t ^ Then, from (A.21) 


^{/Ljinax{X,(0-^/(O}} <i=1, ,AT 

if T ^ 1 ' 


From (A.21), and since 1 /« < 1, we have 


(A.22) 


SlS-'j^-Oal 

> 2jV7’^~ - I^/xn/d 


/I'jiiwxjA'^/) - A'/(/)}} 


From (A 19), (A.20), (A.22), and (A 23), we have 

d ;v 


(A 23) 


EEEwo-A’iio) 


j = l/f R i = l 


• {/AjmaxfA^/)-A',(/)}} 
+ 4M(T + b) 2 + NB 2 M 2 T 2 


< - u—Trrn v/z3 + 4A/Y7 1 4 ti) 2 3 N fi 2 M 2 T 2 

O 

+ N- t i\/D-2Nf( - - lYi\/}J 

O' \0 / 

~ E (E "x] Twj/x^nmxjA^/) - .Y«(/)}} 


~ E^E ,, ij T ^{/ , "J“^ A 'x(0 ~ A';(/)}J 
+ E^ ; ^{^«*ax{A J (0-A J (0}} 

• |t'o»‘ax{A y (/) - -Y,(/)}| (A. 


(A.24) 


Let P be the set of all queues / for which 


(A.19) 


Define 


u - min 
isl, t N 


»miax(A'/(0 - A'j(/)) > ----A77 1 (A.24a) 

«€V j (O' ~ i ) 

Then, from (A. 16), we have 


Eq,Xj(0 

n 

< Y^m +wkmo - a■*(/)} y l< 


] -E°;?’ 


77 / = mm / 
#»i, ,iv jeP, 


(A. 19a) 


+ B Va,^-MT. 

O - 1 

1 5=1 


(A.25) 
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ote that from (A.24a), it j G T, then all i\ such that j e B, 
long to T. Based on this fact, in the following, wc verify that 


"'i«ax{X,(0 - Xk,(i)} ^ fi< 
lev, 



V.j'UULlXAO 



tA.26) 


E*> +T )-!>'?(') 


j=i 


»=j 


<Vi(b) l/2 -C(D) i/l I r 
4(t '-*1/1 


+ ' 


II is clear that if D is large enough, ihc right side u< 
becomes strictly negative and the lemma follows 


\n, 


Note first that 


Appendix B 


i- F \j^ t J 


i T EE u',ti„\iuui{X t {1)-X,{1)) (A.27) 

v ' 


The proof of Lemma 5.1 is given in this Appendix alter 
some intermediate results. Define 

n 1 , = 9(uiax{ /tlJ |nax{X,(0 - A',(/)}| + 2^/) 

Lemma B I If for a server /, we have 


and from the fact that if / 6 /. then all t\ such that y € B t 
belong to T* we have 

( ina x {A,(/) - Xi(t)} 
feM 

> iy'Vu' l , ll) nuix{X J (n-X l [l)}. (A.28) 

„ l&R i 

f Orb 

From (A.27), (A 28), and 0.2), the relation (A.26) follows. By 
adding the inequalities (A 24)—(A.26) and (A. 15), and alter the 
simplifications, we get 

u 

5>V,(H T) - X I (I))X I (I) 

1=1 

s -u-'lmf\/D + AM(1 + b) 2 + NU 2 A! 2 T 2 

n 

IS 13 

+ 'TdYJ'i + A,T (A - 28a) 

,-=i * J fV 

By replacing in (A. 13) irom (A. 14) and (A.28a), we gel 

'£xf(i + T)-'tx 2 (D 

1 1 J =1 

< < yT 1 - C2-T\fi) + < i\fn 
at 



+ rr~~T (A.29) 

(\ ~ 1 

where c lr ■, r are positive constants which depend only on 
the system topology and the arrival and service parameters. 
Select or > 1 such that r 4 - (c^ + r 4 /nr) = -( < and let 
7 1 = (D) ]/1 . Then (A.29) becomes 


max 


^{y'MinaxIA^O - A'i(/)}| > ~ Tf)^ rr ^B.l) 


then y will switch at most once in the time interval (/, 1 4 7 1 ) 
and 

EL g;,(jf,(f) - .y,(0) 

/ f - P t t J? j 

- \M(T 4 M 2 (B.2) 


where is the amount of work served by server / and 
translerred from buffer y to buffer / during the time interval 

M + T) 

Proof The proof follows the same steps as with Lemma 
A.l, except for the following differences. Inequality (A.4) 
holds with nr replaced by r \,(t) on the left side and with rv/ 
on the right side. Inequalities (A.5a), (A.7), (A.9), and (A. 12a) 
hold with rv replaced by n,(t) 0 

Proof of Lemma 5 1 From (A.13MA.15), we get 

£>•*(<+ r)-f\ x 2 (t) 

j 1 1 = 1 

a 

= £rwo 

i=i 

+EE EW'(') 

+ BM 2 T 2 . (B3) 

Also 

fi i? 

s r EE^s (A '' (,) - At(,)) 

/-i i=uen 7 

o 

+£Xj(/)t> r (B 4) 
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Id the following, we upper bound the second term on the right 
side of (B.3). Note first that 

- *,(/)) > -M 2 T\ . = 1, ■ ■ ■, N, 

j, l = l,-■■ ,B. (B.5) 

By interchanging the order of the summation, we have 

EEEw)-^(')) 

»=e=l 

N B 

Q]^) - (B. 6 ) 

Let T be the set of servers for which (B.l) holds. Replacing 
in (B. 6 ) from (A.2) and (B.5), we get 

££Eww>-*,(0) 

5 -E^( r ~ - **<')}} 

^ A \Al r r i M «2 *42r V 2 


g (), and the definition of a 4 (t), we have 


gf 1 - -X,( 0 )} 


> u —;--- r - — Trrrrr: Trn 

~ fl(mmax ;3l , {*/(/) - *i( 0 }) 

x , m ^. „ (B.8) 

J-l, 

Since > 1 , / = 1 , - ■ , TV, t > 0 , we have 

<Nb/i max {.Y.(t) - X/(/)} (B.9) 

jsl, DJfrR, ' ' 

Similarly, since l/a,(f) < 1 , i = 1 , ■ ,7V, / > 0 , 
u j - ^ anc ^ f° r I 1 as defined in (A.l9a), wc have 
(B.10), as found al the bottom of the page. From the definition 
of T and (B.l), (B.10) becomes 


+ 4M(T + by+ NB z M l T l 
< “52 I 1 “ 52 u ; 1 —tttT inax 

b\ h ) n>(t) ,eB ’ 

•{/t ej max{A 7 (0-^i(<)}} 

+ 4M(T + 6) 2 + JVJ? 1 M 2 7' 2 


E (E>; t (^T) - 1 )“ c ^{ /, *^ {jr ' (#, - jf ' (# )}} 

- -l)Jl/T) ~ l ) l ‘ 




(B II) 


if r 

- 52 f 52 h j i * ( ^77T “ 1 ^ nmx 
h\h j w'> /-«• 

■ - -YdOl J (B.7) 

lb simplify the right side of inequality (B.7), we upper 
bound the terms appearing in it in the following. Based on 
the definition ot u and in in (A. 19a), the nomncreasingncss of 


From (B.7MB 9) and (B.l 1 ), we have 

B N 

EEE^w)-^)) 

jszHERj 1 = 1 

- “ W 9(mmax^ B i e7 j, \XA>) - Xi{t))) T "' 

x max {XA1)- X,(!))+AM(T + b)- 

/“l 

NB i M 2 T i + Nti/i max {A .(f) - X,(1)) 

j~ 1, ,B,l±Rj 


- NT {mf)-‘K^*, {XAI) - xm 

- E[ E";) r ys{^i"“!-Y J w - a-.ioi}- 


(B.12) 


SIS. v " l h%{* ji ™ {Xj{t) - A ' ,(/)} } 

4 OT '(sssk(0 - ’Ky^, W(,) - x *» 


> 7V7’I -7 —:- j --——-— r; “l /i max {AT,(/) - X/(f)}. (B.10) 
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i't f be the set ot all queues ) for which 

2,J 

imaax.(X,(t) - Xi(f)) > —■ MT (B 11) 

t&R, {n* - 1) 

I hen from (B 4), we have 
n b 


By replacing in (B 3) from (B 12), (B 17) and usuv (I 
and Lemma B 1, we get 

£>■?('+j> 

1-1 


< -u- 


1 


■ f W( \/ J) 


'ZQr'c.v) < 


i=i 




+7 E^k^ x '^^ - A *(^} E f /i 




h r7? 


" In 1 

+ BYu t - r ^-MT (B 14) 

, <v. - 1 


i- i « 


Notice that 


q{m( \/T)) 

+ \M{I + f>) 2 -1 NB 2 W 2 1 

+ m ,„^- N r( q{ J frn 

l* I 

Hr BM 2 1 2 + £/., V /) + IBMI )j< 

D 1 


a \1) 




<2BMlJ2" + 2BM l E," 1 


1 = 1 

/* n 


“ 'min,-, zjt rv7 — 1 


+ /?M7y>,-=-(B 10) 

i-i ()(!« V I) + IMI) ~ 1 

II we select / = (/)) ]/M then (B 10) becomes 

±\U> + i)-i^n 


/ — 1 


1-1 


< 2B\11^1 + 2BMlE"' 


1 = 1 1 = 1 
1 


</(// maxuh-R { X j(M — X/fO}) — 4 
Defining D — 1 \^(/) we have 

B B 


<l(<iVD) 


(B If) 


'{(/(IMD 1 / 2 ) J 

V D(qlt W 1)1 1) 


(B 20) 


E \. (/)//, '_'VB.\/T) (B 16) where r i r * aic constants that depend only on the system 

^ ^ ^ n Cm/ d lim <■/ i \ - I PVRk / ■ 'S 0 UK H 


I 1 /I 

From (B IS) and (B 16) lelalion (B 14) becomes 

B 

E^M') 

B 

i 

1 = 1 


parameters Since lim,_* x tj( /) = 1 lliere exisls r , > 0 such 
that lor P sufficiently large. 


l 


-< \ • 


i-' 


l<r- () * 0 (B 21) 


i i 


U n 

< E h i^T) + 2BMlE A ' + 2ljMI E ai 


1=1 l 1 

I 


</(/im.ix JSS | » Its { \,(0 - \,(/)l 1 2MJ) - 1 
+ /£uwx,\,(/)- \*(f)}5^/„ (B 17) 


Also there exists a r m such that lor P sulliciently laige 

<,/> l/1 + r 2 \//7 < ( io\/7) (B 22) 

Hence (B 20) becomes 

£>;<' + n-E'f(') 


K / 


/—I 


/ 1 


If R 


Note that Irom (B 11), il y £ F then all /’s such that y C 
belong to J 7 Based on this tact, and arguing similarly as in 
the proof of Lemma 4 1 [relations (A 27), (A 28)], it easily 
follows that 


i <10 

= r i oy/b + D 1 '* ( <i 


l)'! 2 


\<J) i ')- 1) 
\ 

J 


y/lHii(t 7 y/n+'*Dl/')-l) 


< 10 


/£max{A',(0- W))Eh 


(B 21) 




IfrR 


(B,8) 


Since lim,^ x /(ry( r ) - 1) = x, we have that loi P large 
enough, there exists c n >0 such that 

1 


<b- 


^D^^D + cD^)- 1) 


-fio<-Mi 24) 
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m 


Prom (B.23) and (B.24), we have for large D that 

B rr 

£xfM<c l0 Vb-c n D 3/4 . (B.25) 

i=i 

It is clear from (5.1) that if D is large enough, then the 
right side of (B.25) becomes strictly negative, and the lemma 
follows. 0 
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Stability of Queueing Networks 
and Scheduling Policies 

P. R. Kumar, Felloe, IEEE , and Sean P Meyn, Memhei . IEEE 


Abstract —Usually, the stability of queueing networks is es¬ 
tablished by explicitly determining the invariant distribution. 
Outside of the narrow class of queueing networks possessing a 
product form solution, however, such explicit solutions are rare, 
and consequently little is also known concerning stability. 

We develop here a programmatic procedure for establishing 
the stability of queueing networks and scheduling policies. The 
method uses linear or nonlinear programming to determine what 
is an appropriate quadratic functional to use as a Lyapunov 
function. If the underlying system is Markovian, our method 
establishes not only positive recurrence and the existence of 
a steady-state probability distribution, but also the geometric 
convergence of an exponential moment. 

We illustrate this method on several example problems. For an 
example of an open re-entrant line, we show that all stationary 
nonidling policies are stable for all load factors less than one. 
This includes the well-known First Come First Serve (FCFS) 
policy. We determine a subset of the stability region for the 
Dai-Wang example, tor which they have shown that the Brownian 
approximation does not hold. In another re-entrant line, we show 
that the Last Buffer First Serve (LBFS) and First Buffer First 
Serve (FBFS) policies are stable lor all load factors less than 
one. Finally, tor the Rybko-Stolyar example, for which a subset 
oi the instability region has been determined by them under 
a certain buffer priority policy, we determine a subset of the 
stability region. 

I Introduction 

U SUALLY the stability of a queueing network is eslab 
lished by explicitly determining an invariant distnbution 
Outside ot the relatively nanow class oi queueing networks 
admitting a product form solution Joi the invariant distribu 
tion, however, such explicit solutions are rare Consequently, 
stability results are also larc 

Heic we develop a procedure for establishing the stability 
ol a queueing network operating under a scheduling policy It 
is based on just solving a linear program on the coefficients of 
a quadiatic form Alternatively, a nonlinear program can also 
be used The goal is lo programmatically construct a quadratic 
Lyapunov function on buffer levels that has a negative drilt, 
whcnevei the mean number ol pails in the system is large 
I his allows one to deduce the stability in-the-mean of a 
system, even if it is not Markovian For Markovian systems, 
such stability is equivalent to the existence of a steady-state 
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distribution, l c , positive recurrence Moreovei loi Maikov un 
systems, our method also establishes geomeinc convenience 
of an exponential moment 

Such stability results are important for a vanety ol reasons 
First, they are a precursor to more fine grained questions ton 
cermng the perfoimance levels ot various scheduling policies 
Second, several unstable scheduling policies have iccently 
been discovered Kumar and Seidman (2| and Chase and 
Ramadge |T] provide examples of deterministic systems which 
arc unstable under all clearing, i e , exhaustive service, poli¬ 
cies Lu and Kumar [ 11 provide an example ol a re-entiant line 
with deterministic piocessing limes, and deterministic bursty 
airivals, lor which a certain buffer pnonty policy is unstable 
Rybko and Stolyar [4J provide an example of a stochastic 
network which is unstable under a certain buttei priority 
policy Recently, Seidman [5] has demonsiraled the instability 
ol the well-known First Come First Seivc (FCFS) policy, 
also loi a deterministic model Bramson (6) has recently 
constructed a stochastic re-entrant line that is also unstable 
under the FCFS policy Third, there has been much iccent 
inteiest in the use of heavy traffic Biowman approximations 
to construct scheduling policies for queueing networks, see 
Harrison |7] and Harrison and Wein [8| Clearly, to establish 
heavy traffic limit theorems, it is necessary to establish the 
stability of Ihe queueing networks involved Dai and Wang 
1121 have constructed a counterexample where the Brownian 
approximation does ncol hold, see also Whitt | IT] and Dai and 
Nguyen [141 Indeed, heavy traffic limit theorems appeal to be 
only available for systems that are already known to be stable, 
sec Reiman [9], [10] and Peterson [111 

Quadratic Lyapunov functions find widespiead use in lineal 
system theory Foi stochastic systems, Kingman [ 1 ^1 has used 
a quadratic Lyapunov function to analyze a random walk 
con Fayolle [16] has used general quadratic Icoims to 
charactcn/e ergodicity of random walks on Piecewise 
linear Lyapunov functions are used in Fayolle et al\\l\ for 
establishing the stability ol Jackson networks |I9] Meyn and 
Down [181 use the square of the workload to establish the sta¬ 
bility of generalized Jackson networks, where the assumptions 
on the arrival processes and service times are lclaxcd Coffman 
ct al [20| have used linear programming to find both linear 
and quadratic forms with negative or positive drift and thus 
the stability or instability of a certain bm packing algorithm 
Then linear programs test tor a drift of the appropriate sign 
at all the states on the boundary of a prescribed hypercubc, 
our approach may be less computationally complex Recently 
Bertsimas et al [21] and Kumar and Kumar [22| have used 


(X)18-9286/9S$(J4 00 © I99S IFFF 
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Fig. I. The basic open re-cntranl line. 

quadratic forms to obtain performance bounds for queueing 
networks and scheduling policies, provided the system is stable 
and has a bounded second moment. 

II. The Basic Open Re-Entrant Line 

To expose the idea in its simplest form, we begin with the 
treatment of an open re-entrant line; see [23]. 

The network consists of S machines {L 2, ■ ■ ■, 5}; see 
Fig. 1. Parts arrive as a Poisson process of rate A to buffer 
i>i, located at machine a( 1) 6 {1, •■■,£}. Upon completing 
service, they proceed to buffer b 2 located at machine <j{2) € 
{1, Let hi at machine <r(L) be the last buffer visited. 

The sequence {cr( L), ■ • - , <r(L)} is the route of the part. Since 
one may have rr(y) = a(j) for some pairs i and j with i ± /, 
we say that the system is a re-entrant line. Let us suppose that 
parts in b L require an exponentially distributed service time, 
with mean —, from machine a(i). 

Let I(i) := \j \ rr(j) = rr(/)}, i.e., the set of indexes of 
buffers which are located at the same machine as Thus, 
the buffers with indexes in /(/) are in “contention” for the 
same machine. 

Let us denote 

x,(/>) :=■ Numbers of parts in buffer b, at time /, 
including any in service 
and 

w,(1) := I if machine <r(/) is working on a part in 
buffer b, at time t, and zero otherwise. 

For simplicity, we suppose that a machine works on only 
one part at any given time. 

The key problem in “scheduling” such queueing networks 
is to determine which part in which buffer the machine should 
serve, i.e.. which w,(t) should be one. Clearly, an optimal 
choice for reasonable criteria will depend on the location, i.e., 
the buffer, occupied by the part. When the service priority 
depends on the buffer (i.e., “class" of a part), however, 
the steady-state distribution, if any, is not known, and, as 
mentioned earlier, neither is stability. As an example, the well- 
known FCFS policy can be stable or unstable for particular 
systems, when the //,'s are not the same for all buffers at a 
machine. Similarly, buffer priority policies can be stable or 
unstable for particular systems and values of parameters. 

III. Copositive Matrices 

Let Q be a symmetric L x L matrix that gives rise to u 
quadratic form which is nonnegative on the positive orthant 


in /? L , i.e., 

Q = Q t and (yi , • ■ ■, y L )Q(v 1 . ■ ■ • , Ul) t > 0 
whenever 

lh > 11 for all i. (1) 

Such matrices are called symmetric copositive matrices. As 
will be shown in Section V, our methodology will auto¬ 
matically confine itself to the subclass of symmetric strictly 
copositive matrices. These are symmetric copositive matrices 
Q for which additionally 

0/1. ■ ■ •, UL)Q{y\ ■ ■ • ■. */l) 2 > o, if y . > o for all i, 
and y ( ^ 0 for some i. 

Copositive and strictly copositive matrices have been exten¬ 
sively studied; see Cottle et al. |24]. They are characterized 
by the signs of certain determinants (see [24]). 

Keller s Theorem: A symmetric matrix is copositive if and 
only if each principal submatrix, for which the cofactors of 
the last row arc nonnegative, has a nonnegative determinant. 

A recent algorithm for testing copositivily can be found 
in Andersson et al [25|. The determination of copositivity is 
NP-Complctc; see Murty and Kabadi [26]. 

The procedure we advocate below could be used with any Q 
satisfying (1). For concreteness, we will confine our attention 
to the following special types of copositive matrices. 

It is easy to see that any symmetric, nonnegativc matrix, 
i.e., one for which Q r =- Q = [q 1f ], with q,, > 0 for all /, j, 
is copositivc. Also, any positive semidcfimte matrix, i.e., a Q 
for which, Q - Q l and .t 1 Qx > 0 for all .r, is copositivc. 
Moreover, any convex combination (or linear combination 
with positive weights) of such matrices is also copositivc, since 
the set of symmetric copositive matrices is a convex cone. 

IV. The Basic Characterization 

We shall rescale time so that 
L 

X +Y,i‘' = 1 (2) 

i=i 

and resort to “uniformization,” see Lippman [271. That is, we 
shall suppose that there is always either a real or a “virtual” 
part that is being served at every buffer b,. Let {r„}, with 
t { ) = 0, denote the sequence of all arrival and service times, 
real or virtual, and let denote the rr-ficld generated by 
events up to time r„. 

Let./ (/) := ■ ■ ■ ^t'L(l)) 1 denote the vector of queue 

lengths. In accordance with terminology of Markov Decision 
Processes, wc will call ./■(/) the “state.” A policy whose 
action at any time I depends only on .r(t) is called stationary, 
again in accordance with the terminology of Markov Decision 
Processes. Under a stationary policy, the system is described 
by the Markov chain {.r(l)}. 

We will treat a larger class of scheduling policies than sta¬ 
tionary policies. We will consider any scheduling policy which 
takes a constant action in intervals of the form [r n , r n +i), and 
call such a policy noninterruptive. (The term noninterruptivc 
should not be confused with the term nonpreemptive.) As 
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lig 2 State transition diagram for basic open re entrant line 

an example, the well-known FCFS policy is noninterruptive 
Note that any scheduling policy that does not change actions 
between real transition epochs is noninterruptive Of couisc, all 
stationary policies arc of this form and are hence automatically 
noninterruptive 

We will allow preemptive priority at an epoch, H the 
scheduling polfcy calls foi it 

let (, - (0 ,010 ()) r be the /th unit vector The 

state transition diagram of the network is as shown in Fig 2 
lot any noninteiruptive policy 

Let us consider the quadialic form r 1 (r tl )Q i (t„) Note 
that since i(t„) can grow no taster than linearly in n, the 
conditional expectation E[i 1 (r„+iK,M T Nfi) I ] exists 
Fiom Fig 2, for any noninienruptivc policy wc obtain 


Hence 

f,[l 7 (T,.4lKMT r , 41 )|^r ] 

< ' r (T n )Qi{T„)-\ 2\<{Q,[T,) 

l 1 

+ 2^/x (",(r n )((, n f )'( t M 

/ = J 

- 2)11 U'; (r„ )<[Ql(T„) + 1/ 

Let us suppose Thai the initial condition is duemiinistii (oi 
more generally, a bounded random vanable oi even n.oic 
generally, has a finite second moment) As noted eailiei 1 1 ) 
grows no laster than linearly in n Hence / ( i ’( r,,)) exists tot 
every n By taking the unconditional expectation vse ohuin 

M ,7 ( r »4l)SM r r«tl)] 

< H[i 1 (T„)^((r„)] t- IX, [QI [i(r, )] 

L-1 

+ 2^2/1,1 [»,(r„)(f , + i f,) 7 rlx, )] (4) 

- 2ft L i;[wi (r„)»}^i(r„)] 4 M 

Lei us denote 

)'j( 7 ») (5) 

Using (*>), (4) can be written as 


I I' 7 (t,M l)SMn.+ l) I A ] 

= X(i(T„) 4 ( \) r Q(i(r„) 4 < i) 

i i 

4 II,It’ (t„)(i(t„) - (, 4- <4h ) 1 Q(i(t„) - <, 4- <, hi) 
+ I'l "’I (r„)(i(r„) - 11 ) 7 (}( i(t„) - (i ) (1) 

i 

H <i ,(t„))i 7 {t„)Qi(t„) 

I- 1 

Using (2) and the symmctiy of (J as in (1) wc obtain 

F[r‘ (t,i+i )Q r (x„_(-1)] | T t 1 

■ i 1 4- 2X< { Qi(t„) 4 A 1\ Qi ) 

L 1 

+ 2 (t„ )(f , 4 i -i ,) J Q f (t„ ) 

I 1 

l -1 

+ ,+ \ -<,) 1 Q((, + 1 - (,) 

l-\ 

- l\\ L Wj (r 1t )( \ Qi(t u ) 4- Ilf It'i (T n )( 7 l Q( 1 

Note now that since u',(n ( ) = 0 or l, all the terms not 
iealunng /(r 7J ) above are bounded above by a constant, l e , 

L -1 

Af 1 Qi ! + ^2 »i)(f Ml - < / M - ft) 

1 

+ HLU'LiTn^jQ* I < Af <K +-X. 


7 \\)Ql (t„4 1 )] 

^ / (/ 1 

i 

t i 

1 1 L 

•In'll ( n(T„)) 

, I ,-i 

/ 

- 2ii l ^2 , Ii i 1 ( i /( r »)) I- M 

i i 


By summing ovei n , and telescoping, wc obtain 


— 1 y 

A 4- 1 1 


^<11,1(1 ,{?„)) 

I I 

l 1 I 

- 1 - 1 ' 11 


-l y-J 
1 


+ in ^2'nii'( ii( r ,,)) 


1 1 


(6) 


l 


- /;[l 7 (rv + i)^l(TX4l)]) 

4 y £ M' < 4 v lor all \ 


In the last inequality above, we have used the nonnegalivity ol 
; r (r A -|.i)( 1 ?r(Tv-Hi), which is guaranteed by the copositivity 
condition (I), since r(r\+i) lies in the nonnegative orthant 
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Now note that if we can find a 7 > 0 so that 

L 

j =1 

L-1 £ 

- r /u) £ ( 2 ->( r «)) 

.=1 j=i 

L L 

- < -1 53 e ( j 'i ( r ',)) 

j=i j=t 

then from ( 6 ) we would have stability-in-the-mean, i.e., 

, TV L 

■jy — 1 53 E(xj(r n )) < M" < +00 tor all N. (7) 

n =0 / —1 

Before we pursue the issue of finding such a 7 , we point out 
certain consequences of stability-in-the-mean for stationary, 
nonidling policies. In the rest of this paper we will restrict at¬ 
tention to scheduling policies that are nonidling, i.e., whenever 
one of the buffers at a machine is nonempty, then the machine 
cannot stay idle. For stationary, nonidling policies, {./-(t,,)} 
is a time-homogeneous, countable state, Markov chain, which 
has a single communicating class that is aperiodic (since the 
origin can be reached from every stale, and the system can stay 
at the origin for two consecutive time steps). The condition (7) 
then guarantees positive recurrence, i.e., the existence of an 
unique steady state probability distribution. To sec this, note 
that if the chain is not positively recurrent, then the probability 
that the chain is in a fixed finite set of states converges to 
zero as v —♦ oo. Then, that is so even for the finite set of 
states {,r: ,v t < M", and all J\ > 0 and integral}. This 

contradicts (7). Moreover, the Markov chain has bounded first 
moment, and the mean total number of customers converges 
to a finite steady state valqe. In fact, we will show in the next 
section that it even establishes the geometric convergence of 
an exponential moment. 

Let us now see how to assure (7) for some 7 > 0 . We will 
actually work at assuring that the inequality (7) holds without 
the expectation being taken, i.e., 
l 1 L 

^ ^ ^ "t *} (Tfi ) +■ ft I ^ ( h ] )%ij ( T n ) 

1 < = 1 

l L 

-/a £///./ J i./( r J < (8) 

j =-1 /=1 

Let us now motivate the reason lor restricting our attention 
to nonidling policies in our tests for stability. Note that the 
coefficients A rjij of the x,(t„Ys on the left-hand side (LHS) 
above arc all nonnegative, while the corresponding coefficient 
(- 7 ) on the right-hand side (RHS) is negative. Clearly, to 
assure ( 8 ) it is necessary that there exist some choice of 
constants for which 

L L 1 . 

f *U Z i 

J~ 1 J -1 

Focusing on a fixed index /, one will in particular need 
j'j(T n ) to be bounded above by some linear combination of 


{ Zlj{Tti )': 1 < / < L}. This can only be assured if some 
machine is guaranteed to be working whenever .77 > 0 ; hence 
the restriction to nonidling scheduling policies in the sequel. 
Let us return to ( 8 ). For notational convenience. Jet us define 

(Jl+ij : = 0 for all j = 1 , 2 ,* ■ ■ ,L. 

Focusing on a fixed value of the index j, it is clear that ( 8 ) 
is assured, if 

L 

Mi,?,{*„) + 5^/^,(</.+],, - q,j) z,j{t„) 

1 = 1 

< - 7 ) for all j = L 2, • ■ ■, L. (9) 

Grouping the terms by machine, i.e., using {1,2, ■■■,!} ~ 
(J^ {/: cr(i) = rr}, we see that the LHS of (9) satisfies 

L 

j l L \ ( f A + 1 j ~~ ( h 1 1 i T n ) 

S 

= Mi ,-Mr„) + 53 53 /'-('/< + !.)- <iu)~n( r n) 

1 {* rr(i) = rr) 

.S 

< Mlj- r ii T n) + 53 , in;ix /<,((/,+ !,J - </,,) 

“1 0 *(0=*) 

x 53 z n( T ") (><)) 

= ^ij./ j(r n ) + max u 

0 C7(<)-fT( /) } 

x y z,,{t„)+ y max 
{« "(*)*“"(/)} \ <T ‘ "'^0)1 
* /0(<//+J y ~ 7//) ^ 

rr (f) ( t ’\ 

Wc now investigate how lo assure that the RHS ol ( 10 ) can 
be bounded above by — 7 . 17 ( 7 “,,). For nonidling policies 

53 ■ r <( T ») > o=» 53 - 1 

(< rrr?)—<rJ {, fr(i) -rr) 

and so 

■rj(T„)= 53 «'i(r„)j-,(r 11 ). 

0 rr(0 = rrf;)} 

Hence 

■ r j( T ")= 53 

Moreover, since other machines need not be working when b } 
is nonempty, we have the nonidling inequalities 

■r,{r„) > 53 for all a' (r(j). ( 12 ) 

fi. rr(i )=rr / } 
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Employing (11), (12), in (10), we obtain 


L 

Xqijtj(Tn) + Y^^‘+ij- jM 


£ Xq,+ 

+ E 

{<r <f fa (/)} 


(■ 


max 

'KO^OU 




(/i 7) ) * / ( T n ) 
1 + 


max /i,(<j, + i,-</u) 

{i tr(<)- rr ) 


'i(tVi) 


/-I 

+ E 

<=. i 

X \Z( r (T„) -f, + r 1 + i) r Q(i(r, ) ( i, 

+ f‘LWL(T n )\J(r(T„)- ( [ ) T Q{ I [ I , 

+ E' 1 '^ - "<( T > , ))/^ r ( T > , ) f v ) '(i) iU) 

I 1 


Above, [y] + = m«ix{y,0} denotes the positive part of 7 The 
sign needs to be taken into account, stnee a negative sign of 
/i,(q,+i j - q,,) may reverse the inequality 
Hence we see that it we can find an appropriate set of j q tJ } 
ten which 

X(]i, 4- max IIq,j) 


< -7 tor j = I 2 / 

for some 7 > 0, then ( 8 ), and hence (7), is assured Thus we 
have arrived at the following theoiem 
Thcotem J—The Bush Chat cu tei nation Consider the ba 
sic open re entrant line Suppose there exists asymmetric 
copositivc matrix Q = [q tJ ] which satisfies the following 
conditions f or every y - 1 2 I 


£ 

\a a ^er(i)) 


max 

(i rr(i)—a 


/'.(</<+] 1 -<!>,) 


Xq\ 1 + dlix /i.(f/, + i , q,,) 

{' ”(i) «▼(/)) 

f 

<" 0 (H) 

(Above 7/41 y - 0 lor all j) Then, every nomdling, 

nonmierruptive scheduling policy is stable in the mean, 1 e , 
there exist constants i C such thal 


E 

{0 a ^rr( /) J 


in.ix Ih(q,+I J-<!,,) 
{' } 


n 0 1 1 


' Hi'(QKMC))) 

N 


+ (' for all N 


Moreover, il the scheduling policy is stationaiy, ihen there is 
in unique steady stale probability distribution 


From the concavity of the square loot we obtain 


< [ ,7 { T n)Ql (t,i) 4 2\'[Q,(t ) 


1-1 


+ 2 ^ ,)((,+] -(,)'(}!( T„) 


1-1 


- 2/i L wi (r„)f J(Ji(t„) + AT}'** (15) 


Now suppose the conditions of Theorem 1 are met Then, from 
( 8 ), the RHS above can be bounded as 


RHS of (15) < 


/ 


/ 


t /2 


'(TjQffr,,) --^E + 

7 i 

(16) 


Now from Theorem 14 2 2 of [28] by taking r there to be 
the first hitting time of the origin it follows that there exists 
a b > 0 , such that i r Qi > lor all large enough \\t\\ 

in the positive orthant [ (Or in the case where all q n > 0, 
the fact that q n > (i follows trivially from inequality (m) of 
Theorem 1) Hence, theie exists an ( > 0 small enough so that 


RHS ol (16) < [i 7 (t„)Q'(i „)] W2 -f 
whenever 

S;i l (r„)> A/ w 
1=1 


for some large M'" 


V From Stabii ity to Glomf-tric 

CONVtRGFNCh or AN TXPONhNTIAL MOMFNl 

In fact, for a Markovian system, the above Lyapunov based 
negative drift argument actually establishes the geometric 
convergence of an exponential moment (defined below) Thus, 
in particular it establishes the hnileness of all (polynomial) 
moments and their geometric ergodicity 
To see this, we simply work with the square root of the 
earlier Lyapunov function From Fig 2, just as we obtained 
0 ), we obtain 



Letting W'(t„) = yji (r„) 7 Qi ((r„)) we have thus 
shown thal 

H^(Tnu) \f(r )]<W(T n )-< If t(T n ) lies 

outside a compact set, 

and is bounded when i(r u ) is in the compact set Moreover, 
the state can jump by only a bounded amount at each tiansition, 
and hence W{r n] \) - W(r n ) is bounded From these two 
facts, it follows thal the Markov chain representing the evolu 
tion of the system has a geometrically converging exponential 
moment, see [28, Theorem 16 11 ) 


= Ay/(j(r ri ) + <i) 7 4K*(t-„) + <i) 


'Thus Q is actually strictly copositive 



2$6 
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Theorem 2—Geometric Convergence of an Exponential Mo¬ 
ment: Consider the basic, open re-entrant line. Suppose that 
the scheduling policy is stationary and that all the conditions of 
Theorem 1 are satisfied, Then the Markov chain {:r(r r) )} has 
a geometrically converging exponential moment, 2 i.e., there 
exist ( > 0, r > 1, and C < oo, such that for any function 
/ satisfying \f{y)\ < cxp(f||y||) for all y , and any initial 
condition j(r 0 ) = x, 

oo 

£r"|£[/(T (TO )]-£/(y)>r(y) |< rcxp( f |k||) for all x. 

0 U 

Above, ir(y) denotes the steady-state probability of the stale 
y. Hence, in particular, the Markov chain admits a finite 
exponential moment. That is, for some C f < oc 

£'[exp(f||ir(r n )||)] < C 9 exp(f||jp||) < oo for all ti. 

The reader is referred to Meyn and Tweedie [29| for 
estimates of the rate of convergence. It is worth mentioning 
that the uniformization procedure is just a way of com¬ 
puting the drift Ax T Qx , where A is the extended gen¬ 
erator for the unsampled Markov process. Thus one actu- 
ally has, for some /> < 1, | E[f(r t )} - f{y)n{y)\ < 

Cp f V(x) for all x. and all t > 0, i.e., a similar geometric 
convergence for the original unsampled chain. 

VI. A Linear Programming Characterization 

As noted earlier, if Q is a symmetric nonnegalive matrix, 
then it is copositive. Note now that the LHS of (13) in Theorem 
1 is homogeneous in Q. Hence, if (13) is valid, then by 
multiplying Q by arbitrarily large positive numbers, one can 
drive the value of the LHS of the inequality (13) in Theorem 
1 to — tc. This allows us to provide a sufficient condition for 
stability in terms of the unboundedness of a linear program. 

Theorem 3—A Linear Programming Characterization: 
Consider the basic, open re-entrant line. Suppose that the 
following linear program has an unbounded solution 

Max 7 


subject to the constraints 
^0i,+ r j+ *<r', y + 7<0 for all j 

(it' 

Tj>M,(rMi,y - <h,) for all / £ /(./), and for all j 
r ,j > //i(7f+i,j - 7i/) for all / with rr(/) = a, and all j 
Qn-fJji for all j 
1 , 7=0 for all j 

q tJ > 0 for all 

Xtrj> 0 for all a, jf 

Vj unrestricted in sign. 

Then, every nonidling, noninterruptive policy is stable in the 
mean. Moreover, every nonidling, stationary policy has a 
geometrically converging exponential moment, 

2 This property is called ‘‘» , xi>(F||,i’||)-uniform ergodicity" in [281. 



Fig. 3. Example ol Sections VII and X 


The number of variables {</, ; , r>, ,, 7 } in the above linear 

program is + /, + (S - l )L + 1 . (Note that the 

variables q tJ with i > y, and are not really 

needed.) The number of constraints is L + |/(,y)| 4 

£j=i£„ a^ 0 )\{ i: a (') = ^}|- It may be possible to 
rewrite the linear program more economically. 

We find it convenient to slightly modify the linear program 
in Theorem 3 by bounding 7 by I. Thus if the value of the 
linear program is 1, then one deduces stability (rather than 
from the unboundedness of the value as in Theorem 3). 

Corollary I — A 0-1 lest of Stability Consider the same 
linear program as in Theorem 3, except that we impose the 
additional constraint 


7 < 1. 

If the linear program has value one, then every nonidling 
noninterruptive policy is slable-in-the-mcan. Moreover, every 
nonidling, stationary policy has a geometrically converging 
exponential moment. If the value of ihc linear program is zero, 
however, then no conclusion can be drawn regarding stability 
or instability. 


Vll. Example: All Nonidi ing Policies Stable 


Consider the system shown in Fig. 3. Then, to show that 
there exists a (J = Q T satisfying (7), it is sufficient to find 
<hj - <l,i 5 0. so that 


[Ar/i 1 , Ac/i-j. Ar/ui] 




+ [~/'l7ll + 0l<ll2> ~l<\<h2 + null2, -P\<lU + /M72L 


- P2<lu + P2<hs* 


-/'2</22 + P‘2<}2A- ~b2<l2A + //27J.H 
-h(T’n)' 


- PPtU< “/LK/2J, ~P.\<1 jj] 


| -1 2{Tn) 

-21 (t„) 

= 22 (^ 1 ) 

|223(T„) 
«dl(Tri) 

< - 7^1 (r n ) 4 J- 2 (r n ) 4 .r 3 (r n )]. 


By the nonidling condition, = z u {t u ) 4 

J‘2(r n ) = -22(^1) and .r A (r n ) = zi 3( t „) + 333(7-,,). We 
thus see that it suffices to show that one can choose 
{ 911 ^ 12 ^ 713 . 922 ^ 23 ^/ 33 }. all nonnegative, so that A q n - 
Pi911 + /M912 < lit -pl<ll 2 4 Pl<l 22 < 0. -/11913 4 P] <123 4 
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l ie 4 Dai Wang example ol Section VIII 

V/H < ^ + L l 2<]\ i < 0. + //2'/2i + A(?12 < 0 

+ /'27JJ < ~/U7l3 + Af/u < 0 , j < 0 , 

//j(/13 “H ^7ii < 0 

Let us suppose that //i = p z //) = /*. and p = ^ Then 
the above ts equivalent to, (1 - p/2)q n > 712 712 > < 122 , 
(J - p/2)qi 3 > f/ 2 j 712 > 7u* 722 > 4- 723 , 72 1 > 

7u </n > P<l\\/2, 723 > 0, 7ij > pqu/2 
It is easily checked that if p <. 1 then one can choose 
7 t ^0 which satisfy the above conditions 
Hence, we conclude that if //1 = p z — p\ ~z p and 
p = ~ < 1 , then all nonidling, noninlerruptive, scheduling 
policies arc stable in the mean This includes the FCFS policy, 
which is already known to be stable since //] — p j see 
Kelly [W| Moreover, every nonidling stationary policy has a 
geometrically converging exponential moment 


and (t(l) = ) As a consequence 

T j( T n) = ^ z lf ( /, ) 

{’ • */(/) *(/)S*(i)) 

- 2 <;( T ») l° r / IT , 

{l r7(l)=* } 

Recall horn Section IV that our goal is to deiumm 
symmetric copositivc Q satisfying 

■V/i i r J {T„)+ 

>€/(;) 

+ 53 5Z /'-('/Ml, «f„) A ) 

{it rr #nr(y)} {i T(i)=rr } 

< -T ,(t„) tor 1 = 1 2 1 

foi some 7 > 0 ( This can be seen by rewntmg the I HS ol ( 9 ) 
as in the hrst equality of ( 10 )) Using (17), (18) the conditions 
required to establish stability in Corollary 1 can be lelaxed 
Theorem 4—Stability of Buffer Priority Policies Consider 
the basic open re-entrant line Let {0(1) 0(7)} be a 

peimutation of {1 L) Consider the preemptive butler 

pnonty policy which gives pieference to b, ovci b, il 
0(0 < 0(/) and both share the same machine Then the buttci 
priority policy 0 ( ) is stable, with a geometrically converging 
exponential moment, if the following linear piogiam has 
value one 


vril THr Dai Wanc. Fxampi p 

Dai and Wang f 1 2 ] (see also 114)) show that the system of 
Fig 4 does not have a Brownian approximation It is a basic 
open le-entiant line with service rates pi = 10 p z — 20 p\ - 
10/9 p[ = 20 , and p t — 5/1 Let (> = X/pi H-A/// 2 +A ///5 - 
V/M + X/pi be the load factor on the two machines Our goal 
is to determine whether the system is stable for all p ^ 1 

Fust note from the equations involving A in (H) and 
( orollaiy 1 that if the value o! the linear program is one tor 
some A' then it is one for all A A' Hence there will be 
critical value A ( ,„, such that the linear program has value one 
lor A < A c ,, t and value zero foi \ > A, nt Equivalently, there 
exists such a /> iu1 So we wish to see if p lTlt -= 1 

Investigating the linear program horn Corollary 1, we hnd 
lhat its value is one for A <" 0 55587 (approximately) and 
/eio tor 0 55587 < A l Thus we can only assert stability 
lor p < 0 9 r )(0 55587) = 0 528 

IX Bupffr Prioriiy Pouch s 

Consider the basic, open re-entrant line Suppose that at 
<veiy machine a there is a rank ordering of the set ol buffers 
\b, (t(i) = a) served by the machine according to which 
preemptive priority is given by the machine To describe such 
a buffer priority policy moic concisely, let ( 0 ( 1 ) 0 (/>)} 

he a permutation of {1 2 L), with preference given to 

b, over bj if 0(r) < 0 (/) and both b t and b, share the same 
machine, 1 e a(/) = <r(y) The policy is nonidling, stationary, 
and preemptive 

Then, j,(t„) > 0 => u; y (r, f ) = 0, if 0(0 < 0(0 and 
rr(0 = cr(j) Hence z }l = U'j(t„) 1 r (r n ) - 0 , if 0(/) < 00) 


Max 7 


subject to 

A7i/+ max 7,,) 

{, .t i{,) and 0 (f)l#O)} 

- 

111 fix p,(q,+i , ~ 7m) 4 7 

(| ff(i)=n } 

<0 for f = l 2 L 

7^1 

7,7 - q Jt 2 0 for 1 < /,) < 1 


* E 

{fT fT J(T (j)} 


X Exampi f LBFS and FBhS arl Siabi h 

Consider the system of Fig 1 Unlike in Section VII we do 
not lequire that all the p, \ are equal 

Fust let us examine the class of nonidlmg policies Can wc 
prove that all nonidling, noninterruptivc policies art stable ; 
To investigate, we consider the following lincai program 

Max 7 ( 19 ) 

subject to the constraints 

Ay n + iiulx{/(]( ryu — f/i 1) - IH<h \} 

+ j - r; u j 0} 4 7 < <> (20) 

Af/i 2 + liiiqt j - 1 m) + iu.i x{iti{<hi ~ ( lu ) 

-/MC72J 0} +7 < 0 (2D 

Agn + iii.ix{//i((/2) - <]n) —// jf/n} 

4- nicix{// 2 (f/jj — 0} + 7 < 0 


(22) 






Fig. 5. Value of LP for all nonidlmg, noniniemiptivc policies in exumple of 
Section X. The figure on the right is a more detailed view of one comer of 
the figure on the left. 



Fig. 6. Value of LP for example of Section X Joi the LBFS policy and lor 
the FBFS policy. 

7 < 1 (23) 

(24) 

Let p t := and consider p\ = p$. Fig. 5 plots the value 
of the linear program as a function of 2p\ and p 2 . It shows 
that there is a region where the value of the linear program 
is 0 , and for such values of pi,/> 2 *P.i(= /^i), no conclusion 
regarding stability or instability can be drawn. 

Now consider the LBFS policy. The corresponding linear 
program is the same as (19)—(24), with excepiion that (22) is 
changed to 

Af/13 - + lliax{// 2 (7'13 - <] 26 ), 0} + 7 < ()• (25) 

The plot of its value as a function of 2p\, and f >2 is given 
in Fig. 6 . It shows that the system is stable for all values of 
2pi < 1 and p 2 < L Hence, for //] = p 3 , the LBFS policy is 
stable in the entire capacity region. 

Now tum to the FBFS policy. Its linear program is also 
almost the same as (19)—(24), except that (20) is changed to 
the following 

■W 11 +/n(< , /i 2 -'/ii) + uiax{// 2 ((/ij-(/i 2 ). 0}+7 < 0. (26) 

The plot of its value is the same as that of LBFS, as shown 
in Fig. 6 . Hence it is also stable in the entire stability region, 
when pi = p 3 - 
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XL A Nonlinear Programming Characterization 

Note that every symmetric positive semidefinite Q is copos¬ 
itive. Thus, in our stability tests, we could use the class 
of symmetric positive semidefinite matrices. Every such Q 
possesses a square root A, i.e., Q = A T A. Hence, one may 
search over the unrestricted space of a lf ' s, rather than the 
space of q tJ ' s. This yields the following Theorem. 

Theorem 5—A Nonlinear Programming Characterization. 
Consider the basic, open, re-entrant line. Consider the non¬ 
linear program 

Max 7 

subject to all the constraints of Corollary 1 , except that 
every q tl is replaced by Ylk-\ a kt (i k } , and the nonnegativity 
constraint on the g,/s is removed. If this nonlinear program 
has value one, then, every nonidling, noninierruptivc, sched¬ 
uling policy is stable in the mean. Moreover, then, every 
nonidling, stationary policy gives rise to a Markov chain, with 
a geometrically converging exponential moment. 

Also, one can extend this Theorem lo search over con¬ 
vex combinations of a nonnegalive matrix, and a positive 
semidefinite matrix. This yields the following theorem. 

Theorem 6—A More General Nonlinear Programming Char¬ 
acterization Consider the basic, open, re-entrant line. Con¬ 
sider the nonlinear program 

Max 7 

subject to all the constraints of Corollary 1, except that every 
</,, is replaced by r// -f uki<Hj, and ihe nonnegativity 

constraint on the (/,/s is replaced hy nonnegalivity constraints 
on ihc q* tJ ' s, while the (/,/s are unrestricted. If (his nonlinear 
program has value one, then every nonidling, noninterruptive, 
scheduling policy is stable in the mean. Moreover, then, every 
nonidling, stationary policy gives rise to a Markov chain, with 
a geometrically converging exponential moment. 

Both these theorems can be extended in the same ways as 
Theorem I, to treat various kinds of systems and scheduling 
policies. 

To go beyond Theorems 5 and 6 and obtain the most 
powerful test obtainable through our approach, one could 
simply cheek whether the value of the linear program in 
Theorem 3 is one, without imposing any sign restrictions on 
q ir i.e., after removing the constraints r/, y > 0. If the value 
is indeed one, one can then test whether the obtained Q is 
copositive, using an algorithm as in |25]. As noted in Section 
III, however, the lest of copositivily is NP-Complete and may 
therefore be computationally complex for systems of large 
size. 

XII. More General Queueing Nftworks 

Consider a queueing network with S machines and L 
buffers. Lei us suppose every buffer b, has an exogenous 
Poisson arrival process of rate A». A part leaving buffer b, 
goes to buffer b t with probability p n and leaves the system 
with probability (1 - P*j)‘ The service lime of parts in 

buffer //, is exponential with mean Lei us rescale time so 
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f ig 7 Stale transition diagram for general queueing network ol Section XII 


that ^ + Ylj =1 l l j = 1 The sla,e transition diagram is 
shown in Fig 7 Hence 

r(i T (r n+1 )gi(r, M1 )|^ r ,) 

L 

= f f <) 

1 -i 

l L 

+ -t, + <,/ 

<-i i=i 


x C^( / (t m ) r | -|- r |) 


L 

+ i 1 '"’ 1 ( r ,*) 

i=i 



('(tv.) - ',) J 


i 

X Q(i(t„) - l,) + ^M,(l W,(T„))| T ( 7 „)yi(T (l ) 

»-J 


Pioceeding as in Section IV, one may obtain the following 
thcoicm 

rheot cm 7— Stabiht\ of a Gena a\ Nt twoik Consider the 
more general queueing network above operated undei any 
nonidling, nomnterruptive policy Then it is stable in the 
mean, provided there exists a symmetric copositive Q that 
satisfies the following inequalities tor / = 1 2 L 








Machine 1 


Mai l nr ^ 





hg S The Rybko-Stolyar example ol St u inn Xlll 



Hg 9 Value ol ihc linear piogiam loi Rybko- Slolyai example ol Sunon 
Xlll The figure on Ihc nghi is a mo tl detailed uuv ol one portion ol the 
hgurt on the lei l 



fig 10 Snhlt unsidble anti undetermined ugions loi R\bkiv Stulvai tx 
ample 


Ihe service time requirements meet the capacity condition, 

i/m + l 

The following linear progiam tests the stability of the system 
for all p i and f> 2 


l 


max fj, 

(T{l)=<7{ l)\ 


+ 




£ 

* ?M/)} 


max 

{< fT(l) IT 



Max 7 

subject to ihe constraints 

Max[Af/]i + Ar/1 J fi)<{ nd //1712 Ary| j + A(/m - P i<i\ i] 
+ Max[0 i 4- Patfn] < 7 


This can also be written as a linear program if we restrict 
(j ff > 0 lor all i /, or as a nonlinear program if wc take 
Q ~ \ 7 A + [r/JJ, with nonnegativc ry^\ Moreover if the 
policy is stationary, then the system is a Markov chain with a 
geometrically convciging exponential moment 

XIII Thl Rybko-Stoi yar Exampie 


Ary 12 4- Ar/jj - piUn 4- Max[0 > H b 7 

MaxfAryn 4- Ar/jj — p 2 q 2 i ^7Ji4-Af/*j pp/n 4 ft \(J 11] 
4- Max[0 -h // 1 <i 2 t] ^ 7 

Ar/ii + A *4 - p P/u 4 Mnx[(), - /M7n 4- I 1 O/ 44 ] j* 0 


Consider the system shown in Fig 8 The arrivals to buffers 
b\ and hs are Poisson of rate A The service times are all 
exponentially distributed, with mean 1 ///1 at buffers b x and 
and mean l/p 2 at buffers b 2 and b 4 Consider the buffer 
priority policy with ordering {/j 4 , b 2 b s b \} 1 e , with priority 
given to buffers earlier in this list Let us define pi = A/pi, 
and p 2 = A jp 2 

Rybko and Stolyar |4| have shown that this system is 
unstable for A = 1, if p 2 > 1/2 and p\ > 0, even if 


7 ^ 1 

< h ) > 0 7 > 0 

Fig 9 plots the value of the linear piogram loi 0 p\ < 1 
andO < pi < 1 (The region pi 4 P 2 > 1 should he disregaidec! 
as it lies outside the capacity region ) 

The following points are salient hirst, the value of the lineai 
program is zero in the region p 2 > 1 /2 and thus noncontra¬ 
dictory with Rybko and Stolyar’s icsult Second, as shown in 
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Fig. 10, for most of the rest of the capacity region, the system 
is stable, since the value of the linear program is one. There is 
a small region, however, where the value of the linear program 
is it ro; thus the stability remains unresolved there. 

XIV. Concluding Remarks 

We have provided here a programmatic procedure for estab¬ 
lishing stability of queueing networks and scheduling policies. 

There are several interesting questions which arise. First, it 
would be useful to study the structure of the linear or nonlinear 
programs and thus directly establish the stability of policies. 
We have done so analytically for the example of Section VII. 
Second, in all the examples tested by us, any Q giving a 
negative drift was always found to be a nonnegative matrix. It 
would be useful to delemnne whether there exists an example 
of a system where Q is copositive but not nonnegative. This 
should show that the more powerful tests of Section XI are 
in fact valuable Third, it would be useful to implement a 
multi-step drift version of the above results. Finally, it would 
be useful to carry out a similar development for “instability" 
results, as in Fayolle 116] and Coffman et al , [20|. 
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Second-Order Properties of 
Families of Discrete-Event Systems 

Rajandran Rajan and Rajeev Agrawal, Membet , fEEE 


Abstract — We consider discrete-event systems (DES) whose 
logical component is characterized by a constraint set and whose 
lemporal mechanism involves synchronization of the clock se¬ 
quence with a master clock. We are interested in determining 
sufficient conditions on the constraint sets of a family of such 
synchronous DES that ensure that the event counting process of 
one system dominates the convex combination of the event count¬ 
ing processes of a collection of systems Our point of departure 
is a result due to Glasserman and Yao [ 16 ], which established 
a sufficient condition based on characteristic functions. First we 
show that the characteristic function condition is equivalent to a 
simpler condition on the score spaces themselves. As both of these 
(equivalent) conditions arc rather strong, however, we introduce 
cocvality to obtain weaker sufficient conditions. To demonstrate 
the scope of these two results, we prove the near-concavity of the 
throughput in various parameters for min-linearly constrained 
DES. This not only covers various known concavity results 
for tandems, cycles, and fork-join networks of stations with 
general blocking and starvation, but also establishes new ones for 
certain classes of networks which involve splitting and merging 
traffic streams. These results are finally extended to the class of 
generalized min-linearly constrained DES. 


I. Introduction 

A timed discrete-event system (DES) has two aspects — 
the logical and the temporal. The logical aspect of the 
DES deals with the order in which events can occur. This is 
completely specified by a language which is the collection 
of all allowable strings of events. The temporal aspect of 
the DES determines which one of the allowable strings is 
actually realized, and the times of occurrence of the events 
m that string. Several mechanisms, such as state machines [11, 
Petri nets [2|, and finitely recursive processes [3], have been 
proposed for generating differenl classes of DES languages. 
In this paper, we consider languages that arc generated by 
constraint sets, T c Z T j‘, in a manner to he made precise 
in Section II. In that section, we also specify the temporal 
mechanism through the event counting process \D(t) = 
(Dj(/), ■ ■ ■, A„(/))}J1 0 where D n (t) counts the number of 
occurrences of the event up to time /. We are interested 
in structural properties of the event counting process D r as 
a function of the constraint set T. The main result that we 
obtain establishes sufficient conditions for D r to be concave 
as a function of T, in the following sense: If T l \ 0 < p < r. 
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by Associate hdiloi, S. Laforlune. This work was supported in part by NSF 
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satisfy T° D 




, where 


x>^] »{[iv' 


x 1 ’ €TM 


< }> < / 


} 


( 1 . 1 ) 


for some q,, > (),/>= 1,and - 1. then 

the corresponding event counting processes I) p - D 7 ' 

0 < p < satisfy 


D°(1 ) > ). for all limes / > 0. (1.2) 


While this definition of concavity is not standard, it serves to 
ease the exposition. 

The above structural result may be used to deduce the con¬ 
cavity or near-concavity of the throughput of certain queueing 
networks in various parameters such as the buffer configura¬ 
tion, initial job configuration, and blocking parameters. This 
has important implications for optimal design and control 
problems arising in such networks. In particular, it allows 
one to establish the convergence of various tk local search” 
based optimization algorithms to the globally optimal solution. 
Further, when combined with other structural properties in 
specific queueing networks, it enables the identification of 
the optimal solution or the reduction of the search space for 
optimal allocations (see f4|—[ 11]). For example, consider the 
problem of selecting the optimal job population in a closed 
cycle of /M/l/A- stations with communication blocking. For 
this problem, concavity (see [6|) along with a certain symmetry 
property (see [12]) implies that the throughput is maximized 
when the total job population is equal to half the total buffer 
capacity in this cycle (see 113J for some generalizations of 
this result). Example 5.4 in Section V-B of this paper provides 
another instance of how concavity may be combined with a 
symmetry property lo solve for the optimal job configuration. 
For an application to optimal control, see |14, Example 6.3a], 
where a generalization of the dominance result (1.2) is used 
to extend a result on optimal routing due to Ephremides ct 
ai [15]. 

In the next section, we develop the terminology and notation 
that we will require for the rest of the paper. We present 
a self-contained introduction to synchronous discrctc-cvent 
systems (SDES), with an emphasis on treating these systems 
through a language/scorc space approach, as opposed to one 
that emphasizes the state evolution mechanism. 


0()J8-9286/95$04.00 © 1995 IEEE 
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Our point of departure in Section III is a result due to 
Glasserman and Yao 116, Theorem 7.2], a generalization 
of which is presented here as Theorem 3.2. This theorem 
establishes that the dominance of the characteristic function of 
one SDES over the convex combination of the characteristic 
functions of a family of others implies a similar relationship 
among their event counting processes. The first general theo¬ 
retical results in this paper are Theorem 3.3 and Lemma 3.5, 
which establish that dominance of one characteristic function 
over a convex combination of others is equivalent to a similar 
relation among the constraint sets of the SDES, and hence the 
latter too implies the dominance of one event counting process 
over a convex combination of others. The chief merit of this 
theorem is that the dominance condition for constraint sets 
(1,1) is much easier to verify than the dominance condition 
for characteristic functions. 

Theorems 3.2 and 3.3 require unnecessarily strong condi¬ 
tions on the score space for the desired properties between 
event counting processes to hold. We redress this by intro¬ 
ducing the notion of coevality in Section IV. Theorem 4.1 
uses coevality to give a weaker set of conditions on the score 
spaces, that are still sufficient to establish the concavity of the 
event counting processes. 

Section V-A introduces the class of min-linearly constrained 
SDES that model a wide variety of queueing networks. Such 
systems correspond to forward conflict-free Petri nets, i.e., 
Petri nets in which each place is followed by at most one 
transition (see 117]). We apply Theorem 3.3 to show the near¬ 
concavity of the event counting processes as a function of 
the parameter sets that define these systems (Theorem 5.3). 
In Section V-B we extend the class of min-linearly con¬ 
strained SDES to that of generalized min-linearly constrained 
SDES. Such systems arise from Petri nets (not necessarily for¬ 
ward conflict-free) where conflict resolution is managed by a 
switching mechanism (see [181). Subsequently, we generalize 
Theorem 5.3 to obtain near-concavity in both the parameter 
sets and the switching mechanism. Many existing results on the 
concavity of the throughput of queueing networks in various 
network parameters, such as the configurations of buffers and 
jobs, fall within the framework of min-linearly constrained 
SDES. In particular, these include cyclic, tandem, and fork- 
join networks of queues with general blocking and starvation. 
In addition, the min-linearly constrained SDES framework also 
allows us to obtain similar results for a much wider class 
of queueing networks which involve splitting and merging of 
traffic streams. We also present several examples that illustrate 
the scope of these models and of the earlier theoretical results. 

II. The Set up 

In this section we describe the logical and temporal aspects 
of SDES. The first part of this section develops the terminology 
of languages and score spaces, while the second defines the 
associated event counting process and sequences of event 
occurrence times. The setup as described in this section is 
essentially the same as that of Glasserman and Yao [J6J, 
with the difference that we focus directly on languages rather 
than on an underlying state evolution mechanism such as 
generalized semi-Markov schemes. 


A. Languages and Score Spaces 

The logical aspect of a discrete-event system A(£) is 
described by its language £ which is the set of all feasi¬ 
ble sequences of events. The alphabet or event set A := 
{1,2,---, m} is the list of all possible events that can occur 
in the discrete-event system A. A string a of events from A 
is given by rr = r>i * - * rv TI , ru E A 1 < fc < n, n > 0. 
When n = 0, we have the null string which we denote by 
0. The universal language ,4* is the collection all possible 
strings from A , i.e., .4* := {ct = r*irv 2 • • : <\ k E A 1 < 

k < ii, n > 0}. Any subset C C A*, with 0 E £, is called 
a language. Given two string cr,p E A*, we say that p is a 
prefix of <r, denoted by rr > p, if there is another string p such 
that a = pp (// concatenated to p). The score of a string a , 
denoted by [rr], is the vector x E Z+ whose /lh component 
,f, counts the number of occurrences of Hn a, 1 < / < rn. 
The event i E A is enabled for the string <r E £, iff m E £, 
i.e., the event i can occur after the string rr has occurred. Let 
the event list for the string o E £ , <f(rr), be the set of events 
enabled for the string rr. 

Definition 2.1; A language £ is said to be 

1) prefix-closed iff a E £, p < a => p E £. 

2) permulablc iff a, p E £, [p] = [a] =* f (p) = £(cr). 

3) noninterruptive iff i\ E £(rr), rrp E £, [//]„ = 0 => 

(* E <f(rr//). (Here, [p] a denotes the ath component of 

( 4 ) 

4) an antimatroid with repetition ill it is prefix-dosed, 
permutable. and noninterruptive. 

Let A denote the set of all antimatroids with repetition, 
i.e., A := {£ C A* : £ is prefix-closed, permutable, and 
noninterrupuve} (see [19] or 116]). 

The subset S(£) of Z™ given by 5 - {[rr] : a E £} 
is called the score space of £. As 0 E £, we always have 
0 E S(£). Given any subset T of Z+\ with 0 6 T, we can 
recursively define a language M(T), as follows: 

• 0 E M. 

• rr E M and [rr] + e" E T => cra € M. 

Here, e a is the unit vector in the rdh direction. T is called 
the constraint set for the language M(T). This is equivalent 
to saying that a string rr belongs to M, iff the scores of all 
its prefixes lie in T. 

Definition 2.2: A subset T of Z"‘ is said to be 

1) maA-closed iff x,y E T and z - x V y := (j-\ V 
//I,---, r m v //,„) then z E T. 

2) accessible iff for each x E T, x ± 0, there is some 
) = i(x ), 1 < / < in , such that (x - e l ) E T. 

Let (-) :=■ (7 C Z+ : T is accessible and max-closed}. 

Given a set T C Z+, we can construct a language M(T) C 
A*, and its score space S(M(T)) C V ". Similarly, starting 
with a language £ C A* we can construct the score space 
5(£) C Z™, and from that another language A4(5(£)) C A*. 
Next, we examine the relationship between the various sets and 
languages constructed through the above operations. 

I^emma2.3: 1)1 f T E C"), then M(T) E A and 
S(M(T)) = 7. 

2) If £ E A, then $(£) E « and M(S(£)) = £. 

Thus, there is a one to one correspondence between 0 and A. 
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Proof: l) That M(T) is prefix-closed and permutable 
follows from its construction. Its noninteruptiveness can be 
easily deduced from the max-closure of 7. Again, by con¬ 
struction, S(M(T)) C 7. Conversely, observe that for any 
point x € 7 that is accessible from the origin, there is at 
least one string a £ M(T) such that [a] = x. Therefore 
S[M{T)) D 7. 2) As £ is prefix-closed, <S(£) is accessible. 
The max-closure of S(£) follows from the pemiutability 
and noninteruptiveness of £, as shown in [16, Section 2.3]. 
Alternatively, let x £ 1+ be a minimal vector and a£ £ 
be two minimal strings, such that x -- [a] V [i^], but there exists 
no string p such that a/i £ £ and x = [ait]. Without loss of 
generality (WLOG), assume that a = cvi ■ nt p ^ </>. Note 
that the minimality of a forces 

> H ( 2 . 1 ) 

Let a' := rv A By the minimality of x there exists 

a string // such that aft' £ £ and [rr'///] = [a'] V [//] 
x - e 0/ ’. Note that (2.1) implies that [//,'] (Vp = 0. By the 
noninteruptiveness of £, o'a p £ £ =» a'fi f n p £ £. Next, 
compare a and rr'///, and apply (2.1) and noninteruptiveness to 
deduce that aft' — a f a p (£ £ £. Finally, conclude that [<r///] — 
x , which contradicts the earlier assumption. Consequently, 
S(£) is max-closed. 

Observe that as £ £ A, a string a £ £ iff the score of all 
its prefixes lie in S{£). It is easy to verify this of A4(«5(£)) 
too, from its construction. Hence £ — „M(S(£)). □ 

Remark 2.4: In fact, we can find a set A' of prefix-closed 
and permutable languages satisfying a weaker condition than 
noninterruption such that there is a one-to-one correspondence 
between this set and the set 0' of all accessible subsets of Z"\ 
We shall not examine this more general relation further, as the 
main concavity results hold only for the classes A and (-). 

Lemma 2.5: For any 7 C Z+, let 5 be its maximal 
accessible subset. Then, 

1) M(T) = M(S), and consequently S(M(T)) = 
S(M(S)) = S. 

2) If 7 is max-closed then so is S. 

Proof: Assume WLOG that 0 6 7, as 5 is empty 
otherwise. I ) S C 7 implies that A4(S) C M (7). By 
construction, the scores of all prefixes of any string in M(T) 
lie in 5. Consequently, A4(c?) D M(T). 2) Let U — \x £ 
Z™ : x = y V z; r y,z £ S,x & 5}. Let u be some minimal 
clement 1 of U. Let (v,w) be a minimal ordered pair such that 
v,w £ S and u — v V w. Automatically, u € 7. By the 
accessibility of 5, there is some /, 1 < i < rn , such that 
v = v - e! £ S. By the minimality of (v,w) and u, if we 
define u := v V w, then u = (u - e 1 ) £ S. This implies that 
u is in the maximal accessible component 5, a contradiction 
that establishes that U is empty. □ 

Assumption 2,6: In this paper, we shall focus exclusively 
on DES A(£)whose language £ is generated from a max- 
closed constraint set 7 C Z"\ i.e., £ = M(fT). In view of 
Lemmas 2.3 and 2.5, it follows that £ £ A and that S(C) is 
the maximal accessible subset of 7. 

1 All operators on ordered tuples urc taken componentwise. 


B. Synchronous Discrete-Event Systems 

A SDES, A(£), specified by a language £, is ;i r^ ]v . 
that takes a sequence of clock times v ( ;. V x 

{«<}.£ i) to an event counting 
{D(t,u) = (Di(t,w), •••', D m (t,u)) : /; > ()}. We simwiinvs 
suppress dependence on u) for ease of notation. The n, rc -, sc 
mechanism by which D can be obtained from uj is described 
below. First, let {DJ(0 : t > 0} be the master couming 
process for event i, defined by 


T*( := "> 0 . 

Jt=l 

D*(t) niaxfu > 0 : T*(n) < /}, t > 0. (2.2) 

Take /. 0 = 0. Let t n be the nth jump time and o* the 
corresponding event type (direction of jump) of the process 
{/;*(/) = : /■ > o}. Call a 4 " = 

a*a* 2 ■ • ■ (*n master string at time t n induced by u;. The 
string a n £ £, defined inductively below, is said to be the 
realized string in the SDES A at / T) , n = 0,1. • ■ * 

a {) ~ (j) 

if < +l *£(*"). 

<+!• if«r,+iGf(ir"). 

The event counting process l)(t) is given by 

D(l) = [<7 n ] for t n < t < /: n+1 . 

Note that if £ — A m , then a 11 = rr* 7 ' and D(t.) - D*(t). The 
synchronization mechanism described here for obtaining the 
process {D(t.) : t > 0} is identical to the one described by 
Glasserman and Yao in [16], 

Let {A p = A(£ p ) : p ~ 0 , l, ■ ■ ■ ,7 }, r > 1, be a family 
of SDES defined on a common alphabet .4. Let M r be the 
unit simplex in IR r . i.e., 

r 

M r :={(q 1 ■ • ■ •. q r ) € IR*': = 1. q v > 0. />=].• 

(2.3) 

Let q £ M 1 . In this paper, we investigate sufficient conditions 
for the event counting process of one system to dominate the q- 
convex combination of the event counting processes of several 
others, i.e., for the relation 



P~ 1 


V clock sequences ur 
and times / > 0 (2.4) 


to hold, in terms of conditions on the corresponding score 
spaces, {$ p = S(£ p ) : p = 0,1, ■,/ }. 


III. Score Space Concavity 

Our point of departure is a result by Glasserman and 
Yao [16] which establishes that if the characteristic function 
(defined below) of one system dominates the “average” of the 
characteristic functions of two other systems, then the event 
counting processes of the corresponding systems satisfy the 
same relation (2.4). Below, we restate a generalized version 
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of their theorem. Even though the proof in fl6J continues to 
hold for the following theorem with only minor modifications, 
we reproduce it here for the sake of completeness. 

Defininition 3.1: Given a language C E K define its char¬ 
acteristic function x = (Xi«• ■ ■, \m) : ${C) —> Z™ by 
X*(ae) = Xi + l{i€f(*)>. 1 < i < m, where 1^ is the indicator 
function of the set A. Note that if C has been derived from a 
set T C Z+, then x( x ) = max {a; + e p E T : 1 < p < m}, 
where the maximum is taken componentwise, as usual. 

Theorem 32: Let {A* : 0 < p < r}, be a family of 
SDES with corresponding score spaces : 0 < p < r}, 
characteristic functions : 0 < p < ?•}, and event counting 
processes {D p : 0 < p < r}. Let q E M r be such that 


3) »° > x° = |-£;=1 qp*r 1, W° e5°, xP e 5 P ,P = 

=* x°(»°) > £p=i q P x p {x p )- 

Then, (I) <=> (2) -«► (3). 

Proof: (1) (2) Fix v, 1 < ? < m. For 1 < p < r t 

define E Z!£ by 

j- p •= (*'’)• if j = '■ 

’ \.rj, otherwise. 

Note that x v E 5^ for 1 < p < r, by definition of the 
characteristic function. Next by Assertion (1), x° € 5° and 


±° 


T.1^ 


1 


€5°. 


r 

x° > ]p 9 ,,x , \x f ’ e = 0,1, ■•■,r 

P=1 

(3.0 

P =i 

Then 


F=1 


V clock sequences w, 
and times t > 0. (3.2) 


Paw/; The proof is by an induction on the successive 
jump times [t n }^= i of the master counting process \D*(I) : 
/ > 0} (see discussion around (2.2) for notation used in this 
proof). The induction hypothesis is that (3.2) holds for tj , 
0 < j < k. This is trivially true for k - 0. Next, let / := (\* k 
be the event that occurs in the master counting process at 
4+i. Note that 


D p (1 k+ 1 ) 


for i? I, 
\xWy(h)) for > = l. 


In either event the induction step follows easily. □ 

We now show that the dominance of one characteristic 
function over a convex combination of others is equivalent to 
a similar relationship among the corresponding score spaces as 
subsets of Z™. This makes the above result more transparent 
and easier to apply, as we shall show in Section V. 

For any q E M r we denote 


Y, ( b sr 



: x v E S p , for 1 < p < r 


where \x] denotes the ceiling function applied component¬ 
wise, for x E IR+. 

Theorem 3 J: Let {A ;> : 0 < p < r}, be a family of 
SDES with corresponding score spaces {<S P : 0 < p < r) 
and characteristic functions {x p ; 0 < p < r}. Let q E M r . 
Consider the following assertions: 


£/j=i ipS 1 ' 

C 5°, i.c., x" = 

Ep=i <iv xP 

SP,p= 

, r => x° 6 S 

) 


2) x 11 = f£^ =1 (/pX p ], xP € SP,p = 0, l,--,r =>• 

*V) > £; =] qpX p {* p )- 


Clearly, x° < x° < x° + c'. From the definition of the 
characteristic function, x°(x°) > ® u . Therefore 


(*?(*“)) >■*? = 




;»=J 


I < i < m. 


Since i was arbitrary, it follows that x°(* n ) > 
q v X v ( xP )* and consequently Assertion (2) holds. 

* r -i 


(2) =» (1) Assume that 


Ylj ,=i { h 


S v 


\ s° is nonempty 


and let x be some minimal clement of this set. Lei 


(cr 1 , a 2 , • • •, a 1 ) E C l X C 1 X ■ ■ x C' be a minimal ordered r- 
tuple of strings (in the prefix-ordering taken componentwise), 
such that 


x ~ 


iz «pK] • 

F=1 


Let us write rr J> as i\ , where t p G A, J < j < s }) , 
1 < p < r. We allow ,s ? , = 0 if n v =: 0. As 0 G 5°, 
WLOG, wc may lake r/i > 0, Mi > 1 and = 1. Define 
t= and let 


y := 


r — 1 




/'=2 


By minimality of x and (rr ] , a 2 . • ■ •, o '), wc have y = x - c ] 
and y E S {i . Since x £ 5°, it follows that x?(y) = /yi ■ 
Moreover, xJ([/a j ]) = + 1 = [rr 1 ]!. From this we have 

m i 

7i vKfp 1 ]) + 5^ 7;.A i(Kl) > X! ( lp[ nf ‘h 

p-2 p~\ 


= ^1 
> yi 
= x?(w) 

which contradicts (2). Hence the result. 

(1) Sr (2) => (3) This follows directly from the noninterup- 
tiveness condition, which implies the monotonicity of the 
characteristic function, i.e., 

»°.x°6 5°,tf°>x 0 ^xV)>xV)- 


(See [16, Theorem 2.3J for more details). 

(3) => (2) Obvious. □ 

In light of Theorem 3.2, any of the above equivalent 
conditions implies (3.2). 
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Remark 3.4: Letting r = 1 in Theorem 3.3, we see that the 
event counting processes of SDES are monotone as functions 
of the score spaces. This result is the monotonicity of the event 
counting processes across schemes proved by Glasserman and 
Yao for synchronized GSMS 120, Theorem 6.4]. For the case 
of r > 2, the above theorem provides a score space concavity 
result analogous to the score space monotonicity result. 

In the lemma below we show that it suffices to check 
Assertion 1 of the above theorem, with the constraint set T p 
instead of the score space S r \ where T p C. Z” 1 generates the 
SDES A p . 

Lemma 3.5: Let {T v : 0 < p < r) be subsets of T" and let 
{ 5 /J : 0 < p < 7 } be their corresponding maximal accessible 
subsets. Then 


Thereom 4.1: Let {A ;j : 0 < p < ?•} bo a r arm ) ... , 

with corresponding score spaces ) S p . 0 < fl 
characteristic functions {x v : 0 < p < r j. 1.01 ^ 
Consider the following assertions: 


>) * w = I E)l=' <b z ‘‘ 

x* e M. 

2) i° = [e!,=i <4v xV 


(x ] . ■ ■ ■ ,x r ) t .S' 


5 u ’-’ r 
3) jf° > x° = 


*° 6 5°. (x J . 

=* x°(*°) > E;,=] Vpx" 1* p 1 

r -1 


IZ.J <lv* P 


y {) £ S l \ 


. X j *. 




5 1 ' 2 • ^ *V) > hWi 

4) => x" > q„x'’ 


7° D 


E"” T " 


S° D 


Z«» s " 

,1=1 


Then, (!)<=> (2) (3) => (4). 

Proof: (T) => (2) Let (x 1 , • ■ ■ ,x r ) e .S 12 By As- 

6 5", hix /., 1 < /. < m. For 


,0 _ 


Proof: Assume that x r £ S r \ x [ 


componentwise minimal tuple satisfying this. Without loss of 


generality, assume that x 1 - e 1 £ S 1 . Let y {) 


in(x ] - 


,;1 ) + EL 2 'if* 1 ' 


Jl ... 


. By the minimality of (x .x 1 , • • ■ ,x ; 


y " = x" -- e 1 € 5°. By accessibility, x° € 5°, which 
contradicts our earlier assumption. Hence the result. □ 


sertion (I), x" := £p=, q v x v 


if 3 = i. 

rf, otherwise. 

By the definition of the characteristic functions, it is easy to 
sec that (xV • ■ £ 5 1,2 ' '. Therefore by Assertion (1) 


e;=i 

<b xF 

£ 

;\ 1 ' := { 


) are 

a 

2 l 


Z** 

i'=i 


e 


(4-1) 


Clearly, x° < x° < x° + e\ Again, from the definition of the 
characteristic function X "(x") > *°. which implies that 


IV. COEVAUTY 

Theorems 3.2 and 3.3 require unnecessarily strong con¬ 
ditions on the score space to obtain the dominance of one 
event counting process over the convex combination of several 
others. There, we require (3.1) of Theorem 3.2 to hold for 
all choices of {x p £ S p : 0 < p < r}. Note, however, 
that we really need to verify that condition only for those 
\x p £ S p : 0 < p < / } that can occur simultaneously. In 
this section we introduce the notion of coevality to exploit 
this basic idea and to obtain weaker conditions for one event 
counting process to dominate a convex combination of others. 

Let { A p :()</;< r} be a family of SDES. We say that 
(x (l , x 1 , ■ ■ ■, x r ) -is coeval, iff there exists a clock sequence 
u) and a lime / > 0, such that the string <r p realized in 
at time t has score [cr p ] = x p (See Section II-B). Then, rT*, 
the master string at time /, is called an enabling string of 
(x () ,x\ ■ ■ • ,x r ). Let 

:= {(x 0 ,* 1 ,- 

€ 5° x 5 1 x • ■ ■ x S r : (x 0 .* 1 , • • • ,x r ) is coeval) 


and let 

5 1 ' 2 ’ {(x 1 v,x’-) 

E S l x •■ • x S r : (x 1 , • ■ • ,x’’) is coeval). 


x!V) > ■ (4 ' 2) 

F=l 

Since / was arbitrary, it follows that x'V") > 

S;i=i ( lpX v i xV )’> an d consequently Assertion (2) holds. 


(2) =» (1) Let U = {(x 1 , • ■ ■,x r ) £ S h2 '-' r : 


l { lv 


& 5°} be nonempty, and let (x 1 , • • ■. x r ) be some (com¬ 
ponentwise) minimal element of W, and s = ojo? ■ • ■ n* be 
some (prefixwise) minimal enabling string of (x 1 . ■ • • .x'). Let 




= be the enabling string of (x L ■ • ■ .i r ), 

for x p £ S p .“ 

First, note that (x 1 ,*--,® 1 *) £ S 1 ' 2 ' ' r , and = 

for all p. Further, the minimality of (x 1 ,-**,x r ) £ U 
and implies that 


x° := 


(4.3) 


( iv xp — 5Z ( h ,xi I 

i>=i 

and that x° £ 5 U . Using the fact that x° -f we 

conclude that 




v -1 

r 


by (4.3) 


Z ( i '’) 

,.=i 
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which contradicts Assertion (2). Therefore U is empty and 
Assertion (1) holds. 

(1)&(2) =» (3) This follows directly from the noninterup- 
tiveness condition. 

(3) =» (2) Obvious. 

(3) s» (4) Assertion (4) is a restatement of (3.2) of Theorem 
3.2. The proof ts very similar to that of Theorem 3.2. □ 

The simplest situation in which coevality proves itself 
strictly more general than the score space result (Theorem 
3.3) is when the DES A has a nonconvex score-space, S , 
i.e., when the ceiling of a convex combination of scores 
from S falls outside S. This happens, for example, when 
S s= {a? G 1\ : ,r 2 < ^.ri}. Then, x — (0,0), y = (2,1) G S, 
but z = |"(x + y)/ 2] = (1,1) £ S. Since 5 0 (S + S)/ 2, 
it is not possible to use the score space result to prove the 
obvious result that D(t) > |( D(t ) + D(t)). But in this case, 
the coeval sets are easy to characterize, and therefore we can 
show from Theorem 4.1 that relation (3.2) holds among the 
event counting processes. 

It is not usually necessary to explicitly characterize coeval 
sets across different systems. Often, Theorem 3.3 is inapplica¬ 
ble because of the existence of tuples (a: 1 , ■ ■ ,x'), x v G S p , 
p = 1, ■ • •,r, whose convex combination falls outside 5°. If 
it can be shown that these tuples are noncoeval, then Theorem 
4.1 can be used to prove that one event counting process 
dominates the convex combination of others. 

V. Applications To Constrained SDES 

In Section II, we explained how SDES may be generated 
from constraint sets T C Z" 1 . Typically, the constraint sets 
that we shall examine will have the form 

T = {x 6 Z;':x </(*)} 

for some increasing constraint function, / : I 7 " —► Z+. Such 
constrained SDES arise naturally in many queueing networks, 
as may be seen from the examples to follow. In Section V-A, 
we introduce min-linearly constrained SDES and show that 
the structure of the event counting process of such systems 
relates naturally to the structure of their constraining family 
of functions. Subsequently, in Section V-B, we extend this to 
generalized min-linearly constrained SDES. 

A. Min-Linearly Constrained SDES 

Let 


A := {a,;*. : I < i,j < rn, 1 < k < 7/}, 

B := { bji, : 1 < j < w, 1 < A’ < n } 

be two parameter sets of nonnegative real numbers. Consider 
the set T C Z+ defined by a min-linear constraint function 
based on the above parameter sets 


transition I 



( Place OW 

A % 


transition I 1 


, ls 


I'jk 



transition | 


Fig. 1. Portion of a forward conflict-free Petri-net 


It is easy to see that 0 G T and that T is max-closed. 
Consequently, by Lemmas 2.3 and 2.5, its maximal accessible 
component S(A,B) can be used to define a prefix-closed, 
permutable and noninterruptive language C(A,B). The SDES 
A(A,B) defined by C{A n B) is said to be min-lincarly con¬ 
strained. 

Remark 5.1: When n = t n, a, ,k = 1 if./ ^ i - A-, 
I < L./\ k < m, and a lf k = 0 otherwise, the set 7 C Z"' 
is given by 

T(A,B) = {xeF? if, <n+h jk A <j,A-<ih, ;**■}. 

(5.2) 

This corresponds to the class of all conflict-free Petri nets 2 
(also called decision-free), i.e.. Petri nets where each place is 
preceded and followed by exactly one transition. Baccelli and 
Liu [2] have shown the concavity ot the throughput of such 
Petri nets in the initial marking. Several important classes of 
networks for which concavity results are already known, such 
as tandems and cycles of queues with general blocking and 
starvation [ 21 ], |13|, and fork-join networks [ 21 ], are special 
cases of the above. In such networks, the constraint in ( 5 . 2 ) 
can be interpreted as restricting the number of jobs processed 
by server j from exceeding the number of jobs processed by 
any other server k , by more than a fixed quantity b^. 

The class of min-linearly constrained SDES includes the 
much larger class of structurally forward conflict-free Petri 
nets, i.e., Petri nets with at most one transition following each 
place, but with no restrictions on the number of transitions 
that may precede a place (see 117J). Thus, in such a Petri- 
net, a place preceding a transition j cannot precede another 
transition i ± j. WLOG, we assume that exactly ti places 
precede each transition /, and denote each such place by the 
pair (j , A), k — 1, ■■ •, n, where j = 1, ■ . m are the possible 

transitions. Let there be rrijk tokens present initially in place 
(7 A ). We assume that the firing of transition (occurrence of 
event) i deposits tokens in place ( /, A ), and that transition 
j requires r t * tokens from the place ( 7 , A ) to fire. Thus, when 
the score is x = (.i’i, • • - ,.r 71l ), i.e., transition t has fired .r 7 
times, / = 1. •, w, then the transition j is enabled iff 


(■ r 1 + 1 ) 


r r 

< min < 

-^ L l=l 


, I + tll,K 


Am 


T(A.B) = 

m v 

* e Z’| l : X, < a " kJ ' + 1 ^ j ^ m } 

(5.1) 


It is easy to see that the constraint set of this system is 
precisely T(A, B) given by (5.1) with a tJ k = and 

l>,k = Wjk/'jk- 

2 Petri nets are 4 special class of discrete-event systems, with transitions in 
Petri nets corresponding to events in SDES. 
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In the next theorem, we apply Theorem 3 1 to investigate 
the effect ot varying the parameter set B (with a fixed A) on 
the event counting process D ot the min-hncarly constrained 
SDES A(j4 B) For this we need to define the slack s (q) 
associated with any q t M 1 as 


*(1) f 


11 

p=i 



e z+ p= l 



In the following theorem we give an explicit lormula for the 
slack 

Pwposition 5 2 Let q = (qi q,) 6 A/' 

1) It q is rational, i e , it q p - ^ wheic and b { art 
co-prime integers for p — J r then 


s(g) = 1 - 


_I 

I c m (/)] 


*,) 


Here, 1 t m (/q b 1 ) denotes the least common mul 
tiple of b 1 

2) If q is irrational, i e r/ 7 is irrational for some p — 
J t then s (q) = 1 

P/oof See Appendix □ 

Thcotem 5 ? Let {A(A B 1 ) 0 < p < t} be a family 

of mm linearly constrained SDFS with corresponding score 
spaces {S ? <> />< '} and depaiture processes {D f 0 < 

p ^ t } Lei q e A/' be such lhal lor all 1 ^ m 1 * 

/ < n 

i 

h<) ,k > + <s 4) 


Then S" D 




and consequently 


w) > ^ q) D 1 {f u) V clock sequences u; 

and times / > 0 


(5 5) 


In addition ll {o,jK 1 < / < m} arc all nonnegative inlegeis 
for some (y A) 1 $/</// 1 < A n then foi those (y / ) 
(5 4) can be replaced by the weaker condition 


/ = t 

oof In hght ol Theorem 1 
enough to show that T(A B°) D 


(5 6) 


3 and l^emma 1 5, it is 




Let 


£ 7(A /J p ) p — 1 / and dehne y° = 

Then for any / 1 < y < m 


e; 


</: - 


,.=i 

a 

S'* IS 


P =i 


< nun 

l<k<n 


L /-1 


S^ 

;>=1 


(5 7) 


Praprocesaing 


Supply o f 
raw pahs 


Station 1 


^ I lo 4 • 


Legend || * raw job ^ a preprocessed joh ^ ft hoi 
tig 2 Closed welding type join nttwork 


Moreovei, for any A, 1 < A. <" m 


- i ill -.r m 

S'//S"" i, ^ +/ ''* ^S'/' S" 




'/ +//, 


;-l s = i 


I s (9) 


p 1 h — 1 
m 1 

~S"'j < S'/''/NS'/' / ''/* m«) 
<-> ; 1 (1 
III 

- S"•»* < h v 4) 8) 

/ l 


Thus j/ 11 fT(/l B°) 

If {u,,* 1 ^ / £ m} aie all nonnegativc integeis foi some 

A 1 < A < //, then we have 


r » i rn 


r HI I 


S'/'IS""*'/'' +/ ’/' - S""*S'/''/! + S '/' v ii 

J i-l i 1 /-I 


-1 h-1 


til r 1 


-S""* S'/-'/' + S'* 6 ** 

-J /I '( 1 

in 

<Yl (1 '> k ^4^'* (by (5 6)) 

-i 


In this case too # y n f z T(A B {] ) □ 

Note lhal most ot the concavity results obtained so far in the 
literature foi specific SDES fall within the scope of Remark 
5 I and condition (5 6) of Theorem 5 } We now present an 
example to illustrate how Theorem 5 1 can be used to obtain 
ncai concavity results for more complex systems 
E\ampli 5 4 (Weldim> Station With Constant Wot A hi 
Ptoyess (WIP)) Consider the two-station cycle shown in 
Tig 2 This models a production line with constant WIP 
where each pair of successive jobs that depait from station 1 
is welded together at station 2 As soon as a |ob leaves station 
2 two parts enter station 1 simultaneously We assume that 
station / / - 1 2, has a buffer ol size A, with 0 < J < A, 
jobs initially Server t is blocked (communication blocking) if 
the buffer downstream is full Lei D^t) denote the number ot 
jobs that complete service at station / upto time t tor / - 1 2 
Note that at any time t t D\(t) cannot exceed twice 
by more than the number of jobs initially piesent at station 

1 or the number of empty spaces initially present at station 

2 Similarly, £>2(0 cannot exceed one halt D\{i) by more 
than the number of jobs initially present at station 2 or the 
number of empty spaces initially present at station 1 From 
this, we can see that the constraint set T that specifics the 
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SDES corresponding to this network is given by 
T = 6 “l\ : < 2r2 + ((A*2 - ^ 2 ) A Ji) 

^<^j + ^((A j -./ 1 )A.7 2 )J. 

This is a min-linearly constrained SDES with appropriately 
defined parameter sets A and B. 

Let {A(j4, B p ) : p = 0,1, ■ ■ •, r} be a synchronized family 
of such systems. Then the conditions (5.4M5.6) are satisfied 
with q € Af r , if 

- r 

*2-4 > 5>(*£-4) 

r=i 

5(*? -4>> -(«>. 

4> Ixvrl 

/'=! 

}4>£ ^4+ •’(«)• 

P=1 

We may deduce the near-concavity of the throughput jointly 
in the number of jobs and in the number of empty spaces. 

Remark 5.5: The slack s(q) obtained through Lemma 5.3 
cannot be lightened in general, in the sense that there do exist 
families of systems where that slack is the best we can do. 
For instance, take r = 2, J® = 1, J* = 0 and J'f = 2, and 
J 2 = In this case no departures take place from 

station 2 of the systems A° and A 1 , while departures do occur 
in the system A 2 . Consequently, the D l) (t) does not dominate 
a g-convex combination of the other two, with q = (|. \) 
(say). On the other hand, for certain families of mm-linearly 
constrained SDES the slack can indeed be tightened by more 
careful analysis. 

B . Generalized Min-Linearly Constrained SDES 
Let the parameter set A be as before, and let 

1 = {Ijk :R-f = 1, ■ ■, rn} 

be a set of nondecreasing switching functions. Consider the 
set X C Z+ defined by a generalized min-linear constraint 
function based on the parameter A and the switching function 
I as follows 

T(A,l)-.= 

z6Z+ :ir, < 1 < J < mj- 

This represents an extension of min-linearly constrained SDES 
model, as the constraint °ij^t + bjk has been replaced 
by Ijk rt > 7 fc-r,), where I is a nondecreasing function. 
A SDES which is defined from the maximal accessible subset 
of the set T(A,I) is said to be a generalized min-linearly 
constrained SDES. This allows us us to model a bigger class of 
Petri nets — those for which structural consumption conflicts 


are resolved through a predefined “switching'’ mechanism 
(see [18]). 

We have the following near-concavity result for generalized 
min-linearly constrained SDES. 

Theorem 5.6: Let {A(A,/ 7 ') :()</>< r}be a family of 
generalized min-linearly constrained SDES with correspond¬ 
ing score spaces {5 /J : 0 < p < r}, and departure processes 
{ D p : 0 < p < r}. Let q £ Al 1 be such that for all 

x° = ( b xP * xV € S v , 1 < p < ?, and for all 

1 < j < m, 1 < A- < // 

m p j m -i 

> I>L4i(5>^)J • < 5 - 9 > 

i=i //=i *=i 


Then 5° D | [ q t ,S p , and consequently, 

i 

D ( \Luj) > ^^q p D p (l. uj). V clock sequences u 

p-i 

and times t > 0. (5.10) 
Proof Proceeds on the same lines as Theorem 5.3. Let 
y p e T (A. I v ) % p = J, ■ ■ • , r and define y {) = q p y p . 

Then for any /, I < j < rn 


■j= E 






- 1 p=[ L V~1 


Thus, y u G T(A.I U ). □ 

Note that the above result strictly generalizes Theorem 5.3. 
The following lemma can be used to deduce the near-concavity 
of Petri nets with conflict resolution in the initial markings. 
Lemma 5 7* Suppose that for some j, A* 

':»(•) = u w+ b ^ 

for some nondecreasing function H p k (-), and that there exists 
a constant r jk such that 

r 

i>u jk > Yl + *(«) + f i* > 

p^i 

and that for all 2 °, z 1 , ■ ■ •, z’ e 1R + , with z° > ( q p z p , 

r 

lI%(z°) + r jh >J2^ , l \(~ P )- (5.1D 

P =l 

Then the {F h : p = 0,1, ■ • ■, r} satisfy (5.9). 
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Station 2 

hb« 


swion i ^—i j in 

_ j jobs \ Station 4 

ABES* 


\ ip \ Closed network with splitting and mcigtng 


Station 2 



Jojoba 


f ig 4 Open nelwoik with splitting and meigintr 


Pwof Lei i 1 ' £ S '. I < /> i i, and i° — 
Thcn 

ft (£«.,*= + C 

/ ' ,n \ 

> 11 p (E 7 ' 

\-l /l 7 


■+ E 'll + '(9) (• 

/-l 

^E^^fE" 1 ^'') 

/ =1 \ J 7 

I 

+ E ( h ''/* + 

1 1 

E'/, 


□ 

/* u//?i/>/c * (( /rmd Network u ith Splitting and Men \>ini>) 

Consider the closed network shown in Fig T The /th job 
to depart station 1 is sent to station 77(f) 6 {2 where 
//( ) is some routing (splitting) mechanism Define 7/,(r) = 
i J { a/(i/) j} / ~ 2 J The output streams horn these two 
stations arc merged into station 4 from where they cycle back 
to station I 

We assume that there is no blocking, l e all buffers have 
unlimited capacities Let J - (J\ /> h h) be the initial 
configuration of jobs in the system The constraint set T that 
speuhes the SDLS corresponding to this network is given by 

r = {Tel\ i j £ 11 + J\ 

r 2 £ II >( 1 1 ) + h 
1 j £ W } (/i) T Ii 
/1 < 12 + / i 4 /1} 


M 1 be such that 



))=] 


i 

/” -^2'h ,J j +'(<!) + <-, t ai i 

v~\ 


i n> 


Then, (5 11) is satisfied with (, - i m} + a<, for y - 2 f as 
ioi dll J» . ' € K+ with ?" > E; = i'/, ' 

Wj( °)+L, + « f / > /^"+«M/ 

r 

> E ij 1 f ° f /) 

/.-I 

I 

> E r/ j // ' (v ' ) 

/ 1 

Thus from Umma S 7 and Theorem 5 6 wc may deduce 
that the corresponding depailure piocesses satisfy D > 

For the case of round-robin touting, the muling functions 
as defined at the beginning of this example are given by 
IIA r) = \i/2], and //,(r) - |_»/2j Note that the functions 
U 2 (i) = i jl + 1 / 2 , and //,(i) = //2 also lead to the same 
constraint set T Furthci this latter choice ol functions gives 
us a stiongcr result as (S 12 ) is satished with 

/y,=/< 2 =I - <2 -a( 2 =- and - r, =-or i - 0 

z z 

L \ample 5 9 (Open Network with S plittwi* and Mu tpng) 
Consider the network shown in Fig 4 that was used in the 
previous example, with the modification that thcie is an 
unlimited supply of jobs at station lie J[ ~ ^ This 
corresponds to an open network where the first server models 
the exogenous arrival process The initial configuration of jobs 
in this open network is given by J =■ (h h h) anc * l h e 
constraint set is defined by 


Let the routing mechanism satisfy 

flji -t t < ll,(i) < i € Z+ ) - 2 J (S 12) 

This is a generalt 7 ed mm-linearly constrained SDES with an 
appropnately dchned parameter set A and switching mech¬ 
anism 7 (/j(/) = H,{r) -f J n i t Z +i j ~ 2 3) Let 
{A(J P ) 0 <//</} be a synchronized family of such 
systems, parameterized by the initial configurations Ixt q € 


T - 6 1\ i2< 7/ 2 (/i) + h 

M £ 7/j(/i)-t- /t 

m£ ; i + A | 

Now consider a synchronized family fA(./ /) ) /> = 0 1 

r} of such systems parameterized by their initial conhg 
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urations. Let q £ M r be such that 3 

■/°> (5.M) 

p-1 


It is easy to see that whenever (x 1 , • ■ ■ ,x r ) are coeval, and 
x° := rSp=.i then J - ? = J"i = ■ • ■ = ■ Moreover, 

for j = 2.3 


«) + -0 


= H l (x° l ) + 

< Hjtf) + . 


E"» j : 


p~ i 
/?. 


/>=i 

X>(* 

p--j 


Thus, for the (symmetric) system with a symmetric rout¬ 
ing mechanism and identical servers at stations 2 and 3, 
E[N J (t)}< t > 0, is convex and symmetric in (J 2 ,./ 3 ) with 
J 2 + ^3 = const, and J 4 = const.. Thus E[N^(t)], l > 0, is 
increasing in I -/2 - and therefore a more balanced load is 
better than a less balanced one (in the sense of majorization). 
So if at any time one has the option of “jockeying” jobs 
between queues 2 and 3, it is better to transfer jobs from the 
longer queue to the shorter one. 

Appendix 

Proof Of Proposition 5.2 

1) a is rational , i.e., (\ p = p 1 where r /M Aj, E 2 + are co¬ 
prime, f) = 1. ■ ■ ■, r. Let A = l.c.m.jAi, ■ ■, <$ r }. Then, it is 
easy to see that g.c.d. {“S ■ ■ ■, = 1. (If the above g.c.d. 

is 7 > l, then the l.c.m. will be ~). Now consider the set 


It is easy to see that x 4 also satisfies the required condition. 
Thus, Assertion (1) of Theorem 4.1 is satisfied, and hence 

r 

vr > 0. (5.15) 

p~' J 

Moreover, we also have 

D i i(t) = D\{t)= - = D r i (t). V/> 0. (5.16) 

The total number of jobs, N(/), in the system comprising of 
stations 2, 3, and 4. at time i > 0, is given by 

N(t) = h + -h + Ja + Di(t) - D 4 (l). (5.17) 

Now if we assume that J\ 4 ./J 4- ./J = J (const.) for 
p = 0,1. ■ ■ ■, r, then by (5.15), (5.16), and (5.17), we get 

N°( 1 ) = 4 + .7“ 4- .7“ +1^(7) - n° 4 (t) 

r 

<J"+ Ji’ + J 4 V !>?(/)-£ vW 

= E^(' ; 2 + ^ + J 4)+E f 7p^(7)-^^''(7) 

p~l p=l J)= 1 

r 

=E^(o. 

p=i 

This establishes that the number of jobs in the system is convex 
in the initial configuration (J 25 •/.]« J 4 ) under the constraint that 
J 2 4- J* 4* Ja = J (const.). 

The results obtained above arc pathwise. We now consider 
a probabilistic routing mechanism that is symmetric, i.e., 

{/ 2 (j-);.r> 1} = {h(f);jr > j}. (5.18) 

Also assume that the servers at stations 2 and 3 are identical, 
and J\ - ,/|, J\ = J\, ,/■{ =■ J\. Then because of the 
symmetry 

{N'ity,! > 0} = {N 2 {tyi > 0}, (5.19) 

3 Nole that (5.14) is a strictly weaker condition than (5.13) which is whai 
wc would have required if we had not made use of coevality. 




Clearly, D is closed under addition, and g.c.d. {D) = I. Then, 
it follows that D contains all sufficiently large positive integers 
(see [22, Lemma 1 -66]). Thus there exists k\ j 1 , - ■ ■, x f E 
Z 4 , such that 5Z),=j x* ^ ^ 4 1. TTiis implies that 

£p=, p = * + {■ 



v 



(A.l) 


Moreover, for any x 1 . ■ • •, x' E Z + , we have € 

Z+, and hence 



(A.2) 


That .s(rv) = 1 - j s follows by combining (A.l) and (A.2). 

2) a is irrational. Assume, WLOG, that r*i is irrational. It 
suffices to show that 


sup{ [r*i/| - r>ix : .r E Z + } s = 1. (A.3) 

Since rvj is irrational, (A.3) is equivalent to showing that 

inf{<rj.r - : x E Z+.x > 1) =: c = 0. (A.4) 

Let C := {«].!■ - [rrjxJ : x E Z+,.r > 1}. Now assume that 

v > 0. Let l := min{A: : kv > lj. Note that (/ - J)r < 1 < 
Ic < 1 4 c. Let e := l + c - Ic. By the definition of r, we can 
choose a d E (7, such that c<c/<r+y. It is easy to check 
that if deC, then L A,rJ ^J € C for any // E Z+, n > 1. But 

1 < Ic < Id < Ic 4 c = 1 + r\ and thus Id - [ZrZJ — Id - 1 < c. 

This gives us a contradiction, which establishes that c = 0. 

□ 
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Technical Notes and Correspondence 


Regional Pole Placement of Multivariable 
Systems Under Control Structure Constraints 

S. Sathiya Keerthi and Makarand S. Phatak 

Abstract —Many controller realizations are structurally constrained. 
Some typical examples are static output feedback, constant gain feedback 
for multiple operating points of a system, two-controller feedback, and 
decentralized feedback. A general class of problems of regional pole place¬ 
ment of multivariable systems with such control structure constraints is 
Considered and a unified numerical method is given to solve them. First 
a problem in this class is converted to a problem of solving a system of 
equalities and inequalities. This system is then solved by using a modified 
homotopy method. 

I. Introduction 

Consider structurally constrained controllers, such as static output 
feedback, constant gain feedback for multiple operating points of a 
system, two-controller feedback, and decentralized feedback. With 
such constrained controllers it is in general not possible to place 
the eigenvalues arbitrarily in the complex plane. For satisfactory 
dynamical behavior ol a system, it usually suffices to place the 
eigenvalues in some desired stability region, S in the complex 
plane, i.e., to 5-stabili/e the system. With structurally constrained 
controllers 5-stabilization is quite possible. In this paper we consider 
a general class of problems of 5-stabilization (or regional pole 
placement) of multivariable systems with control structure constraints 
and give a numerical method to solve them. This work is a nontrivial 
extension of the authors’ earlier work [6] to the multivariable case. 
While doing the extension the possible overlap between 161 and this 
note is kept at a minimum by omitting the most repetitive details. 

The problems mentioned here have also been considered in many 
earlier papers from which we chpose references somewhat arbitrarily 
and point to [3J for the case of static output feedback, [8| lor the 
case of constant gain feedback for multiple operating points of a 
system, [9] for the case of two-controller feedback, and [5] for the 
case of decentralized feedback. Our method is fundamentally diff erent 
from the above methods and treats different problems under a single 
framework. 

In our method wc formulate (Section II) each of the regional 
eigenvalue placement problems as a problem of solving a system of 
equalities and inequalities, which we solve (Section Ill) by employing 
a modified homotopy framework. An example is presented (Section 
IV) to show the efficacy of our method. 

II. Problem Formulation 

Consider the multivariable dynamical system 

■r(f) = A,i[t) + Bu{1l y(t) = C,r(t) (2.1) 
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October 20, 1993, and March 2, 1994. 
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Fig. 1. An example of a popular stability region S in the complex plane. 

where ,r 6 /?", u € i/ £ and A. D. (' arc constant 
matrices of appropriate dimensions. We will assume the following 
throughout: system (2.1) is controllable and observable, rank B = m 
and rank C' -- jl The feedback we consider is of the form 

u(t) - -K.r(f). ( 2 . 2 ) 

Our objective is to S-stabihzc, i.e., to place the eigenvalues of 
( 4 - BK ) in some desired stability region, S in the complex plane 
under given control structure constraints. S comes from spccilicatmns 
of the desirable dynamical behavior of the closed-loop system. In 
particular we focus our attention on the popular stability region, 5 
shown in Fig. 1. Control structure constraints are imposed by the 
way feedback is realized. For example, ihe in case of static output 
feedback the constraint is that rows of A must belong to the range 
space of rows of C so that stale feedback (2.2) can be realized as static 
output feedback. To solve this constrained problem we formulate a 
system of inequalities and equalities 

r/(r)<() (2.3) 

/#(r) = 0 (2.4) 

where r is some set of intermediate variables. It is worth mentioning 
here that a neat transformation of our control problem to an instance 
of (2.3M2.4) is by no means trivial; so our way of doing it is 
interesting in its own right. Once a r satisfying (2.3M2.4) is found 
the gain matrix required for stabilization can easily be found. In 
Section 11-A wc present details of how r is chosen and how to 
formulate the inequalities. The equalities will be derived in Section 
II-B. 

A. Choiic of Variables and Formation of Inequalities 

Given a controllable pair {A. B) the feedback matrix K which 
gives a specified characteristic polynomial for (.4 - BI\) is not 
unique. A', however, is determined uniquely by a specified matrix 
polynomial P{s) |]]. P{s) is defined by 

P(«) = 7 , («) + n-(*) (2.5) 

where T is a constant (i.e., independent of *) w x n matrix 
P(m) = diag [a" 1 . ■ ■ ■, a" m ]. 


(X)l8-9286/95$04.00 © 1995 IEEE 
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l (w) = block diag[(l s s'' 1 V, , (is s” T1 ~ J ) r ] 

md Hi* i^n« arc the controllability indexes of the pair ( A, B) 
he A which satisfies 

del (*T - 4 + DR ) =- del (/ J ( *)) (2 6) 

is given by 

A = f-'(A+rr> (2 7) 

where the matnces ( (nonsingular and uppei tuangular), k and T 
(nonsingular) depend on ( 4 B) see |l| lor details o( the dcpcn 
dcncy Since the relationship between A and l is afhne as given by 
(2 7) T is an attractive choice toi the set ol variables, i However 
the problem of finding conditions on the elements of F so that the 
set of eigenvalues, \ = A( 1 - ZJA) C S is very hard On the other 
hand suppose we denote 


dei(/ , M) = dcl(/’(s) + n (h)) =_ 



+ a s + h ) ( s 


V) 


(2 8 ) 


Piopoution 2 2 Suppose that Assumption 1 | u 1 1 
is equivalent to the system of h equalities gm 

m[ block del (V(C ) + H (C ))] = () , 

det(/ > (A) + n (A)) = 0 

where 

n =[1«] <'-['! ‘ and 

~f - II 

block del(P(( )) = det(P(s))| 

Proof We can eliminate s in (2 8) by substituting u h> ( 

/ = 1 2 / and \ so that the right-hand side of (2 8) is /uo 
by Cayley Hamilton's theorem Then we get an initnnedian. system 
of equations which is the same as (2 12) except thal the leirn < \ 
is absent This intermediate system of equations contains a total of 
4/ +1 = (n + 21) equations Out of these 21 are redundant ind when 
they aie eliminated we get (2 12) Thus (2 8) implies (2 12) 

Wc now show using Assumption 2 1 that (2 12) implies (2 8) 
Let p (s) be as in Assumption 2 1 If we show that for each 
/ = 1 / + 1 

i> (s) divides det ( P{ s ) + n ( s )) (2 13) 


where I ["/-] integer part of n/2 and the last term (s - A) 
occuis when n is odd Then the following proposition shows that the 
constraint \( 4 — Bk ) C S can he easily reformulated in terms of 
the elements ot - where 

-Ml (2 9) 

Piopoution 2 1 Let S be as shown in Pig I then \( 4 - Dh ) C 
S is equivalent to 

n v — a /2 < nj b < (1 -|- tan tt)(a /iy 


n | -f o | n -f b -> 0 n“ + o_n -p /) > 0 V/ £ {1 /} 


< A < n, (2 10) 

By using the results on lepresentation ol general stability regions 
in terms ot inequalities 1 1 p 298-299] Proposition 2 I can be easily 
proved We omit the details for the sake of brevity 
The use of quadratic factors in (2 8 ) avoids the need to work with 
complex numbers We define the sci of variables i by 

t = (F } ( 211 ) 

It is clear that the elements of i aie not independent The reason 
for choosing such an overdetermined set, <, is given later in Remark 
2 2 In Section Il-B we develop equality constraints which must be 
satisfied by the elements of F and - to ensure (2 6 ) and (2 8 ) I he 
system of inequalities (2 10 ) define 7(1 ) < 0 (truely </(- ) < 0 ) in 
(2 3) 

B lqualities 

We first formulate the equalities which are duecily associated with 
the definition of the set of variables, t in (2 11 ) 

Assumption 2 1 Let p,('t) = s‘ + «,s + />,,/ = 1, and 
y>H 1 ( s ) = s - A For each /. / € {1 / + 1}, t # j P>( •*) and 

Pj(s) are relatively prime 


then we are done because of Assumption 2 I and the tact that the 
degree of det {P[ s) + 1 \ (sj) is equal to n By the second level 
equation in (2 12) it dncctly follows thal (2 13) holds for / — J + 1 
Now lake / £ 11 /[To show (2 13) we need to consider two 
cases ( 1 ) \| = A_ (2) ^ A where \| and \ are the loots of 

s -p a s + b = 0 

In case I, s -f a s -P b = 0 bas a root \i ol multiplicity two 
We have 


■ 1 

0 


1 ■ 

■ 1 

o' 

> 

J 

0 

y 

A, 

1 


and 

block det( P(( ) + H (C )) 

' 1 O' 

" [\i 1. 

[dct(r(\,) + n (A,)> det<7’( \i)) + n <a,i 

0 del l?(\,) + n (A,)) 


From (2 12) and (2 14) it follows that 

del (T( \j) + 1 ) (A,)) = 0 -^-del(7-(\i)) + n (\i) = 0 

(I A1 

and hence (2 13) holds A similar (actually easier) prool holds for 
case 2 This completes the proof ol the proposition ■ 

Rcmai k 2 l Equations in (2 12) are independent I he proof ol this 
is implicit in Lemma 3 2 which is stated and pioved in Section 111 
Runatk2 2 In the single-input case A is dcleimmcd uniquely 
by ^ and can be expressed directly m teims of I he elements of 
* as [using factorization of the scalar polynomial Pi**) in (2 8 )] 
A = 7 ,[fl, — 1 ( 4' t + <h 4 + /*,/)]( 4 — A/), where 7 is as defined in 
111 So (2 12) is not required in the single-input case [61 In the multi 
input case A is determined uniquely by T and cannol be expressed 
directly in terms of the elements of 1 and some other variables 
required for its unique determination This is because ot the lack 
ot a facton/ed parameterization of the matrix polynomial f'M So 
we define the overdetermined set / in (2 11 ) and the constraining 
equations (2 12 ) for the multi-input case 



IEEE TRANSACTIONS ON AUTOMATIC CONTROL. VOL, 40. NO. 2, FEBRUARY 1995 


274 


The block determinant in (2.12) and its partial derivatives with 
respect to the elements of r, which are required in the numerical 
solution described in Section III. can be obtained by using a transfor¬ 
mation of a polynomial matrix to Hcrmite form [2]. This transforma¬ 
tion uses elementary operations to reduce a square polynomial matrix 
to Triangular form. First det (/>(«)) is obtained from the Hermite 

form of P(s) as det ( P{*)) = V* -f ii\ ,s" -J 4-h ii„. Then block 

detP(C) is obtained by evaluating the above polynomial at C, i.e., 
(subscript / in C* n,* and h t is dropped in the following expressions 
for notational simplicity) block det P(C) — det ( P{s ))|,~r = C n + 

/Jj C" ~ 1 H-*f /inl' 2 - Partial derivatives of block determinant with 

respect to elements of T are obtained as follows. From P(s) equal 
to the expression found at the bottom of the page where are 
the elements of F; the partial derivative of block deiP(O) with 
respect to, say, f 1 , is nothing but C ,,1_l [adj P > ( ■•»)] 11 U=r ■. where 
[adj /*(*)] n is the (1. 1) element of the adjoint of P(*). The partial 
derivatives with respect to other ~, J t k s are similarly obtained. Partial 
derivatives of block determinant with respect to elements of C are 
obtained from the partial derivatives of power of C which are given 
below 


ir~T. c 


J -I 


On 


[C')C" 





1 <!•<!/ 


where 



0 0 

II 

S- 

CC| 

0 o' 

0 -1 

dir | 

-1 (J 


Remark 2.3: Assumption 2.1 can be enforced by simple inequali¬ 
ties, using 2 x 2 McDuffc resultants [2|. These inequalities along with 
(2.10) actually define //( r) < 0 in (2.3). In the actual implementation, 
however, wc have preferred to omit these extra inequalities arising 
from enforcing Assumption 2.1. We have also observed, on all the 
examples tried, that this omission does not lead to any difficulties 
in obtaining the final solution. We will discuss more about this in 
Section III (Remark 3.1). 

Remark 2.4: An important observation is worth mentioning here. 
Our algorithm for solving the constrained stabilization problems, 
which will be discussed later, starts from an initial guess point and 
varies a, and h, continuously, while satisfying (2.12), so as to reach 
a solution. Since (2.12) allows a pair of repeated roots, a continuous 
change between a pair of real roots and a pair of complex conjugate 
roots is possible. 

Now we consider the equalities which arise from structural con¬ 
straints on state feedback. By appropriately appending (2.12) to these 
equalities we get (2.4). We demonstrate the setting up of the equalities 
Only for the control structure of decentralized state feedback. Details 
for the other control structures mentioned earlier are straightforward 
and can be found in |fi|. The I\ given by (2.7) is function of T, a pan 
of i\ and so we denote K by K (r). The control structure constraints 
on I\ are treated as constraints on i*. 


Decentralized State Feedback 

Let r be the number of subsystems of a large scale system 
for which a decentralized feedback is to be realized. The state 
feedback gain matrix, K typically takes the block diagonal form: 
I\ = block diag [A'i The control structure constraints 

corresponding to this form can be expressed as 

Ej‘K( c)E“ - 0. i =■ 1. 2. • ■ •. r (2.15) 

where the matrices Ej\ E" are constant matrices of appropriate 
dimensions which select all blocks in the ilh block row except the fth 
one. Sometimes there exists an overlap between the state variables 
of the subsystems. In such a case K cannot be expressed in block 
diagonal form. The structural constraints can still be represented 
as in (2.15), however, with an appropriate choice of E] and 
i = 1. Together (2.15) along with (2.12) define (2.4). 

Remark 2.5. The equations in (2.12) involve characteristic poly¬ 
nomials and determinants, quantities which are known to have 
bad sensitivity properties. In other words, in the presence of finite 
precision arithmetic, the difference between the set of eigenvalues 
of (.4 - BK) and the set of roots of (2.8) can be large even when 
the errors in the satisfaction of (2.12) are small. Therefore, when a 
K is determined using the numerical method proposed in this paper, 
it is necessary to evaluate the eigenvalues ol (.4 - BK) and check 
whether they belong to *S'. If this check fails then it indicates that our 
method has suffered from numerical instability. If computations arc 
done with a high precision this would not happen. See lor instance, 
the example of Section IV. 


Ill. A Modified Homoiopy Mfmiod 


We solve the system of equalities and inequalities obtained in the 
previous section by using a modified homotopy method. Homoiopy 
methods are popularly used to find zeros of a square system ol non 
linear equations. Oui main reason for preferring homotopy methods 
over the usual iterative methods is that it has been observed in practice 
that the domain of attraction of a solution point for iterative methods 
is usually much smaller than that for homotopy methods. A good 
survey of homotopy (or. continuation) methods together with guiding 
references on theory, numerical methods and applications is given in 
[7], We devise a modification of a homotopy method to solve the 
system of equalities (2.4) (which is typically underdetermined) along 
with the inequalities (2.3). 

Let H{t\ /) be a function with the following properties: a) when 
t — 0, the system ol equations H ( t\ 0) = 0, <j{ r) < 0 is trivial in the 
sense that it is easy to find a v satisfying them, and b) when t — 1, 
//(r, 1) = h(v). Thus when / = 0 we know that f is a solution 
to H(t\ 0) = 0 , </(f) < 0. In a homotopy method one starts from 
this known solution (F, 0), and moves on the solution hypersurface 
defined by {(/•,/): H(i\t) ~ 0. f/(r) <().()</< 1} in an 
attempt to reach a solution r*, if it exists. Our choice of the H 
function is 


V(D + (f- l)// l ff) 
/»■-’(r. -) 


(3.1) 


where: the h l function represents the given control structure con¬ 
straint, see for example, (2.15) for the case of decentralized stale 
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feedback; the h 2 function is the left-hand side of ( 2 . 12 ); and T is 
chosen as'explained' below. Recall that v = {F, 7 }. When t ~ 0. 
we obtain a solution v to ff(r, 0) = 0 as follows. Choose a T so 
iat (jil) < 0. Note that it is easy to get such a 7 because of the 
Jecoupledfiature of the inequalities; see (2.10). Once a 7 is chosen, 
choose a T so that /i 2 (F, ?y) = 0 . Again, it is easy to choose such a 
F; for example one possibility is to choose F which corresponds to 
a diagonal matrix P(s). Then, v is nothing but {F. 7}. 

To move on the solution hypersurface from / = 0 to t = 1 we 
devise a numerical method by using an optimization set-up. Here our 
description of the optimization set-up will be very brief; see [ 6 ] for 
its detailed discussion and the algorithm based on it. 

Since f = 1 has to be reached for the homotopy method to be 
successful, and 1 = 1 has to be reached before crossing over to 
t > 1 , (.1 - t) is a good choice for the objective function to be 
minimized. So we define the following optimization problem 

minfl - 1) subject to H{t\ t) = 0, g(v) < 0 . (3.2) 

It is important to note that, with (1 — t) as the cost function, the 
aim is not to solve the optimization problem (3.2). It is formulated 
only to drive the motion from t = 0 to / — 1 . It is easy to maintain 
feasibility with respect to the inequalities </(r) < 0 simply by using 
a barrier method |4j and replacing (3.2) by 

min (T — /) 4 - D{ r) subject to II(v. 1) = 0 (3.3) 

where B{r) is a barrier function which is smooth and which tends 
to x , i.e., establishes a barrier, as r approaches the boundary of 
{r: Hi 1 ') < fl} and a is positive. Problem (3.3) is repeatedly solved 
for a sequence of a values (which monotonically decrease to zero). 
This solution process is terminated as soon as t = 1 is reached and 
there is no need to go for an actual minimization of (3.2). Therefore, 
unlike usual barrier methods, our method requires that the solution 
of (3.3) is repealed only for a few a values. In our numerical 
implementation we have used a logarithmic barrier function [4J. See 
[ 6 ] for the rationale used to choose the decreasing sequence {c*}. In 
all the test examples it was observed that the initial choice n = 0.1 
worked well in that it was sufficient to solve (3.3) only once, i.e., 
t = 1 was reached while solving (3.3) with c\. 

Suppose that the 5-stabilization problem does not have a solution. 
In that case the method will reach a stage where the solution trajectory 
comes close to making one or more of the inequalities active and, as 
a result / gets stuck at a value less than one. In that case the region 
S can be expanded by appropriately changing the active inequalities 
and the procedure can be restarted. 

Problem (3.3) is only an equality constrained problem. We solve 
(3.3) by a continuous realization of the gradient projection method 
[4, 61. Let 

{( 1 -. t): H(\\ t) = 0} (3.4) 

and V/|,(r. t) be the projection of the gradient of / = (1 — 
t) -I- r k .B(r) onto the tangent space of Z at (v. t). In the gradient 
projection method one moves on Z along the trajectory of the vector 
field defined by -V/,,. Careful tracking of the solution manifold 
Z using the vector field -V/ J( can be done by using a simple 
modification of any state-of-the-art ODE solving package [10]. Such 
a tracking allows continuous check on any deviation from the solution 
manifold caused by the propagation and accumulation of numerical 
emirs and applies a correction to get back to the solution manifold 
whenever necessary. 

Let (’ denote the curve on Z starting from (7\ 0 ) and defined using 
the above process. The tracking of C requires Z to be smooth. The 
smoothness of H does not automatically guarantee smoothness of Z. 
The following result describes the smoothness of Z. 


Theorem 3.1: Let Z be as defined in <741 
{(*', t): Assumption 2.1 holds) C 7? v . Suppose ,v.» 

Jacobian of h 1 with respect to F, has full ww - ih \, 

(r, t) € Z nr. Then Z H r is a differentiable manifold. 

Remark .3.1: Typically C cuts 0V, the -boundary r,{ 1 
sally and so there is really no need to worry about ar.v i! ( . ... 
imposed by any crossing of 0V (if at all it occurs) during s‘v 
of C. In doing all the numerical examples we have simp), U i ;i . 
Assumption 2.1 and yet did not face any difficulties in reaching a 
solution ( v *, 1 ). 

Remark 3.2: The assumption that hf has full row rank usually 
comes directly from the way the control structure constraints are 
formulated. 

To prove Theorem 3.1 we use the following lemma. 

Lemma 3.2: Suppose v is such that Assumption 2.1 holds and 
h 2 (v) = // 2 (F, 7) = 0 , where h 2 is as in (3.1). Then h\{]\ -), the 
Jacobian of h 2 with respect to 7 , is nonsingular. 

Proof: Let *(*) = det (P(s) 4- FV'(»)). With straightforward 
algebra it is easy to see that h:f F. 7 ) has block diagonal structure 
with ith (i 6 { 1 , ••■./() block given by 


(0 - 1 ) 
(-1 0 ) 


On; 
Obi 


Gj{C.)=e? + ajCi +hjh 
G,+ i(C.)=C-A/ a . 


l+l 

j~ i. ,//< 

./ = L ■ ■ • / 


The last ((1 + 1 )th) block is 0t{X)/d\ = + M. 

If Assumption 2.1 holds then for each j, G, ( C>) (which is the 
McDuffe resultant associated with p, (*) and p,{ s)) is nonsingular 
[2] and A 2 4 - a ,A 4- h, is nonzero. Hence Lemma 3.2 follows. ■ 
Proof of Theorem 3.1 : To prove Theorem 3.1 it is sufficient to 
show that the Jacobian J of II with respect (F. 7. 1) has full row 
rank for every (v. t) = (F. 7, f ) 6 Z D \ \ We have 


J = 


hi 0 

h'U r. 7) ^<r. 7) 


h l (D 
ir( r. 7) 


(3.5) 


Then from (3.5) and Lemma 3.2. Theorem 3.1 follows directly. ■ 
It is very important that C connects the points ( T . 0) and \ 1) 
(if v* exists). This connectivity condition depends on the choice of 
II. For any of the 5-stabilization problems which are considered 
in this paper choosing an II such that the connectivity condition is 
satisfied is a very hard problem. We believe that it can only be solved 
when complete system theoretic solutions to these problems become 
available. Our choice of H given in (3.1) is not guaranteed to satisfy 
the connectivity condition. Tests on several problems have shown, 
however, that the homotopy method based on the H as in (3.1) and 
the optimization formulation (3.2) has good empirical success. On 
all the problems tried we were always successful in reaching t = 1 
without encountering any local minima of (3.2) or any unboundedness 
on the elements of F. 


IV. Example 

We carried out numerical tests on several constrained 5- 
stabilization problems using the VAX-88 computer. Here we present 
only one example to demonstrate the effectiveness of our approach. 

Power System: We consider the example of seven-stale, two-inpul 
model of load frequency control of a two-area power system. The 
numerical data for the A , B matrices are taken from [51. Here wc 
also take into consideration integral control action which is employed 
to reject step disturbances due to changes in load. Thus we append 
the state vector by two more variables which represent the integrals 
of the area control errors defined as + A f\ and --Vn, + -A/V* 
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TABLE T 

Solution for Power System 


| Initial value* (t » 0) j 

A 

{ -2±>1, -3±», -4±Jl. -5 ±>1, -2 } 

w 

^ pn(s) ■ a 9 + 12a 4 +,59s a + MS*' + 190* & 100 

put*) * o 

Pn(s) = «* + + 123s a + 3fts + 442 

EfKE* 

EfJi'E,* = 0.084 1.127 0.480 0.517 
£(■KE” = 0,234 0.078 0.000 0.884 

: 


- - - r • 

Number of integration step# = 1954, cpu time = 736.15 set 
_ 11 __ 


taiuiible solution values (1 — 1) 1 

A 

(-1.47087 ± >0.691058, *»2.72493 ± >0.992018, 

-4.42945 ± >1.49412, -6.27487 ± >3.05654. 

-1.99674} 

P(») 

pli(s) = + 13.9234s* + 82.3513s 3 + 271.316s 3 + 338.132s + 234.420 

Pn(j) * -3.85005s 3 - 40.9614s a - 101.740s - 108.530 
pji(s) - -3.27041s 3 - 58.2824a 1 - 10.7900a - 146.782 

Paa(s) « s 4 + 17.8735a 3 + 116.771a a 4 289.408s + 269.346 

Et-KE? 

000 0] ’ 
EjfKEp = (000 0} 

K 

(fci, *i»)= [ 0.238848 0.959565 1.84323 0.643260 - 0.58OL20 ] 

(* 3 », Ufa) *[ 2.70128 0.637658 1,621895 2.159241 1.07738 ] 


see |5] for the physical meanings of — Ap*i»* and A/,, i = 1, 2 and 
the other state variables. The feedback is decentralized with some 
overlap. The 2 x 9 gain matrix K has the following structure 

x x x x x 0 0 0 0 

0 0 0 0 x x x x x 

k ] k I ri 0 

0 A'2<i A’2 

where the x’s represent the free gain elements. Results obtained 
by our method on this example are tabulated in Table I. We used ihe 

stability region of Fig. I with ui = -1. a < 2 = -10 and 0 = 45°. For 

doing homotopy curve tracking we used relative tolerance of 
In view of Remark 2.5 we compared the eigenvalues of (.4 - Bh ) 
and the roots of (2.8). They matched up to three decimal digits. 

V. Conclusion 

In this note we have given a numerical method for solving a general 
class of multivariable regional pole placement problems with control 
constraints, which has worked well on many examples. Given that for 
general control structure constraints the problem is very hard, such 
a method should prove very useful. 
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Continuous Robust Control Design for 
Nonlinear Uncertain Systems Without a 
Priori Knowledge of Control Direction 

J. Kaloust and Z. Qu 


Abstract —In this paper, a robust control scheme is proposed for a class 
of nonlinear systems that have not only additive nonlinear uncertainties 
but also unknown multiplicative signs. These signs are called control 
directions since they represent effectively the direction of motion under 
any given control. Except for the unknown control directions, the class 
of systems satisfy the generalized matching conditions. Nonlinear robust 
control is designed to Identify on-line unknown control directions and to 
guarantee global stability of uniform ultimate boundedness without the 
knowledge of nonlinear dynamics except their size bounding functions. It 
is also shown that the proposed robust control can be made continuous 
through utilizing the so-called shifting laws that change smoothly and 
accordingly the signs of robust controls and that, no matter what time 
constants and gains of the shifting laws are, global stability is always 
ensured. The analysis and design is done using Lyapunov’s direct method. 


I. INTRODUCTION 

Robust control of nonlinear systems in the presence of nonlinear 
uncertainties has been studied extensively. The important classes 
of stabilizable uncertain systems and their robust control laws can 
be found in [I], [3], [6|, |9]-|11|. The uncertainties in those sys¬ 
tems can be general nonlinear functions and input-related and/or 
input-unrelated. The input-related uncertainties studied so far in the 
previous results, however, are not only sign-invariant but also have 
known signs. These signs, called control directions, represent motion 
directions of the system under any control, and knowledge of these 
signs makes robust control design much easier. The objective of 
this paper is to develop a robust control design procedure based 
on Lyapunov's direct method that achieve global stability but does 
not require a priori knowledge of control directions. For practical 
implementation, designs of both continuous and discontinuous robust 
controls are considered and compared. It is shown that continuity of 
robust control can be achieved by designing the so-called shifting 
laws that change the signs of the robust control in a continuous fash¬ 
ion. The shifting laws are based on the results of on-line identification 
of unknown control directions. 
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It is worth mentioning that the problem of unknown control 
iilections has been studied for the last 10 years in the area of adaptive 
ontrol. Most results in that area are for linear time invariant systems 
ith unknown parameter and unknown high-frequency gain (that is, 
ontrol direction). The first result was proposed by Nussbaum |8) 
m which the adaptive control uses the so-called Nussbaum gain, a 
transcendental function whose sign changes an infinite number of 
Limes as its argument tends to infinity. Using the same principle in [8], 
Mudgett and Morse [7| developed adaptive control for general linear 
systems without knowledge of high-frequency gain. An alternative 
method called correction vector approach was proposed in |5J. 
Recently, efforts have been made to extend these results to nonlinear 
systems that have unknown control directions. In [2], [4] two adaptive 
control schemes have been proposed for simple first-order nonlinear 
systems. 

Compared with these existing results (especially the two on non¬ 
linear systems), the results proposed in this paper have several 
advantages. First, the class of systems studied here is second-order, 
and the basic idea and approach used can be extended to second- 
order vector and higher-order systems. Second, systems considered in 
this paper are general nonlinear systems with general nonlinear time- 
varying uncertainties, while those in [2], |4| are time-invariant, have 
only unknown constants, and satisfy the Lipschilz condition. Third, 
the robust control laws provided here can be selected to be continuous 
in contrast with discontinuous adaptive control schemes formulated 
in |2|, |4|. Fourth, the proposed robust control scheme guarantees 
global stability of unilorm ultimate boundedness in the presence 
of general nonlinear uncertainties. Finally, instead of stabilization 
problem formulated in [2|, |4J, outpul tracking problem is solved in 
this paper. 

The outline of this paper is as follows. In Section II formulation 
ol our robust control problem ol nonlinear uncertain system is 
presented. Robust control and shifting laws are developed in Section 
III, followed by a simulation example. Finally, the conclusion is 
drawn in Section IV. 


II. PROBLEM FORMULATION 

We shall consider a second-order nonlinear uncertain system to be 
described by 

■i'i = /1 (.f‘i ■ t) + A/i (./*i. i/i. t) + Hi (.n ■ a i .i Vi < 0 (1) 

■i'j =/j(-r J ,.o./) + A/ i (.r 

+ ( 2 ) 

where x ~ [rj .rj] 1 € is the slate, ii(-) € W is the control 
input, the variables i/\ and //_> represent bounded, uncertain time- 
varying parameters, and f/i and are sealed (in ihc sense that 
|ri]| = \d 2 \ = 1), constant parameters which characterize directions 
of controlled motion. The control directions, signal ,] and 
are unknown. The 1 unctions fiiJi.t) and />(.ri,.o.f) are known, 
functions A/i (.rj, #/i, 1 ) and A/iLn . xj. ;/ 2 . t) represent uncertain¬ 
ties, and the functions r/i (.rj. «i x- 2 - /)i ■ t) and m( -n. .rj. rr^iM/j. 0 
may contain uncertainty in addition to c /1 and ai as well. 

The class of uncertain system is given by (1) and (2) is a 
special case of the class of nonlinear systems in [101. These systems 
satisfy the structure specified by the .so-called generalized matching 
conditions. The difference between [10] and this paper is that the 
above systems has unknown control directions. For simplicity of 
presentation, subsystems (1) and (2) are assumed to be scalar. 
Extension can be made to higher-order and vector systems in which 
the control directions a, are diagonal and constant matrices of proper 
dimensions. 


We introduce the following assumptions lor s\m< n . 

(1) and (2) 

A.l) Under a continuous control it(j.t), snss w ■ 
a classical solution. 

A.2) The uncertainties A /1 and A/* aie bounce,' v 
continuous, and locally uniformly boumleu m * . 

|A/i (.i*i , tf \. f )| < pi (.i'i), and jA/u, , 

P 2 U 1 , .ra), for all i and t}j in some puscuUvt > , , . 
sets and for all (.r,, x - 2 ./). 

A.3) For the functions //,. there exist known umiimious, poMiive 
definite functions /i, and nonnegalive funenons r , such »h,u. 
for / ^ 1,2 and foi all (.ri,.r.». a, 


(|.i'j|)( 1 + (.ri, a i.i j. f)) < «i.rjf/i (a i . (/1 ,i j. //,, p, 

^2 (| u |) ( 1 + i | . I J.iljli. i/:,t). 


Moreover, ihere exist known continuous positive definite 

functions o, such that inequality j)I 

holds for / = 1.2 and lor constants < 2 > 0 and l ., <2. 

A.4) The functions have the property that f/i(.i ) ,rf l t 

t) — </i(<n )i;i(.r|..r'.».»/i.l) and /yj(.i _~ 

<12 () <U ( x 1 . . 1 2 . a . II 2 . t ), where »y i (■) and ry 2 (•) are known, 
odd and normalized functions with «y,(l) — 1. The inverse 
functions of q,{') arc assumed to be locally well defined 
Moreover, 0y\/Q.rj ^ 0 if x 2 ^ 0, and, if the sign of ,r> 
is fixed, the sign of the partial derivative is fixed as well toi 
all .i'i. ij i and /. 

A.5) The functions in the system can be bounded by known 
functions that arc uniformly hounded with respect to 
time and locally uniformly bounded with respect to the 
state. That is, for all < 

f\ (.n )■ 0| < f‘A{f [■'!>). |//,(.i \. u i j j. i/i,0| 

< 5i(.ri,.r 3 ), and ,. x- 2 , <i 2 u. i/j. f)| 1 fh{ i i , x 2 . n ). 
Also, the functions in the subsystem (1) are differentiable 
once and their partial derivatives are bounded by known 
functions as, for all (.n u. if ,, »/.»• U 


Of 1 

+ 

0±f, 

Of 


(It 

Of, 

4- 

0\f, 

Ox, 


"0.n 




Oil i 
()/' 

Oh i 
0.V I 

<2[h 

0X2 


< h t {.l l.J'j). 

< h ij (.I’], .Ij. t). 

< A - 1 


It is worth mentioning that Assumptions A.l)-A.3) come directly 
from the generalized matching conditions 110|. Assumption A.4) is 
somewhat implied by Assumption A.3). Assumption A.5) is needed 
for identification of control directions. With these assumptions, we 
can proceed with continuous robust control design in the next section. 


III. ROBUST CONTROL DESIGN 

Along the line of nonlinear robust control design tor uncertain 
systems, the main approach is Lyapunov's direct method togcthei 
with using deterministic bounding functions lor uncertainties. There 
are several classes of uncertain systems in 11 ], |3], |6|. |9| |11| 
for which global stabilizing robust controllers have been found if 
bounding functions of uncertainties are available. One of the common 
features among these results is that stability analysis and especially 
generation of robust controller is done through a Lyapunov argument 
in which robust control is chosen to compensate lor uncertainties 
by dominating them in both magnitude and sign. In those results, 
control directions are known in the sense that, after choosing robust 
controller, the sign of robust control determines the sign ol the time 
derivative of the state and the Lyapunov function. 
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In this paper, robust design is studied for systems with unknown 
control direction. Like the existing results, the process of designing 
a successful robust control is to use the Lyapunov argument and 
the concept of control dominating unknown dynamics. Since control 
direction is unknown, an initial guess is made on control direction, 
and the robust control is designed whose direction is changed by a 
so-called shifting law. The key results in this study are that unknown 
control direction can be identified using the domination concept and 
that, since continuous control can produce a belter transient response, 
smooth transition of the direction of robust control can be achieved. 
Intuitively, the basic idea emanates as following: begin with some 
guess a, of a, to design robust control in terms of and identify 
a, on line (both parts use the dominalion concept in a slightly 
different way); then formulate shifting laws d, so that robust control 
is continuous and guarantees global stability. 

The following four-step procedure is used to design a continuous 
robust control without a priori knowledge of control direction: 
On-line identification of control directions, shifting laws, state trans¬ 
formation and robust control design using the recursive procedure in 
HO). 

Step I: The first step is to determine sif?n[ai] and .\i^/i[«a] on 
line. Without correct identification of these signs, any control will 
fail to ensure stability. As explained earlier, if a robust control 
works for a given uncertain system, the robust control must dominate 
the uncertainties in the system in both sign and magnitude. This 
domination provides us with a mean of determining the unknown 
control directions of the system. 

Let us first investigate the subsystem (2). If the robust control u 
is designed properl). the function <u will dominate in magnitude 
und sign the uncertainty A/j. Due to the unknown control direction 
<i 2 , a successful robust control must identify the control direction, 
and the identification can be done by designing robust control such 
that the function <j -2 dominates the control-unrelated part of dynam¬ 
ics, /j 4- A/j, that is, \ih{.r\ . .»■„*. hjii.i/v. t )| > i. J'j. *)| 4 
|A/j(.r i. .r». ifj. t )|. This inequality can guaranteed if </ is chosen 
such that 

'/.*(•» n-r-2. \tt\. ijj.t) ,> [/,-Ui,.i\i) 4 i. (3) 

Note that, although the above inequality appears conservative for 
choosing robust control, the condition is necessary in the worst case 
that both the known and unknown dynamics }> and A/_» may be 
unstable and therefore, the control has to dominate. 

Integrating both sides of (2) yields, for all [ti.tj] C [/«,/] 

) = / [jLfUi,.i >. r) 4 A/j(.r| . .1 j, //■», T) 

■ll\ 

4 !lj(-r i, j 2 - f i 2 ti• tf-j . 7-)]Jr. 

Note that the integrant functions are all continuous. Under condition 
(3), it follows that, if ;h (•) does not change its sign for r 6 
[U^], )] = *i. .i- 2 . tnn. i}>, r)]. Using 

Assumptions A.3) and A.4) we have, for any [f i. fj] C [f», t] 

=5 j < .r,, .r>, ii. i;j. ) - .r- (/ 1 )] 

= sign[u{T)]sif*n[.rAt 2 ) - )] Vr € [/i. fj] (4) 

as long as condition (3) holds and h(t) docs not change its sign. 

There are two important observations needed to be made about the 
identifications of si$n[<i 2 ]. First, since we can and have to choose 
robust control to dominate uncertainties to achieve stability, .vig/tfu-j] 
can be identified almost instantaneously after system (1) to (2) runs. 
Second, since a 2 is a scaled constant, its identification needs to be 
performed only once. That is, condition (3) is required to be true 


only at one time instant. This makes it possible for us to smoothly 
transfer the robust control with potentially wrong control direction to 
that with correct control direction. 

Applying the recursive design procedure introduced in [ 10], we can 
design robust control it through designing a fictitious control for sub¬ 
system (1), but the design of fictitious control requires identification 
of control direction sign[a } \. Differentiating (1) yields 


Of 1 , 1 , Ojh 

Of + 01 Of 


4 


Ofh . On 1 

^—-I'L 4 z — 
O.r 1 d.rj 


■r'j 


Of 1 OAfi 
lln 0.r\ 


which implies that, after substituting (1) and (2) into the above 
equation 


/i = /#(-#■! - //i. f/j. /■) 4 


Ojh (-i* 1 . ni-i'j. f/i. Q 

O.rj 


X ft'2 ( r I - J - <l'2tK lj>. t ) 

= M-n . rj . //I. //2,/) -4 <l\{(t\ ) 

0 (J\ (.I’l , #/!-/■) 


0.V 2 




(5) 


where function h(-) is defined and, by Assumption A.5), bounded 
as follows 


h(.r\ ..rj, 1 / 1 . tfj.t ) 



Of 1 OAfi Oy ] 

_i_ 

Of 1 OAf 1 Oil 1 


IF + 01 + of 

1 

O.r 1 O.i 1 Or 1 


x ( f[ 4 A / 1 4 m ) + vr— (fi + A fi ). 

OJ'2 

|/»(.i 1. o| 

< /. f ( ./• 1 , ./ >) 4 h ,, (./1. .1 2 ) [fi (■»' 1) 4 p 1 ( j i) 4 h 1 ( i' 1 ■ 12 )] 

+ 1 . ‘t ‘2 ) If 2 ( -l‘ 1 ■ ■» 2 ) 4 - !>• ( ■'“ 1 , .I’J )] — Jl(.l I . j ‘2 ) ■ 

Similar to the process of determining a we can hnd stf*n\<i \ ] 
if robust control u is designed such that the sign of right hand side 
of (5) is decided through dominalion by the term containing //*>(■)■ 
That is, robust control is designed such that 


\iu (-i 1. «i'j . it j 11. t/j. M| 

0tj 1 (.r 1 . n 1 .( j, 1 / 1 ,/) 


O-f 2 




or more conservatively 




0<h 

Of* 


hU 


.0). 


( 6 ) 


Recall from Assumptions A.3)-A.4) that (j-j , . 1 - 2 , u.tfjJ) has the 
same sign as that of u. Under condition (6), we know from (5) that r 1 
has the same sign as that of q\{<t\ )<ii{(t 2 )if[t?</i (.i’i, 2 * 2 , // 1 . t)/0.r*). 
Then, over any time interval in which tt and dtj\ /O.t '2 do not change 
their sign, the sign of n is fixed, which in turn implies that the 
solution .rj has a fixed concavity or convexity. Hence, it follows 
from this geometric property that 


«gn 


■I'l (f 1 ) 4 ) 


2 '* 1 

l 2 )\ 


= si^n[d r 1 (r)] 

Vf I < T < t‘2, 
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Thus, we have that, whenever no sign change occurs over the interval 
/, < t < h for u(r) and dr/i(.ci. ,r 2 , r/i,r)/cfcr 2 


sign[ni\ = sittn[a 2 ]sign 

X sisn[u(r)]si^n 


■Cl (f 1 ) 4 .Cl (#2 ) 

,(*> + f A] 

2 ’ r 

'V 2 JJ 


Off] (.Cl , .»‘2, */l , t) 
dj'2 


(7) 


provided that condition (6) holds. The Eqs. (7) and (4) show how 
to identify on line the control directions. Once the are 

determined, they can be used to construct the so-called shifting laws 
which direct continuously (fictitious) robust controls with guessed 
control directions to those with correct control directions. This will 
be done in the next step. 

It may appear from (5) or (6) that there is a singularity problem 
when \0(ji/d.i2\ becomes arbitrarily small. Although this singularity 
problem does make it impossible for us to choose robust control u 
to satisfy (6) at the time instant that singularity occurs, it does not 
present any obstacle for identification of control direction a j. This 
is because condition (6) needs to hold only for a very small period 
of time and because singularity cannot present persistently since u 
chosen to dominate uncertainties and known dynamics will cause 
j ‘2 to increase in magnitude. This reasoning will become much more 
obvious after we state robust control laws and basic stability analysis. 
Thus for now, we proceed with our analysis by assuming 


Off i 

0.V2 


> c„ 


(K) 


where C„ is a constant chosen by the designer. 

Step 2: After both siyn[n,] are determined, one has two options to 
implement robust control. The first option is to change the directions 
a, of robust controls instantaneously (at most once for each direction) 
to the correct values. If this option is adopted, the results in [10] can 
be readily applied to generate robust control and to conclude global 
stability. Since instantaneous changes of signs exist only in theory, 
sign changes should be implemented as fast as possible. Although 
this option is used sometimes in practice, it is not our choice because 
of the following problems. First, it is unclear how fast the change 
of control directions must be to ensure stability. Second, the rate of 
change of control directions is usually limited by control bandwidth 
and, even if possible, sign change being very fast is not desirable 
since it may excite high-frequency dynamics neglected in the course 
of modelling. 

Our choice, the second option, is to transfer robust controls 
smoothly from possibly wrongly guessed control directions to the 
identified, correct control directions, which will be accomplished by 
formulating shifting laws. It will be shown that, as long as direction- 
independent robust control laws u, and m, are designed properly 
and implemented, global stability is guaranteed no matter what lime 
constants and gains of the shifting laws are. This result provides a 
theoretical guarantee of the freedom for the designer to choose the 
gains in the shifting laws to achieve good transient performance. In 
fact, our choice gives a complete answer to the stability question 
posed for the first option. That is, in order not to cause saturation 
of actuators or to excite unmodeled dynamics while avoiding large 
overshoot of the state, shifting laws should be chosen such that their 
gains are reasonably large but not too large. 

Let the initial guesses of the control directions a i and n 2 be 
denoted by ai (to) and rr>(fo), then the shifting laws are given by 


( 0 until condition 

(8) becomes valid 

-A\ci - k,„ i |<7i(.ri, ) thereafter 

(9) 

( 10 ) 


where |«,(M = l.f; = b,-n,.b, 4 (he v 
and u v will be chosen in steps three and four 
«■> = fh *(M,"' (‘) represents the inverse fumi:^ 
k.,k nt > 0 are the gains of the shifting laws, and l. / 
constant. Both A * and can be freely chosen b\ \\w • 

variables h,(f) will be used as multipliers in robust , f 
The above shifting laws require identification re Mill* f;- n, o Ml ; 
(3). The initial values |ri,-(/ 0 )| = 1, as well as the pan 

by = 0 in (9), are chosen such that domination of robust controls 
with a, are in effect at least initially to perform identification of 
control directions, The choices of (9) and (10) arc made so ihu:, as 
will be shown in the proof, stability is warranted during the transition 
of control directions of robust controls. 

In the subsequent design and analysis, the state will be shown 
to be uniformly bounded. More importantly, it is worth noting that, 
even though a large control of the wrong sign may be applied, the 
transient response of the state will not have a very large magnitude. 
This can be seen from the following facts. First, the singularity region 
is small in the state space. Second, whenever the state escapes from 
the singularity region, sign identification is almost instantaneous. 
Third, the shifting laws makes r, converge to zero exponentially 
(and possibly in finite lime); robust control will have the corrected 
sign whenever |n| < 1 and |c 2 | < 1; and, in a finite time interval 
that can be made arbitrarily small by adjusting A,. \t,\ will be small 
enough so that robust control has its full strength. This implies that 
the temporary but adverse effect of large control of the wrong sign 
on the state can be minimized by choosing a reasonably large A,. 

Step 3: In this step, state transformation is performed to introduce 
a fictitious control variable u\ This allows us to design robust control 
u recursively and to facilitate stability proof. 

Let .c'/ be a bounded, smooth desired trajectory for the output .n 
of the system to track. Based on ./ '/ and .ri (0), we choose a smooth 
“perturbed desired” trajectory y,t. to avoid singularity. The exact way 
of choosing //,/ will he explained in step 4. Define the new state 
i] = .ri - y,t, then (1) can be rewritten as 

-i = /1 (j‘i. /) 4 - A,/j (j . i)\ ) + ff\ (.1*1. nj ./■;*. i) i. t ) - iiti 
= — A* -1 4 A/[ (-i i - .1*1. »/i. f) 4- g\U\,aiu\rpJ) 

4 - [//i(.ri,r/|.r 2 .r/i,f) - cp (.r \, a \ ir. ip. t )] (11) 

where k > 0 is a feedback gain chosen to be the designer, and 
the lumped uncertainty A/[(■) is A/[(. ,r \, ip . t ) - /i(cj,0 4 
A/i (.ri. i/i, t) 4- A"~i - <),/. Similar to classical observer design, k 
should be selected such that A** > k for good transient performance. 
It follows that 

|A/,'(»M>| < /i(.n ) +/'.(■<•]) + 0.5(1 + *■*;?) + M 

= •*■.). 

It also follows that there exists a known function c(-) such that 

\ff\(j ‘\. <h J‘* 2 . ffi.t) - (/j (.ri. r/i HM/j./ )| < | -r a - HfUi ■ »«'). 

The choice of fictitious control u: will be done in the next step. It 
will be shown that ir = ir(.n, .vi. a i). Now, define the second new 
state : 2 = j'> - ir. Then, we can rewrite (2) as 

h — hi r i. J'2 , t ) 4 A h (.ci. * 2 •»/-’■ f ) 

- fi'j 4 . .r 2 . a 2 f/. 1/2 J) 

4 A//»( ~i, . 1 * 1 , ^ 2 . iV2- m’« VJ' f) 

4 ffi («i* 1 «. 1 * 2 , (i 2 v « *12 ,t ) (^ 2) 

where the “lumped uncertainty” is A/if-j,.rj. - 2 , .r-j- «’■ 'ri- 1 1 
h(*'i'- r 2-t) 4 A/afii-i.j-a, 1 / 2 - 1) 4 k^ 2 4 «')|:i |A7gn[:a] 


f -2 = -A i *e 2 - A:,„ sif>n[c 2 ]\~-2\(j2{.c\< J'l.u,) 
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The uncertainty can be bounded by a known function pi(-) as 


< /a (■•*!■. , u-j) 

+ 0.5(1 + k 2 ij) + <’(.ri..r-j. )|~i | 


c hr 

I a" I ( 

Die 

da j 

■l«ll + 

D~i 

Die 

flj-i 

■MS 

i>'A- 


(13) 


where | i 11 , |.r 11 and |uj | can be easily bounded, and the partial 
derivatives of w with respect to its arguments can be bounded as 
well after tr is explicitly designed in the next step. It has been 
shown in [101 that bounding functions and robust control can be 
made differentiable. Therefore, the bounding function ^(■) in (L3) 
can be found. 

Step 4: In this step, robust control it and fictitious control tr are 
designed to guarantee global stability. Based on stability analysis 
presented in the appendix, they are chosen respectively to be of the 
form 


ii’( :i, ,ri. a \) = wi i O) (:i. j’j )0j (pV(- i ■ .i'i)) * (14) 

tt(*i. j*i . a- 2 ) = — rij£# 2 ( :j, ,rj, ) 

^ -t/ 2 », (15) 


where 


, /<{(:i..ri) + f 

) - r—-■ ■—r//i(:i..n ). 

/ti( ) =- :i/>7(*i.•»■] ) 

, + f 2 

: 2 ..r a ) - 


x /oUi..r,. -j..r_>), 


/<2( -1. -ri. = :-///(:i..f|. ij.-o) 


where 0 j(-) and 02(0 are functions defined in Assumption A.3), 
f > 0 is a design parameter. Note that \p,\ < 2 and. if |/*,| > f, 
> 1. The bounding functions //(*) and /»"(•) should be chosen 
to be continuous and to satisfy the following conditions 

). //,'(•. i..d)|^, 0 #0. 

/»a(ii ,j-| ^ 0 . 

(>j(~ i, J’i. ^..O) > mux |/'j(-1.i. iai-rj). 

[/a( j'i.. r.i) + /»j(-<■ i. J’a)]. —~ j- 


Implementation of the above robusl controllers requires identi¬ 
fication of control directions by domination. The domination can 
be ensured by choosing y t i and *■ as follows. First, select yAto) 
such that Zi(to) ^ 0 (which can be done by adjusting -i(fo) 
through choosing i/,/). Then, it follows that, at time to, robust 
control u is nonzero. Note that |<M/u)| = 1. Also note that r 
can be chosen freely and that, as will be shown later, the smaller 
r the better the tracking performance. Thus, we can choose f such 
that, at lime t = to, |//i(-i,.ri,- 2 ,.i\ 2 )| > r, which implies that 
1 S IffaUi- J*i ,- 2 . J j )| < 2 at time to. Therefore, we know from 
Assumption A.3) that 


. , i , ^ .WW) MpiQ'Ap’i)) 

•tAsi.Tt.Mv-*> > — = —^— 


^ 02&Api)p% n , 

> -^-= Pil-Ufl* - 2 ..^) 


which, based on the choice of pi (■), guarantees both inequalities ( 6 ) 
and (3). And these two inequalities can always be satisfied for any 
finite interval if t is small and if yAl) is perturbed from .i ,f /(f) such 
that (t) is not zero in that interval. This concludes the four-step 
process of designing smooth robust control law. 

Global stability of the system under robust control laws and shifting 
laws are analyzed by Lyapunov's direct method using Lyapunov 
function L(z i, ai. a- 2 ) = L i (:i,rii) + L 2 ( 12 , " 2 ) where 

= + 

Li(~-i.n 1 )= jL(«i<n(«i)- 1 )" + 1 )"- (16) 

n nt A’nr 

The following theorem guarantees global uniform ultimate bounded¬ 
ness of the system in the form of ( 11 ) to (12). The proof can be 
found in the Appendix. 

Theorem , Suppose that condition ( 8 ) is valid. Then, under the 
robust control laws (14) and (15) together with the shifting laws 
(9) and (10), the state of system ( 11 ) to (12) is globally uniformly 
ultimately bounded. Moreover, the system output .r 1 tracks any 
smooth, bounded desired trajectory V/( 1 ) with error that can be made 
arbitrarily small in the sense that 

Mm siii>|j-| (r) - .(•'/( r)| < l/—7-)-—' 

I—V kk n , 

Although the above theorem excludes the singularity problem in its 
statement, one can extend the result of the theorem to a more general 
case through the following argument. First, nonsingularily condition 
( 8 ) needs to be valid only for 011 c instant or a small period of time. 
Second, C n is a design parameter and can be made smaller 10 make 
singularity region in the stale space smaller. 'T hird, since uncertainties 
as well as known dynamics are bounded by well defined functions, .i | 
is finite at time to for any initial conditions. Thus, when the system 
does start in the singularity region, robusl control u dominates all 
dynamics and therefore, by choosing a small enough t'„, can make 
|j'i | large and out of the singularity region quickly enough to identify 
the control directions. One may argue that this scheme does not work 
when subsystem ( 1 ) has arbitrarily small finite escape lime if the slate 
starts in the singularity region. However, the case does not exist since 
it contradicts ./1 being finite. And, if this case would exist, there is 
no control, continuous or discontinuous, of finite magnitude that can 
stabilize the whole system. 


Illustrative Example 

Consider the following second-order nonlinear system 


j' 1 = rtf-rj -F -Yf. 


J‘2 ~ 


where the uncertainty A/ = .rf cos(/) + 0.5sin(0.5/). Robust 
controls (14) and (15) arc chosen with the design parameters r = 0 . 1 , 
k — 2 , C n = 0 . 2 , and with the following bounding functions 

ftl = ft\ = .rf -f 0.5 + ™(1 -f ). 

h = 0.25 +0.25 + 1^1*. 

H - + l-i 1-2 + 12 7; + ■'■'{ + l"i I*l J ‘a|*] 

1 /' 11 T r ! 


+ 


/*! 

Li/‘.r , +^ 


jl-il^ 


- ll 


«i 


i'i + f 2 


I'I Tt| | | H 
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Trajectory error 



secs 


(b) 

1 ig 1 Robust control with A =11) and A =1 

The simulation was tamed out using SIMMON® with the following 
choices ( 1 ) desned trajectory is tj 1 = , 1 = 0 ( 2 ) initial conditions 
are n( 0 ) - / ( 0 ) = 1 o\(ti ) = -1 n (/o) = 1 (3) unknown 
control directions arc chosen to be «\ = 1 and nj = — 1 

Ihc simulation results ol the states and the slutting laws are shown 
in Figs 1 and 2 respectively tor two sets of choices 7 and A It 
should be noted that there is no large overshoot in the state during 
and after sign identification 

IV CONCLUSION 

Robust output tracking contiol of a class of nonlinear uncertain 
systems is studied A system in the class satisfies the generalized 
matching conditions but contains one or several parameters of un 
known signs used to charactenzc control directions Without a pnou 
knowledge oi control directions a four step design procedure is 
proposed which consists of identification of control directions and 
construction of robust controls and shilling laws The system may 
contain nonlinear uncertainties, and both continuous and discontin 
uous iobust control can be designed using only bounding functions 


Trajectory error 



secs 


(b) 

Fig 2 Robust control with A = 5 and A =01 

ot unknown dynamics Under the condition that the subsystem!s) 
not directly controlled has no arbitralily small finite escape time the 
propostd robust contiol guaiantees global uniform ultimate bounded 
stability wilh arbitrarily small ultimate output tracking errot 

APPTNDIX 

Proof of the Ihtonm Iaking the time denvativc of I \ ( i <> \) 
in (16) along the trajectory of system (11) and (12) under control 
(14) and (15) we get 

7 i = 2 i { — A i + A /j ( i r i i/i /) ■+■ /i ( i i u " '/i U 

+[<7i ( n "i ' 2 »/ i t) - <i\ (11 o i ii >/i f) \\ 

+ 2 {-A > - i (1 1 i* ii )| i|wi »n[ ] 

+a/:< , i, 2 1 2 " >1- 7) 4- (I ( I 1 « I II II t)} 

£ — 2ATi( i ~j) + 2|/fi( j 11 )| 4- 2|/i (-i ii ' >1 

+ 2 i'/i(ii u i/i /) + 2~jr/>( 11 i a u i) 1) 

+ 2|-i ||f i|r/j( i | it ) + 2| *||f f i i. ii ) 
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Using the same arguments in [10], one can show by Assumption 
A.3) that -,</,(■) < 0 and that, if > f, -,</,(■) < -|/i,<-)| for 
i = 1,2. Therefore, we have that, for all (:,..r,,0 

ii < —2kL\ -f 4r + 2|-i ||(’i |f/i(./'i. ) 

+ 2|-a||r ,, j|Sa(.ri. .fy. u, ). (17) 

Now we proceed by taking the time derivative of Li in (16) along 
shifting law (9) and (10). It follows from si^n[q t {o ,)] = ) = a, 

that 

2 2 

la(oi.fia) = v—«i(«iVi("i) - l)n + - l)r* 

"fri 

< —2 -r—Li{(i i, <ii) — 21ci ||f’i \fji (*i*i, ir, ) 

S’lM 

- 2|c*||r 2 |jj 2 (n, j- 2 . ii,). 

Combining the above inequality with (17) we have that, for all 
(j 1 1 , , j* 2 , ,'.y, t ), L) ■+■ Li 5s "~2kL\ — 2h\Li/k,„ ■+■ . The above 

inequality shows that the state of system (II) to (12) is globally 
uniformly ultimately bounded. Solving the inequality and noting that 
|« # (0)| = 1, we can show the ultimate bound on the output tracking 
error in the statement of the theorem. 
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On the Ordering of Optimal Hedging Points in a 
Class of Manufacturing Flow Control Models 

George Liberopoulos and Jian-Qiang Hu 

Abstwci —The optimal flow control policy of a single-product unreliable 
manufacturing system that must meet a constant demand rate is known to 
be a threshold type policy: safety production surplus levels called hedging 
points (thresholds) are associated with each discrete stochastic capacity 
state of the system and serve to protect the production process from 
uncertainty in future capacity availability. This correspondence extends 
and generalizes previous results on the ordering of optimal hedging points. 
Our method is based on examining special properties of the Bellman 
optimality conditions of the underlying stochastic control problem. 

I. Introduction 

Manufacturing flow control models address the dynamic allocation 
of stochastic capacity among competing products in a just in time 
manufacturing environment. Several of these models assume that 
medium-term demand rates for a number of products are constant, 
while short-term production rates are continuous over time and 
must live within capacity constraints. These constraints are dictated 
by the configuration of operational machines in the manufacturing 
system modeled. When the suite of machines (e.g., operational/failcd) 
changes at random points in time, optimal policies are characterized 
by generally intractable dynamic programming Bellman equations 
[ 111 . To dale, most analytical manufacturing flow control models 
have assumed thal the machine state process is homogeneous and 
uncontrolled, implying that the failure rale of a machine does not 
depend on its production rate or its age, i.e., how long that machine 
has been operational since its most recent failure. 

Akella and Kumar |2J were the first to perform rigorous analysis 
on a one-product, one-machmc (with two machine slates, up and 
down) manufacturing system with a discounted cost criterion. For 
the same system, but with a long-run average cost criterion, Biclecki 
and Kumar [4] derived the steady state probability distribution of 
production surplus. Sharifnia 11 7 1 extended this approach to one- 
product, multiple-machine-statc systems, and Algoel |3| derived 
partial differential equations for the joint steady state probability 
density function of production surplus for multiple-product systems, 
but did not solve these equations. Caramanis and Sharifnia [5| used 
the results of [ I7| to design near optimal control policies for multiple- 
product systems by decomposing them to many analytically tractable 
one-product systems. Malham£ and Boukas |I 3 [ and Malhame [14| 
demonstrated the Markov renewal nature of one-product manufactur¬ 
ing systems under hedging point policies and studied the ergodicity of 
such policies. Liberopoulos and Caramanis 1 12], Hu ctal. [7], and Yu 
and Vakili j 19] explored the optimal control structure for one-pmduct 
systems with production dependent failure rates. Hu and Xiang [ 8 ] 
demonstrated the equivalence of a one-product, two-machinc-statc 
manufacturing system with a queueing system. Hu and Xiang [9], 1 10 ] 
showed that the ordering of optimal hedging points of certain one- 
product systems can be found analytically. Sethi et at. [16] analyzed 
the structure of turnpike sets that characterize optima] production 
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surplus levels when the demand is a Markov chain; they showed 
that the turnpike sets exhibit a monotone property with respect to 
capacity and demand. A more comprehensive list of references on 
manufacturing flow control may be found in a recently published 
look by Gershwin [ 6 ]. 

A key feature of the optimal control structure common to all models 
is the hedging point. A hedging point is a production surplus threshold 
level associated with each discrete machine state /. toward which the 
production surplus must be guided as quickly as possible while the 
machine state remains fixed at i. A hedging point is also referred to 
as a safety or buffer production surplus and is carried to protect the 
production process from the uncertainty in future capacity availability 
or other uncertainties (e.g., demand and processing time). 

This correspondence mainly deals with systems similar to those 
studied in [14], 117|, having one product with constant demand rate, 
and many machine states with constant transition rates. II is known 
that apart from a one-product, (wo-machine-statc system [2], [4], it is 
practically impossible to derive explicit expressions for the optimal 
hedging points. With this in mind, the goal of this correspondence is 
to uncover properties on the relative ordering of optimal hedging 
points. Such properties have many important applications. They 
can be used to develop simple numerical and heuristic methods 
for obtaining optimal or near-optimal control policies, since they 
dramatically reduce the search space of optimal hedging points. Also, 
for systems where machine state transition rales arc age dependent 
(age being the time elapsed since the most recent machine state 
change), they can be used to study properties of the corresponding 
optimal age dependent hedging curves or turnpikes. This can be 
done by approximating age dependent transition time distributions 
with phase type distributions, each phase having associated with it a 
hedging point. This correspondence extends and generalizes results 
reported by Hu and Xiang [9), [ 10 ], who showed that for some 
special systems optimal hedging points are ordered monotonically. 
The method wc use to establish the relative ordering of optimal 
hedging points is based on the Hamilton- Jacobi-Bellman optimality 
conditions of the underlying stochastic control problem. 

The remaining of the correspondence is organized as follows. In 
Section 11 the manufacturing flow control problem is formulated as 
an optimal control problem whose optimality conditions are given by 
a set of Hamilton-Jacobi-Bellman equations. In Section III certain 
properties on the ordering of optimal hedging points are derived based 
on ensuring consistency of signs on both sides of the derivative of the 
Hamilton-Jacobi-Bellman equation. In Section IV these properties 
arc applied to several special systems, and in Section V several issues 
and extensions are discussed. 


II. The Manufacturing Flow Control Model 

Wc consider a manufacturing system with the following features 
[17]: 

1) The system produces a single part-type for which there is a 
constant demand rate denoted <1. 

2) The capacity stale (also called machine state) of the system is an 
finite-stale continuous-time irreducible Markov chain denoted n(f). 
with state space S. 

3) The production rate of the system at time /. denoted i/(/). must 
belong to the closed interval l = {/i: 0 < u < r n ( 0 }. where /■, 
is the maximum production rate of the system at machine state / 6 S. 

4) The production surplus of the system at time t , denoted .r(t), 
is the difference between the cumulative production and cumulative 
demand up to time t. 

5) The production surplus cost rate, denoted <y(.r). is a strictly 
convex function in .r and has a unique minimum at .r 4 which implies 


that y{jr) x as .r —► ±cc\ r/(,i ) represents lu 1 , 
backlog) cost when ./■ is positive (respective^, n, ■ ,* , 

6 ) The transition time from machine stale i > > 

is exponentially distributed with constant transii,., l 
J )■'<./ € 5, with (j(t\ / ) = - f f (i, r i\>. 1V * 

O.i ^ j and q(/\ i) < 0./ e S.) 

The last feature implies that the machine siate pi M , 
homogeneous and uncontrolled. With this in nimj. j ,j/, . , 

is a stochastic process characterized by the follow my ditlen.iui.il 
equation: 

= </(/) - J. s.l. 1) < i « , , 

The objective is to find optimal production rales ir,Y», t / l0 
minimize the long-run expected average cost 

J = ton /rj(l/f)^ r/(.i , (.s)l</s|. (2) 

For the above limit to exist it is assumed that in the long-run the 
demand is strictly feasible, i.c., that the demand is less than the 
equilibrium mean of the capacity process. Under suitable regular¬ 
ity conditions imposed on the control [15],' the optimal feedback 
control of the system is characterized by the following Humil- 
ton-Jacobi-Bellman (HJB) dynamic programming equation |6]: 

•J* = //(.»') + min {V/(.r)(u - </)} 

u ()« l( *-» I , 

+ *' J i (•*’) ~ ^ )]■ < F S. (3) 

l/' 


where J* = infi„ ( , () j .7. is the optimal value or differential 

cost-to-go funtion starting from slate ( r. /). and ' denotes the 
derivative w.r.t. r. According to (3). the optimal produclion rale 
u ,*(.r) satisfies u*{.r) = arg min { : 0 v. it < r,]. 

Theorem I: V,(.r) is strictly convex in .t and has a unique 
minimum at 6 S. 

Proof Outline The proof that I ,(.i) is strictly convex is similar 
to that of Theorem 5.1 in 1181 and we shall omit it. We only note, 
however, that the strict convexity of V,(.r) follows from the strict 
convexity of < /(,r) and the convexity of the sets l ,.i € S. In fact, 
exact solution of J’,(.r) for small systems [2], [4|, |16] suggests 
that the value function may be strictly convex even if //(./ ) is not 
strictly convex. For the results obtained in [18] to hold, it must be 
assumed that under any admissible control law some state (.in. A) is 
a regeneration point of the process \.v(t). r#(/)} (tor example k may 
be any machine state and .r n any point such that ■’ u < jt* ). Denote b> 
T„ successive visit (regeneration) times to (rn. k). The value \ ,(.r) 
is well defined for regenerative control laws as 

Vj(.r) = min e{ f (</(.r(.s)) — .7) r/.s | .<'(())- .r. n(U) — A. 

(4) 

Thai I',(.!■) has a unique minimum then follows Irom its strict 
convexity and the fact that </(.r) —> -x as ./ —> ± x . since, by 
(4) this fact implies that V,(a*) —* "x as .r -+ ±\. QU.D 

Theorem 1 implies that 


17(.r) > 0. V/'(.r) > 0 for .r > 

r/(.r)<n, r/'(.r)>o for .»■<„. (-5) 


In view of the above, the optimal control policy is given by. 

{ v, if ,r < 

min (fl. r ,) if .r — (b) 

0 if .r>: ( . 

1 As we consider the infinite horizon case we also require that admissible 

control laws be restricted lo those for which ihe process {/(/). n(/i] is 
asymptotically stationary ergodic. 
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Point ~r is known as the optimal hedging point associated with 
machine state /. Unfortunately, the value of the i £ S« can not be 
explicitly calculated except for a system with one machine that dwells 
in two states (up and down) \2], \4], For systems with rrtore than two 
machine states, however, the optimal hedging points can be obtained 
numerically: first, the long-run average tost can be computed for each 
tentative choice of hedging points, i t .i 0 5, as follows: 

/■+ ^ 

./■ a (.r)] = j <7(.r)/ (.r 

where / U) is the steady-state probability density function of r and 
is readily available as an implicit function of the i,J 0 S [17]. Then, 
the optimul values of the can be found numerically by minimizing 
J using a gradient search algorithm. 

111 . Hedging Point Ordering Properties 

Recognizing that it is practically impossible to derive explicit 
expressions for the optimal hedging points, we instead seek to 
uncover properties of the relative ordering of hedging points. To 
this end, we assume that the function J',(./*)./ 0 5. exist and 
are continuously differentiable/ and that the optimal control is 
Characterized by (3). If V, (,<■)./ 0 5. is continuously differentiable, 
r/'(.r) is defined almost everywhere (except possibly al and the 
points where //(.«■) is not defined). Differentiating (3) w.r.t. .r yields 

0 = r/(.r) + r/'(.r(u,*(.r) - r/) 

4- j)[V/(.r) - r/(,r)], a.c. i 0 .S’. (7) 

The proofs of the properties that follow are based on ensuring 
consistency of signs of quantities appearing on both sides of (7). 
Namely, the right-hand side of (7) cannot be strictly positive (or 
negative), since the left-hand side is 0 . 

In what follows it is assumed that r, ^ d. for all / 0 S. 
This assumption simplifies matters but is not essential. Under this 
assumption, there are two types of machine states, feasible and 
infeasible. State / is called feasible if r, > d. and infeasible if r, < d . 

Moreover, it is assumed that there is at least one infeasible machine 
slate. Otherwise, d can be met at all times, and there is no need to 
hedge beyond ./■*. the minimizer of i/(.r). If all states are feasible, 
all hedging points are equal to .r\ 

Property /: .r* < min/, a< s {:a ). 

Proof: Suppose the opposite is true, i.e., suppose thai .r* > z, — 
inim Kf s ). Then, for .r = :, + . the first two terms on the right- 
hand side of (7) are strictly negative, the terms in the summation 
are nonposilive, whereas the left-hand side is 0. Therefore, .r* > 
mim Afs { :a ( cannot he true. Q.E.D. 

In most manufacturing systems .r* = 0 , i.c., the cost rate is 
minimum when there is neither positive (inventory) nor negative 
(backlog) production surplus. In these cases. Property I stales that 
safety production surplus levels must always be nonnegativc. 

In most of the work reported to date 117], 113], [14] it has been 
assumed, without proof, that optimal hedging points associated with 
infeasible machine states are larger than the largest of the hedging 
points associated with feasible machine stales, This means that in 
infeasible machine states it is best to always produce at the maximum 
allowable rate. Although intuitively this seems reasonable, it has yet 
to be proved; however, wc can show the following properties. 

2 For 1 ,(.r) to be continuously differentiable, one must further assume that 
f/(.r) is Lipschitz continuous. Continuous differentiability of the V,(.r) has 
been proved for special manufacturing flow systems |2|, |4| with piece-wise 
constant cost rate function, as well as general piece-wise deterministic systems 
with jump Markov disturbances 115|; it has also been proved for systems with 
random demand 1 16|. 



Fig. I. System 1. 


Property 2: If /*, > d . for some / 0 S, then < 
maXj ,(, ,)m> {i,}- Equivalently, if > max,.,, (l . l)j>n {i, (, 
for some / 0 S . then r, < d. 

Proof: For the first part, suppose the opposite is true, i.e., 
suppose that > max, , j(i ,,) {:,}. Then, for .<■ — the first 

two terms on the right-hand side of (7) arc strictly positive, the terms 
in the summation are nonnegative, whereas the left-hand side is 0. 
Therefore, > max, , /(/ 7) , 0 j:,) cannot be true. The second part 
is dual to the first part and follows immediately from it. Q.E.D. 

It is noteworthy that the inequality < max, r/fli ,) } 

(respectively, > max, f/(<<7L so {:,}) in Property 2 becomes, 
< max, >0 {~ y | (respectively, > max, f/(f , ,,,.o {}), 
if either > .r* or y(jr) is continuously differentiable. A corollary 
of Property 2 is that the largest hedging point belongs to an infeasible 
machine state. This is stated as Property 3. 

Property 3: max, , t {:, | < max, ). 

The following properly concerns the ordering of any two hedging 
points. 

Property 4: For i j.i. j 0 S. define 

"= max {.a} 

k <,[. ky>,,(, n kf, , 

(T — if{A*: »/(/. A) > < A-), k f /, j } - ^), 

:= min {:a} 

A ■/(<. A )' A) k?i, ; 

(i- x.if{A-: ij(i. A) < (j{j. A).A- ^ /, j } = ^). 

If r, > d. > T, and T < then ■„ < 

Proof: Suppose the opposite is true, i.e., suppose that 
vSublract (7) for machine state j from that for machine state /. After 
rearranging terms, the result is 

0 = V,"(.r)K‘(.0 - d) - Y"(.r )(«*(.r) - d) 

+ A*) - r/(j. A')]V/(.r) 

k/, , 

- q(i. A•)U/(.r) + r/(.y, A , )U / , (.<‘) 

4- h(/\ j) 4- q(j , i)][V;U) - U/(.r)]. 

Then, for .r: max {“. :, [ < .<■ < min {^. (. all but the terms in the 

summation on the right-hand side of the above equation are strictly 
positive, the terms in ihe summation are nonnegativc, whereas the 
left-hand side is 0. Therefore, cannot be true. Q.E.D. 

There may be little intuition about Properly 4 at this poinl; more 
intuition about this property may be gamed when it is seen in the 
context of the special system examples in Section IV. It should be 
pointed out that Properties 1-3 hold even if the 0 S are not 

strictly convex. 

IV. Special System Examples 

Hu and Xiang |9|, (I0| studied optimal hedging point ordering 
properties for systems with special Markov chain structure under a 
discounted cost criterion. In this section we demonstrate how optimal 
hedging point ordering properties of those and other systems are 
special cases of the general properties developed in Section 111, for 
the average cost criterion. All the systems we study have only one 
infeasible machine state, labeled 0. 
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Hg 2 Sysiem 2 



Ti^ 1 Sysiem 1 

System 1 is shown in Fig 1 and is defined as tollows 

S \\Hm ! Definition S = {() 1 n } where in ^ <1 and 
i * tl i _ l 2 n The following transitions must be allowed 

/ —► / — 1 / = 1 n and / -* ii for some t < n The only other 

allowable tiansitions aie / —► y y / / 0 n — 1 

The characteristic ol System 1 s Markov chain is that any transition 
liom left to light is allowed whereas transitions fiom light to left 
must be skip free A special instance ol System I is when transitions 
are skip free from left to light too (death-birth chain) A death birth 
chain can be used to model a manufacturing system with n identical 
machines whose machine stale is characterized b> the numbu of 
machines that aic up System I has the following pioperty 

Property For System I i* < i £ < i < ( 

PiooJ By Properly 1 i* < / = 0 1 n 1 lie rest ol 

iht prool is by induction Fust by Pioperty 2 s. t Now 
suppose that £ i < < * i 1 / 0 < / < n We must 

prove that v i Suppose that the opposite is true i e suppose 
that -> j This implies that > max { i -t i } — 

max, ! , ){ ;} Consequently by Property 2 i < tl However 
/ " <1 is not tiue therefore v i cannot be true utliei Instead 

< i must be true 0 F D 

System 2 is shown in Fig 2 and is defined as follows 

System 2 Definition S = {() 1 n ) where /u <. <1 and 
; > (I i = 1 2 n The following transitions must be allowed 
/ —/ - 1 / — * 0 / - 1 ;/ and 0 —► n No other transitions 

are allowed 

A special instance of System 2 is when =0 and i \ - i — 
i I his can be used to model a system with one machine that 
may be either up oi down and whose down time is exponentially 
distributed and up time has a Coxian distribution with n stages A 
special case ol the latter sysiem is a system where the only transition 
allowed to machine state 0 is from machine slate l Such a system 
can be used to model an Filang up time distribution with n phases 
System 2 has the following property 

Property 6 For System 2 

1) If q(n 0) < q(u -]())< ^ q( 1 0) then i* < - < 

i £ £ i £ n 

2 ) I fq(u 0) > q(n -]())> > q( 1 0) then i' < 1 < 

< < - < -0 



Fig 4 System 4 



Fig 5 A system with two nomdcntical unreliable machines 1 he re ire tom 
possible machine stiles (0 0) (0 1) (1 0) (1 1) when, a 1 (icspcUocly 
0 ) in position i indicates that the dh machine is up (icspcclivch dimn) I hesi 
stales are denoted (I l 2 f respectively The I olure and rcpiui nits of iht 
dh machine arc ji and h re spec lively 

1 ) II q{n 0) — <({n - 1 0) = - </(l 0) then i* , - 

= -= < i 

Proof We only prove I since the proof toi 2 is very similar 
and 1 follows immediately from 1 and 2 By Property I i * < 
i — 0 1 n The rest ol the prool is by induction First by 
Property 2 i £ o Now suppose that i £ < i * 

o r 0 < i < n We must prove that + ( £ Thai 
follows immediately from Property 4 after substituting / y ” by 
/+! / i -x jespectively and noting that </(/ + ] 0) < /(< 0) 
Q C D 

Sysiem 1 is shown in f ig 1 and is defined as follows 
System 1 Definition S — {() 1 //} whtre m < I and 

i > (1 i = 1 2 n The following transitions must bt allowed 
; 0 / = 1 n No other tiansitions are allowed 

When 1 1 =0 and > i - i = i System 1 can he used 

to model a system with one machine that may he cither up or 
down and whose clown time is exponentially distributed and up tunc 
has a hyperexponential distribution with n slates Sysiem ^ has the 
following property 

Property 7 For System ^ i * v c / = 1 2 u Also 

il </(/ 0 ) < ?(j 0 ) (; /) € {1 2 n } then 

Proof By Property I i* £ / — 0 1 n By Properly 

* < i / = 1 2 n The last part follows immediately from 

Property 4 after substituting “by f -x. respectively and noting 
that q(i 0) < q(j 0) QF D 

System 4 is shown in Fig 4 and is defined as follows 
S \stem 4 Definition S = (0 1 n } where i tl and 

/>(//- 1 2 ii Hie following transitions must bt allowed 
/ «-* I) and i <-► n / = 1 ;/ — 1 No other transitions die allowed 

System 4 can be used to model a production system thu m ly enter 
one of many failuic modes before failing completely For example a 
system with two nonidentical unreliable machines is \ special case 
of System 4 with u "= 3 (sec Fig S) For System 4 wc have the 
following result 
Property S For Sysiem 4 

1) i* < < max* i a i { a I < ) 

2 ) It there exists j* £ $ such that q{ i* 0) 

max* j k -m/(A 0 ) and q{j* n) = mun i k qll 1 1 
then ,, = max* ^ k <r, _i {-a } 
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Proof: 

1) It folloWvS immediately from Properties 1, 2, and 3. 

2) Suppose the opposite is true, i.e., suppose that 

= max* 1 {~a }, where i* # j*. Since 

«) > ( Jij** '0-</('*< <>) < <l(j\ 0), and x* < < c 0 , 

following the proof of Property 4, 'it can be easily shown that 
i,* > v cannot be true; hence must be true. This 

implies that v = max* i<k„-i (■-*}* Q.E.D. 

V, Further Issues 

In Sections 111 and IV it was seen that for a system whose 
machine state process is modeled as a homogeneous and uncontrolled 
Markov chain, the relative ordering of optimal hedging points can 
he determined, if this Markov chain has a special structure and. 
in some cases, its transition rates have a certain ordering. It is 
noteworthy that this ordering of hedging points does not depend 
on the specific values of r, and <y(t, j)J. j £ 5. A question that 
arises is the following. Once the ordering of hedging points has 
been established using the type of arguments presented in Section III, 
can similar arguments be used to also determine the ordering of the 
differences between consecutive hedging points? Based on numerical 
examples, our conjecture is that the ordering of the differences 
between consecutive hedging points depends on the specific values 
of r t and q[i. j ), i. j € 5. 

An extension of the manufacturing flow control model presented 
in Section 11 is a model with age-dependent machine state process. 
By age dependent we mean that transition rates of the machine stale 
process depend on the age of the machine state, i.e., on how long 
the machine state process has been in a particular state. We first 
note that for a system with age dependent machine stale process the 
optimal control may depend not only on the machine state of the 
system but also on the age of the machine state. Consequently, the 
optimal control is no longer a hedging point policy. Instead, il is a 
so-called hedging curve or turnpike policy—a generalized hedging 
point policy in which values of hedging points depend on the age 
of the machine state. 

In [10] it is shown that any transition (failure) time with age 
dependent transition (failure) rate can be approximated arbitrarily 
closely (in the sense of weak convergence) by a random variable 
with Coxian 1 distribution with many stages. The stages of the 
Coxian distribution correspond to different ages of the transition 
(failure) time, and the transition (failure) rates between the stages 
are proportional to the corresponding values of the hazard rate of the 
original age dependent transition (failure) time distribution. Since 
a random variable with Coxian distribution is a Markov process 
itself, a system with age dependent machine state process can 
be approximated by a system with homogeneous Markov machine 
slate. This approximation provides a means to investigate ordering 
properties of the optimal control policy for a sysiem with age 
dependent machine state process based on results obtained on a 
related system with homogeneous Markov machine state process. 

Finally, an extension of systems with homogeneous Markov ma¬ 
chine state processes is systems in which transition rates of machine 
states depend on machine production rates. Some results on such 
systems have been reported in [71, [12], 1191. 
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Recursive Identification Method for 
MISO Wiener-Hammerstein Model 

M Boutayeb and M Darouach 


A Infract — A simple technique for recursive identification of the 
Wiener-Hammerstein model with extension to the multi-input single- 
output (MISO) case Is presented. We use a new transformation of the 
inpul-output difference equation where parameters to he estimated 
are those of each subsystem of the initial and unique realization. After 
that, a weighted extended least squares (WELS) method is employed to 
estimate recursively and separately parameters of the linear subsystems 
and the static nonlinear element. Convergence analysis of the proposed 
procedure is also studied. Finally, a numerical example is provided to 
show the efficiency ol the algorithm. 

I Introduction 

Modeling, identification, and control design of nonlinear systems 
have been the sublet ol many research activities in the last decades 
Indeed lor many dynamic systems (he use of nonlinear models is 
ollen ol great interest and generally chaiaclen/es adequately physical 
processes over then whole opciating range [ 8 ] Thus, accuracy and 
performances ol the contiol law increase significantly One of the 
nonlincai realisations frequently studied is the Hammerslein model, 
which is composed of a static nonlinearity in series with a linear 
dynamic system Seveial identification techniques ol this kind ol 
models with an extension to the MISO model weie pet formed the 
literature is abundant about this sublet and we rcler the reader to 
[I | [9| and the ictcrences therein However all recursive identih 
cation methods of SISO and MISO Hainmerstcin model (|5], [ 6 ] 
and 1111 ) cons si at first in transforming the initial icalization to an 
equivalent icpiesentation with common denominator and extended 
numerators Next to that classical iccuisive methods are applied 
The obtained parameter vector to be estimated by this appioach, 
is ol high dimension particulaily foi large scale systems, and 
then computational requirements increase with possible numerical 
instabilities Ihus, control design becomes veiy complicated and too 
crude to be ol use 

On the othei hand very lew ellorls have been done to 
extend these methods to the general repiesentation, called the 
Wicnci-Hammerstem model oi G model, particularly the MISO 
case The Cl model is dehned as a lineal system in cascade with a 
static nonlinear element followed by another linear system In [I] 
and 121, Billings it al have proposed an identification algonthm 
loi the Wiener Hammcistein model based on correlation analysis 
This was earlier suggested by Korenberg [ 10] Howevci in their 
algorithms some restrictive assumptions are required for the Wind of 
input sequences to preserve the separability principle On the othei 
hand, computational lequircmenls are considerable Moie recently, 
Yoshinc et al (18] have suggested another approach lor identification 
ol the G-model, this consists ol estimating, in the SISO case, impulse 
responses ol the linear subsystems and parameters ol the nonlinear 
clement 

In this paper, we present a recursive method to estimate parameter 
ol the linear and nonlinear parts of the G-model, which is an extension 



Fig 1 MISO Weiner-Hammerstein model 

ol our previous work in [5] We investigate a m w Innmil iiiou m the 
input output difference equation, from which a W 1 R| s algorithm 
is established Contrary to the approach developed m | 6 ] and |ll] 
the obtained algonthm has the advantage to estimate sepai aiel\ and 
recursively parameters of the linear and nonlincai subsystems ol the 
initial realization where parameter number is ol minimal dimension 
All these results aie extended to the MISO Wienci Hanmicisicin 
model Conditions for parameter convergence lo the actual ones art* 
established, it’s also shown that the weighted factors aic intioduced 
to enhance the convergence ol the ptoposed algorithm 

II Probifm Formulation 

Fig 1 shows the general scheme of the Wiener Hammeistem 
discrete lime model with / inputs The output signal m at time A 
is given by 


i/k = y\k + 4 '/ k 


(I) 


wheic if.K is the nonmeasured intermediate outpul related to the /th 
input 


with 


'/ k 


DM \ 
4 («r ') 


k 


1 * = (i 1 7 n 4- + f) M 7 k 


and 


7, = 


C ,($[_ 
D,(q 


■II k 


( 1 ) 

0 ) 

(4) 


The linear subsystems in the G-model are dehned as 


-1 or 

) = 1 4 0 1 q 

' + 

4 u q 

( 6 ) 

0,(7 1 

) = Go 4 Gi q 

' + 

4G, q 1 

(7) 

C,(.y 1 

) = < #0 4 ( j q 


4 tin q 

( 8 ) 

and 





D(q ' 

)= I 4 r/,i q~ 

' + 

+ <l.rv 1 

for / ' 1 1 (9) 

q 1 is the delay operator, 
instant A and orders w, 

a k is 

the /th input ol the system al lime 
and s, arc supposed to be known 


These can be estimated from the input output data |9| For simpliuh 
in the sequel, we note 4,( 7 —1 ) D,[q ~ 1 ) ( ,(</ 1 ) and D u/ 1 ) by 
4, B, ( , and D f , respectively, / = 1 / The linear lianslci 

functions D,{q ~ 1 )/ 4 ,(r/“‘) and ( ,((/"' )/D (</ 1 ) art assumul lo 
be stable 
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111 Somf Preliminary Transformaiions 
Before giving the mam results of this paper, let us make some 
important transformations which lead to a unique and equivalent 
realization of (1M9) 
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We remark thai there are an infinity of realizations, noted by 
Anft/iChfti, th'A equivalent to (A,. C, Dfj u ) for 
i =s 1 . * - ■ , r and j = 1, ■ ■ ■. m { . Indeed, the input output repre¬ 
sentation of the G-model may be written as 

Uk - .Via 4 - 4 y,k (10) 

with 


!l,k = -j-i 


(ID 

1 .id — k + ■ ■ 

■ 4 /a ( mi , Z' 1 '; 

(12) 

7 

Z,,d = — 


(13) 

, = 0, ,0 4 »!(/ 1 


(14) 

, = i\ ,o 4 c\, i q J 

1 H-f for i = 1 ,* 

■ ■, r 


(15) 


the obtained parameters are related to the previous ones by 

bij = tlb* 11 1 t'ij “■ fic, ij and (fij = j ^ (lb) 

for any I, and f, € i?*. Polynomials -4, and F, are those in ( 6 ) and 
(9). One interesting way to choose /, and t, is to set 

and t, = r <( ) (17) 

polynomials F, # and C fl are then in the following form 

B'Aq 1 ) = 1 4 b, ,\q 1 4 • - ■ 4 h t , ri q r ‘ (18) 

and 

C,A(/~ l ) = 1 4 + ■• • 4 for / = 1 . — . r. 

(19) 


,4=1 + ii i</’’ 

‘ + ' 

••+«»c 

(27) 

.0 = 1 + b t g~' 

+ •' 

• + K<f v 

(28) 

C = 1 + r |</ -1 

4- • ■ 


(29) 

0 = 1 + i/|i | _1 

l +- 

■ ■ + <//(/ 1 . 

(30) 


Into the difference form and by making use of (26) the output signal 
may be written explicitly as 

UK = F(0. u k , u A , Zk ). (31) 

9 contains parameters to be estimated of the realization (26) (30) 
and is defined as 

9 = (ai r, (32) 

F( 0 , //a - //a ) = -Ay a 4 <h BZ k + ■ ■ ■ + ;/,„FZr (33) 

Za = Cm -FZa (34) 

and 

.4 = n i r/ 4- ■ ■ • + 9 1 - D — d\q 1 -f ■ ■ ■ 4 <7 / q 1 . (35) 

We note that Za was omitted on the left-hand side of (33). Indeed, 
computation of Za is recursively done by (34) and will depend only 
on the inpur signal for all initial values Zn. --,Z/ (because the 
transfer function C/D is assumed to be stable). 

Recursive parameter estimation of the Wiener-Hammerstein model 
is given by the following theorem. 

Theorem /. A WERLS estimation of the parameter vector 9 of 
the Wiener-Hammerstein model is 


The final realization, where the index t is omitted, is summarized 
as follows: 


with 


and 


= yik H- 

4 u.k 

( 20 ) 

II 

, 

( 21 ) 

— .9/i Z,a 4 

■■•4r/„,, ( Z,7' 

( 22 ) 






(23) 

= 1 4 h, | q 

4 ■ ■ ■ 4 (f/yi, 9 P ' 

(24) 

= 14 r,\q~ 

1 4 ■ ■ • 4 ('i^q *' for / = 1, • ■ 

■. r. (25) 


^A + l =9 A + A A 1 (</A-H — F(9 A. I/a -j-1 • If A + I )) 

A a n ■= l\ fk 11 (fU i Fa /a + 1 4 Aa f i) 1 
Fa 4 i = (l - h a f i j/ , |) Fa 

with 

hF{9, I/A \ i, IIA f l ) 

/* + * =- - - 

{Aa h f L s a sequence of weighted factors that are positive numbers. 

Proof The WERLS estimation of 9 is obtained by minimi/ing 
the classical criterion function 

j * + ' 1 

■ A +1 = j - ( V / - .')/) 1 ( 36 ) 


We remark that the transformation in (16), (17) allows us to replace 
and e,,o by I, then parameters number to be estimated is n / 
instead of (nr A 2r) in the initial realization. On the other hand we 
can show easily that (20)-(25) is unique, this property assures the 
uniqueness of the identified parameters. Order it / is defined as 

U[ = / + II + III + A + jl 

With / = / 1 +-f /, , li = »i 4 -f/ 1,.111 = /lli-f- 1 - in,, s = 

•s i 4 ■ • - 4 *i and /> = j>i 4 ■ ■ ■ 4 ih ■ 

IV. Recursive Parameter Estimation 
of SISO Wiener-Hammerstein Model 

For the case of SISO Wiener-Hammerstein model, the output 
signal is 



with 


where tj, can be seen as a prediction of //, and is defined as 

if, = F{(i k . </ y , Uj) (37) 

where 9 A represents the parameter vector estimation of 9 by mini¬ 
mizing ./a. 

0 A.fi is then obtained by setting the derivative of ./a-h. with 
respect to 0 , equal to zero: 
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Then we can prove easily thal in the recursive form [ 14], wc obtain 


0*+i = 0 a + Am iUa-h - l (0* </h-j »A4i)) 

= A /a + j {fl+ 1A //41 H- V4-1) 1 
A+i = ( / - /w+.f*\,)A 


and 


/a-h = 

End of proof. 


<^r((9 (/n 41 w* 4 j) 


<S 0 


(41) 

(42) 
(41) 

(44) 


B 

1,1 ^ T 

K ,n 

( , 
Di " 

" + + '(v, 

;-) ') 




D j 

( C 

/r \ \ 

(45) 

4 

4 

4 1 


7 U"0 ) 

or 







n 

1 

1 (Ik 

- (v 1 / A 4 

1 

+ 'l s 1 ) 

(46) 

with 







F - 

- II 

4 , and / / = 

1 

¥ 

( 11 i — 7) 7 ii 

(47) 

As 1 1 

= 1 

1 

contains the term 1 

]^[ 1 is always in the form 

1 

n 

1 

A = 1 

4 4 

and F) = <1 \ q 1 

+ 4 

(48) 

\ is a polynomial nonlinear in parameters with a known structure 

Horn 

(48) and by making use ol (46) 

</a becomes 





in = r <0 (/a ti 

U ) 

(49) 

with 






0 = 

(ft 11 

<1 1 

L n 1 a b n 

6|y ! | h , 



< 11 

< 1 

1 ' , < (1 ll 

</i/i r/ 1 (If 



Vi 1 

Vh 

> , vi v ) ! 


(50) 

and 






1 (0 

1 /A Hk 

) = • 

-1 */a 4 r {q \Z 

1 

a 4 4 </ ^ ) 

(51) 

thus we 

obtain 

the 

same lormulation as 

in the nonlmcai SISO case 


Theoiem 2 II the assumptions 

.hm in,i, [A 'I - \ 

a — 

, wo 

'I" Two "' 


V Rfcursive Paramftfr Estimation 

OI MISO WlFNER HAMMrRSTEIN MODLL 

In this section a direct extension of the proposed method in the 
SISO case is presented At first wc introduce the signal vector as 
in 01) whcie 0 contains paiamctcis o( each subsystem ot the 
icalization (20)—(25) next wc use the same principle as in the SISO 
case 

From (20) the output signal may be written explicitly as 


(V 


06 ) 


1 - v^H-V < <n 41 < 1 H" \ A 
hold then the estimator given by Theorem I tnsui ihu 
Inn 0 * = 0 

A r*_ 

where 

i A 4 l ^ I } k \ 

-±k | l =1 -:-- 

fk 4 | Pk fk +1 4 

and ii m [A ] buA\ IA ] are the minimum and miximum ut’cn 
values ol [A ’] respectively oah will be determined lalu 
Proof Consider a quadratic function difined as 

W 41 ■= I A J | ^ + I (S7) 

with 

0/4 1 — 1—0 (58) 

By subtracting 0 from both sides ol (41) va obtain 

0*41 - 0 a 4 Pk fk h (tk \ \ Pk /* 4 1 4 \* j) 1 11 j ( 59 ) 

with 

r *4 1 = I/H 1 — I (0* Vmi (/ A 1 /) ( 60 ) 

On the other hand from (42) (44) we have 


Pk t\]ki 1 — V 41A /a 41 ( f* 41 Pk 1 k +1 f 1) 


(61) 


Substituting (61) into (SO) and (59) into (S7) respectively the 
quadratic function becomes 

. . "• 1 

1*4. = 


0A + T- Pk f I }k 41 ( A 4 I 

Aa + i 


r A + l 


1 


0* 4 A 1 // 4 1 ( A I 

'A 1 I 


(62) 


5 => A 4 - 1 - A 4 /u 1 ( M 1 

M4-I 


4 


— Li 


M 1 


From (42) (44) we have 


V 4 J 


(64) 


I lien (64) becomes 
ii+i •=»/n + 


s* + I 


with multiple inputs where 0 contains parameters ot each subsystem 
ot the initial realization The WERLS estimatoi of 0 is then given 
by Theoiem 1 

VI CONVLRGENCF ANALYSIS 

Several approaches have been proposed for the parameter conver 
gence analysis of RLS algonthms in ihe deterministic case Heie we 
extend the method developed in [ 19] to the general case ol polynomial 
leali/ation (49) 


4 T ~ — 0/ jk 4 1 1 A + J 4 v /*^4 1 Pk f I /a 41 ( k 4 I 

A* 4 J A J +j 

As // H 0* is an approximation of the innovation sequence w t 1 
we introduce an unknown factor a* H to correct this approximation 
such as 


/* 4 | 0 * = - 0*4 | f k 4 | 


( 66 ) 


for linear systems and in 119] we have /* — which is independent 
from 0 and then a* 4 , =1 Substituting (66) in (65) \*c obtain 


A+i = A 4 


iAfi/u 1 / 41 
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thus, if the condition (54) holds, we have 

r 2 ' -— + J—il* IA+. /*+. < 0 ( 68 ) 

Aa H f, 

and then 


bn < 1 A ■ ■ - < 1 n. 


(69) 


We note 


liu) I a = r. 

A —"x 


From the persistently exciting conditions (52) and (53), we have 

u ^ i .[/y ■]*/*» 


lt(P k ') Hi i m .^[P k 


> 0 


and then 


lint —■ . 

tr( PC') 


= ll>111 „.!=d 

*-"* »/ L.k [P k ] 

Jim h = 0 . 


(70) 

(71) 

(72) 


End of proof. 

Remarks: Hereafter, let us make some remarks. 

1) Implementation of the proposed algorithm in the case of 
Wicner-Hammerstein model with time delay. A particular case of 
the realization (l)-(9) is that 


b, {) = ■■•;= b, t -i =0 and b , ^ 0 
r |(J = • ■ • = < t | --0 and („ ^ 0 for/=.■■■. r 


this configuration depends upon the structure selection irom the input 
output data, where and u , are times delay related to the /th input. 

Polynomials /?,( 1 ) and become 

Ri(<J 1 ) = <1 1 (b , + b ,4 i q + ‘ ’ ’ + h v q l> ' ' ') 

C,(tJ ' ) ~ <1 +fr, «, iq~ ] + ■ ■ ■ -f t s,q~ *"') 

and in the proposed algorithm we replace simply Z ,a by Z ,a _ , and 
<Oa by u ,a - „ r . 

2) Initialization of the algorithm, the mean of the sufficient 
condition (54) for convergence, and how (52M54) can be ensured 
in the practical case. Indeed, most identification methods developed 
in the literature are based on a linearized model around a trajectory 
which is generally given by the last estimation. This approach is 
available only if the last estimate belongs to a neighborhood ol 
the actual parameter vector, the condition (54) represents then the 
validity of the linearized model In practice, if the proposed algorithm 
is adequately initialized (54) is generally fulfilled. Thus, a had 
initialization leads generally to various problems such as convergence 
to a local minimum, to a wrong estimate or instability [16], [17). 
Anyway, there is no approach which can be recommended as being 
universal to give an analytic solution to this problem. This fact is 
related to the many different unexpected features that ihe non linear 
relation can introduce. 

But. for the particular case of the Wiencr-Hammcrstein model, one 
way to overcome the initialization difficulty is to write the output 
signal as a linear parameter representation [51, [ 6 ] [ 11 |: 

f/k - (73) 



Furthermore, (52), (53) represent the persistently exciting condi¬ 
tions, some basic definitions are given in [17] and |I9]. However, 
in practice wc must take into account the constraints that physical 
limitations and safety may impose. Many strategies were developed 
to achieve a given input spectrum where the amplitude and variance 
arc constrained to a certain interval. Some experiment results are 
given in [71 and [10]. 

VII. Numericai Kxampif 

The numerical example considered here is composed of two 
subsystems with the nonlinear static polynomials 

In = <i\ i Zn -f (i\iZ\ * 

I 2A — !l2\Z>L -I- (UlZiL + <U \Z!k 


and the pulse transfer functions* 



Pi _ 

1 +/’ 

i 






4i 

1 +"!!'/“ 

1 + <i\ i<r 2 ' 






c. 

1 + < 1 1 9 

1 + Cl J q 






Pi ~ 

l + </l 1 q~ 

1 + d\i<r 2 






n, _ 

1 

Ci 

_ L±_ 

*'i\q 

1 



A, ~ 

1 + «JJ<7 

1 ' ih 

1 4- (h 1 <1 ' 

~ r +7i 

VI <] 

- 

with 

« II = 

0.5, <1,2 

= 0.35, <i>\ 

— 0.7./) 

1 1 — 

0.1, 

f 1 1 = 

-0.5 

.<'u = 

0.6.fj, : 

~ 0.S, r/ j i =■ 

0.G5. r/, 

2 = 

0.35, 

</.i = 

0.45, 

(1 21 ~ 

0.55.//j i 

- L2.au = 

— 1 .S, ryj 

! 1 = 

0.05. 

i <122 ~ 

1.3.-/,, = - 

-1.1. 







The input signals (i/u ) and {u 2 k ) arc zero mean white noise se¬ 
quences with standard deviations <t\ = 2 and a> — 2 . 5 , respectively. 
In order to show efficiency of the proposed algorithm, the sequence 
(Aa) is chosen large enough to fullfil condition (54) and then to 
enhance convergence of the algorithm, we take 

Ai+I = H )/7 A/* + 1 

where the initial estimates and Po are taken as 
iin = 0: A. = 10'J 1( ,. 


where o k is the data vector which contains only input and output 
signals and, where £a is an over parameter vector. Note that (73) 
is an equivalent representation of the model given by (33) with an 
overparameten/ation. Next to that the use of a least square scheme 
provides a consistent-like initial parameter vector which may initialize 
adequately the proposed algorithm, see also [3] and [19], 


Numerical results are given in Figs. 2-4 where the true values 
of all parameters are reached about 1800 samples. Fig. 2 shows the 
rate of convergence of parameter error norm: ||0a — 0||. Owing to a 
lack of space here, only the estimates of two parameters are given, 
for example: rai and qu which are represented by Figs. 3 and 4, 
respectively. 
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Fig. 4. Parameter estimation of <j\ i- 

VIII. Conclusion 

A simple method for recursive identification of MISO 
Wiener-'Hammerslein model was performed. It is shown, by 
means of an attractive transformation, that parameters to be 
estimated are those of each subsystem of the initial and unique 
realization. Sufficient conditions are given to assure the convergence 
of the algorithm where the sequence {A a } is introduced to enlarge 
the interval [1 - 1 + n/Ah + i] and then to enhance the 

convergence of the proposed procedure, this was illustrated by 
means of a numerical example. Finally, some important remarks to 
deal with the initialization problem, in order to ensure condition 
(54), are given. 
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Production Rate Control for Failure-Prone 
Production Systems With No Backlog Permitted 

Jian-Qiang Hu 


Abstract —Previously, the problem of optimal production rale control 
for failure-prone production systems has been studied exclusively under 
the assumption that backlog is permitted. It is well known that when 
backlog is permitted, the optimal control is usually the hedging point 
policy. In this note, we consider systems in which backlog is not allowed. 
We show that the hedging point policy is still optimal. For systems 
with backlog, it is usually quite straightforward to show that their 
optimal cost-to-go functions are convex—a key property that is needed 
for the hedging point policy to be optimal. With no backlog permitted, it 
becomes much more difficult to establish the convexity property, and the 
explicit formulas for the optimal hedging point and the optimal cost-lo¬ 
go functions have to be obtained, based on which the convexity property 
can then be verified. The method we use in this note to derive these 
explicit formulas is mainly based on an interesting relationship between 
the inventory process of the system under the hedging point policy and 
some stochastic process which is welt studied in queueing theory. 
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I. Introduction 

In this note, we consider the problem of optimal production rate 
control for failure-prone production systems. This problem has been 
studied extensively by many researchers in the literature (for a 
review and a comprehensive list of references on the subject, see 
a recent monograph by Gershwin [4)). A key assumption used in 
all the previous work is that backlog of demands is allowed. This 
assumption usually implies that the instantaneous cost function on 
inventory level (negative inventory being backlog) is convex (in most 
cases, it is either piecewise linear or quadratic function of inventory 
level), which can immediately lead to convexity of optimal cost-to-go 
functions based on a simple sample path argument. Using the convex 
property and the dynamic Bellman equation for the optimal cost-to- 
go functions, one can then easily show that the optimal control is a 
hedging point policy, in which a (nonnegative) production inventory 
level of part types is maintained during times of excess capacity 
availability to hedge against future capacity shortages brought about 
by machine failures. Intuitively, the hedging point is safety production 
inventory and is carried to protect the production process from the 
uncertainty in future capacity availability. 

Although the backlog assumption is reasonable for many systems, 
it may not hold for a large class of production systems in which 
unsatisfied demands are lost. One such example is a production line in 
which machines are linked in tandem. In such a system, the demand 
rate for each machine (except the last one) is simply equal to the 
production rate of its downstream machine, and obviously, in this 
case, production inventory level between any two machines can never 
become negative (i.e., backlog is not permitted). This motivates us to 
consider the problem of optimal production rate control for systems 
in which backlog is not permitted. As we shall see, for a system with 
no backlog permitted, it becomes much more difficult to establish 
convexity for its optimal cost-to-go functions. This is simply because, 
in this case, the instantaneous cost function on inventory level is no 
longer convex, hence the simple sample path argument used before 
when backlog is permitted cannot be applied. Instead, the explicit 
formulas for the optimal cost-to-go functions have to be derived. 

To derive the optimal cost-to-go functions, we first focus on 
hedging point policies and try to obtain the optimal hedging point 
policy and its associated cost-to-go functions. Clearly, if the optimal 
control policy is a hedging point policy, then the optimal cost-to- 
go functions are simply equal to the cosl-to-go functions associated 
with the optimal hedging point policy. To obtain the optimal hedging 
point policy, we use a method which has recently been developed by 
Hu and Dong [51. |6| to study the systems with backlog permitted. 
The basic idea of the method is to establish a relationship between 
the inventory processes of failure-prone production systems and some 
well-studied stochastic processes in queueing systems. In our case, wc 
show that the inventory process of the system under a hedging point 
policy is related to the workload process of a single-node queueing 
sysrem with limited workload. Based on this relationship and existing 
results in queueing theory, we can obtain the probability distribution 
function of the inventory process, from which we can then find the 
optimal hedging point. Finally, with the optimal hedging policy and 
its cost-to-go functions, we can use the verification theorem to verify 
that the optirpal hedging point policy is indeed optimal overall. We 
point out that it is also possible for one to obtain the probability 
distribution function of the inventory process for our system based 
on a different approach used in |l], (2), [8|, in which a system of 
two differentia] equations needs to be solved under two boundary 
conditions. However, the method used in fl], [2], |8J can only be 
applied to Markov systems (i.e., both machine up and down times 
have to be exponentially distributed), while our method is applicable 
to non-Markov systems as well. 


The remainder of the note is organized as follows. In Section 11, 
the problem of optimal production rate control for the failure-prone 
production system with no backlog permitted is formulated as an 
optimal control problem whose optimality condition is given by a 
dynamic Bellman equation. In Section Ill, we establish an important 
relationship between the inventory process of the system under a 
hedging point policy and the workload process of a single-node 
queue with bounded workload. Based on this relationship, we obtain 
the probability distribution function for the inventory process. We 
then proceed in Section IV to derive the explicit formulas for the 
optimal hedging point, the optimal value of the cost function, and the 
optimal cost-to-go functions. In Section V, we present the verification 
theorem, based on which we prove that the optimal hedging point 
policy obtained in Section IV is the optimal control policy. Finally, 
some discussions and future research directions arc presented in 
Section VI. 

11. Problem Formulation 

We consider a production system which has a single machine and 
produces a single part type. The system tries to meet a constant 
demand rate <1, and backlog is not permitted, i.e.. unsatisfied demands 
are lost. The machine has two states: up and down. When the machine 
is up, it can produce at any rate between zero and a maximum rate /■. 
We assume that r > <I . The machine state changes in continuous time 
according to a homogeneous Markov process: the stale changes from 
down to up at a rate and from up to down at a rate q\ (i.e., both 
machine up and down limes are exponentially distributed with rates 
f/i and qu , respectively). We use i € {1. 0) to denote the state of the 
machine, where one corresponds lo up and zero to clown. Denote the 
production inventory at time t by ,r(f)(> 0). Lei /(/) be the state of 
the machine at time t. and let //(f) he the controlled production rate 
of the machine at time t. Then ./■(/) is characterized by the following 
differential equation: 

<lr{1 ) _ fn(f)-r/. if /(/) = lor ,v[t) > 0 

dt jo. iT i(t) — 0 and .r( t) = 0 

where u{t) 6 [0. r\ when i(1) = l(//(f) F [</. r] if ,r(f) - 0), 
and u[f) — 0 when /(f) - 0. The objective is lo (ind u stationary, 
feedback control law, //(.r, /), so as lo minimize the following long- 
run expected average cost: 

./ = Km e|( 1/D jf (<■.,•(/) + <- 0 ](r(/) = 0. i(t) = <)))rff j 

( 2 ) 

where r is the unit holding cost for inventory, m is the unit shortage 
cost for unsatisfied demand, and 1( ) is the indicator function. It 
should be pointed out that the instantaneous cost in the objective 
function, </(./•, i) = r.r -I- 1 (./■ = 0. i ~ 0), is not a convex function 
with respect to j\ 

Under suitable regularity conditions imposed on the control (c.g., 
[7J), the optimal feedback control of the system is stationary and 
characterized by the following Bellman equations: 

mill { - - U - <l)\ - qi [I i (■»') - Ur ){.!•)] -I- r.r - J* = l). 
i«nc,i<, <Ij- J 

for ./■ > 0 (3) 

- flto(r 0 |r) - !’,(.•)] + C.v - .7* = 0. 

for .r > 0 (4) 

where V,(.r) (# = 1,0) are (differential) cosl-to-go functions and 
J* is the minimum long-run expected average cost associated (i.e., 
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- minJ). According to (3), if Vi(.r) is convex with minimum 
point A then the optimal production rate satisfies 


0, 

it .r > and i = 1 


(1. 

if .r = : ‘ and i = 1 

(5) 

r, 

A 

§ 

o. 

It 



wliJ'h is hedging point policy, and is the hedging point. If the 
instantaneous cost function were convex, then one could easily show 
that I i (.r) is also convex based on a simple sample argument ([ 10J). 
However, as already mentioned above, the instantaneous cost function 
in our case is not convex, hence the explicit formula for \ i(.r) has 
to be obtained in order to establish its convexity. 


*(t) 



III. Hedging Poini Polic y 

In this section, we focus exclusively on the hedging point policy 
defined by (5) with hedging point :. We want to derive the steady- 
state probability distribution function for the inventory process under 
the hedging point policy. Our method consists of two basic steps. 
First, wc establish a relationship between the inventory process and 
the workload process of a single-node queueing system with bounded 
workload. Second, we use this relationship to derive the probability 
distribution function for the inventory process based on existing 
results in queueing theory and a level crossing technique. This method 
was first used hy Hu and Dong \5\, |6| to study systems with backlog. 
It is worth pointing our that a different approach proposed in (1], |2J, 
[K] can also be used to obtain the probability distribution function in 
our case. The basic idea ot their method is lo solve a set of differential 
equations. However, we note that the method ot 111, [2|, |K| can only 
he applied to Markov systems, while ours is applicable to non-Markov 
systems as well (i.e., the machine up and down times do not have to 
he exponentially distributed). 

To start, we lake a close look at a sample path ot the system under 
the hedging point policy. For simplicity, let us assume i (0) = : and 
>(()) = 0. Denote hy t.i „ the length of the nth down time and hy 
t„ a the length of the nth up time. Therefore, t' f i „s and „s arc 
u.d. exponential random variables with rates </ ( , and q\ Let f j „ 
be the beginning ol (he epoch of the nth time the machine is down 
and 7 „ „ he the beginning ol the epoch of the nth lime the machine 
is up, i.e., 

ti- I 

T,i 1 1 — / -F ,) and 7 „ „ -- T,i „ -F t,i (b) 

i i 

To simplify notation, wc denote 

l(f/ „ = . r (T<, „ ) and .r (l „ a{T ( , „ ). 

We then have the following recursive equations for .i,/ „ and .r„ M : 

„ „ - max (.r,/ ,, — 1<i ,,(l • 0) (7) 

j',i „ n = min(■. a „ „ + tn »(* - <D) (8) 

with .r,/ i = Equation (7) represents the unique dynamics when 
the machine is down, where “max" means that the inventory level 
cannot fall below zero (i.e., backlog is not permitted). The “min" in 
(X) represents the hedging point policy, namely, when the machine 
is up, the inventory level increases at rate r — </ until either it hits 
the level ; where it remains until the machine breaks down, or the 
machine breaks down before it hits the level The trajectory ot 
is illustrated in Fig. I. 

Define the following iwo random lime transformations: 

o,/(f) - [ 1('(") = 

Ju 


*u(t) 



Fig. 2. The trajectory ot .i r /(f). 


= [ l(/(*) = 1M* 

Ju 

and their inverse functions 

T,i(t) “ oj’ff) - inf { t: vAt) > /( 


r„(t) = v„ 1 (/ ) = inf { t\ (.j„(t) > t ). 

Define also two processes {.r,i(/): / > ()( and 1 > D} as 

.!-,/(/) - .r(and .r„(0 = .i(r„(/)). 


Intuitively, [respectively, r/> #J (f)] is the total down (respectively, 
up) time of the machine during the time interval [0. /]. The process 
{.r,i(0: t > 0} (respectively, {.!■„(/): t > l)[) corresponds to the part 
of the process {.r(/): / > 0} when the machine is down (respectively, 
up). The process t > 0} is illustrated in Fig. 2. 

It is not too difficult lo verify that ./*,/(/) is in fact the same as 
the workload process of an M/M/ 1 queue with hounded workload, 
in which the inlerarrival times arc {t,i M : n = I. 2.’*}, the service 
requirements are {t„ „(r — r/ ): n = 1, 2. ■ • ■ (, the service rate is </, 
and the upper bound on the workload process is Hence, the steady- 
state probability distribution function ol the process {./,/(/): / > (H 
is given by 





■i > : 

0 < j- <r : 
. 1 * ' 0 


(9) 


where 


J_ _ 

r - J 


and (> = 


quit' ~ J ) 
q \<1 


(Sec Takacs [9].) Next we use F\ H (r) lo calculate F\ (.»'), the steady- 
state probability distribution function of the process 1 ■£ 0}. 

Wc use f\ (.r) and ) to denote the density functions of l \{ >') 

and F\ rf (.r), respectively. 

Define D, (t) to be the downcrossing counting process of .«(/) at 
level .!■ during the interval [0. /], and F,(t) the uperossing counting 
process. Define to ~ 0, and 


t„ = inf {f: .»(/) = /(/) = 0. / > i 1 for " > 
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By definition, it is clear that t > 0} is a regenerative process 
with regenerative points {/„. »/ >0). Based on the regenerative 
theorem, we have 


Fa (.!•)== 




m 


Note that 


s: 


1 (,r(s) < ,r -f ) r/s 


-/V 


(a) < .r) ds 


is the total time that the process .r(f) takes value between the interval 
[j 1 . .*■ + ^.r) during [0. t\]„ which is also equal lo 

r - d <1 

We further note that U,(t\) = D, (f j) for 0 < .r < Therefore, 
we have 


IV. Optimal Cost-to-Go Functions 

To obtain the optimal cost-to-go functions, let us assume through¬ 
out this section that the optimal control is the hedging point policy. 
This will be verified later in the next section. Let i* be the optimal 
hedging point, then (12) tells us that it must satisfy the following 
equation: 

r.-^r [rf- ( r + rfl, 

1-p (l-/') 2 

+,>lr + = 1) (13) 

(this is simply due to dJ /</: — (),) and furthermore. 


J-* ■= (■■:* H-^ 


<1 1 + 'll 


(14) 


^-e\I l(.r(s) <.,■),h] = £•[/},(/,)]. 

dr L./ n J (r - (l)<1 

This gives us 

f , r] _ dF.sir) _ >■ F[P,(/ ,)] 

/U ' £[/,] 

_ /• ZT[<?w( /1)] F[A, (/ ij[ 

r - rl rf/ij <IE\o,i(U )] 

wx* 1 "o> 

r - d qu + '/i d )] 

The above method, which we used to derive (10), is the standard 
analysis of level crossings (e.g., see Cohen |3, p. 363-3731). Using 
the same method, we can also show that the most right-hand side of 
(10) is equal to f\ tl ( ,r). Therefore, we have 


It is worth noting that :* is the unique solution of //(--) = () (because 
<1h{ i)/<h > 0). However, / ~ i) if the unique solution of /;(;) = 0 
is negative (which is the case if /»(()) =• nl - cm/i > 0). Since we 
have assumed that the optimal control is the hedging point policy, 
wc have J* = , and furthermore, the Bellman equations (3) and 

(4) reduce to 


d\’{.r) _ f .4i V(.r) - b\(J* - r.r), 
il.v ’ -n). 


for 0 < .i ^ 
for .i > / 


(15) 


where 


/a U)- 


1 ~ 


-/.(i -,.i« 


/ _II_ _ JU_ \ /_ 'LL ILL 

I 4, = f 'i 

l nt -in r - l in -in 

V ,i ,i / \ ,i 

wilh boundary conditions 

(‘n ~ J -3“ (/o (3 j () — \ n (())) = 0 


V — d rjn -f ryi 1 — /a 1 -/*) 


lor 0 < J' < :(). 

( 11 ) 


We now calculate F\ (0) and F\ (:). Again, using the regenerative 
theorem, we have 


(16) 


(17) 


F\ (0) 


= -i 

_ E[o ,,(/,)] - 0)</<^,(s)] 

F[t\] Flt^iit j)] 


<■:* -r — </! (Ti (; *) - lh(/)) = (). 

Solving (15), we have, for () < .r < :\ 

«V), £*••)> m* (% - iy,(r. - > r ‘) 


V>, 


.7*0 + -!_(/“ - i) 

M 


(IK) 


7i 


7o + 71 


Fv ,,(())= 


1 -/> 


and 


F\(~)= 1 - F.\ (0) — / /\ (.r) r/.r = - 

./ii (/( 


(/n + i/i 1 - /« c> 

>/,i (!-/»)«- 1,11 

+ 7i 1 -/m = 


Finally, we calculate the long-run expected average cost .7. asso¬ 
ciated with the hedging point policy. Based on (2). it is quite clear 
that 


where k\ — //(1 - />). and for > 

V,,) = (/ - -^ + ')h=-» 

- ^.7*(.r - :*)- J, 

where - - (r/i> + </i )/</. 

Based on (IK) and also using (16), we have for 0 < .i < :* 


(19) 




t = r f ,rf\ {.r)d.r -f r () F.\ (0) 
Jo 

1 


r,(.r) =. r,(U)-t- 


71 


(70 + 71 )(1 - /« -!•(!-/<)-) [ 


^,71 (1 - />) 


A’i (r — f/)70 
7o + 7i (74 1 

" Mirr-rf-jU J - 2 " ) 


(.7* -!•»)(. -1) 


71 r 


+ c f > 


1-P 


-(-/» + </,)v(-' ,n -' ,): |. (12) 


; . ( 12 ) 


Af !/(/■-(/)* 


Li,*e 


- |.7’(1 -i 1 ')- c.r + f-(( 

«‘J 


1) 


( 20 ) 
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and 


d\ i ( 0 

<l\ 


__ ( ki 


Air/o(l -rf)* 

■rTr^i 

A i f/(i - a ) 


+ 


(/ - d)qo 
tqi i 


i —iI) 2 




( 21 ) 


Substituting (q and J* from (11) and (14) into (21), we obtain 


d\\( i) _ < (go + <1 1 )(i - d ) , j ( — a i( 

(I i itfd(\ - p) 2 

f (go + gj ) , * _ , 

A , d (/-,/) 1 


loi ()< i < 


( 22 ) 


systems is to decompose them into many simple subsystuns 
systems with only one machine The interaction among these simpi, 
subsystems can be formulated through the lotal demand iuic to eadi 
subsystem Clearly, Pui some subsystems backlog ot demands will 
not be permitted, especially it they are suppliers to othei subsystems 
One example is a production line in which machines ne linked 
together in tandem In such a system the demand rate 1m each 
machine (except the last one) is simply equal to the pioduction late 
ot its downstream machine hence backlog vs ill not be permitted 
Out results can give optimal instantaneous production rates tor those 
subsystems in which backlog is not peimitted Hence, they can be 
used to develop optimal or near-optimal designs on production rate 
control lor complex systems when combined wilh other methods 


Similarly, by combining (19), (17), and (14), wc obtain lor i > 


li(D = li( *) + 


r - 


A id 


k ) 


-1] 


<(r 


( ') ) - r<, - *) 


( 21 ) 
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Rimrinus 


and 


r/li( 

di 


i) 


k ( 


go 4- gi 


) -l)+-r(l - 
<1 


(24) 


V Thf Vfrimcation Thlorim 

Iii this section we verily that the optimal hedging point policy wc 
obtained in the previous section is (he optimal contiol We fust need 
the following erihcation theoicm 
Unfidition riuoum For the system undei considciation sup 
pose the control u*(i i) is defined by (S) where * is given by 
(II) If there exist continuously dillerentiable functions 1 (i ) on 
i e (0 "v ) and a constant V such that the Bellman equations (1) 
and (4) are satisfied and 1 (i) < (whcie a and b arc two 

constants) then n*( < 0 is the optimal contiol which minimizes the 
long run expected average cost defined by (2) 

I he proof of the veuhcation theorem is essentially the same as the 
one given in [2| which uses the Dynkm foiinula Hence wc will not 
lepeal it here For the optimal cosl-lo go Junction I i( i ) winch we 
obtained in the previous section we can easily verity [based on (22) 
and (24)| that it is a convex function with minimum 4 and also it 
is continuously dif leienliablc Hence based on the above verification 
theoicm the optimal hedging point policy is indeed optimal 
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VI CONCIUSION 

We proved that even when backlog is not permitted, the optimal 
control foi the failurc-pione production system remains to be a 
hedging point policy We demonstrated that the traditional sample 
path argument is no longer valid in proving that the optimal cost 
to go functions are convex functions instead the explicit formulas 
have to be obtained To obtain the optimal hedging poml policy and 
the optimal cost to go functions, we first established a relationship 
between the invenlory process ot the system undei a hedging point 
policy and the workload process of a smgle-node queueing system 
with limited workload, and then used it to obtain the probability 
disinbution function of the inventory process based on some existing 
results in queueing theory and the level crossing technique 
An immediate application of our results m this note is to lailure- 
prone production systems with many machines connected in network 
One way to study the optimal contiol problem for these complex 
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Relative Stability of a Linear Time-Varying Process 
with First-Order Nonlinear Time-Varying Feedback 

J. S. Ansari 


Abstract—A method of establishing sufficient conditions for the stability 
of a system consisting of a time-varying linear process and a first-order 
nonlinear time-varying feedback is presented. Rate of decay of unforced 
response and BIRO stability can also be determined. The governing 
equation of the system Is represented in the form of an integro-differential 
equation, so that a modified form of (he Lyapunov-Razumikhin-type 
condition can be applied. The first part of the note deals with stability of 
vector functional equations. The results obtained are applied to feedback 
systems. An example is worked out to show the applicability of the results 
obtained. 


I. Introduction 

Methods such as circle criterion and Popov s criterion are available 
for establishing sufficient condition for the stability of systems 
consisting of a linear process and zero-order nonlinear feedback 
fl], |2|. No general results arc available for linear time-varying 
process with first-order nonlinear, time-varying feedback systems. 
The method presented here is applicable to such systems. In particu¬ 
lar, linear processes with an arbitrarily varying parameter can also be 
considered. As such, the results may be useful for studying stability 
of adaptive control systems. 

The note is in two parts. The first part presents a method of 
establishing an upper bound on the norm of the solution of a vector 
functional differential equation. The results obtained are then applied 
to a linear time-varying process with a first-order nonlinear, time- 
varying feedback. 

The note presents two points. It shows that the Lyapunov-Razum- 
ikhin theorem, which is essentially applicable to functional equations, 
can be applied to feedback systems governed by ordinary differential 
equations. This is done by rewriting the system equations in integro- 
differential form. Some systems which cannot he analyzed by other 
methods can be studied by this method, The other contribution is that 
the Lyapunov -Razumikhin theorem has been extended to included 
inputs and, furthermore, a method of establishing the rate of decay 
of solutions is also included. 


II. Lyapijnov-Ra/umikuin-Typf Thporem 
L et 


Lyapunov-Razumikhin sufficient conditions for stability require 
that [4] 

Vltp(t)) < -n < 0 

for all continuous test functions, j>, that satisfy the inequality 

V("< !>{*)) <V(1.p(t))-s£l(t). 


A modified form of a Lyapunov-Razumikhin-typc condition is 
presented here in the form of a theorem. The theorem gives sufficient 
conditions under which V{1* yit)) < H(t). for all / > t„: where 
//(/) is a function of our choice. By choosing H as an exponentially 
decaying function, we can establish the rate of decay of the response, 
which is not possible through the original condition. Moreover, the 
Lyapunov-Razumikhin theorem had not been used to study stability 
of finite-order linear processes with nonlinear feedback, because it 
is not applicable to ordinary differential equations. In order to apply 
a Lyapunov-Ra/umikhin-type condition, we shall use convolution to 
represent the governing equation in an integro-differential equation 
form. 

Let //(/) be a solution of equation (I), and V(L i/(/)) be a scalar 
function fulfilling the following conditions. 

i) 

R'(Ill/ll) < V(L //) < r(||//||) (2) 


for ||</|| < 1„. / > —7(fn) where <r(.s) and 1 (s) are increasing 
functions of * for 0 < .s <. Y (t . and ir(0) = /’(()) - 0. 

ii)V(f. n) is differentiable with respect to t and by all elements y, 
of the vector //. We denote 

/ 0Y (IT 0Y\ 0} 

\ Oil, ()</■_. + i)f/n ) ()}) 


iii)<)V /(It and all OV/Oy, are continuous with respect to t and if,. 
for t > t„ and ||,v|| < //„. 

Theorem. If for a continuous function 77(f) i(||// fl ||), with 

continuous derivative, 


lim 

S -f 


0\'(s, j>) 0\ 

Os + Oil 




< flit) 


V/ > (3) 


For all test functions p, belonging to set / 7 (f), where Pit) is the set 
of all vector continuous functions for which the following conditions 
are satisfied: 


Y(t.p(t)) = H(1) 


(4) 


it(t) - /(,*//. t) (!) 

represent a vector functional differential equation of the retarded type 
|3), where //(/) is an (n x 1) vector, and y t in the argument of 
/ implies dependence ol / on r/(.s), t - 7 (t) <. .s < t. Function 
[t - T[t)] is nondeercasing. Interval [t - T(t). f] is represented by 
lit). ||f/|| represents Euclidian norm in IV'. Equation (1) is such that 
y = 0 is a solution. If y[ s) is given in the initial interval, /(f„), then 
Hit) for t > t„ is determined by equation (I). It can be shown that 
the solution is continuous. A scalar function \ (/, y) is chosen such 
that « (||i/||) < Y(t. y) < i’t||i/||), where n» and » are increasing 
functions, and ir(0) and r(0) are zero. If Y tends to zero with time, 
then it follows that \\y\\ also lends to zero. 

Manuscript received May 4, 1993: revised October 20, 1093. 
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Y(s.p(s)) < His). for t-T(t) < s < 1 (5) 

then 

V(/.!/(/))< Hit). for all t > t„ (6) 

provided the initial conditions are such that 

Vis. vis)) < H{t„). for * G 7 it,,)- (7) 

Proof: It is given that initially 

V(i„. y{t „) | < Hit,,). ( 8 ) 

Suppose there exists a 1 1 , such that f \ is the smallest value of f > fu 
for which 

)) = //(/,). (9) 

It follows that 

H/. ij(1)) < H[t). t u - Tit,,) <t<ti (10) 


(X) 18“9286/95$04.0O ® 1995 IEF.E 
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and that y(t) belongs to the set P(t i), where P is dehned by 
conditions (4) and (5) Hence, we can substitute t/ in place of p 
in condition (3) to get 


Inn 

»t 


cM (s y) 
ds 



j('h *) 


< Hit i) 


( 11 ) 


According to (1), /(</„ s) is equal to i/( s ) Hence the left hand side 
of inequality (11) is equal to \ and inequality (11) can be written as 


lmi \ (» i/( s)) < H(t I) (12) 

—'i 

At the same lime it follows from relations (9) and (10) that 

Urn \ (s i/ ( s )) > /T(/, ) (H) 

" 'i 

which contradicts inequality (12) Hence assumption (9) is not true 
and 1 is bounded by H 


III Appi k ation To FYedback Systi ms 

Consider a system consisting of a time varying hneai nth order 
asymptotically stable plant coupled with nonlinear time vaiying teed 
back Let the plant be governed by 

u(t) = I q(t t + .<(/ 1 ) (14) 

whcic <i is the impulse response function and <>(f 1 ) is the initial 
condition response and I is ihe (n x 1) vector representing the 
initial state For convenience we shall write (14) as 

il-tlidt 1 ) (14a) 

Let the feedback be given bv 

i/ + nt u) = \ it 0 - 1/)) (h) 

whcic / is the icleieiicc input Functions F(/ i ) and \(/ i) 
are nondecreasing with respect to < and \ (t 0) and T(/ 0) arc 
identically zero 

Combining (14a) and (IS) gives 

u = -h(f 1 (i - tjim } ))) (16) 

Let us choose 


1 (/ «) = a 2 (17) 

and 

HU) - n , '' (18) 


Since the linear plant is asymptotically stable we can hnd a ${t) 
such that the initial condition response 

\o(1 } )| <6 *(f) t > t (19) 


if 


Hi II ' 

Let the input by bounded by 

|'(/)|<^ t>t» 


( 20 ) 

( 21 ) 


Coiollaty I If 


±D '< x '\ 


+ tint) + nI' |i/(/ i )| ( Xl // j 


< — A -f 


b(t ±lh x ') 
±Bt K ' 


(2^) 


then 


|»(/)|<- n, " i <i r 

where T can be inhnity provided 

||) || < f |# (f)| < A and \u{t)\ * B lor t(-J(t 


Cok)IIui\ 2 II loi any B > 0 and b - () wc Lan lincl positive 
A and t to satisfy condition (22) for all / "> / then the unioiced 
solution ly = if - () is asymptotically stable 
Cotollats 1 If foi any B ^ 0 and \ — 0 wc can hnd posmvc 
b and f to satisfy condition (22) lor all / ^ t then the system is 
stable in the sense ol hounded input bounded output (BIBO) 
Rcmaik Corollaries follow immediately and obviously Irom the 
theorem Therefore proofs are not included 


A b \amph 
Lei 


V + (u + M V + <ibil — ( m 


(2T) 


and 

// + /(/ u) - \ (1 (i - iy)) (24) 

where h > a > 0 # > 0 \ (t D) - T(/ ()) - 0 b (f i) and 
A (/ i ) are nondecreasing with respect to * I el initial conditions 
be such that 


|n<0)|' B ||I (0)||<* 

when ) (0) = (</(()) i/(0)/M' Let |;(f)| _h I Ml 

1 valuating f/ and <P and substituting in inequality (22) wc gel 


t 6+ B< ex - p ( ~ N 
\<i\ ) b — a 


-\t) 
\) 




(2S) 


If ti = 2 h - 3 i = 1 and 
/ (t u) . 


10 r J t * 0 


and we choose \ = 0 A b -= 0 < — B/G^/2 then inequality (2^> 
becomes 


\{t ±i) 
±i 


17 


l- 0 62")r 1 11 + 2 7"if 


(20) 


wheie 


i — 0 20(>Hi 0 ' — 0 1G2 j/7. *'+()7<U. 11 

If inequality (26) is satisfied foi all / > 0 then 

Mol D< ' ' 


Application of the above theorem to (16) shows the following 


t > 0 



m 
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BIBO Stability: If the values of n, 6, c, and F are as chosen above, 
and we choose A = 0, h = J3/2, then inequality (25) yields 


A’(L ±j-) 15 

±.r < l+r- 3 '(') 


(27) 


where 


.v = h + (D/G) + (2D t ~' u /3). 


If condition (27) is satisfied, then 


Properties of Optimal Weighted Sensitivity Designs 

Kathryn E. Lenz 


Abstract —?Weighted sensitivity designs for a class of slngle-lnput/single- 
outpuL linear, time-invariant systems are shown to have the property that 
the optimal controller has fewer right-half plane poles than the plant 
has right-half plane zeros. Weighted complementary sensitivity designs 
are shown to have the property that the optimal controller has fewer 
right-half plane zeros than the plant has right-half plane poles. Effects 
of various choices of weighting functions on the optimal solutions are 
described. 


|ti(/)| < U />() 

provided 

|/*(/)| < D/2 />(). 


B. Comparison to the Result Obtained Through the 
Original Lyapunov-Razumikhin Theorem 

It is of interest to note that the application of the original Lya¬ 
punov-Razumikhin condition to (23) and (24) gives the following 
sufficient condition for asymptotic stability of the unforced solution: 


±/r‘.v 


±cD /&-</ 
h — a \ ah 


( 


-fit 


(I 



< 


F(±D , t) 
±D 


— n 


x VD. B„ > n > 0 


(28) 


where n is positive and as small as we like. 

For asymptotic stability with infinitesimal rate of decay, A in 
condition (25) can be taken as infinitesimally small, and h is zero. It 
can be seen that as A and a tend to zero, conditions (25) and (28) 
approach each other. Thus, the result obtained through the original 
Lyapunov-Razumikhin theorem is a special case of the result obtained 
through the modified theorem. 


IV. Conclusions 

A method of applying the Lyapunov-Razumikhin sufficient condi¬ 
tion for stability to feedback systems governed by ordinary nonlinear 
differential equations is presented. The results obtained show that the 
meLhod is useful in that it is applicable to some systems which cannot 
be analyzed by other known methods. 

A modified form of the Lyapunov-Razumikhin condition is pre¬ 
sented. which establishes rate of decay of unforced response. Inputs 
can also be considered and BIBO stability studied. 
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1. Introduction 

Using a result due to Poreda [71, we derive basic frequency 
domain qualitative properties of the optimal .solutions to weighted 
sensitivity minimization and weighted complementary sensitivity 
minimization problems for single-input/singlc-output. linear, time- 
invariant systems. We show thal under certain continuity assumptions 
applied to the plant and weighting function, weighted sensitivity 
optimal controllers have fewer right-half plane poles than the plant 
has right-half plane zeros. Conversely, if a given stabilizing controller 
has fewer right-half plane poles than the plant has right-half plane 
zeros and makes the magnitude of the weighted sensitivity function 
flat across frequencies, then Ihe controller is optimal. 

The analogous result for weighted complementary sensitivity de¬ 
signs is thal (he optimal controller has fewer right-half plane zeros 
than Ihe plani has right-half plane poles. Conversely, if a given 
stabilizing controller has fewer right-half plane zeros than the plant 
has right-half plane poles and makes ihe magnitude of the weighted 
complimentary sensitivity function flat across frequencies, then the 
controller is optimal. 

Certain weighted sensitivity and weighted complementary sensitiv¬ 
ity problems are not optimized by a stabilizing controller, even though 
the optimization is done over the set of all stabilizing controllers. 
We describe situations in which this can happen for plants wilh j f? 
poles or zeros. We also show that if one wishes to obtain a stabilizing 
controller as the optimal solution to a weighted sensitivity or weighted 
complementary sensitivity optimization problem, one should not use 
a weighting function with a j ft zero. 

II. Notation, Dfhnitions, and Assumptions 

Denole the imaginary axis of the complex plane C by j ft and the 
imaginary axis together with infinity by j ft. Wc say thal * is in the 
right-half plane, t’ + , if Re(.s) > 0. Correspondingly, .*• is in the left- 
half plane, C , if Re(.s) < 0. The subset of functions in L % on j ft 
that are also continuous on j '\ft will be denoted by 7,, v . The space of 
functions thal are analytic in C 4 and extend lo functions in L' on 
j ft will be denoted by II ' . The functions in II v having continuous 
extensions from to C + U j ft will he denoted by .\ v . 

For M a function of the complex variable s, M* will denote 
M(—s). A function M £ II x is inner if M*.\( = I a.e. on j ft. A 
Blaschke product 7J(.s) of degree w is an inner function of the form 
B(s) = h IT/! | (1 -T r s )/(1 + ijs), where |A’| = I and £ C* for 
t y = 1, 2, ■ ■ m. For B a finite Blaschke product, D* = IV ‘. 
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A function -V € H *■ is outer if the closure of XH 2 is H 2 , where 
jj : is the space of functions that are square integrate on j ft and 
analytic in C ¥ . A function F £ H* is a unit in H v if F" ] £ II x . 

If there exist functions A', r, V, D £ H X/ such that P — XD"\ 
and V X +1 'D = I, then we call X 1 a coprime factorization of P. 
We say that A’ stabilizes P under negative feedback if the functions 
(1 + PK )“ 1 , PA'( I + PI O” 1 . A’( 1 + PA' ) _1 , and P( 1 + PA')' 1 
are in H x . If such a A’ exists for P, we say that P is stabilizable. 

Suppose that C has a coprime factorization C = PY 1 such that 
I S + VP = I, where P = A'P” 1 . and ALT. V. D 6 . Then C 

stabilizes P under negative feedback. The Youla parameterization 
of the set of functions that stabilize P under negative feedback, 
parameterized in terms of C = TV -1 , is given by 

£(P) := {A: A = (K - 1)Q)(V A XQ)~\(J £ II' 
and Q / -VA '" 1 }. 

Assumption l The plant P(.s) is stabilizable and has a coprime 
factorization P = XD 1 = V, A „D~\ where A, is a finite Blaschke 
product, A" f . is an outer function, and D is a rational function with 
inner/outer factorization D = D,D„. 

Let a generalized zero of a Lcbesgue measurable function / on a 
Lebesgue measurable subset of ./ft be, as defined in |3|, a point on 
/ft for which the magnitude of f is not essentially bounded troni zero 
on any neighborhood of the point, II j is continuous at a generalized 
/ero, the generalized zero is a zero of f. Denote the set of generalized 
zeios of / on ./ ft by Z( /). 

For a I unction f continuous and non vanishing on ,/ ft and mcro- 
morphic in l’ 4 , the winding number (or index) of /(,/-) may be 
computed as wno(/) = • / - />/, where - / and />/ denote the number 
of C r zeros and C 1 poles of / respectively, counted according to 
their multiplicities. 

For a given contiollci (' and plani P. denote Ihe number of t 1 + 
poles of C by p ,. the number of f 4 zeros of C by , the number 
ol C 1 poles of P by p,,. and the number of C " /eros of P by 
counted according to their multiplicities. We show in Section 111 that 
the optimality, as described in (3.3) and (3.5). ol ( 1 £ A( P) depends 
in part on the number ol V 1 poles and V 4 zeros of P and (\ 

Lemma 2 1. II //(*) £ and is nonvanishing on j ft, then //(.s) 
has at most a finite number ol C 4 zeros. 

Ft oof Suppose that //(s) £ \ 0 and is nonvanishing on ./ft. 
Since ;i{ s) is analytic and not identically zero in , each (3 4 zero 
of a(s) has a finite order and the set ol all zeros of <y(.s) has no limit 
point in (3 4 . Suppose that r/(.s) has an infinite number off 4 zeros. 
Let <j(,s) = (s - l)/(.s -t- I) = The function n(.s) conformally 
maps (3 4 onto the open unit disk. The function 1 is analytic 
in the open unit disk and continuous and nonvamshmg on the unit 
circle. Since <i has an mhmte number of zeros in V 4 , 1 has an 

infinite number of zeros in the open unit disk. But this sel ol zeros 
has no limit point in the open unit disk. There must be a sequence 
of these zeros with a limit point on the unit circle. Therefore gm> 1 
has a zero on Ihe unit circle. This implies that <j( s) has a zero on 
,/ft, which is a contradiction. 


III. Quauiaiivf Properiils of Optimal Solutions 

Both weighted sensitivity and weighted complementary sensitivity 
optimization problems can be transformed into the onc-block general 
distance problem of finding an optimal norm - and if possible an 
optimal Q £ such that 


(3.1) 


When R £ the infimum in (3.1) is achieved for some (J t // 

1 4, p. 1351. If R € I<2 , then the optimal Q is unique and |/? m 
is constant a.e. on /ft [5, p. 196]. 

From Poreda's Theorem 1J7. p. 250] we have that lor U comm 
uous and nonvanishing on jft 

iuf \\P-Q\u = 11*11. I.U) 


if and only if \R\ is constant on and wno(7?) < 0 . From Poieda's 
Corollary ([7, p. 2501 see also [ 1 . p. 52]). if R is also mcmtnorphic 
in <3 4 , (3.2) holds if and only if R =■ 7 Pf/? 2 , where B\ and Bj are 
Blaschke products with dcg(Pi) >deg(Z? 2 ) and is a constant. 

Throughout this section, suppose that P - XD 1 satisfies As¬ 
sumption I and a controller C is given such that C — M' 1 lor 
l\V £ P \ and IX + YD = I. Also assume that neither P nor 
( is the zero function. 

Define the weighted sensitivity transfer function A(.s, A ) to he 
\(A ) = [ 1 V|/(I + PA)]. Assume that the weighting function IV| is 
analytic in <3 4 and has no zeros in <3 4 . Assume that W\D, Y,, £ H v 
has at most finitely many generalized zeros on ,/ft and is continuous 
at each point of Z(\\\D 0 X„). Further assume that ITi D„\ £ A u . 

Parameterize the sel of all stabilizing controllers yi(P) in terms 
of ( \ Then the weighted sensitivity optimization problem associated 
with A consists of finding ■)] and K such that 


-„ = ||A(A-)|U= juf |i( 


]V 


— inf 
(Kin 


I + Pl< 

•,OT’ + ir,DVf)|K 


(3.3) 


Define 


-2- iuf (3.4) 

qk n * 1 


To equate the optimization problem (3.3) with the one block general 
distance problem (3.4) we rule out the possibility that • j < - 1 . To do 
this assume that > |TVj D„\'(j*')\ for every £ Z(\\\D„X V ). 
Then, |2, p. 12], 7 ,» — A i. This assumption need only be checked 
when P has a generalized zero on j ft. With this assumplion C is 
optimal for ( 3 . 3 ) if and only if Q = 0 optimizes the right-hand side 
of (3.4). 

Let ]\ > be analytic in f 4 and have no zeros in Assume that 
TVjA/D (J £ ,4,, has at most finitely many zeros on ./ ft. Also assume 
that WiXuV £ A 0 . Define the weighted complementary sensitivity 
transfer function (-)(.s, 7v ) to be (-)(A’) = \U\>PK/( I + PA )1. 

The weighted complementary sensitivity optimization problem 
associated with B consists of finding 71 and A such that 


-IK-MA’)||, 


inf 

MMl'l 


1 \jPh_ 
I + PK 


rz inf \\w,xr -UjXDQW^. 

QCH' 


(3.5) 


Define 


7 1 = inf^ \\D;W2XJ--Q\U- (3-6) 

To equate the optimization problem (3.5) with ( 3 .h), assume that 
7 j > H 2 XJ’ijju') for every j£ Z[\Y>X„D 0 ). Then, [2, p. 12], 
71 =7 >. This assumption need only be checked when P has a pole 
on j ft. With this assumption C is optimal for (3.5) if and only if Q 
- 0 optimizes the right-hand side of ( 3 . 6 ). 

Theorem 3.7 . A controller C £ 1!(P) is uniquely optimal lor the 
weighted sensitivity problem if and only if |VK')| is consianl a.e. 
on ./ft and p, < 


7= inf \\r-q\u =n*-yiK. 

Qi H * 
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Proof: Parameterizing D(P) in terms of C * UY~ l we have 
that C is uniquely optimal for (3.3) if and only if Q = 0 is uniquely 
optimal for (3.4). Assume that Q * 0 is optimal for (3.4). Then 
since X*W\D if ] r £ £7* Q 53 0 is uniquely optimal and |A(C)| 
» |A7JTj D U V | is constant on jH, Therefore X*\Y]D t A’ does not 
vanish anywhere on jit and [7, p. 250], wno(A7U”i D a V) < 0. The 
product X;X\\n o V is meromorphic in C ¥ since it can be expressed 
as the quotient of analytic functions in C 4 . Since X, is a finite 
Blaschke product and A7lT’i D n Y does not vanish on jit, H'i D V Y 
does not vanish on jH. Then by Lemma 2.1, H'iA.V has a finite 
number of C 4 zeros. The C 4 zeros of X\\D„Y are the C 4 poles 
of C and the C 4 zeros of A', are the C 4 zeros of P. Therefore 
wnoCY/TVi D 0 Y) = p, - Thus > p ,. 

Conversely, suppose that and |A(C)| is constant a.e. 

on jH. Then |A7iriAA | is constant on jit. Since X?]Y\D„\' 
is continuous and not identically zero on jit. it does not vanish 
anywhere on jW. Then wno(A7ir 1 D V Y) = p r - < 0. Therefore 

[7, p. 250|, Q = 0 is uniquely optimal. 

Theorem 3M: A controller C £ £(P) is uniquely optimal for the 
weighted complementary sensitivity problem if and only if |0(O| 
is constant a.e. on j^t and < Pi- 

Proof: Parameterizing IC(P) in terms of C 1 = P\ r we have 
that C is uniquely optimal for (3.5) if and only if Q = 0 is uniquely 
optimal for (3.6). Assume that Q = 0 is optimal for (3.6). Since 
n:\V 2 XJ' £ 17, Q = 0 is uniquely optimal and |0(O| = 
(Dril’aA'.T’l is constant on j$. Therefore D*WoX ( ,L' does not 
vanish anywhere on jit and wno(DrHj XJ T ) < 0. The product 
Dj\Y' 2 X v V is meromorphic in C 4 since it can be expressed as the 
quotient of analytic functions in C + . Since D, is a finite Blaschke 
product and D*\Y 2 XJ r does not vanish on \V 2 X„P does not 
vanish on j'S. Then by Lemma 2.1, U 7Y„r has a finite number of 
C 4 zeros. The C 4 zeros of HVY„f r are the C 4 zeros of and the C 4 
zeros of D, are the (7 poles of P. Therefore v/no(D*\Y 2 X U P) = 
- p lt , Thus p,, > . 

Conversely, suppose that p ft > and |0(O| is constant a.e. on 
jH. Then l-DfHiW.ri is constant on j'it. Since D*\Y 2 X o r is 
assumed continuous and not identically zero on j ft. it is nonzero 
on jit. Then wno(Z7*H jA7T) = - p t , < 0. Therefore Q = 0 is 

uniquely optimal. 

Example 3.9: Let sequences {%} and | consist of the 
positive numbers satisfying cos % sinh - (ll = sin cosh and 
cos cosh Qn - 1. Assume the ordering Oj < vi, and lor 

j < k, Let P = X ti Di~\ where D, = (.s - 1)/(.s + 1), and 

Y _ 1 |^ x 1 4- ft* 4- 1 

( * + 1 ) 1 1 + f S + f* 2 0u ' 


One can show that A7 belongs to A,, and is outer [ft]. Let V(s) 
be the constant A7(l) _1 . Then V = (1 - VX,.)D~ l belongs to 
m and PA7 + YD t = 1. So C = I V 1 € £(P). Define the 
weighting function Hj = (0X7)’*. Since f7\7 has no poles or 
zeros in l’ 4 , 117 is analytic in <7 and has no zeros in C 4 . Also 
|0(O| = |n^A7f’| - I a.e. on jit. Since p lt = I, and - 0, by 
Theorem 3.8 C = f’V“ 1 is optimal for (3.5). 

Remark 3.10: Theorem 3.7 requires that P have at least one (T 4 
zero. Suppose that z t , - 0 and that P = A7 D ] 1 with D, inner and 
A’„ a unit in H*. Let C\ = kX ~ 1 . For k> 1, H(P) and 
M’i/(1 + PC\) * WiDfiD, + k). For each j^ 6 jit 


liin 

k — X 


111 

1 + PC\ 


lim 


1 -f kD, + kD ; + k’ 2 


0 . 


Thus infhtup) ||A(A')|U = 0. No I\ € D(P) achieves this 
ifimum. 


Remark 3.11: Theorem 3.8 requires that P have at least one C 4 
pole. When p P = 0 and P has no poles on j% K = 0 is optimal for 
the weighted complementary sensitivity problem. One can check that 
0 G E(P) and that ||0(O)|U = 0. 


IV. Weighting Functions 

In Section III we observed that the optimal controller makes the 
weighted sensitivity or weighted complementary sensitivity function 
flat across frequencies. In this section we explore the implications of 
this for various choices of weighting functions. Suppose throughout 
this section that the plant P and given controller C satisfy the same 
assumptions as in Section III. Let P = l T ,l r u and V = V,V„ be 
inner/outer factorizations. 

One might expect that a controller that optimizes a weighted sensi¬ 
tivity problem would have poor complementary sensitivity properties. 
This is not necessarily the case. 

Corollary 4.1: Suppose that C 6 ^(P) is the unique optimal 
controller for a weighted sensitivity problem. Suppose that C has 
no generalized zeros on j it. Then C is also uniquely optimal for the 
weighted complementary sensitivity problem with \\ j = n{XJ\ t ) 1 , 
where a is any nonzero constant, if and only tl < p v . 

Proof: This result follows from Theorem 3.8 after we observe 
that for \V 2 = o(A7P t |0(O| = \D;\Y,X 0 U\ = |n| on jR. 

Corollary 4.2: Suppose that C £ £(/') is the unique optimal con¬ 
troller for a weighted complementary sensitivity problem. Suppose 
that 1 1 , has no generalized zeros on j R. Then (’ is also uniquely 
optimal for the weighted sensitivity problem wilh TT| = i'(D„V,) 
where v is any nonzero constant, il and only il y\ < 

Proof: Observe that |A(( )| = \ X*W\ D„Y | = |nl on j R. Apply 
Theorem 3.7. 

In weighted sensitivity optimization the weighting 1 unction and 
the plant determine the optimal Nyquist plot of PC(ju') verses ^. In 
the case of a constant weight we can say precisely what the optimal 
Nyquist plot is. 

Corollary 4.3: Suppose that ll’i = //, for some constant v > 0, 
C G Y.{P), and p, < : /( . Then C is optimal for the weighted 
sensitivity problem if and only if for each G j R, PC \j^) satisfies 
the equation of a circle wilh center — ( - 1,0) and radius /• = oj -, 
where - is the optimal norm in (3.3). Furthermore, r < 1. 

Proof: By Theorem 3.7, C is optimal for (3.4) if and only if 
|A(C)| is constant on jit. Let PC'(j ^-) = .1 + jif. Then for each 
G j it, |A(C)| = -) if and only if i'“(l/((l + r) 2 + iy“)) = V- 
Equivalently, y 2 + (1 + .r) J = From the Maximum Modulus 

Theorem, since AtG'') is analytic in C 4 and A(C') = 1 / at every 
c’ 4 zero of P, ||A(C)||x > v. Since A(C) is nonconstant in C’ 4 , 
||A(C)|| > > v. 

Corollary 4.4: Suppose that \Y? = //, for some constant p > 0, 
C G 3](P), and <p P . Then C is optimal for the weighted 
complementary sensitivity problem if and only if for each tr G jit, 
PC{ju') satisfies the equation of a circle with center r t = {r tl , 0) 
and radius r = - p 2 ), where c tJ = -i‘ 2 /h‘ 2 - where 

0 > p is the optimal norm in (3.5). 

Proof 1 By Theorem 3.8, C is uniquely optimal for (3.5) if and 
only if |H(C)| — a.e. on jit for some constant From the 
Maximum Modulus Theorem, since B(C) is analytic in and 0(G y ) 
= p at every f 4 pole of P, -) > p. Since 0(0 is nonconstant in 
P + , **) > p. Let PC(jui) = .r + jy. Then for each jjj G >R, |A(C)| 
= -; if and only if p 2 ({Y 2 + y 2 )l({ 1 + j -) 2 + y 2 )) = *>“. Equivalently, 
y 1 + (,v + - p 2 )) 2 = yy/h a - p' 2 f. 

Certain weighted sensitivity and weighted complementary sensitiv¬ 
ity problems are not optimized by a stabilizing controller, even though 
the optimization is done over the set of all stabilizing controllers. 
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We describe situations in which this happens in Corollary 4 5 and 
Corollary 4 6 

Corollary 4 5 Suppose that H,D has a zero on jH Then 
C e £(P) is not optimal for the weighted sensitivity problem 
Proof Suppose that C € D (P) is optimal for 0 Then by 
Theorem ^ 7 |A(C)| is constant ae on / ft Since IT \D has a zero 
ai j+t 6 j ft, then | \(0| = 0 a e on ./ft But no A 6 £(P) can 
achieve this 

Thus when a plant P has a ./ft pole with multiplicity n no 
stabilizing contioller will minimize the weighted sensitivity function 
if n i docs not also have the pole with rnuliplicily at least // Tor 
example consider P = (s + l)/s Foi \ = 1 and D = s/(s + 1), 
Y I )~ 1 is a coprime factorization ol P For any TT , that does not 
have a pole at i = 0 problem (TT) will not be optimized by a 
stabilizing controller 

Also, Corollary 4 ^ shows that il one wishes to obtain a stabilizing 
controller as the optimal solution to a weighted sensitivity problem 
one should not choose M i strictly propel or with a j ft zero 

Corollary 4 6 Suppose that > 0 and TT * Y has a zero on / 1? 
Then C t 3D is not optimal lor the weighted complementary 
sensitivity pioblem 

Proof Suppose that C G L(P) is optimal foi (1 5) By Theorem 
18 |(-)(( )| is constant ae on /ft Since H Y has a zero at 
6 y I? |(-)(f )| = 0 ae on /ft But this contradicts the fact 
that since /> y > 0 by the Maximum Modulus theorem ||<-)( K )|| v > 0 
for all A e £(P) 

Thus when an unstable plant P has a y 1? zero no stabilizing 
controller will minimize the weighted complementaiy sensitivity 
lunclion if 11 P also has the zero Corollary 4 6 also shows that 
il one wishes to obtain a stabilizing controller as the optimal solution 
to a weighted complementary sensitivity optimization problem with 
an unstable plant one should not use a weighting function with a 
/ ft zero 
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A Periodic Fixed-Structure Approach to Mulliratc C ontrol 
Wassim M Haddad and Vikram Kapila 


Abstract—\n this note we develop an approach to designing reduced- 
order muJtlrate controllers A discrete-time model that accounts lor the 
multirate timing sequence of measurements is presented and Is shown 
to have periodically time-varying dynamics Using discrete-time stability 
theory, the optimal projection approach to fixed-order (It, full- and 
reduced-order) dynamic compensation is generalized to obtain reduced- 
order periodic controllers that account for the multirate architecture 11 
is shown that the optimal reduced-order controller Is characterized by 
means of a periodically time-varying system of equations consisting of 
coupled Riccati and Lyapunov equations. In addition, the multirale static 
output-feedback control problem is considered For both problems, the 
design equations are presented in a comist., unified manner to facilitate 
their accessibility for developing numerical algorithms for practical 
applications 


I INTKOIXK HON 

Mdny applications of feedback control involve continuous lime 
systems subject to digital (discrete time) control Furthermore in 
piactical applications the control system actuators and sensois have 
dillenng bandwidths For example in flexible structuie control it is 
not unusual to attenuate the low frequency high amplitude modes by 
means of low bandwidth actuators that are relatively heavy and hence 
able to exert high force/lorque to contiol the higher Irequency inodes 
Obviously, the high bandwidth actuators would require sensois that 
are sampled at high rales while low bandwidth actuators require only 
sensors sampled at low data rales As a consequence the use ol 
various sensor data rales leads to a mullirale control pioblem To 
properly use such data a mullirale controllci musi caielully account 
for the timing sequence ol incoming data The purpose of this note is 
to develop a general approach to full and reduced order steady state 
multitate dynamic compensation 

Multirate control pioblems have been of inteiesi lor many years 
with increased emphasis in recenl yeais |l| |1| |9J [1IJ |15| A 
common iealure ot these papers is ihe realization that the multuatc 
sampling process leads to penodically lime varying dynamics Hence 
with a suitable remteipretalion results on multirale control can also 
be applied to single rate oi mullirale problems involving systems 
with periodically time varying dynamics Ihe principal challenge of 
these problems is to arrive at a tractable conti ol design toimulalion 
in spite ot the extreme complexity ol such systems In order to 
account foi the periodic lime vjrymg dynamics ol mullirale systems, 
a periodically time varying control law aichitecture was proposed in 
[10] and |ll] which appears promising in this regard An alternative 
approach which has been proposed for the multuatc control problem 
is the use ot an expanded state space formulation [21 However this 
approach results in very high older systems and is often numerically 
inti actable Finally a cost translation and a lilting approach to the 
multiiale LQG problem has been proposed m 115J which docs not lead 
to an increase in the state dimension Specifically | \ S\ shows how to 
translate a multirate sampled data LQG problem into an equivalent 
modified single rate shift invariant problem via a lifted isomorphism 
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However, this approach results in an equivalent system involving 
more inputs and outputs than the original system. The interested 
reader is referred to [7], [8], [10], and [14] for further discussions 
on multirate and periodic control. 

For generality in our development, we consider both full- and 
reduced-order dynamic compensators as well as static output- 
feedback controllers. In the discrete-time case, this problem was 
considered in |4|, while single rate sampled-data aspects were 
addressed in |5]. The approach of the present note is the fixed- 
structure Riccati equation technique developed in [4], Essentially, 
this approach addresses controller complexity by explicitly imposing 
implementation constraints on the controller structure, and optimizing 
over that class of controllers. Specifically, in addressing the problem 
of reduced-order dynamic compensation, it is shown in |4] that 
optimal reduced-order steady-state dynamic compensators can be 
characterized by means of an algebraic system of Riccati/Lyapunov 
equations coupled by a projection matrix which arises as a direct 
consequence of optimality and which represents a breakdown of 
the separation between the operations of state estimation and state 
estimate feedback; that is, ihe certainty equivalence principle is 
no longer valid. The proof is based on expressing the closed-loop 
quadratic cost functional as a function of the design parameters, i.e., 
the compensator gains, and the utilization of Lagrange multipliers for 
optimization over the parameter space. Thus, this approach provides 
a constrained optimal control methodology in which we do not seek 
to optimize a performance measure per sc, but rather a performance 
measure within a class of a priori fixed-structure controllers. 

In the present note, analog-to-digital conversions are employed 
within a multirate setting to obtain periodically time-varying dynam¬ 
ics. The compensator is thus assigned a corresponding discrete-time 
periodic structure to account for the multirate measurements. It is 
shown that the optimal reduced-order multiratc dynamic compen¬ 
sator is characterized by a periodically time-varying system of four 
equations consisting of two modified Riccati equations and two 
modified Lyapunov equations corresponding to each intermediate 
point of the periodicity interval. Because of the time-varying nature 
of the problem, the necessary conditions for optimality now involve 
multiple projections corresponding to each intermediate point of the 
periodic interval and whose rank along the periodic interval is equal 
to the order of the compensator. Similar extensions to reduced-order 
multirate estimation are addressed in [12]. 


Nomenclature 


/r- 0i y h. 0, 

o f , ()-'.»(), i’ 
R. R' 

71. 777. h , Hi . h 

.r, g, j\ , u 


r x r identity matrix, r x s zero 
matrix, 0, 

transpose, inverse, trace, expected 
value 

real numbers, r x * real matrices, 

7J.X! 

discrete-time indices 
positive integers; 

I <n, < n: ii = n + n, 

77 ( -, m -dimensional 


vectors 

.4, ZJ, C(k) n x 71 , n x m , l k x n matrices 

.4, (k), B, (A*),C (fr), Di (k) n, X Hi , n, X l k , to X n, . m X h 
matrices 

i?i. It} iixii. 77i x m state and control 

weightings; i?i > 0. > 0 

J?ia u x mi cross weighting; 

R, - R I2 R-'R'{. 2 > 0 


II. The Fixed-Structure Multirate 
Static and Dynamic Control Problems 
In this section, we state the fixed-structure static and dynamic, 
sampled-data, multirate output-feedback control problems. In the 
problem formulation, the sample intervals In and dynamic com¬ 
pensator order u ( are fixed, and the optimization is performed over 
the compensator parameters [-4, (•). (*)- C (■). D ( (•)]. For design 

trade-off studies, h k and n, can be varied and the problem can be 
solved for each pair of values of interest. 

Fixed-Structure Multirate Static Output-Feedback Control Prob¬ 
lem: Consider the n th-order continuous-time system with multirate 
sampled-data measurements 

= A.v(t) + Bu(t) + //'I (/), t E [0, -sc ), (2. 1) 

H(tk) = C(t k )x(t k ) + tc 2 (t k ). k = 1, 2, ■ ■ •. (2.2) 

Then design a static output-feedback multirate sampled-data control 
law 


u(t k ) = D, (t k )\/(t k ) (2.3) 

which, with D/A zcro-order-hold controls i/(f) -- u(t k )*1 E 
[t k . t k + 1 ), minimizes the quadratic performance criterion 

JAD.(-)) = Half; /V r(.s) 

f Jo 

+ 2.1- 1 (s) B\‘ 2 ll(s) + 1/ 1 (.S ) ll> II ( s )] f/.S. (2.4) 

Fixed-Structure Multirate Dynamic Output-Feedback Control Prob¬ 
lem: Given the nth-order continuous-time system (2.1) with multi- 
rate sampled-data measurements (2.2), design an i/ r th-ordcr (1 < 
n, < n) multirate sampled-data dynamic compensator 

r,(t kf i) = A,(t k )j'i{U)-hB l (t k )i,(t k ) (2.5) 

ii(h) = C.(hU'.(t k ) + D,(t k )u(t k ) (2.6) 

which, with D/A zero-order-hold controls u(t) = u(t k ),t E 
\t k . + 1 ), minimizes the quadratic performance criterion (2.4) with 
JAD,{-)) denoted by J,(A,(-). B,(-)< (',(•). !),(■)). 

The key feature of both problems is the time-varying nature of the 
output equation (2.2) which represents sensor measurements available 
at different rates. Fig. 1 provides a typical multirate timing diagram 
for a three-sensor model. For generality, we do not assume that the 
sample intervals h k = /*,+ j - t k are uniform (note the sample limes 
for sensor #3 in Fig. 1). However, we do assume that the overall 
timing sequence of intervals [t k . f* + v], k — 1, 2, - -, is periodic- 
over [0, oc), where N represents the periodic interval. Note that 
h k + \ =■ /in, k = 1. 2, ■ - ■. Since different sensor measurements are 
available at different times t k , the dimension l k of the measurements 
H(t k ) may also vary periodically. Finally, in subsequent analysis, the 
static output-feedback law (2.3) and dynamic compensator (2.3) and 
(2.6) are assigned periodic gains corresponding to the periodic timing 
sequence of the multirate measurements. 

In the above problem formulation, ic j ( /) denotes a continuous-time 
stationary white noise process with nonnegative-definite intensity 
V\ E R n *", while ir-jff*) denotes a variable-dimension discrete- 
time white noise process with positive-definite covariance \ : 2 (t k ) E 
R lk * Ik . We assume u> 2 (t k ) is cyclostationary, that is, l : 2 (f* + \) = 
mtkl k = 1. 2.• ■ 

In what follows, we shall simplify the notation considerably by 
replacing the continuous-time sample instant t k by the discrete¬ 
time index k. With this minor abuse of notation, we replace x(t k ) 
by .!■(**), JCtUk) by J\(k), y(t k ) by */(*■), u(t k ) by u(k) t w 2 (t k ) 
by ir-jCA'), .4, (ft) by A,(k) [and similarly for D,( ), C( ), and 
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£> (.)], C{tk) by C(k), and Vi(U) by 1V(A\). The context should 
clarify whether the argument is *'fc” or ‘A/’ With this notation, our 
periodicity assumption on the compensator implies A r (k + A r ) — 
, 4 , (A ). = 1, 2, -. and similarly for /?,(■), CVLK and A ( ). 

Also, by assumption, C(fc 4 A r ) =■ C(k), for A- = 1. 2. ■ ■ 

Next, we model the propagation of the plant over one lime step, 
For notational convenience, define H(k) = j^ k ? A * <ln. 

Theorem 2.1: For the fixed-order multirate sampled-data control 
problem, the plant dynamics (2,1) and quadratic performance criterion 
(2.4) have the equivalent discrete-time representation 

.v{k 4 1) = A(k).r(k) + B(k)u(k) + w[(k). (2.7) 

y(k) = C(k)Hk) + W'Ak). (2.8) 

i h 

J — A. v . + lint -rrc’y^.r 7 (k)R](k)j'{k) 

^ i s _ x A 

+ 2.1-' (* )J?Ia(*•)»(*)+ « 7 (A- )/?2 (*■)«(fc)] (2.9) 

where 

A(k-) = r 4, ’\ /)(/.■) = H[k)B, 

ir\(k) = / f '''irilH*)!/*. 

Jo 



Mmpling 


^ h ^ ^2^1" hg ^1* h4 h<j h? 


Fig. I. Multirate timing diagram tor sampled-data control 


* 


Inn — tr 
a ^ 7\ 



4 ' * /?, 




.1 /"V 

/u./„ 


' 7 ?,. 


Ill, The Fixed-Structure Multirate 
Samfled-Data Static Output-Feedback Problem 

In this section, we obtain necessary conditions that characterize 
solutions to the multirate sampled-data sialic output-feedback control 
problem. First, we form the closed-loop sysiem for (2.7), (2.8), and 
(2.3) to obtain 


RvAk) = I \ A ' *Ih H(s)D(Is + -i -// 7 (k )Rn. 

hi Jo h i 

/?.(a-) = r, + -- / \n'H r (s)ihm»)D 

"i Jo 

+ ri l u H(s)D + 13 r H , (s)Rv 2 \'l° 

and ir\(k) is u zero-mean discrete-time white noise process with 

^ (A)»r', 7 (A-)} = r,(A) = /'. l 'V,. ' 

Jo 

The proof of this theorem is a straightforward calculation involving 
integrals of white noise signals, and hence is omitted. Note that by the 
sampling periodicity assumption, A{k 4- X) = A(k). k = 1. 2 • ■ 

The above formulation assumes that a discrete-time multirate 
measurement model is available. One can assume, alternatively, that 
analog measurements corrupted by continuous-time white noise are 
available instead, that is, y(t) = C.i(f) 4 wj(f). In this case, 
one can develop an equivalent discrete-time model that employs an 
averaging-type A/D device [5] y(k) = ( l/In ) + L jj(t)dt. It can 

be shown that the resulting averaged measurements depend upon 
delayed samples of the state. In this case, the equivalent discrete- 
time model can be captured by a suitably augmented system. For 
details see [5]. 

Remark 2.1: The equivalent discrete-time quadratic performance 
criterion (2.9) involves a constant offset Av 1 which is a function of 
sampling rates, and effectively imposes a lower bound on sampled- 
data performance due to the discretization process. 

'As will be shown by Lemma 3.1, due lo the periodicity of In . is a 
constant. 


.r[k+l) = A(k)x(k) + iHk) (3.1) 

where A(k) = A(k) + B(k)DAk)C(k). The closed-loop 
disturbance ir[k) = v’\[k) + B(k)D, (k)uAk). k = 1.. 2. • - ■. 
has nonnegative-definite covariance \’(k) \\(k) 4 

D{k)D r (k)} r 2 (k)Dj (k)l3 l [k), where wc assume that the 

noise correlation V, 2 (A-) = £[iv[(k)w'[ (A*)] = 0, that is. the 
continuous-time plant noise and the discrete-time measurement noise 
are uncorrelated. The cost functional (2.9) can now be expressed 
in terms of the closed-loop second-moment matrix. The following 
result is immediate. 

Proposition 3.1: For given /), ( ), the second-momenl matrix 
Q(k) = £[.r(k).v 1 (A 1 )], satisfies 

Q(k+ 1) = A(k)Q(k)A r (k) 4 Y(k). (3.2) 

Furthermore, 

JA O, (•)) = 4 lim yrtr 

a — > A 

A 

■ ^[g(A’)J7(A-)4 D!Ak)n2(k)DAk)Viik)] (3.3) 

A — I 

where R(k) £ Ri(k) + Bii{k)DAk)C(k)+C'\k)D! (k)By.(k) + 
C r (k)D, r (k)RAk)D,(k)C(k). 

Remark 3.1: Equation (3.2) is a periodic Lyapunov equation 
which has been extensively studied in [7] and |8j. 

We now show that the covariance Lyapunov equation (3.2) reaches 
a steady-state periodic trajectory as I\ —+ cc. For the next result, 
we introduce the parameterization, k = a 4 LV, where the index 
n satisfies 1 < n < N, and ;i = 1, 2. ■ ■ ■. We now restrict our 
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attention to output-feedback controllers having the property that the 
closed-loop transition matrix over one period (n) = -|- A r — 

1 )_4(o + N -2) ■ ■ - ii(o), is stable for = 1, ■■ ■, .V. Note that since 
A(') is periodic, the eigenvalues of 4 |t (n) are actually independent of 
o. Hence, it suffices to require that $,,( 1) — A(N ).4( A'-1) ■ ■ • ,4( 1) 
is stable. 

Lemma 3.1 112]: Suppose 4>,,( 1) is stable. Then for given D t ( k ), 
the covariance Lyapunov equation (3.2) reaches a steady-stale peri¬ 
odic trajectory as k -* x, that is, 

liw [</(*), Q(* + 1 ).■■■.«(*■ +A T - l)] 

A. — x 

= [QM< Q(n + 1 )-••-,Q(o +JV-1)]. (3.4) 
In this case, the covariance Q(k) satisfies 


P(r\) = A 1 (o )P{n -|- 1 )A{n ) 

+ Pn r (n)»7. , ('«)/ , «(o) 

+ [P«(n) +J? a »(o)A(f»)C(n)]' 

• 1 ?;„ , (n)[Jf* 1 (n)+ i? iu (n)A(M)C(«)]. (3.10) 

Furthermore, the minimal cost is given by 

J.(D,(-)) = * + Y - 2Aia(o)i7J u 1 (<i) 

r, . 1 

■ A(n)<^i)C’Vor 2 ; l (n)C'(«)]. (3.1D 


Q(o + l) = /i(r»)g(rtM'(o) + f(«). n = 1. ■ • ■, A" (3.5) 


where Q(X + 1) = C?(l). Furthermore, the quadratic performance 
criterion (3.3) is given by 


,7.( A (■)> = * + yir 
\ 

■ 5^W(f»)/?(ri) + D < ('0/fa(<0A(n)r 2 (o)] (3.6) 
0-1 


where 




For the statement of the main result of this section, define the set 


5, = {A (■): $„('') is stable,for o = 1, ■ ■ ■. .V}. (3.7) 


In addition to ensuring that the covariance Lyapunov equation (3.2) 
reaches a steady-state periodic trajectory as k —> -v, ihe set 5, 
constitutes sufficient conditions under which the Lagrange multiplier 
technique is applicable to the fixed-order multirate sampled-data static 
output-feedback control problem. The asymptotic stability of the 
transition matrix ) serves as a normality condition which further 
implies that the dual P(t\) of Q(n) is nonnegative-definite. 

For notational convenience in stating the multirate sampled-data 
static output-feedback result, define the notation 

/?a„(o) = B l [n)P(n + l)23(n) -F /iij(o). 


v ao (.i) -4 cv.)g<f.)c'(fO + i*(<o. 


Proof. The necessary conditions for optimality follow from 
standard Lagrange multiplier arguments. See [14) lor a similar proof. 

□ 

Remark 3.2. In the full-state feedback case, we take C(n) = 
L r>(n) = 0, and ft]>(o) — 0 for n = 1.- -..V. In this case, 
(3.8) becomes D , (<i ) - - RjiojD 1 (n )/ J (n + 1) 4(n), and (3.10) 
specializes to 

/*(..) = A‘ («i)/»(f. + l )■!(..)+ * /?,(«) 

- A 1 (n)P(n + l)0(n)ff,„ l (»)73' (..)/’('• + D M<>) (3.12) 

while (3.9) is superfluous and can be omitted. Finally, we note that if 
we assume a single rale architecture, the plant dynamics are constant 
and (3.12) collapses to the standard discrete-time regulator Kiccali 
equation. 


IV. Tur Fixed-Structurf Mm ) iratl Samihi d-Data 
Dynamic Om piii-Flfdha( k Controi Problem 

In this section, we considci the fixed order multirate sampled-data 
dynamic compensation problem. As in Section III, we first lurni the 
closed-loop system for (2.7), (2.8), (2.5), and (2.6) to obtain 

.r(fr+ 1) “ A[k).i(k)+ ir(h ) (4.1) 

where 


•4(A) = 


.4(A) + /?(/.-)Z>, (A)C(A) 
B,(A)C'(A) 


D(h)C.( A) 
- 1.(0 


J’„(o) = D 1 (n)P(i\ + l).4(o)+ 

QJn) = A(n)Q(»)C‘ (n) 

for arbitrary Q(o) and P(n) 6 R ,lXn and n = 1 .■■■. A\ 

Theorem 3.1: Suppose D, (■) £ 5* solves the multirate sampled- 
data static output-feedback control problem. Then there exist n x n 
nonnegative-definite matrices Q{<\) and P{<\) such that, for n — 
1, - ■ ■, j\\ D t (#0 is given by 

A(o) = R;,!{<y)P,AH)Qin)C''(<i)VZ'(») (3.8) 

and such that Q(n) and P(n) satisfy 
<?(•> +n = .4(o)g( f >).4 r (o) + r,(.i) -g u (n)r,; 1 '(a)g,|(«) 
+ W,(«) + B(n)D, («)!',.(«)] 

• 1V, 1 (<>)[g„(n ) + Art)D, (a n(«)]■', (3.9) 


4(A-f V) = .4(A). A=l. ’.•••. 


The closed-loop disturbance 


»<’( A) = 


ir'i (A-) 4- B{k)l), ( k)u'2(k ) 
n,(k)w,(k) 


*• = 1. 2. " 


has nonnegative-definite covariance shown by (x) at the bottom of the 
next page, where once again we assume that the continuous-time plant 
noise and the discrete-time measurement noise are uncorrelated, i.c., 
3"i:>(A*) = c [ii'I [k)ir] (A*)] — 0. As for the static output-feedback 
case, the cost functional (2.9) can now be expressed in terms of 
the closed-loop second-moment matrix. Specifically. Proposition 3.1 
and Lemma 3.1 hold for the dynamic-output feedback problem with 
.r(A-). .-i(A‘h V{k), and B{k) replaced by .»(/,*). A(k). V(A’), and 
/?(/. ), respectively, where B{k) is shown in (y) at the bottom of 
the next page. 

For the next result, define the closed-loop transition matrix and the 
compensator transition matrix over one period by = 4(n + 
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; V-l>A(tt+iV-2)-" A(o) and # CJ ,-(r») = ,4,(o+ .V - 1 ).4,(o + 

\ _2) - ■ ■ .4 r >(rv), respectively. Note that since A, {<\) is required to 

bf periodic, the eigenvalues of $ t7 ,(a) are actually independent of n. 

In the following, we obtain necessary conditions that characterize 
olutions to the fixed-order multirate sampled-data dynamic 
compensation problem. Derivation of these conditions requires 
additional technical assumptions. Specifically, we further restrict 
[ A.(-). ft.(-)- C(*L D f { : ) ] to the set 

5, = {(-4. (o), DAn). C,(o), A (o)): 4>„(.o) 
is stable and ($ f7 .(o), B n ,(<*), C, v (c\)) 
is controllable and observable, n — 1, ■ ■ ■, A } (4.2) 


for arbitrary P(a) £ R' 1 *" and o — 1. ■. A r . 

Theorem 4.1: Suppose [A r (■). D ,(•),(?.(■), A(-)] 6 5, solves 
the fixed-order multirate sampled-data dynamic output-feedback con¬ 
trol problem. Then there exist n x n nonnegative-dcfinite matrices 
Q(o'), P(o). Q ((\), and P(o) such that, for a = 1, ■ ■ ■, A’, A, (n ). 
B, (n ), Cr (rt ). and D r (a) are given by 

A,.(a) — P(n -I- l)[A( rv) - ZJ(<v)/?,J fl 1 (o)P (1 (n) 

- (?„(f>)r 2 ' l (rt)C(a) - D{(\)D r ((\)C{o)]G l (a). (4.5) 


where 

/)„„<o) = [A,(o + .V - l).4,(o + V - 2)•••.4,(0 + l)fl,(«). 
.4,(o + .V - l).4,(o + .V-2)--- 
• .4,(n + 2)A (o + 1 A(o + AT - ])). (4,3) 


TV (a + .V - 1 ).4,(o 4- .V — 2) • • • .4, (o) " 
G (o + .V - 2).4,(o + .V — 3) • • • .4,(n) 


C (a) 


(4.4) 


The set 5, constitutes sufficient conditions under which the La¬ 
grange multiplier technique is applicable to the fixed-order multirate 
sampled-data control problem. This is similar to concepts involving 
moving equilibrium for periodic Lyapunov/Riccati equations dis¬ 
cussed in [7] and [8]. Specifically, the formulas for the lifted isomor¬ 
phism (4.3) and (4.4) are equivalent to assuming the stability of .4(-) 
along with the reachability and observability of [A, (•)* #.(■)■ C ( ■)] 
[8], [10]. The asymptotic stability of the transition matrix $,,( 0 ) 
serves as a normality condition which further implies that the dual 
P{<\) of ^(n) = t’[.r(n ).r l (n-)] is nonnegative-dcfinite. Further¬ 
more, the assumption that [ < l\ / ,(n). P, 7 ,(n). C f/I (n)] is controllable 
and observable is a nondegeneracy condition which implies that the 
lower right n, x n, subblocks of (2(o) and P(n) are positive definite, 
thus yielding explicit gain expressions for .4, (a ). B, (o ). C, (rv), and 

Dr{ n). 

In order to state the main result, we require some additional nota¬ 
tion and a lemma concerning pairs of nonnegative-definite matrices. 
For details sec [6], 

Lemma 4.1: Let Q. P be n x 11 nonnegative-definitc matrices and 
assume rank QP — n ,. Then there exist n, x n matrices 6\ T and 
an n ( x n, invertible matrix A/, unique except for a change of basis 
in R "•, such that QP — G ] MV. TG 1 — Furthermore, the 
a x n matrices r = G 1 F. r± = /,, - r, are idempotent and have 
rank n r and n - n r , respectively. 

The following result gives necessary conditions that characterize 
solutions to the fixed-order multirate sampled-data control problem. 
For convenience in stating this result, recall the definitions of /?•_>„(■). 

Pi(-), and Q„(') and define the additional notation 


A/(o) ^ 


1 „ 

DAn)C(n) 


3/(0 ) ^ 


L-^iVoP.f/O J 


Brio) = T(o A l)[c^„ 1 (r>) A B[i\)D ( (o)]. (4.6) 

C, (rv) = -[R 2 ,! (o)P„(n ) A D<(a )C(o )]G' 7 (n), (4.7) 

A (o) = — R 2 I (o )P ri (n ) Q (n )C T {o ) \ 1 (n ) (4.8) 

and such that Q(a). P(o), C}(a), and P(n) satisfy 

Q ( o +1) = .4(o )Q(o ),4 7 (o) + 3,(0) - g„(ft)v;;'(o)Q 7 (o) 

+ Tj (o + 1 )[{.4(o) - D(<x)R,,!(<\)P„(o)) 
•Q[n){A(n)~ /3(o)/?.7„‘(o)n(o)} 7 
+ + B(o)D,(o )3 2 ,>(o)}Kj~ i (o) 

• {Q„(o) + C(o)D,(o)l 2 „(o) [ 7 ]rj (n 4- 1). (4.9) 

P(o> = .4 r (o)P(o + l).4(o)+ i/7,(o) 

- P, 7 (o)/7. 2 „‘(o)P„(o) 

+ t] (o)[{.4(o) - Q „(o)V 2 ” 1 (o )C'(o)} 7 P(o + 1) 

• {.4(o ) - A,(o) Vi“ 1 (o )C( o ) ( 

+ {PA a) + P>„(o)D,(o)C(o)} 7 /? 2 - , (o) 

• {Pn(o ) 4- P' 2 „(o )D, (o)f'(o ))]rj (o). (4.10) 

0(0 + 1) = r(o + 1)[{ .4(0) — /J(o)P-‘(o)P„(o)|0(<0 

■ {.4(o) - P(o)P 1 7„'(o)P,(o)} 7 

+ (Q..(o) + P(o)D,(o))'j„(o)}1' 2 ”‘ I (o) 

■ (Qn(o ) + jB(o)D,(o )V 2 „ (o) | 7 ]r 7 (o + 1). (4.1I> 

P(o) = r r (o)[{.4(o) -g,i(o)r 2 :, l (o)C(o!} 7 P(o + 1) 

• {.4(o)-g.,(o)\:.; , (o)C'(o)} 

+ { P, ( o ) + Bi„ ( O )Dr{ O )C'{ O ) ) 1 m'. ( o ) 

■ {P„(o) A P'2n(o )Dr{(\ )C(o)}]r(n ) (4.12) 

rank Q(n) = rank P(n) = rank Q((\)P(r\) - /#,. (4.13) 


V(A‘) = 


3 (/•-) + Z?(/■ )P, (i )13(A )P, 7 (A-) D r (A ) 

B, (A-) 3 ; 2 (A-)D , 7 (A -)// 7 (A 1 ) 


/?(A-)A (A-)12( k)Dj{ A’) 
A (A-)Vii(A-)ZJj(A) 


/7(A ) i 


Ri(k) + A )D,(A’)C(A-) + C 7 (A-)D, r (*•)/?/ 3 (A') 

+C T {k)DUk)RAk)DAk)Ca‘) 
C?Ak)RUk) + Cj. (A- )l?j (A)A (A-)C(A) 


/f. 2 ( A)C(A-) + C' r (A-)A r (A)P a (A)C,(A) 


(x) 

(y) 


Cj(k)BAk)CAk) 
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Furthermore, the minimal cost is given by 

(■). DA-).Cr(-), £>,(•)) = *+ ^lr 

/V 

)g(,>)A/ J (a) + Alr( f »)g(o)3/ / (f.)}A(r.)]. 

nr~ l 

(4. J 4) 

Proof: The proof is identical to the proof of Theorem 3.3 of 
[14] concerning reduced-order control of discrete-time linear periodic 
systems with suitable reinterpretations for capturing the equivalent 
multirate sampled-data model. □ 

Theorem 4.1 provides necessary conditions for the fixed-order 
multirate sampled-data control problem. These necessary conditions 
consist of a system of two modified periodic difference Lyapunov 
equations and two modified periodic difference Riccati equations cou¬ 
pled by projection matrices r(n), n = 1.■ * ■, V. As expected, these 
equations are periodically time-varying over the period 1 < n < X 
in accordance with the mullirate nature of the measurements. As 
discussed in [4], the fixed-order constraint on the compensaior gives 
rise to the projection r which characterizes the optimal reduced-order 
compensator gains. In the multirate case, however, it is interesting to 
note that the time-varying nature of the problem gives rise lo multiple 
projections corresponding to each of the intermediate points of the 
periodicity interval, and whose rank along the periodic interval is 
equal to the order of the compensator. 

Remark 43 ■ As in the linear time-invariant case [4] to obtain the 
full-order multirate LQG controller, set ?i, — it. In this case, the 
projections r(n), and F(n ) and G(o), for n = 1, ■. X, become 

the identity. Consequently, (4.11) and (4.12) play no role and hence 
can be omitted. In order to draw connections with existing full-order 
multirate results, set D< (n) = 0 and i?i‘j(o) = 0, o = 1, ■ ■ •, .V, 
so that 

.-Mo) = A(n) - fl(o)/?;„ , < ( .)fl'('i)P(o + l),4(n) 

-,/l(n)g(»»)C , (rt)r 2fl , (o)C , (rt). (4.15) 


V. Numerical Evaluation of Integrals 
involving Matrix Exponentials 


To evaluate the integrals involving matrix exponentials appearing 
in Theorem 2.1, we utilize the approach of [16]. The idea is to elim¬ 
inate the need for integration by computing the matrix exponential 
of appropriate block matrices. 

Proposition 5./: For n = 1, ■ ■ -, consider the following parti¬ 
tioned matrix exponentials 
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of orders (13// 4 ///) x (3/i 4- m ), (3// 4 ///) x (3// 4 w), and 3n x 3//, 
respectively. Then, for /» - 1, ■ • -, X, 

A(,\) = E>2D{n) = £ih, 11(n) = £/|£ji. 


»i(«0 = 7 -E^Er,. /?„(n) = 4 

tin On 


= A((\)Q((\)C il (n)Y> (l l [a). 


(4.16) 


liAn) = Ih+ E[D' Eiifn , + El, Ei ill - B 1 e/t/T.]. 


C (o ) = - )/? '(,. )P(n + 1) 4(<i) (4.17) 


iS = 





where Q(o) and £(n) satisfy 

Q(r> + l) = .4(r,)g(o).4'(f.) + r,(n) 

-j(n)g(a)C , i«)r i ; , («)C(«,)g(,iM , (<o. (4 .ih> 

P(n ) = .4' (r,)/»(« + 1 ).4(«) + /?, (a) 

— .4^ (n )P(n + 1 )J3(r, )/?i (1 '(o )D' + l)4(n). (4.19) 

Thus, the full-order multirate sampled-data controller is characterized 
by two decoupled periodic difference Riccati equations (observer 
and regulator Riccati equations) over the period n = 1,---,.Y. 
This corresponds to the results obtained in 110]. Next, assuming a 
single rate architecture yields time-invariant plant dynamics, while 
(4.18) and (4.19) specialize to the discrete-time observer and reg¬ 
ulator Riccati equations. Alternatively, retaining the reduced-order 
constraint and assuming single rate sampling, Theorem 4.1 yields the 
sampled-data optimal projection equations for reduced-order dynamic 
compensation given in [5]. 


The proof of the above proposition involves straightforward ma¬ 
nipulations of matrix exponentials. 

VI. Illustrative Numerical Example 

For illustrative purposes, we consider a numerical example in¬ 
volving a rigid body with a flexible appendage. This example is 
reminiscent of a single-axis .spacecraft involving unstable dynamics 
and sensor fusion of slow, accurate spacecraft attitude sensors (such 
as horizon sensors or start trackers) with fast, less accurate rate 
gyroscopes. The motivation for slow/fast sensor configuration is that 
rate information can be used lo improve the attitude control between 
attitude measurements. Hence define 
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Ry=E t E . J?a = l. 

Note that the dynamic model involves one rigid body mode along 
with one flexible mode at a frequency of 1 rad/s with 0.5% damping. 
The matrix C captures the fact that the rigid body angular position 
and tip velocity of the flexible appendage are measured. Also, note 
that the rigid body position measurement is corrupted by the flexible 
mode (i.e., observation spillover). To reflect a plausible mission, we 
assume that the rigid body angular position is measured by an attitude 
sensor sampling at 1 Hz, while the tip appendage velocity is measured 
by a rate gyro sensor sampling at 5 Hz. The matrix R\ expresses the 
desire to regulate the rigid body and tip appendage positions, and the 
matrix V't was chosen to capture the type of noise correlation that 
arises when the dynamics are transformed into a modal basis. 

Using the homotopy algorithm based on a prediction and a Newton 
correction scheme for periodic difference Riccati equations reported 
in [13], the following designs were obtained. For u, = 4 discrete-time 
single rate and multirate controllers were obtained from (4.15)—(4. J 9) 
using Theorem 2.1 for continuous-time to discrete-time conversions. 
These designs were compared using the performance criterion (4.14). 
The results are summarized as follows 

Measurement Scheme Optimal Cost 

Two 1 Hz sensors G5.G922 

Two 5 Hz sensors 53.9930 

Multirate scheme (1 Hz and 5 Hz sensors) 51.S0G1 

Note that Ihe improvement in the cost of the two 5 Hz sensor 
scheme over the multiraie scheme is minimal, which clearly demon¬ 
strates that the multiraie scheme provides sensor complexity reduction 
over the lwo 5 Hz sensor scheme. 
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Pursuing a Maneuvering Target Which 
Uses a Random Process for Its Control 

V. F. BeneS, K. L. Helmes, and R. W. Rishel 


Abstract —Since a pursuer pursuing a maneuvering target does not 
know what maneuvers an evading target will make, the maneuvers (the 
target's control law) appear as a random process to the pursuer. However, 
he has opinions about what the evader will do. From these, he can assign 
a prior probability distribution to the evader's maneuvers. 

For a linear pursuit evasion problem in which the evader's control 
law is modeled as a random process, in which the pursuer has partial 
noisy linear measurements of his own and the evader's relative position, 
and a quadratic optimality criterion is used, past results of the authors 
imply that Ihe optimal control is a linear function of the “predicted miss." 
Determining the predicted miss involves estimating the evader’s terminal 
position from past system measurements. Nonlinear filtering techniques 
are used to give expressions for computing the conditional expectation 
of the evader’s terminal position even in the presence of the random 
unknown maneuvers of the evader. 


I. Introduction 

Recently, Helmes and Rishel [5], |6|, and BeneS 111 have shown, 
for a linear system, 

d.i'i — (A.o + Dui)dl ■+ (I) 

whose driving noises .o may not be Gaussian, and for an optimality 
criteria which is a quadratic function of the terminal state plus a 
quadratic running cost in the control, that the optimal control is a 
linear function of a quantity called the “predicted miss.” 

The purpose of this note is to apply these results, and results from 
optimal nonlinear filtering and extrapolation, to describe the optimal 
control for a linear pursuit-evasion problem in which the evader’s 
control is modeled as an unknown random process. 

The main problem in computing the predicted miss is to compute 
the conditional expectation of the evader’s terminal position given 
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where 

i < J u '(Wyf)'4m,-(I/2) f o ' | WvfPrf* 


(45) 


and 






(46) 


Thus, from (36), 


Efafl.M,] 


EojiUjAjj = £n[yfa.rf.|Ml MAjj 

£„[\,]V(,] Ea[<>i.1i\Mi] Ea[n t i t \Mi] 

(47) 


Now, from (40) and (45), we see that i if and «/ are Mi measurable. 
Thus, they may be taken out of the conditional expectations in (47) 
giving 


£fo<|A4,] = V? + 


£o[y;.<i|.vit] 


(48) 


Now, again, since under TV t’t and m { are independent processes, 
the conditional expectations in (48) may be replaced by ordinary 
expectations with respect to the distribution of r ( giving (38). A 
similar argument gives (39), That i, is a solution of (42) follows by 
differentiating (46) using Ito’s rule. 

Remark I: Formulas (21) and (22) |or (38) and (39)] express the 
conditional expectations we are interested in, in terms of the expected 
value with respect to the distribution of the process c, of quantities 
which are solutions of stochastic differential equations driven by the 
measurements w t . It is not surprising since we do not know r t that we 
should have to take an expected value with respect to its distribution. 
However, evaluating this expected value in the general case could be 
prohibitively complex. For instance, to determine the quantities whose 
expected value is to be taken in (38) and (39), equations (40)-(42) 
must be solved. There is one equation—(41)—for each control path 
and the solution of this equation is a coefficient in (42) for i ( . Thus, 
(42) could be an infinite system of equations, one for each control 
path, which are drive by the incoming measurements m <. 

The difficulties mentioned in Remark 1 disappear in the following 
case. 

Finiteness Assumption: The probability distribution of the control 
i'i is concentrated on a finite number of paths 

i'?. (49) 


and is given by 


P[v, = «'!] = P\ i = l, (50) 

Theorem II, in this case, is easily seen to reduce to the following 
theorem. 

Theorem 3 If the finiteness assumption holds, then 


Remark 2: In Theorem 3, the solutions of the linear system (53) 
driven by the n known paths t'J could be precomputed to give the n 
functions tj\. Thus, (40) for yf and (54) for the n functions ,i\ are 
a system of n + 1 equations driven by the measurements ni, which 
must be solved to determine the quantities in (51) and (52). This is an 
implementable method for computing these conditional expectations. 

Remark 3: In the case of a general distribution for the paths of the 
control process *■*, the following could be considered as a numerical 
method for determining the conditional distributions in Theorem 2. 
Choose a finite partition of the path space of t't into sets 5 1 . ■ • ■. S„. 
Choose a representative element i>i in each set S,. Then form an 
approximate distribution P on the path space concentrated on the 
representative elements, by 

P[r/ = /’I] = m,). (55) 

The sets 5* should he chosen so that il u\ and t/j arc, respectively, 
solutions of (53) corresponding to an arbitrary element r< of S, and 
the representative clement i'J of S ,, the probability that i/J and y' t 
are far apart is small. Then for this finite approximate distribution, 
use the quantifies (51) and (52) of Theorem 3 as approximations for 
(38) and (39) of Theorem 2. 

How, in practice, could the previous results be used lo give a con¬ 
trol law for the pursuit-evasion problem? First, a finite representative 
family of control paths that the evader might use or nearly use should 
be selected and probabilities assigned to them. Then the formulas (51) 
and (52) of Theorem 3 would give the conditional expectations 

EMM,] and E[r.\M,\. (56) 

Then the conditional expectation of the evader’s terminal position 
is given in terms of the conditional expectations of (56) by (20). 
the predicted miss by (17), and the control by (7). This gives a 
computationally feasible procedure for computing (7). 

V. Conclusions 

Previous work (l| implied that the optimal control tor the pursuit- 
evasion problem could be given in terms ol a formula involving the 
conditional expectation of the evader’s terminal position. The Kallian- 
pur-Slriebel formula, a double conditioning with respect to both the 
past measurements and the values of the evader’s unknown control 
law, and Kalman filtering were used to express this conditional 
expectation in terms of expected values of given functions of past 
measurements with respect to the distribution of the unknown control 
law. When the unknown control law has only a finite number of 
paths, this gives a computationally feasible procedure for computing 
the optimal control law. In other cases it gives an approximation to 
the optimal control law. 


Rt-F'ERFNCES 


rr i u i # , E"_j P' y* ^ 

E[ui\M,} = yf + jyi ' p, JT 


and 


(51) 


(52) 


y p't‘ 

El "' lM,] = fepnf 

where yf is the solution of (40), y\ is the solution of 

fly', ~ [(f - -f dt: yi = 0 (53) 

and T, 1 is the solution of 

d il = ,l!(ffy}) f (thn, - Hyf <H)\ i ( \ = 1. (54) 
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A Subspace Fitting Method for 
Identification of Linear State-Space Models 

A Swindlehurst R Roy B Otteislen, and 1 Kailath 


Abstract — A new method is presented for the identification of systems 
pai amctcrued by linear state-space models I he method relies on the 
concept of subspact fitting, wherein an input/output data model param¬ 
eterized hv the state matrices is found that best tits, in the least-squares 
sense, the dominant subspace of the measured data Some empirical 
results arc included to illustrate Iht performance advantage of the 
algorithm compared to standard techniques 


1 INIROIHCTION 

Research in the identilication ol discietc time lintar systems has 
tocused m recent years on prediction erroi methods (PEM s) based 
on autorcgiessive and autoregressive moving average (ARMA) data 
models and then various derivatives (eg (l| [2|) Although linear 
slate space models arc common in estimation and control they 
have not been widely used m identification Implicitly ot course a 
given input/output model can be associated with various canonical 
slalt space realizations and thus a PEM might be ihoughl of as 
finding a state space model for the system One of the goals of 
tins note is to demonstrate however that there are some important 
advantages in explicitly considering more general state space forms 
m identification 

Several early approaches to the general state spate identification 
pioblem were based on examining the structure ol a Hankel matrix 
composed ol samples of the impulse response of the system [3| |5] 
More retenlly De Moor [6] [71 has developed a total least squares 
algorithm that exploits the same shift structure present in a certain 
input/output data model and that allows arbitiary input excitations 
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was supported by iht National Science I oundation under Grant M1P 9110112 
A Swindlehurst is with the Department ol Electrical and Computer Tngi 
ncenng Brigham Young University Provo UT 84602 USA 
R Roy is with ArrayC omm Inc Santa Clara CA 95054 USA 
B Otlersten is with the Department of Signals Sensors and Systems Royal 
Institute of Technology S I (HI 44 Stockholm Sweden 
7 Kailath is with the Information Systems 1 ab Stanford University 
Stanford C A 94305 USA 
fbFF 1 og Number 9406993 


Related methods have also been recently pioposed by Mount n 
18] and Verhaegcn et al [9] 

The method presented jn this note also exploits the mheiem slui 
structure in the data, out in a different way The motivation loj 
this new algorithm comes from some recent results m sensoi in i\ 
signal processing In particular it is shown how the idemihe m >n 
problem can be cast in the \uh\pace fitting framework whuc tin 
goal is to find Ihe input/outpul model which best his (m ilu It a I 
squares sense) the dominant subspace of the datu This appioach his 
been successfully applied by Otlersten and Viberg in the context ol 
narrowband dircction-of arrival estimation [ 10] Ol special mien.si is 
the facl that a weighting can be applied to the dominant subspace to 
emphasize certain directions wheic the signal to noise iaiio is high 
This weighting can provide an advantage in cases involving ncailv 
unobservable systems or an insufficiently excited state space 

II Daia Modfl and Assimhions 

Consider the following multiple input multiple output (MIMO) 
time invariant linear system in state space form 

Te +1 = Axk 4 Bu k 


y k = Ct k -4 Du k 4 m (1) 

where n e R u k 6 R y k € U 1 v k < U l and the system 
matuccs ABC and D are of consistent dimension The system 
input ua is assumed known and the output y k is coiruptcd by 
additive measurement noise in If several observations of the inpul 
and output vectors are available they may be grouped together into 
the single equation (e g see [6| and 111 ]) 

y - rx + HU + \ ( 2 ) 


where 
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x = [nan* i za- f; — i] and where U[nu x j) and V[h x )) 
are block Hankel matuces constructed exactly as Y but containing 
samples of the input and disturbance sequences respectively 

We will make the following assumptions concerning the data model 

of (2) 

■ The system is obseivable, and the block dimension / is chosen 
to be large enough so thal rank {P) - n This implies h > n 
• The input sequence u k has sufficiently eutted the system and 
j > n, so that rank ( X) = n 
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• The input is uncorrelated with the measurement noise (open-loop 
operation). 

• A nontrivial matrix U 1 can be found to satisfy UU 1 = 0. This 
requires j > mi , and leads to 

YU X = TXU x +VU X . (3) 

• rank (XU 1 ) — n, so that j > mi + u. This rank condition is 
satisfied for most choices of the input (e.g., with probability one 
if the input is zero-mean white noise) [6], [11]. 

With these assumptions, YU L is an li x p matrix, with n < p < 
j — mi. In most situations, p = j - mi > n, so that YU 1 is 
composed of a (low) rank n “signal” term rXU L that depends on 
the parameters of interest, and a “noise” term VU 1 that is (with 
probability one) full rank. 

It is convenient for us to also define here the “sample covariance” 
matrix 

fly, 1 = J (YU 1 )(YU X )' 

and note that it converges to its limiting expected value with 
probability one 

Ry — lini £ {/iy fM | = rR u j -}- (T*e 

where 

R u i = UxU x )(XU x ) 1 . a 2 E= -£ {(VU x )(VU L ) r ) 

and £ is normalized so that del (I?) = 1. It is easy to show that 
the generalized eigenvalues {A*} of the pair (H w l.E) satisfy 
Ai > ■■■ > A T1 > A„+j = ■■■ = A/, = rr 2 . Furthermore, the 
eigenvectors E = [ci • • ■ c„] associated with Ai, - ■, A„ satisfy 

EE = r(ri)T (4) 

for some n x n matrix T. We have written r as a function of a vector 
q, which contains the elements of the parameterization chosen for A 
and C . Just as there are many realizations or coordinate systems that 
can be used to describe the state space, there are many identifiable 
parameterizations ij that can' be chosen, each yielding a differeni T 
that satisfies (4). The subspace fitting method presented in this note 
is based on the relationship of (4). 

Note that with no measurement noise or infinite data, the model 
order n is revealed by simply counting how many of the smallest 
generalized eigenvalues are equal. With a finite collection of noisy 
data, a statistical test is required to estimate this quantity. This 
problem has been extensively studied in very general contexts, and 
many such tests have been developed (e.g., see [ 12]—[ 14]). We will 
thus assume throughout the remainder of the note that n has been 
correctly determined. 

III. A Subspace Fitting Approach 

Because of the effects of noise, only an estimate of the generalized 
eigenvectors E can be obtained from Ry / ± (or equivalently from an 
SVD of YU X ). Consequently, assuming an appropriate identifiable 
parameterization q has been chosen, we propose the following two- 
step identification procedure based on (4). 

1) Estimate 11 by means of the following weighted least-squares 
problem: 

r, =ai'griiiii|| EEW'^~ - r( V )T ||£ 

=HrguiinTr ( EEWE J E r ) (5) 

where Pjj = I - r(»/)[r(ij) 7 /’(if)] -1 J’(i|) 1 . The estimates of the 
system matrices are thus given by A(^),C(^). 


2) Estimate D and C using a noise-free version of (2) and the 
estimates A(q), C(ti). 

Some comments are in order concerning the two steps of the 
Weighted Subspace Fitting (WSF) algorithm described above. 

Step 1 : The choice of the weighting matrix W will be discussed 
below in Subsection A. A variety of possible parameterizations for 
7 / exist, depending on what (ir any) prior information about the 
system is available. For example, if the system is multiple-input 
single-output, a particularly appropriate choice is to assume that the 
system is in observer form, and hence that A is left-companion 
and C — [1 0 - - ■ 0 ). Alternatively, A could be assumed to be 
diagonalizable, in which case C is solved for directly, and the 
minimization of (5) can be simplified to involve only a search over 
the n pole locations. 

One of the advantages of the problem formulation considered 
herein is that any a priori information about the structure of A and 
C can be directly incorporated into the problem. In some instances, 
the dynamical model underlying the system may be well understood, 
but may depend on certain parameters that are imprecisely known. 
These parameters appear as unknown constants in the matrices A 
ami C of the system model, and could serve as the parameters in 
the minimization of (5) just as easily as the poles or the coefficients 
of the characteristic polynomial. Whether or not the set of unknown 
parameters can be uniquely identified is then problem dependent, and 
must be determined on a case-by-case basis. 

The minimization of (5) is identical in form to a problem in 
antenna array signal processing considered in [151. In fact, once 
an appropriate parameter vector has been identified, an algorithm 
essentially identical to that described in [ 15] may be used to estimate 
A and C. Additional information on the connection between state- 
space system identification and anay signal processing can be found 
in |16] and [17]. 

We note here that, even though E ^ l, consistent parameter 
estimates can be obtained by setting E = l if the noise is white 
and independent with equal variance among the / outputs, and if / is 
sufficiently large. 1 In other cases, however, ignoring E when E / / 
(as is done in [6], [7J, |9|) can lead to biased parameter estimates. 

Step 2: This step is also used by De Moor in his identification 
method [6], 17]. A description of the mechanics required to solve for 
D and D can also be found in [19], and will not be given here. 

A. Subspace Weighting 

In simple terms, the presence of the weighting matrix W in the 
minimization of (5) allows certain directions or dimensions of the 
low rank subspacc to be emphasized over others. For example, the 
generalized eigenvector e\ corresponding to the largest eigenvalue 
represents the component of the measured data where the signal 
energy is the strongest. The eigenvector e„ gives the direction where 
the signal energy is weakest but still nonzero. A measure of how 
“strong” or “weak” the signul energy is in a given direction can be 
based on how much bigger than a 2 the eigenvalues Ai, ■ ■ ■. A„ are. 
The difference A„ - <r 2 can be quite small in cases where r or X 
arc ill-conditioned (e.g., because the system is nearly unobservable 
or the input is not sufficiently exciting), and in such cases one would 
probably wish to give much less weight to c v in (5) than ci . 

The following “optimal” weighting has been derived for subspace 
fitting problems of the form (5) in [10]: 

Wns f =(A.-& i I) i A;' (6) 

1 How large / has to be for this lo hold depends on the magnitude of the 
noise. 
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Fig 1 Pole and zero estimates for simulation example 1 


wheie the diagonal matrix A contains Ai A as its diagonal 
elements, and <r 2 is an appiopnate estimate of <t 2 (obtained, for 
example as an average of the /# — « smallest generalized eigenvalues) 
The term “optimal” here means that, under certain conditions, the 
weighting ot (6) will lead to minimum variance parameter estimates 
The Hankel structure of Y and V in our problem violates one of 
these conditions, namely, that the columns of the data matnx be 
independent However, using Ww sr in (^) has been empirically 
shown to yield bettei results than using no weighting at all (W = 1 ) 


B Comparison to Other Techniques 

The principle difference between the WSF algorithm presented 
above and the methods described in [6], [7], and |9] is that the WSF 
approach can exploit the complete structuie of T rather than just 
a single “shift-invariance ” This advantage is especially evident in 
situations where a prion information is available about the structure 
of A and C On the other hand, the methods of |6J, [7J, and [9| admit 
a simple “closed-form” solution, whereas (5) can onl> be solved by 
means of a multidimensional search 




3(4 
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Fig 1 Continued 


Util ike standard ARMA-based PEM approaches, the WSF algo¬ 
rithm is just as easily applied to MIMO systems as m the single¬ 
input single-output case, has a built-in mechanism for model order 
determination, and can exploit structured system models with more 
compact parameter!/ations Even in the unstructured case, WSF 
tends to yield pole and zero estimates with much lower variance 
than PEM’s, as illustrated by the simulation examples ot the next 
section Recent work in identification and controller design for rapid 
thermal processing of semiconductor wafers [19], [20] has also borne 
out some of the advantages of using subspace-based identification 
methods over PEM\ 

IV Simui ation Examples 

In this section we considei two simple simulation examples that 
illustrate the advantage of the subspace fitting approach In the first 
example, the parameters of the following three state SISO discrete 
time system will be estimated 

0 30 -0 40 0 60 
A = 0 00 0 80 0 00 

0 00 0 00 0 79 



C — [1 30 0 40 -0 80] 

D = 1 

The poles of this system are located at 0 3, 0 79, and 0 8, and the 
zeros at 0 82, and 0 68 ± j() 33 Note that for this system, X will 
be ill-conditioned since the lower right 2x2 block of A is diagonal, 
with nearly equal diagonal elements, and the last two elements of B 
are identical This is also evidenced by rhe fact that there is a near 
pole-zero cancellation in the transfer function 


This system was simulated using a zero mean unit powei white 
Gaussian process as its input, and assuming a zero initial slate 1 White 
Gaussian measurement noise with a standard deviation ot 0 001 was 
added to the system output Using the noise free input and the noisy 
system output three methods were used to estimate the system poles 
and zeros These were the method of De Moor (6] [7| (which loi 
brevity, we refer to as the S4ID technique) the WSF algorithm and 
a PEM based on an output enoi model [1] The correct model older 
was assumed to be known in each case 

We conducted 250 Monte Carlo experiments with an independent 
measurement noise and input sequence generated for each trial The 
block dimensions of the Flankcl matrices were chosen as \ = 12 
and j = 50, corresponding to a total of 61 output samples for each 
trial The WSF method was implemented using the weighting ot (6) 
and assuming a diagonal parameterization for the system matux A 
Both WSF and PEM used the true parameter vectors to initialize 
their respective search routines The results of the simulations are 
displayed in Fig 1 The solid line at the right of each plot represents 
the unit circle, and the true pole and zero locations are indicated by 
the symbols x and o, respectively 

The variance of the poles and zeros estimated by WSF is clearly 
much smaller than that of the other algorithms This is especially true 
for the pole at 0 3, in fact all 250 estimates are so closely bunched 
together that they are almost indistinguishable from the x marking 
the true pole All three algorithms estimated the complex zeros very 
accurately, the individual trials are again indistinguishable from the 
true zero locations However, the variance of the real valued zero is 
much smaller for WSF than for either PEM or S4ID In addition, 
both PEM and S41D estimated complex poles on about 20'/ of the 
simulations, while the WSF poles were always puiely real (although 
they were not constrained to be so) One drawback of the subspace 
based methods is that, on a few occasions (5 for WSF and 11 for 
S4ID), they produced unstable pole estimates 

2 The effects of a nonzero initial state can be handled by assuming the 
presence of an additional input that is a unit Dirac impulse [6| 
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Fig. 2. Pole estimates for output error model, example 2. 
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Fig. 3. Pole estimates for WSF, example 2. 
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In the second example, we consider the following system used in 
the simulation studies of T9] 

4 [1.92 —0.9316 

* = 1 0 



C = [0.05 0.025] 

0 . 


The poles of this system are at 0.96 ± jO.Ol, and there is a single 
zero at -0.5. The same type of input and measurement disturbance as 
above were used in this case, except the variance of the disturbance 
was increased to one. This resulted in an average effective output 
SNR of only 4.4 dB. The dimensions of the block Hankcl matrices 
were chosen as i = 40 and j = 161, for a total of 200 samples per 
trial. As before, both WSF and the output error PEM were initialized 
with the true parameters, and 250 independent trials were conducted. 
The results of the simulation are plotted in Figs. 2 and 3. 

Fig. 2 shows the pole estimates for the output error model, with 
the true pole locations indicated by the crosshairs, and Fig. 3 shows 
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the ^ame for the WSF algorithm No separate plot is shown for S41D, 
since it gave results essentially equivalent to WSF in this case (with 
the exception of one outlier) The WSF approach agam yields pole 
estimates with a much smaller variance than the corresponding PEM 
The average mean~square prediction error for WSF in this case was 
0 985, compared to 1 403 for the output error PEM 

Because of the various optimality properties of PEM’s in general 
[1], one may be surprised by the superior performance of WSF 
m the above two examples Since both algorithms used essentially 
the same minimization procedure (Gauss-Newton iterations with 
identical initial conditions and termination criteria) the relatively 
poor performance of the output error PEM in these examples may 
be attributed to a propensity for convergence to local minima 

V Conclusions 

We have presented u new method for identification of linear 
systems parameterized by state-space models The method is based 
on the notion of weighted subspace-fitting (WSF), a problem-solving 
philosophy most recently applied to parameter estimation problems 
in sensor array signal processing Like other state space identification 
schemes, WSF is easily applied in the MIMO case allows tor direct 
incorporation of a prion physical constraints into the problem, and 
appears to have better numerical properties than methods based on 
input/output models In addition, the ability to appiopnately weight 
the signal subspace vectors used in the WSF minimization pro 
vides a degree of robustness over previous subspace-based methods 
when the system is nearly unobservable or not sufficiently excited 
Although implementation of WSF requires a multidimensional pa 
rameter search, accurate initial conditions for ihe search aie readily 
obtained 
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Consistency of Modified LS Estimation Method for 
Identifying 2-D Noncausal SAR Model Parameters 

Ping Ya Zhao and John Litva 


Abstract —Least squares (LS) and maximum likelihood (Ml) an the 
two main methods for parameter estimation of two-dimensional (2 D) 
noncausal simultaneous autoregressive (SAR) models MI is asymptoti¬ 
cally consistent and unbiased but computationally unattractiye On the 
other hand, conventional LS is computationally efficient but docs not 
produce accurate parameter estimates for noncausal models Recently, 
in [17], a modified IS estimation method was proposed and shown to 
be unbiased In this note we prove that, under certain assumptions, the 
method introduced in [17] is also consistent 

I Introdi chon 

I wo dimensional (2 D) signal and system modeling and parameter 
estimation have many applications such as 2 D Kalman filtering 11 ] 
image estimation and identification [2]-[4] image restoration [5| [6] 
multidimensional spectrum estimation [7] [8] direction finding [9] 
textuie analysis and synthesis [10] [11] multidimensional system 
identification |12] [13] etc Several kinds of models arc used to 
utc a few Gauss-Markov random field (CiMRF) models which 
are sometimes called conditional Markov (CM) models [14]-[16| 
simultaneous autoregressive (SAR) models [ 16] [17J autoregressive 
moving average (ARMA) models [181 [19] etc In this note we will 
concentrate on the problem of estimating the parameters {f//, /} of 
the 2 D noncausal SAR model in the form of 

//(' j) = - (,k M/( ' “ * v-0 + '(' j ) (l) 

(k Dc \ 
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vhere {< (/, 7) f is a field of independent and identically distributed 
indom variables with zero mean and constant variance \ is called 
it neighbor set of the model whose shape determines the causality 
l the model According to [20] the model is a noncausal one if \ 
l b the form 


’N C {(A /) - P < A < p -(/</< <j )\{(0 0)} (2) 


I his is the most general of all the models and is known to provide 
iI k best peitormancc For the same specific problem using the 
noncausal model will lead to simpki model form and more efficient 
i omputational schemes 

11 the model is noncausal howevci <{i /) will correlate with 
{i/(/ — / i - /) (A /) f- ’N [ and the convenlional least squaits 
(l S) will not be an unbiased and consistent parameter eslnnatoi 
One is then forced to use the computationally inefficient maxi 
mum likelihood (ML) estimator to dcteinune llie noncausal SAR 
model parameters accurately To avoid excessive computation ol 
the ML an iterative scheme was developed m [16] which gives 
ipproximate Ml estimates of the SAR model parameters As a 
means ol overcoming inaccuracy in ihe conventional 1 S a modified 
IS estimator was proposed in |17| The modified LS has been 
proved mathematically to be an unbiased estimator under some mild 
conditions and the expci iments in 1171 show that the modified LS 
md the appioxnntle Ml have the same estimation accuracy On 
ihc othei hand Iht modified LS is supenoi to the approximate MI 
(torn a computational point of view If the modified IS estimate 
is calculated directly from (14) u icquires fcwei operations than 
lor one iteration ol the approximate ML whose implementation 
usually icquues sevcidi delations More importantly the modified I S 
possesses some shift invariance properties [211 which can be used to 
develop a last implementation based on iccursion An order recursive 
algorithm with the computational complexity of 0{m ) where m 
is the number ol Ihc estimated parameters is given in 117] and 
a spalially recursive algonthm with the computational complexity 
ol I bin 1 / ■+ (){m) multiplications and divisions per iccursion 
(MADPR) is given in [22] Ihc spatially lecuisivc algorithm uses 
only iht local observations appearing in a sliding data window so 
it can be used to estimate 2 D nonstationar) and noncausal SAR 
model parameters It is believed (hat tht computational efficiency is 
an important issue and reeuisive algorithms usually lead to leduction 
in computational burden [21| |24| 

When further compai isons arc made to the conventional L S eslima 
lor the modified LS estimator is favored to havt another significant 
idvanlagc The convenlional L S normal equations are somtllines 
ill conditioned which leads to numerical instabilities However the 
coefficient matrix Of tht modified LS noimul equation has a dillerent 
stiucture with a lower piobability ol being singular It thus can 
effectively avoid the instability problem which is very important in 
real implementations ol the paramctei estimatois 

Yet consistency is an extremely essential chdiacterislic for an 
estimator which can serve as a very important criterion lor selecting 
an cstimatoi lor a particular application The mathematical proof 
of the consistency of the above mentioned modified l S estimator 
has not been presented anywhere in the literatuie The purpose of 
this note is to present a proof for the consistency of the unbiased 
modified l S estimator This proof corioboratts computer simulation 
results that the mean square errors ol the modified LS decrease 
at their greatest rate when the sizes of the data sample windows 
increase and that the mean square error of the modified LS estimator 
has a rate of decrease which is much greater than the approximate 
ML 


11 The Modified LS Estimation Mi ihod 

When the model is a causal one the conventional LS is ui unhi isul 
and consistent estimator The basic idea of the modilud 1 S is the 
translation of the noncausal SAR model parameter estmiaiion problem 
into a problem based on a causal model and on anticausal model This 
is achieved by translatng (1) into 

% <>'/(' J) = - 51 '/a /(/('-A ;-/) + *{/ j) (T) 

(A /)(-\ 

7o oV(* y) = - 51 '/a /'/('-A y-/)f< (i /) (4) 

(A Oc x 


where 

% i + % ( ““ 7o o = l (S) 

'('/) + ( (/ y) = t (i i ) (6) 

A C. {(A /) 1 < A < p -q < /<,(/} U {(() l) 1 < / i (;} (7) 
^ C {(A I) -p< A < -1 -q < I 5 I/} 

U (( l ) /) -q < / < - 1 } ( K ) 

provided that such a decomposition exists Minimization ol the 
modified LS obicetivt function 

Ha, ) - 5Z U' <' J*] + 1' (' J*] 2 ) + ■?/>(</„, + </ - i) 

( ;U < 

= y. {[< (' /)] + [« (< ;)] ) + iii, (Hq , i + - ]) 

i in < 

W 


with rtspecl to results in the modihed LS estimator where G is a 
2 D data window n ( is a number of data points contained in (> and 


a, = [a , 1 mi'i 'i a. 


hi 


1, = 


<U o 

column [(if, i (A /) F ’N } 
q ( 

column {qi i (A l) e \ [ 


D _ a ( (j — 


Ei , m 1' (' >)] _ E, 


/)] 


% 0 


The resulting modified LS cstimatoi is 


g, (Ci) = '(C. )z t 


( 10 ) 

( 11 ) 

( 12 ) 

I 

(H) 

(14) 


where 


R,U «) = 


jn,(G)j 


0 \\< 


0 


n< 

0 


R,(6)J 


d^) 



JI* 

j) 

(isjyea \ 

X *pU<j)* a r {hj) 

(i,mo 

[»(«./)' 

r i/i*- j)' 

Vp(». JT) = column {;i/(i - k. j -l): (*■, /.) e A" } 
j) = column{</(/' - fr. j - l): (k, l) 6 A"'} 
*(*. i) = [#(«• j + q)• ■ ■ V(i< j ) • • ■ y(i- j - g)] T 


K<G) = 

*p(*! i)~ 

*>.» = 
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(16) 

(17) 

(18) 

(19) 

( 20 ) 
( 21 ) 
( 22 ) 


Proof: By the same procedure as that in [17], it can be shown 


that 


fl P (G)[ffp - S,,(G)] = 


H(,.j)eo !/('’ j)t"(>d) + D 
0 

Eo-mcM*' .»>'(»• + D I 

. Vp<'- J) J 


(25) 


Let 


JMG)k - fl P (G)|[ff„ - g„[G)} r Rl(G) = S (26) 


where 



Su 

5j2 

0 

5 M 

5l5] 


*? r 

S‘2’2 

0 

5 2 4 

5*25 

5 = 

0 

0 

0 

0 

0 


Six 

S‘24 

0 

544 

545 



5 2 a 

0 

51s 

5 t r ) 5 - 


(27) 


z v = (0 • ■ • Otic 0 ■ ■ ■ 0] T . (23) 

Boldface 0 stands for a matrix or vector with all zero entries, J 
is a variable dimension square exchange matrix with ones along 
the antidiagonal and zeros everywhere else, and g p [G) denotes the 
estimate of g p based on the use of the data {y(i, (/, j) E 2?}, 

and D D G is another data window, such that for (i\, ji) € G, the 
elements of vectors ji ) and xJ(tV ji ) are defined on D, i.e„ 
if y(i % j) is an element of a£(f'i, j \) or *J(*j, ji ), then (?'. j) E D. 

Using the same procedure as that in [17], we can prove that g p {G) 
is an unbiased estimate of g if 

1) R V (G ), defined by (15), is a positive matrix and independent 

of y v - a,,(Gy. 

2) the decomposition of (1) into (3) and (4) and e(t\j) into 
t r (i\j) and e"(i\ j) which are independent and have zero 
means and constant variances exists, and (13) is valid for 
sufficiently large G; and 

3) j) and c a (i.j) are uncorrelated with f y(i - k. j - 
./): (*, /) € V } and '{y(i - k, j - /): (k, l) E ;V“), 
respectively. 

Because the same modified LS objective function (9) is used, 
g v {G) is actually the same estimate as that in [17] except that D is 
factored into nod and rf, instead of D, is used as the central element 
of 9 P (G). From (13), we see that d is approximately a constant and 
D is proportional to no if {?' ( i, j)) and {*“(■*. j)} are stationary. 


The purpose of the proof is to show that the absolute values 
of all elements of matrices £{5n}, £{5n}, and £{5s5}. vec¬ 
tors £{5i2), £{Si 4 }> £{525.}, and £{54s}, as well as scales 
£{522}. £{£ 34 }, and £{5-4 -i} 4 do not exceed O(na) as nr; —<■ oc. 
Let us begin by studying £{5n}. 

From condition 2). we conclude that e a (i, j) has zero mean, and 
from condition 3) that r“(?\ j) is uncorrelated with y a p (i. j). Further, 
from condition 5), we conclude that c a {i, j) is uncorrelated with 
y(i - k, j - l) for (k\ l) g N U {(0, ())}. It follows, by using 
condition 4) for {ki . /]), {fa . /a) E A a that 

\E\y(ii-ku j] ~l\ )y(h~fa. j-i~h)(' <l (h. 72)} I < M 

(28) 

it (i 1 - ji ) = ( i'2> j‘2 )4 (/j — i'2 4- fa , j\ — j‘2 4- h) F A r U {((). 0)}, 
or (12 — /1 + A’i, j -2 - ji 4- / 1 ) E N c U {((). 0)}; otherwise, the 
left-hand side of (28) equals zero. Thus, the absolute value of the 
[ms - \h (2</ + l)4-/i| +1. ms - \h(2q + 1 )4-/-j| 4- T)th element 
of £{5n) can be expressed in the following form: 

£ j 3/(' 1 “ ' £ ~ h )c ,a ( M - j\) 

z ~ ~ i' 2 ) e<l C 1 ' 2 ' j 2 ) I 

J 

< Y, \E{v{ix-k,.jy-!,) 

( 1 1 - j 1 ) - (• 3 1 j 3 ) C ; C! 

(il+ 7l-J2+fj)€N ,: Uf(0. »)} 


111. Consistency of the Modified LS Estimator 

Lemma: Given a stationary image [y(uj): (i. j) E £}, the 
absolute value of any element of E{R,,{G)[g v - g v [G)\[g v - 
g v {G)} 7 Rl(G)\ does not exceed O(na) as net —+ 00 n where nu 
is a number of data points contained in G, if the conditions 2) and 

3) are valid, and 

4) there is a positive number M such that for ( 11 , 71 ), 
(^2* j‘ih l in . j: 1) E B and k] + k2 + fa ~ 4, 

|£{[tf(n- j, )J*■'F*«*. h ('•■• h )]*'}| < AT: (24) 

and 

5 ) e' {L j) and e n (i, j ) are uncorrelated with y(i - k. j -l) for 
{k. I) £ N U{( 0 , 0)}., where A r is the neighbor set used in ( 1 ). 


■ '"(/‘i* ji )yU‘2 ~ fa, j-2 - hy'[t2. J2)}\ 

+ 53 |£7{i /(/1 - ki. 7 i - /i) 

(«1 ’ >1 - J2 )£G 

(•2-M +tj • J2-Jl + 'l)€A7‘-U{(U, 0)) 

■ j\ )y( i ‘2 - k 2 . J 2 - h) r a (hi h)} \ 

4- 53 h - h ) 

('iwi)er; 

■ ?/0l - A’2, ji - /j ) [f " ( lj , 7 I )] 2 }| 

= 5 < 2(ni,v 4* 1) n r; A/ + nc;M (29) 

where m a is the number of the parameters contained in N a or N r . 
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Similarly, the absolute value of the ( m \ - |A i ( 2q 4-1) 4- /, | -f 1 )th of F f5i a } is 
dement of F{Su} is 


W Y V(<1 -1| J| -/jV"(h ji) 

I [(-l 

j Y v *'-’ Ji ^ r *' 2 J-) + D 

L< J )<■<• 

< |^{(/('i —Ai ji — /1 )< (/1 ji) 

(l i 1 M * i J Kf 
( 1 . 1 N {( )} 

'/(' - A Jf (/ 2 M}| 

4- 52 IMvM ” A i 71 — /j) 

( L 1 ( *)£ < 

( j +aj j i i ( m 

< ('I )\)'/(<> J.)f (ij J )}l 

£ 2(m \ 4- 1 )ii( M (30) 


ri ]L 71 " ,i)f 7>) 

I [(m ii)fT 

r y>b G* j J + 

[( i i x ' 

< 52 IM'/G.-Gy /,) 

i i ( ><< 

< i j s > 

< (m ;i)>/(' j 2 )( (i. i )|| 

+ Y m Mill - It - l\) 

1 Jl ( < 

H‘i -Ml 'V 1 f ] 

« ('i /i)v(" h)< (I /2)}I 

< 2 (m/v + Due V («) 




Since f (t y) and t (t y) have zero means and aie uncorrelated 
with /yG - A y - /) fot (A /) £ A U {(0 ())} < (i j) is uncor 
rtlaled with y f G j) and < (/ y) is uncorrelated with y f '(f y) lor 
(A i /i) € ’S and (A /J € ^ respectively then 


|F{i/(/i A i 7* — /i )r (/1 /i )</(/ -A yj-/ )r (/ y )}|< U 

( 31 ) 

d ( 1 1 — / 4 A 2 j i - y 4* / ) F 3 U {(0 0)} oi G — 1 1 + A i / - 
yi 4 /i) E 'N U {(()())} otherwise the left hand side ot (31) equals 
/cio Thus the absolute value of the (w \ - |A i (2ry 4- 1) 4- l\ | 4- 
1 |A (2«y 4 1) 4- / |)th element ol F{£i } can be expressed as 


r 


I e 

(< i i 


y{i \ — 1 1 j i — M» Gi yi) 


52 <y(' - A )2 - I )i (* y ) 

( v >M ; 

< 52 |F{i/(/i - A I yi - /)) 

i i 2 

i M i n c v { ) 

( Gi )\ )y{i 2 - A j M'('/ y )}| 

4- 52 |F{</(>i - A i /i — /1) 

( i i ) 1 

I HA I 1 /j N t 0)j 


< (n yi )'/('2 - A. y - / b (/ y )}| 
< 2(m \ 4- 1)/^r 3-/ 


(32) 


By a similai piocedure we can prove that loi (J| /|) (A /J £ 
A the following statements aie true 

1) the absolute value of (|/ (2f/4-l)4-/ | )th clement of / {S; } * 
2(?n \ 4- 1 )ii( 1/ 

2) the absolute value of (|A (2i/4~l)4d |Hhelement ot F { S ( 

2{m \ 4- 1)»( 3/ 

3) the absolute value ot (|A i (2ry 4-1) 4-/i | |/ (2ry + 1) 4- L|)th 

element ot £ {5 } < 2(m \ 4- 1 )>u 3/ 4- to 3/ 

4 ) IMS }| < 2(m \ + 1 )n< M 4 tt( 3/ 

3) j F{ S i }j ^ 2(ni \ 4- 1 )»T( 3/ 4- n ( 37 and 

b) |F{ Sa i}| ^ 2(ni\ 4- l)T?r 3/ 4- tu 37 

To summarize the absolute value of any element ol i {5} docs 
not exceed 0(n f ) as n< —♦ *v 

Because the maximum absolute value ot all elements of a matrix 
is a norm ol the matrix under the conditions ol the above lemma 
the norm ot / { R, (fi )[g t - g } (f»)][^ - (C T )] 7 R\ (d)| docs not 
exceed ()[nc ) as n< —> x 

Tluoum Given a stationary image {iy(/ y) (/ y) e Z7J the 
modified LS estimator defined by (14) is a consistent estimatoi d the 
conditions in the lemma are valid and (34) (found at the bottom ol 
the page) is a positive definite malnx 
PtooJ Foi large value of iu 


*,«.) = n< Q + 0("r ) OS) 


where Q is defined by (34) and &{ n< ) = R, { G) — in Q is a matrix 
satisfying the condition that 


and the absolute value of the (w \ — |A i (2r/4-1 ) + /i 14- J )th element 


F{& J (n f )}=()( n< ) (36) 


JF{x/(t y)x ; 1 (t j )}J 0 

1 


Q = 


o 


0 


0 


0 


(34) 


0 


1 

0 


F{x y (/ y )x t 1 (/ j )} J 
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Therefore 

E{n v (G)[ ai ,~g p (G)][g p -g f (G)] T R T p (G)) 

— E{[ncQ + 0{na)][/,, - g p {G)][g p ~[»<iQ + O(na )] 1 } 
*»lQE{\g p -i p {G)]\f p -»„(CV)] T }g + 0(».^ 2 ). (37) 

The application of the lemma, which says \\E{R P {G)\g v - 
^ F (C?)][fi,—(C7)}|| does not exceed 0 (tir;) as tu; 
leads to 

QE{\g p - 9 ,,(G))[g,, - g„(G)] T }Q — 0 as uci — x (38) 

and thus the consistency of the estimate g p (G) follows because of 
the positive definiteness of Q . 

IV. Concluding Remarks 

In this note, we have proven analytically the consistency of the 
modified LS estimator recently introduced in [17], which can serve as 
a very important basis for choosing the best estimator. The conditions 
for the consistency are 2)-5). There have been some discussions 
on conditions l)-3) in [17]. Condition 4) can be valid in practical 
applications where {y( /, j )} is bounded, and therefore {r' (#', j )} and 
j)} are bounded. Condition 5) means c* (i, j) and /*(/, j) 
are uncorrelated with y(k\ /) far away from the point (i< 7 ), which 
is also very natural in real applications. 

Extensive computer simulations with charging size data sample 
windows have been done. The results of the computer simulations 
show the following: 

1) The mean square errors for the modified LS estimator decrease, 
and the modified LS parameter estimates converge to their 
corresponding true parameter values with the greatest rate when 
the sizes of the data sample windows increase. 

2) The rate of decrease in the mean square error for the modified 
LS estimator is about seven times greater than that for the 
approximate ML. 

3) The conventional LS estimator has the least rate of decrease in 
its mean square errors because, in this case, the conventional 
LS is not theoretically a consistent estimator. 

The computer simulation methods and results are omitted from this 
note due to the space limitation, but they are available upon request. 


[9) R. R. Hansem, Jr„ and R. Chellappa, “Noncausal 2-D spectrum esti¬ 
mation for direction finding,’* IEEE Trans Inform Theory, vol. 36, pp. 
108-125, 1990. 

[10] R. Chellappa and R. L. Kashyap, “Texture synthesis using 2-D noncausal 
autoregressive models,” IEEE Trans Ac oust,, Speech, Signal Proc essing, 
vol. ASSP-33, pp. 194-203, 1985. 

Ill] P. Y. Zhao, A. R. Figueiras-Vidal, and J. R. Casar, “Color texture 
analysis and synthesis,” Rep., Universidad Polilecmca de Madrid, Nov 

1991, 

112J P. Y. Zhao and Z. Y. He, “Multidimensional system identification and 
applications,” Advances Model Simul., vol. 18. no. 1. pp. 15-26, 1989. 

[13J P. Y. Zhao and D. R. Yu, Multidimensional System Identification 
Shanghai, China: Shanghai Scientific & Technical Publishers, 1994 (in 
Chinese). 

[14] J. W. Woods, “Two-dimensional discrete Markovian fields,” IEEE Tiarts. 
Inform Theory , vol. IT-18, pp. 232-240, 1972. 

[15] __ “Markov image modeling,” IEEE Trans Automat Conn , vol. 

AC-23, pp. 846-850, 1978. 

[16] R. L. Kashyap and R. Chellappa, “Estimation and choice of neighbors 
in spatial-interaction models of images," IEEE Trans Inform Theory , 
vol. IT-29, pp. 60-72, 1983. 

[17] P. Y. Zhao and D. R. Yu, “An unbiased and computationally efficient 
LS estimation method for identifying parameters ol 2-D noncausal SAR 
models,” IEEE Trans Signal Processing , vol. 41, pp. 849-R57, 1993 

[18] R. L. Kashyap, “Characterization and estimation of two-dimensional 
ARMA models,” IEEE Trans Inform Theoiy, vol IT-30, pp. 736-745, 
1984. 

[19] P. Y. Zhao and D. R. Yu, “Spatially recursive algorithm to estimate 2-D 
ARMA model parameters" Sci Lett , vol. 36, no 8, pp 578- 581, 1991 
(in Chinese). 

[20] A. K, Jam, “Advances in mathematical models for image processing," 
Proc IEEE , vol. 69, pp. 502-528, 1981 

[21J P. Y. Zhao and Z. Y. He, "A fast spatial recursive algorithm for 2- D 
adaptive LS filtering and prediction,” Ads am es Model Simul . vol 13, 
no. 1, pp 57-64, 1988. 

[22] P. Y. Zhao and D. R. Yu, “Spatially recursive algorithms for adaptive 
estimation of two-dimensional non-causal and non-stalionaiy simulta¬ 
neous autoregressive model parameters.” Ini J S\st Su , vol 23, pp 
1033-1049, 1992. 

[23] Y Uetake, “Realization ol noncausal 2-D systems based on a descnplor 
model,” IEEE Trans Automat Conti , vol. 37, pp 1837-1840. 1992 

[24] Y. Uetake, “Optimal smoothing for noncausal 2-D systems based on a 
descriptor model ,"IEEETians Automat Conti , vol 37. pp 1840-1845, 

1992. 


References 

[1J J. W, Woods and C. Radewan, “Kalman filtering in two dimensions,” 
IEEE Trans Inform Theory , vol. IT-23, pp. 473-482, 1977. 

[2] H. Kaufman, J. W. Woods, S. Dravida, and A M. Tekalp. “Estimation 
and identification ot two-dimensional images,” IEEE Ttans Automat 
Conti , vol. AC-28, pp. 745-756, 1983. 

[31 J. W Woods, S. Dravida, and R. Mediavilla, “Image estimation using 
doubly stochastic Gaussian random field models,” IEEE Trans Pattern 
Anal Machine Intel . vol PAM1-9, pp. 245-253, 1987. 

14] F. C. Jcng and J. W Woods. “Compound Gauss-Markov random fields 
lor image estimation,” IEEE Tians Signal Processing , vol. 39, pp. 
683-697, 1991. 

(51 H. C, Andrews and B. R. Hunt, Digital Image Restoration, Englewood 
Cliffs, NJ: Prentice-Hall, 1977, 

[6) A. K. Jain and J, R. Jain, “Partial differential equations and finite 
difference methods in image processing. Part IT Image restoration,” 
IEEE Trans. Automat Contt . vol. AC-23, pp 817-834, 1978. 

[71 J. W. Woods, "Two-dimensional Markov spectral estimation,” IEEE 
Trans Inform. Theory, vol. IT-22, pp. 552-559, 1976. 

|8] B. F. McGuffin and B. Liu. “An efficient algorithm for two-dimensional 
autoregressive spectrum estimation,” IEEE Trans Acoust, Speech, Sig¬ 
nal Piocessing , vol. ASSP-37, pp. 106-117, 1989. 



'■EE TRANSACTIONS ON AUTOMATIC CONTROL. VOL. 40. NO. 2, FEBRUARY 1995 


A Robust Hybrid Stabilization Strategy for Equilibria 

John Guckenheimer 


Abstract —For an equilibrium of a general dynamical system, the 
domain of stability of a linear feedback controller is enlarged by the use 
of a general “hybrid” or “switching” strategy. The strategy is illustrated 
for numerical simulations of an Inverted double pendulum on a cart. 


1. Introduction 

Linear control strategies provide a means for the stabilization 
of equilibria under general hypotheses. When applied to nonlinear 
systems, the effectiveness of these strategies depends upon the size of 
the domain of stability that is produced for the stabilized equilibrium. 
If this domain is small compared to the accuracy of the measurements 
or the size of disturbances within the system, then the linear controller 
is likely to fail within a short period. Failure of the system can 
be catastrophic, with the system wandering far from the desired 
equilibrium. We present here a general procedure to recapture stability 
of a linear controller when a trajectory leaves it region of stability. 
By using a hybrid strategy based upon discrete switching events 
within the state space of the plant, the system returns to the region 
of stability for the linear controller from a much larger domain. 
The control procedure is robust and remains effective under large 
classes of perturbations of the underlying system. We illustrate the 
effectiveness of our technique by applying it to the control of an 
inverted double pendulum. 

The stabilization of unstable equilibria is a fundamental problem 
for the control of engineering systems. A sufficient condition for 
stability of an equilibrium point of a smooth dynamical system is 
that the eigenvalues of its linearization lie in the left-half plane. 
This is easily proved by several means, for example, by defining 
a quadratic Lyapunov function in a neighborhood of the equilibrium. 
Control theory addresses the questions of when stabilization is 
possible with the modifications (controller) that can be built into 
the underlying system (plant). Over the past 50 years, an extensive 
theory of “linear control" has developed comprehensive procedures 
for determining when the stabilization problem is solvable and for 
the design of controllers that implement stabilization. This theory 
is widely employed in engineering for the design of controllers 
in communication systems, chemical process control, avionics, etc. 
However, linear teedback control is not a complete panacea for all 
control problems, even ones of stabilizing equilibria when complete 
state-space information is available at all times. One difficulty that 
is encountered in some applications is that the domain of attraction 
of a controlled equilibrium may be small. This leads to unacceptable 
constraints on system performance. Small random disturbances in 
the environment or the inability of the actual physical system to 
implement its model idealization lead to failures of the controller. 
The results of the failure can be catastrophic in terms of the design 
objectives. For example, in the double pendulum example we describe 
below, the failure of a linear controller leads to large motions of the 
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pendulum, and in the presence of damping, the pendulum evenni.il> 
falls to rest at its naturally stable hanging position. 

The goal of this work is to provide a simple, effective mums 
of recovery from the failure of a linear controller. We \mm i> 
design a “safety net” around the (small) domain of attraction ut ,» 
linear controller, so that if a disturbance moves a system outside 
this domain of attraction, it will be guided back into the domain 
by the application of a different control strategy. The strategy that 
we describe is very general. It can be applied to any system that 
meets conditions of controllability and accessibility. Moreover, the 
computations that are required for the design of a controller are 
based on the linearization of the system at its equilibrium, just as 
with linear controllers. Verification of the effectiveness of a particular 
design requires more extensive simulation, but the design guidelines 
are based upon accessible information. 

The framework in which our control strategy is implemented 
has precursors in the literature [2], [3J, [11], [12], [15]. The terms 
“switching” system, “variable structure" system, and “hybrid” system 
have all been used to describe piecewise smooth vector fields in 
the context of control, but there does not yet seem to be an 
effective, systematic theory of such systems. We shall use the terms 
switching system and hybrid system interchangeably. One of the 
essential aspects of our work is the presence of “hysteresis” in a 
piecewise smooth system: there is a discrete component of the state 
of the system used by the controller in addition to its location in 
the underlying state space of the physical system. We recall the 
description of hybrid systems that we have used previously [1] and 
adopt here. 

The problem domain is a disjoint union of open, connected 
subsets of R u , called charts. Each chart has associated with it a 
vector field. Inside each chart is a patch—an open subset with 
closure contained inside the patch. The patch boundaries are assumed 
piecewise smooth. The evolution of the system is implemented as a 
sequence of trajectory segments where the endpoint of one segment 
is connected to the initial point of the next by a transformation, 
although the transformations are trivial in the examples studied in 
this note. However, states of a system have both a continuous and 
discrete part, and switches that change the discrete part of the system 
state do occur. Time is divided into contiguous periods separated by 
instances where a stale transformation is applied at times referred to 
as events. 

We end this section with a few “philosophical” comments un¬ 
derlying our approach to nonlinear control. Structural stability is a 
useful concept for dynamical systems, stating that perturbations of 
a system remain equivalent to the reference system by continuous 
changes of coordinates. In implementing hybrid control for stabilizing 
equilibria, we have sought to maintain this type of robustness to 
perturbations of the system itself. Nothing in the controller should 
be subject to the choice of exact values of any parameters. In 
particular, we have avoided the use of sliding modes or switches 
that must be implemented exactly to be effective. To a large extent, 
this strategy involves trade-offs to attempts to achieve optimization 
of some cost function in the control because that is likely to push 
one to select parameter values in a system that arc borderline for 
a property. If the property is one that involves the stability of the 
system, then small perturbations cannot be relied upon to maintain 
the efficacy of the controller. Here the emphasis is squarely upon 
reliability rather than efficiency. It is quite possible that modifications 
of the strategy described here will lead to improved performance by 
returning a system to the stability region of a linear controller more 
quickly. 


0018-9286/95$04.00 © 1995 IEEE 
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II. Barriers to Unbounded Motion 

This section discusses a strategy for maintaining the motion of a 
trajectory in a bounded region of an unstable equilibrium point with 
a piecewise constant control, Consider the following linear system 
as a model example: 

i‘1 = AiU’i - r) 

,i’2 = A a (jr a - c). (I) 

In (I), r represents a "control" that moves the equilibrium point 

of the systems along a line. While we have chosen a particular 

form for this system, most planar vector fields with real eigenvalues 
can be transformed to this representation by a linear change of 
coordinates. Such a transformation to a "normal form" exists if the 
system is controllable. Necessary and sufficient conditions for this 
are that 1) the eigenvalues are distinct and 2) the control moves the 
equilibrium along a line that is not an eigendirection. We assume that 
Ai > Xj > 0, so that equilibrium point is a source. 

The goal is to define a feedback control r(j’) so that the motion ol 
the system remains within a moderate-sized bounded neighborhood 
of the origin. We approach this by defining a hybrid system with 
two patches that arc half-planes 7i+ and H _ defined by y > x - (, 
y < .r + r, respectively. The boundaries of ff+ and H. arc the lines 
L± defined by y = .»■ ^ r. These lines are parallel to the control line. 
In each of the two patches, the control r(.r) takes a constant value 
c ±. The values of c are chosen with the object of making trajectories 
in the overlap strip H+ fl H- stay in a bounded region of the origin. 
This defines a hybrid system with parameters r-* and t . The goal is 
now clear: to choose values of these parameters to create a trapping 
region surrounding the origin. 

We can compute readily that the trajectories of the system ( 1 ) are 
defined by 

J'i(t) - c + exp(/A, )(.n(0) - r) 

J , lt(f) = r + exp(/A 2 )(.r i (0)-r). 

Given r, we would like to find r± that creates a trapping region 
around the origin in the strip -r < j 2 - t 1 < c . Along a segment 
of the right boundary L *. of the stop, we would like the vector field 
associated with H+ to point to the left, toward the interior of the 
strip. Similarly, we would like the vector field associated with II- 
to point to the right on a segment of the left boundary /,_ of H . 
These segments are to be chosen so that the flow carries one into the 
other. See Fig. 1. To choose with the desired properties, we argue 
as follows. For simplicity, we shall assume that = — < so that 
the system has a symmetry. The symmetry streamlines the analysis, 
but is not essential to the argument that wc give. 

Regard the value of r as fixed for the moment. We shall determine 
the conditions we would like it to satisfy. Along £+, define the point 
p f to be the point where the vector field has slope l. The point 
is obtained by solving the equations 

.r, = ,r i - c 

Ai(./’i - r) = X’Aj'2 - c) 

whose solution is 

Above on L+, the vector field points to the right of /*+. Below 
on the vector field points to the left of £ + . The trajectory 
starting at p+. should lie above the trapping region in the strip. For 
example, we might want the intersection of the .i a - axis with the 
strip to lie in the trapping region. For this to occur, it suffices that 
the trajectory with initial condition intersect the strip at a point 



Fig. I. The barriers L± and region in which trajectories switch back and 
forth belwcen the boundaries. The curves segments show trajectories, with 
arrows located where they encounter a palch boundary 


TABLE 1 


A 

c/e 

1.5 

9.444 

2 0 

5.828 

2.5 

4.614 

3 o 

4.0 

3 5 

3 G29 

5.0 

3 063 

10 

2 518 

100 

2 065 


with a nonnegative value of .r|. A lower bound for < ^ satisfying 
this criterion is given by the value of i for which the trajectory with 
initial condition passes through the point (0, < ). This yields an 
implicit equation for r/< in terms of the ratio A — Ai/\j 

(r) M _ ( A, - 

C-r-) xI ‘ Aj> 

Representative values of the solution of this equation are given in 
Table I to three-digit accuracy. This discussion leads to the following 
theorem. 

Theorem 2 l Let /< be larger than the solution ol the equation 
(fP = (A, - Aj) (V| - X -' (A..) V -’ 

and set r_ = — . Denote by \Y± the trajectories of the vector 

fields X± defined by 

.i i = AI (./■ j — < £ ) 

h = A Aj‘2 - '4 ) 

passing through the points (0, ). Assume A,> < Ai. On the lines 

l £ defined by .r? = .n ^ », consider the segment m± with endpoints 
at ( 0 , ) and the intersections of with l ±. Then trajectories of 

A'+ with initial conditions on 1+ intersect /_, and trajectories of A_ 
with initial conditions on /_ intersect /+. 

To prove this theorem, we still need to verify that the trajectory of 
A“_ with initial condition at ( 0 , r) intersects the segment s f . From 
A\ , we obtain the equation 

d‘ l ’2 _ A 2 (.r-j H- O 
fl-r i Ai(.n + O' 


(2) 
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Equation (2) gives the trajectory of A'_ with initial condition (0, <) 
in a form parameterized by ,v j: 

^i, -r+ (r+ c)(l + y) ^ 

with A = A^/Ai. Similarly, the trajectory of A. f with initial condition 
(0. c) is given by 

(.n. / + (r- r)(l - y) A )- 

These two trajectories intersect at one point in the right-half plane if 
the trajectory of A'- passes below the point (c. r). Setting u = xj /r 
and ft = r/c\ we want to determine when 

-1 + (1 + 6)(1 + «) A = 1 - (1 - 6)(1 - «)'. 

This equation is readily solved for h in terms of ?/ 

i, = 2 - a - ») A - (i + ») A 

(1 + «) A - (1 - u) x ' 

At the solution to this equation, we want to have -l-|-(l + fc)(l + 
u ) x - ii + b > 0, so that the intersection point of the two trajectories 
lies above the line /+. This yields the requirement that 

h (! + //)-(! + «)* 

(1 + ii) - (1 + w)' ’ 

Substituting the value for h at the intersection point into this inequality 
and simplifying leads to the requirement that 

? - «(1 + i/) A + i/(l -/,)'- 2(1 - //-’) > 0. 

This inequality holds throughout the unit square in the (ti, /) plane, 
which is the domain of interest. We conclude that the intersection of 
the trajectories of A’+ and X with common initial point (0, t) lies 
above the line I + since we assumed A < 1. 

Recall that our hybrid system X applies a mode switch from A"- 
to A"^ when a trajectory hits /+ from the left and, similarly, u mode 
switch from A’ f to A'_ when a trajectory hits /_ from the right. The 
theorem implies the following corollary. 

Corollary 2.1: With the notations introduced above, define If to 
be the region bounded by \\ r ±, the trajectories of X± with initial 
conditions at (0. ) and .s^. Then trajectories with initial conditions 

in If remain in R for all forward time. 

Less formally, we say that R is a trapping region for the hybrid 
system A’. We can say more still about the dynamics of X in R. There 
are passage maps 0+ that map trajectories with initial conditions on 
*s± to their intersections with s T . The maps 0± are monotone, so the 
composition 0 is a monotone map of the interval * + into itself. 

It follows that this return map has a stable fixed point, representing a 
stable periodic orbit for the hybrid system. Additional computations 
lead to the conclusion that the return map is a contradiction and has 
only a single fixed point. To carry forward these computations, we 
use ihe coordinates which scale r to 1, writing u = .ri /r and b = e/e 
as in the proof of the theorem. The trajectory starting on the line /_ 
with initial value it = no is given by 

(,,o. -1 +(l+ * + "„) (}^) j. 

The intersection of the trajectory occurs at a value of u — ti\ 
satisfying 

— 1 + (l + ft + u o)(-r ~— ^ — i/1 + b = 0. 

\1 + "0/ 


We want to estimate <lu i /<•/ 1 /o from this equation. From the cqi.i 
equation 

(l + »o)* _ (1 + II! r 

1 + I# o + ft 1 -(- i/i — b 

we deduce that diii/duo < 1. The right-hand side of the Iasi cqu.ii, 
defines a function of i/i which intersects the function of </„ on the len 
hand side crossing from above to below while decreasing. Thcivfoie 
implicitly differentiating the last equation gives du \/du» < I Horn 
this, we conclude that the return map of oui hybrid system has 
derivative smaller that 1 and is a contraction. 

Theorem 2.2; With the same hypotheses of the pres ions theorem 
and corollary, there is a stable limit cycle for the hybrid system that is 
globally attracting for all initial conditions in the trapping region R. 

Remarks: 

1) Due to the symmetry of the hybrid system, the stable limit cycle 
is symmetric with respect to the origin. 

2 ) In systems with equilibria that are saddle points with two- 
dimensional unstable manifolds, the procedure described above can 
be applied with switching surfaces that are hyperplanes tangent to the 
directions spanned by the stable manifold of the equilibrium and the 
tangent to the control curve. We have not investigated systems with 
unstable manifolds of dimension larger than two, but the strategy 
might work there as well. In that case, one can try to use switching 
surfaces that are tangent to the directions spanned by the tangent to 
the control curve and all but the largest two eigendireclions. 

III. Multiple Barriers 

The results of the previous section can be extended and improved 
in a number of ways. We describe two. 

The barriers described in the previous section can be combined 
with linear controllers. If one knows a region around the equilibrium 
that lies in the domain for a linear controller, then one can define a 
hybrid system with three patches: the system described in the previous 
section, and a domain cut from these two patches in which the linear 
controller will be applied. If the stable limit cycle of the switching 
system intersects the domain in which the linear controller is applied, 
then the barriers and switching system serve to guide the system to 
the domain of the linear controller from initial conditions between 
the two barriers. To prevent the system from exiting the domain of 
the linear controller, distinct boundaries can be defined to switch the 
linear controller on and off. 

The second extension of the controller described in the previous 
section is to place multiple barriers in the system parallel to one 
another. Consider, for example, a planar system with six barriers 
that arc parallel lines /,, / = 1. ■•,G, The lines I, divide the 
plane into seven closed “strips’' S,. t = Si and S’: are 

half-planes. From the 5,, we form six overlapping patches D, - 
S, U S, -I- 1. In each of these patches, define a constant control 
that increases in magnitude as one moves away from the control 
line. The transition conditions are defined so that if one crosses a 
patch boundary moving away from the control line, then the control 
setting of larger magnitude is applied. If one crosses a patch boundary 
moving toward the control line, then the control value changes to 
one of smaller magnitude. The effect of these barriers is to guide a 
trajectory back toward the origin from farther away from the origin, 
while at the same time decreasing the amplitude of the control when 
feasible. Combining these multiple barriers with a linear controller in 
a neighborhood of the origin allows one to recover from disturbances 
of large size that move the system outside the region of stability 
for the linear controller. See Fig. 2 for an illustration of the patches 
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Fig. 2. The geometry ol the Male space for the syslem with three sets ol 
barriers and a region in which linear feedback control is used. 

associated with a system that has three pairs of barriers and a domain 
where the linear controller will be applied. 

IV. An Example: The Double Pendulum on a Caiu 

We describe an example—-a frictionless double pendulum on a 
zero mass cart whose acceleration along a track can be controlled. 
The object is to keep the pendulum in the fully upright position. The 
control of a pendulum on a cart has been a frequently studied problem 
[41-|10], [131, |14|. This example provides a good illustration of the 
effectiveness of our stabilization strategy on a nonlinear system. 

The double pendulum consists of two point masses w t and mj 
with body 1 attached to a fulcrum and body I attached to body 2 by 
masvsless rigid rods of lengths /| and lj. We want to include within 
the system of equations the additional effect of applying a horizontal 
acceleration. See Fig. 3. Choose units for which the acceleration of 
gravity is 1 , let the magnitude of the acceleration be n, and set 
p = l 4- w\/ni 2 . Then the equations of motion are given by the 
following vector field X as shown in the equation at the bottom of 
the page. Here, 71 , <12 are angular coordinates and pi, pj are the 
conjugate momenta. The angles 71 , 72 are measured with respect 
to vertical rays pointing down, so the stable equilibrium with Ihc 
pendulum hanging down is given by 71 = 72 = pi - pj> = (). 
The vertically upright position that we want to stabilize is given by 
7 t = = 7 r and pi = pj = 0 . 



The vertically upright position is an equilibrium of the pendulum 
equations (without horizontal acceleration) that has a two-dimensional 
stable manifold and a two-dimensional unstable manifold. Therefore, 
we are in a situation for which the theory described earlier can be 
applied. To do so, we need to construct a linear controller, a region 
in which the linear controller will he applied, barriers parallel to the 
hyperplanes spanned by the control line and the stable manifold at 
the vertical equilibrium, and control values for each of the patches to 
he used hy the controller outside the patch of the linear controller. 

The linear controller is defined by making the acceleration of 
the pendulum fulcrum a linear function of Ihe location of the 
pendulum in phase space. For convenience, we shall use coordinates 
(— sin (71). —sin (72), pi. pj) near the upright equilibrium. We seek 
a vector - = (//1. //j, 71, 71) so that setting n - 717, 4* 7^7. 4- 
//ipi 4- pipy makes the upright equilibrium stable. Controllability 
of the system linearized at the upright equilibrium implies that wc 
can lind - to place the eigenvalues of the Imeaily controlled syslem 
anywhere in the complex plane. We describe one approach to solving 
this problem. Treating the eigenvalues of the linearization as functions 
of the control coefficients </, gives a system ol equations that can be 
solved for the y,. Let A,. / — 1 - - ■ 4 be ihe desired eigenvalues lot 
the controlled system. Denoting the Jacobian matrix of the vector 
held by .1 and r = we seek vectors u , /l) and * so that 

( 4 . r 1 ), r< — \ t „ ' m 

If A 4 - A,/ is invertible, rewrite this equation as 
— A,/)“'••=- H\. 

It follows that m, is a multiple ol ( l — A,/) 1 r and 
*((.-1 - A,/) * i.) = -I. 

As / varies in { 1 . 2 . 3 , 1 }, this yields a system of linear equations 
for If the system is nonsingulai, then it uniquely determines * in 
terms ol the eigenvalues A,. 

We have investigated a numerical example with parameters 
/, ^ i/2, (> = :\/ 4 , m, ^ 2, m, = I, and 


fjt = P i 
hi =■ Pii 

■ - s i n (4D) c °-M f 7 1 zJiLi t sin [g±Lz tJiHi cos ^^ z ~ ^ 

in Mp - cos 2 (71 - 72)) 

+ n ( -p cos (71) 4 - cos ( til ) cos (71 - 7-j) 

Mp - Cos ’ 2 (71 - 7j )) 

• - P (sin ( 71 )cos (71 - (jj) - sin( 72 )) 4- (p/ipjf 4- Jjpj cos (71 - <u))sin{<i\ - 7 j) 
Pl ~~ Mp - cos 2 (r/i - 72 )) 

+ o(~P cos(72) 4 - pc oMti )cos(7i - 72) 

D(P - cos 2 (7t - q 2 )) 
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q i 

(a) 



(b) 

Fig 4 A typical hybrid trajectory convuging to the upright position with 
magnifications showing increasing detail near the equilibrium position The 
initial point of Iht trajectory is (2 5 16) near the left side of ( i) 


{A, A. A, A*} = {-0 1 —0 5 -0G -0 7} This gives 

(exactly) ^ = (-1 305 2 41G —1 1 1 2305) Using these 

paramctei values, we investigated numerically a model of the double 
pendulum with the hybrid switching control strategy described in 
the Iasi section The only modification of the strategy was to defint 
two neighborhoods of the vertical pendulum on which switches to 
and lrom the lincai contioiler were applied This was done because 
many of the linearly controlled trajectories do not approach the 
equilibrium with monotomcally decreasing distance 
To test the effectiveness of this controller, trajectories were com 
puted on a grid of miiial conditions in the plane pi =- p 2 = 0 
The domain of stability of the linear controller in this plane contains 



(O 

fig 4 Continued 


a small elongated region aiound the equilibnum diagonal q = qi 
whose length along the diagonal is approximately 0 5 and whose 
width is approximately 0 07 Tor the switching system we used level 
sets of the function h — {q i - n)/\ - (q 2 - 7r)/H- pj /12 - p /G 
as the switching surfaces The function /; was computed so that 
it is parallel to the hyperplune spanned by the conti ol direction 
(0 0 2 0) and the stable eigenvalue at the fully upright equilibrium 
With switching surfaces given by h = -0 5 // = -0 i h — -0 1 
/,=_()] h “03 and h = 05 with concsponding values for 
a of j 3 1 -1 l -j there is a much larger domain of attraction 
for the upnght pendulum In the plane pi = pj = 0 U appeals 
that the square with vertices at the points (q\ q ) = (2 > 2 5) 
and (f/i q ) =- (3 7 1 7) is completely contained in tht domain ol 
attraction loi the upright pendulum See Fig 4(a) tor a piojcction 
of a typically tiajectory into the plane pi = p = I) f ig 4(b) 
and (c) shows magnifications of this trajectory dost to the upright 
equilibrium The width of this squaie is an ordci of magnitude huger 
than the width ol the domain of stability for the lincai contiollci in 
the plane pi = p - 0 If the lincai controllu is not used then 
the asymptotic stale is the limit cycle described in the first section 
Switching to and horn the linear controller when the square of the 
distance to tht upright equilibrium is 0 002 and 0 005 resptcnvdy 
appears to mbustly stabilize the pendulum at the precise upright stale 
Note that 0 005 ~ (0 07)’ so that the disk for switching the lineai 
controller off could not be chosen smaller and still remain in the 
basin of attraction of the vertical equilibrium foi the linear controllu 
Addition ot stochastic perturbations to the vector held does not appear 
lo significantly diminish the size of the domain of attraction tor the 
upright equilibrium 

As illustrated bv this example the hybrid oi switching stiategy 
that we have presented lor the stabilization of equilibria appears lo 
be robust All aspects of the strategy seem to be structurally stable 
and persistent with respect to very general types of perturbations 
Ll augments linear control for stabilizing equilibria by guiding a 
trajectory back into the domain of attraction lor a linear controller 
from a much larger region 
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Adaptive Control of Systems 
with Unknown Output Backlash 

Gang Tao and Petar V. KokotoviC 

Abstract —Adaptive control schemes for systems with unknown back¬ 
lash at the plant output are developed. In the case of known backlash, 
a backlash inverse controller guarantees exact output tracking. When 
the backlash characteristics are unknown, adaptive laws are designed to 
update the controller parameters and to guarantee bounded input-output 
stability. Simulations show significant improvements of the system per¬ 
formance achieved by such adaptive backlash inverse controllers. 

1. Introduction 

Backlash is common in many components of control systems, 
such as actuutors, sensors, and mechanical connections. A typical 
backlash example is the mechanical motion due to the imperfect 
contact of two gears. From the early days of classical control theory, 
the backlash nonlinearity has been recognized as one of the factors 
severely limiting the performance of feedback systems by causing 
delays, oscillations, and inaccuracy. 

The backlash characteristic is a nondiffcrcntiable nonlinearity 
which is often poorly known. Therefore, the control of systems with 
unknown backlash is an open theoretical problem of major relevance 
to applications. In [1] and |2], we proposed an adaptive control 
scheme for systems with unknown backlash at input of the plant, 
that is, in the actuator. In this note, we address the problem with 
unknown backlash at the plant output, that is, in the sensor, as shown 
in Fig. 1. Perhaps the most common example is a position servo: the 
block G(D) represents the power amplifier motor, and the backlash 
is in the position sensor such as a potentiometer connected to the 
motor shaft through a gear box. 

A contribution of this note is the construction of new adaptive con¬ 
troller structures for the output backlash problem which is essentially 
different from the actuator backlash problem. Our new controller 
structures can be initialized to achieve exact output matching when 
the plant is known. They lead to a linear parameterization of the 
closed-loop plant when the backlash is unknown. Such a linear 
parameterization is crucial for the development of an adaptive scheme 
to deal with the unknown backlash. Our approach is to develop an 
adaptive backlash inverse to cancel the unknown backlash effect. 
A feedback-feedforward controller structure is then combined with 
such an adaptive backlash inverse to achieve the desired tracking 
performance. 

The note is organized as follows. In Section II, we present the 
model of the backlash at the output of a linear part and formulate the 
control problem. In Section III, assuming that the backlash is known, 
wc present a backlash inverse and introduce the idea of backlash 
inverse control. In Section IV, we develop two adaptive backlash 
inverse controller structures when the backlash is unknown: one for 
ihe linear part known, and the other for the linear part unknown. 

Manuscript received April 5, 1993; revised January 28, 1994. This work 
wus supported by the National Science Foundation under Grants ECS-9203491 
and ECS-9307545, by the Air Force Office of Scientific Research under Grant 
F49620-92-J-0495. and by a Ford Motor Company grant. 

Ci. Tao is with the Department of Electrical Engineering, University of 
Virginia. Charlottesville, VA 22903 USA. 

P. V. Kokotovis with the Department of Electrical and Computer 
Engineering, University of California, Santa Barbara, CA 93106 USA. 

IEEE Log Number 9406996. 


00l8-9286/95$04.(X) @ 1995 IEEE 



IEEK IKAN5AC llClNS ON ALiUMAiK LUNlKOL. VOL. 40, NO 2, t tbKUARY 1995 



Fig. I. Plant with output backlash. 


We design adaptive laws to update the controller parameters, which 
ensure closed-loop signal boundedness. In Section V, we present 
two design examples and use simulation results to show major 
performance improvements achieved with our adaptive backlash 
inverse controllers. 

II. Problem Statement 

Let us consider the following discrete-time plani with a linear part 
G{D) and a backlash nonlinearity D(-) ai its output, as shown in 
Fig. 1: 

y(t)=- I3(-(/)). i(1) = G(D)[u](t) (2.1) 

where </(/) is the control input, y{t) is the measured output, G{D) — 
krTTiT)' *'/• is a constanl ’ £(#) antl ^(D) are monic polynomials, 
and the symbol D is used to denote, as the case may be, the 
translorm variable or ihe advance operator: D[.r](/) = ./*(/ + 1 ). 

The backlash characteristic B(-) with input i(i) and output .</(/) 
is described by 

- n) for z(t) < :/ 

for -(/)>:, (2.2) 

!J(t- 1) for < :(/) < 

where ni > 0, r,, n are constant and - ' , “ l) + n, = 
+ r, are the :-axis values of the intersections of the two 
parallel lines of slope w with the horizontal inner segment containing 
ij{t — ]). This backlash model is the discrete-lime counterpart of the 
continuous-time backlash characteristic modeled in [ 1) and [21. 

In our control problems, the backlash parameters m, c, , n are un¬ 
known, and the internal signal :(/) is not available lor measurement. 
The control objective is to design a feedback control for the plant 
(2.1), which stabilizes the closed-loop system and makes the plant 
output y(1) track a given bounded reference signal // T ,i(0. 

We will consider two designs: one for the plant with a known 
linear part G(D) and the other for an unknown G{D). We will use 
the model reference control strategy and therefore make the following 
assumptions: 1) G{D) is minimum phase; 2) the relative degree n* 
of G{D) is known; 3) the degree n of P{D) is known; 4) the sign 
of is known; 5) ^ > mo for some known constant in o > 0, 
r, > 0, ci < 0. Assumption 5) will be used to project the estimate 
of — away from zero. Without loss of generality, in view of 4) and 
5 ), wc further assume that k,, ~ 1. 

Our approach will be to estimate the backlash parameters m % r ,, 
ci so that an adaptive backlash inverse can be implemented to cancel 
the backlash effects. With such an adaptive backlash inverse, a linear 
controller structure will be designed to generate the control signal 
u(t) to achieve the control objective. 

III. Nonadaptive Backlash Inverse Controllers 

In this section, we introduce the idea of backlash inverse control 
and develop a backlash inverse control scheme when both the linear 
part G(D) and the backlash /?(■) are known. 



(a) (b) 


Fig. 2. Backlash inverse: (a) block diagram; (b) Ovaiacienslic 
A Backlash Inverse Control 

A graphical inverse of the backlash characteristic in Fig. 1 is shown 
in Fig. 2, which contains vertical jumps. This backlash inverse will be 
described with the help ol the indicator function \[A’] of the event X : 

1 if _Y is true 

0 otherwise. 

To characterize the upward and downward changes of the reference 
signal //,„(/), we define 

Vi.n(f) = \[?/r„(M > y„,{i - 1) 

or </„,(/) = y,n(t - 1- 1) “ 1] (3.1) 

1 ) 

or !/„,(/) = y,»(t - 1), \n,.(f - 1) = 1] (3.2) 

and require that {t) and \j,..(0 be initialized as \ M „(fo) + 
\i,„(to) = 1, \t m(fn)\/m(fn) ~ B. A consequence ol this definition 
is that \ ITI ,U)+ \h„(t) - 1 and )\/,„(f) = 0 foi any 1 > t u- 

The output of the backlash inverse in Fig. 2 is 

:,„(t) = + \ in ,{t)c, + \h„[t)ch (3-3) 

HI 

The mapping BI( ): y,„(t) —* - m{t) defines a backlash inverse [I] 

D(BHy m {T))) = y„,(T) =► B(BT {//,„(/))) = V/ > r. 

To introduce the idea of an output matching control law with a 
backlash inverse, we first consider the simplest case when G{ D) in 
Fig. 1 is an //"-step delay, that is, ;( / 4- n* ) = u(t). 

Proposition 3.1 I3j: Suppose that G(D) = D " . Then the 
control law u{t) = ~,„(f -F O, / > /o, guarantees that y(t + ”*) = 
y„,{t -I- n*) for any t > to , provided that //,„(/n + n*) = </(/» + »»*). 

In this proposition, the condition that y„ t (fo 4- n*) = U ( f o + n *) is 
used to initialize the backlash inverse controller for output matching, 
which can be practically achieved by redefining the initial value ol 
y m (t). For this simple G{D) and with the backlash characteristic 
known, the measurement of :(/) is not required. 

Next we consider G{D) of a general form with known backlash 
Z?( ), but we now assume that the signal i(t) is available for 
measurement. This unrealistic assumption is made only to help us 
introduce a nonadaptive controller whose structure will be of interest 
in our adaptive designs in which this assumption will be dropped. 
Using the notation 

= <i{D)[u](1). a{D) = ■ ■ ■. D J ) / 

(#) = ft(0)|:lm. = 1)' 
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we choose the parameters #* € if" ' and 0? € if" to satisfy the 
matching equation 

6 ?o(D)P(D) + K 1 b(D)Z(D) = P(D) - Z(D)D"' (3.4) 

and define our nonadaptive controller as 

+ t > to. (3.5) 

Proposition 32 13 /.* The control law (3.5) guarantees that ~(t 4 
n*) = + for any t > to. Moreover, z/(f4»*) = //,„(f4n*) 

for any t > to, provided that y T „{to 4 n*) = y(to 4 O- 
Our designs in subsequent sections will avoid the unrealistic 
assumption that the signal :(f) is available for measurement by 
replacing the term tf! 7 ou(f) in (3.5) with terms implementable 
without the knowledge of -.(f). 


IV. adaptive Control for Unknown Backlash 

In this section we propose two adaptive backlash inverse con¬ 
trollers: one for the design with G(D) known, and the other for 
the design with G(D) unknown. In both designs, the backlash /?(■) 
is unknown and the signal z[t) is not available for measurement. 

A. Design for D(-) Unknown and G(D) Known 

With c 7n (f), T(f) defined in (3.3), (3.10), introducing 9* u = ^ 
i9* = r,, 9% = r # and u j u (t) = W£)[//](f), ui.(f) = ft(/5)[ v , ](/'), 
u 0i(t) = b{D)[\i](t), u- 0* 1 U) u {t) 4 y u ,(t 4 n*), u>,n,(t) = 
^'w,(f)+ + «*), = ft: r ^i(f)+ \, m (t + »*), we 

express the controller (3.11) as 

«(f) = trJ^M) + ) + 0 ,*,,„(f) + 0,V'i„,(f). (4.D 


B. A Backlash Inverse Controller for G(D) and DU) Known 

To develop a controller structure which uses only the measured 
plant output </(/), we define 

\.(f) = v(</(f) > ,v(f- 1) or y(t) = y(l - 1). \,(t- 1) = 1] 

(3.6) 

\i(t)= \{u(t) < »(/-!) or y(t) =■ y(t-l). V /(f-l) = l] (3.7) 

\o(f) = \[u{t) = y(t- 1)1. (3.8) 

Similarly, in (3.6) and (3.7), \,(t) and \i(t) are initialized as 
\| (fu) 4 \i{* o) = 1, V- (fo)\/(fo) = 0 so that v, (t) 4 \i(/) = 1 
and v i ( t) \ i ( t ) = 0 for any t > to . 

Using (2.2), (3.6M3.8), we then express the internal signal -(f) as 

-(f) = Ol!l + x ,(t)c, + \,(t)c + u(t)H u (t) (3.9) 

ni 

where <1o(t) 6 [o, r,] is bounded. The term \u(f)rfo(f) is the 
part of the signal -(f) which, due to the backlash characteristic, is 
unobservable from the output y(t). In general, \o(f)rfo(f) may be 
nonzero even if the backlash B(*) is known. 

Since -(f) is not available, we choose its estimate to be 

u(t) 

-(f) — —4 \,(t)r, 4 \i(t)n (3.10) 

rn 

so that the estimation error is -(f) - 1(f) = \o(f)r/ 0 (f). 

Hence, when the linear part G{D) and the backlash £(•) are 
known, but the signal -(f) is not available for measurement, the 
controller (3.5) is modified as 

>,(t) = 0 ;/+ + »*), i>t n (3.1D 

where 04 (f) = fr( D)[':](f). Now, despite the estimation error i(f) - 
T(f) = \n(f)rfu(f), this controller can ensure the exact tracking 
e(t) = //(f) - w„,(f) = 0. 

Lemma 3.1 [3J: The controller (3.11) achieves the exact tracking, 
that is, i/(f 4/0 = //,„(/ 4 n*), for any f > to, provided that 
T(r) * ~(r) for r = f () - n 4 1, f« - « 42.---.fo - I, ?o, fo 4 
1. • ■ ■ ,fo 4 n* — 1. fo 4 it*, and //, r ,(fo 4 w*) = //(fo 4 n*). 

For the controller (3.1 1 ), the initialization for output matching is 
achieved through the condition: “(r) = :(r) for r = f 0 - n 4 
1, ■ ■ ■ 9/0 4 »*, and g fM (fo 4 h*) = //(fo + t#*), where T(ft> 4»') = 
-(f 0 4 n*) initializes the backlash inverse mapping: y(t) —> l(t). 
The condition T(r) = :(r) is satisfied if the motion of ;(r) does 
not stay inside the backlash, Since the backlash region characterized 
by c, and n is finite, a proper initial excitation of the system can 
always keep -(r) outside of the backlash region. 


We then proceed to use the adaptive version of (4.1) for 9 *,, 9*, , 
9*i unknown: 

«(f) = K 7 *'« ( t ) + e „, UU .„( t ) + ( f , l ( tU .„ At ) + V . i ( tU ' i , Jt ) 

(4.2) 

in which only the estimates 9, n (t), 9,, (f ), 9, /(f) of 9*„, 9 *,, 9* t are 
updated, because, with G{D) known, we can solve 9* and 9* from 
(3.4). 

To develop an adaptive law for updating the estimates 9,,,{t ), 
9,, (f), 9, /(f), dividing both sides of (3.4) by P{D) and operating 
the resulting identity on 1 /( f ), we obtain 

»(fi = #: J *',((i+#: r u' (o + -.(/ + »*). (4.3) 

Defining the parameter and regressor vectors as 91 = 

(*;„. 0 :,.»:,)'. -(.(/) = i 0 *'j„(/) + </(/ + »•). 0 -'-■,(')+ 

\, (/ + »i*), H* 1 + \i(T -I- »')) ; , and using (3.9), wc express 

1 /(f) in (4.3) as 

u(t) = 0* 7 u)„(f) + 0,:4„(f) + ,/,(/ + „*) (4.4) 

where is the unknown parameter vector, and r/i(f 4 //*) - 
4(/4»* )\q(/4»* )4^* 7 b{D)[<lu \oJ(f) is the bounded disturbance 
representing the unobservable part of the backlash input -(f). 

With the estimate 9 f ,(t ) of 9J } , we use (4.4) to define the estimation 
error 


n(t) = - «*) + el (t - - »*) - «(/ - n*). 


We then choose the adaptive update law for 9f,(t) 


9 h (t) = 9 b (t- 1)- 


")/ ; ^/,(f - ii*)f/,(f) 


H-u» 4 7 (f-n*U-4(f-»*) + /i 2 (/) 


(4.S) 


where 0 < -ys < 1. 6(f) = (0i,(f - 1) - 0i,(f - »*))' ^i,(t - n*). 
To achieve robustness of the parameter adaptation with respect to 
the bounded disturbance d](t), the design signal fi,(t) is to be 
chosen as one of the existing robustifying modifications |4]—[6J. 
For example, the switching <r-modification of [5] generates ft, (f) as 
f b [t) = -(r(9i,{t - 1), (t 0 , M},)9h(t - 1), where 


<r(9b(t - 1). (To, 



for \\9 b (t — 1 )||j > 2i\//i 
otherwise 


with 0 < (To < -^r*. Although not shown in (4.5), with the 
assumption 5) we use projection to ensure that 0,„(f) > iwo, 
9,,(t) > 0 and 0 f ,(t) < 0. 
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Fig. 3. Tracking errors with different control schemes. 


Fig. 4. 'Tracking errors of lhe second-order plant. 


B Design for D(-) and G(D) Unknown 
To develop an adaptive controller structure when both the linear 
part G(D) and the backlash B(-) are unknown, we define 0* = 
“0!, 0* = c,0!, o; = r/0!, and express (3.11) as 

»(/) = rj *-..(/) + + k t *>, (o + #r T u-,(«) 

+ #,*„!/,„(( + »*) + 0'r\r,u{t + »*) + +»»*). (4.6) 

Since the parameters 0*. 0j> 0*, 0?- 0 *m 0,%- 0,*/ are unknown, 
we use their estimates 0„(f). 0 w (f). 0, (f), 0|(/). 0 fl .(/).0,-» (f). 0,/(f) 
to implement the adaptive version of (4.6): 

«(() = + tf(tUAi) 

+ of (f )*-,(() + #,„( l)y,„ (/ + n* ) + #,., (f) \ ,,„(# + »*) 

+ #,-i(0\i. „(/ + »*)• (4.7) 


and use it to construct the update law for 0(f) 
mn-fHt n 


where 0 < ^ < 1, and /(/) is a design signal for robustness, e.g., 
f{t) - -tr(0(f - 1). (T 0 . A/)0(f - 1) with 0 < an < We also 
use projection to ensure that 0,„(f) > /no, 0,,(f) > 0, 0,/(f) < 0. 

Our adaptive control schemes ensure the bounded-inpul bounded- 
output stability, as follows. 

Theorem 4.1 [3J: All signals in the closed-loop system consisting 
of the plant (2.1), the controller (4.2) [or (4.7)], and the update law 
(4.5) [or (4.9)] are bounded. 

Since we have not yet shown that rhe tracking error c(f) - 
i/(f) - converges to zero, the tracking performance of the 

adaptive systems will be illustrated by simulation results. 


In this case, we need to update not only the adaptive backlash 
but also the feedforward and feedback parts of the controller 
structure (4.7). To develop such an adaptive update law, we 
introduce 0* = (0;, r . <9* 7 , Of. K ,. KiV. *(*) = 

UUt).J y (t). J’.'M.Jfit), #(( + »*), \At+n'). \,(t + n*)) 7 . 
and express u(t) in (4.3) as 

u(t) = 9 wT + r/j (f + n*). (4.8) 

With the estimate 0(f) of 0\ we use (4.8) to define the estimation 
error 

e(t ) = & 1 {1 - l)*j(f — n *) — i i(t — ti*) 


V. Examples 


We now use examples to illustrate our adaptive designs and present 
simulation results which show that our adaptive control schemes lead 
to significant improvements of tracking performance. 

Example 5.1: We consider G(D) — yy~-— where a i = 1.83. The 
backlash B(-) with in = 0 . 014 , r, = 22 . 5 , and rt = — 24 . / is 
unknown to the following controllers. 

a) Fixed linear controller with no backlash inverse for G(D) 
known: 


«m = 0:^:if(f) + c»...(* + i). 
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b) Adaptive linear controller with no backlash inverse for Gl D) 
unknown: 


"(f) = •,(*)*(*) +*™(Ojf»(f+ 1). 

c) Fixed linear controller with an adaptive backlash inverse (see 
Section IV-A): 

Hit) = 0,„(O(0-0(O + y,nV+ 1)) + H.AtM:\,lt) 

+ \,nAt+l)) + t>.i{t)lK\,lt)+\i„At + 1)). 


d) Adaptive linear controller with an adaptive backlash inverse 
(see Section 1V-B): 

Hlt) = »„lt)ul1) + 0At)\,lt) + #ilt)\i(t) + #„At)y„At+‘l) 

+0,At)\,„At + )) + tt.i(t)\,nAt + 1). 


The initial parameter estimates are # y (0) = 0 (0)0,,,(0), 
0,(0) = 0.(0)0,,(0), 0,(0) = 0 (0)0,,(0), 0 (0) = 2.5. 
(0,„(O). 0,,(()), 0,,(()))' = (70,10.-10)'. Their matching 

values are 0J = 0!0,*„, 0,* = 0!0,*,, 0,* = 0*0,*,, 0* = 1.83, 


.7)'. 


(0’„ 0,’,. 0,*,) J = (2S2. 22.5. -21.7 
Example 5,2 Wc now considei the second-order G(D) = 
7 T T" ~ T> 1 , r , = -1.3. «| = —0.3. The backlash /?(■) is 

the same as in Simulation l. For this G{D), we have ihe tollowing. 
Controller a). 


Hit) = Knit - 1) + 0;„0:' lilt - 1). nil))’ + K,V„At + 2). 


Controller b). 

mt) = 0, ,(#)«(( -1) + 0' (t)(j/(/ -1). nit )) 1 + 0,„(/)i/,„(/ + 2). 

Controller c): 

,,(f) = 0*i,(f - 1) + 0„,(t)(0*' (//(/ - 1). ./(())' + v„,(t + 2)) 
+ 0,,m(0* 7 (\,(t- 1), \, (f )) J + \,,„(M-2)> 

+ 0,i(f)(0*'(\i(/- 1). \,(f))' + \,,„(f + 2)). 


VI. Conclusions 

In this note we have developed two adaptive control schemes 
for systems with a backlash characteristic at the output. The first 
scheme is for systems with a known linear part and an unknown 
backlash. The second scheme is for a plant with both the linear part 
and backlash unknown. Our adaptive controller structures consist 
of a linear feedforward part and a hnear-like feedback part which 
incorporates an adaptive backlash inverse. These controller structures 
result in linear parameterization from which adaptive laws can be 
designed to update the controller parameters to ensure the closed- 
loop signal boundedness. Simulations results show that significantly 
improved system performance can be achieved by our adaptive 
control schemes. 
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Reciprocal Processes on a 
Tree—Modeling and Estimation Issues 

Robert W Dijkerman. Ravi. R Ma/umdai, and Arunabha Bagchi 


Controller d): 

ult) = 0,At)ult- 1)+ 0'(>)(,/(/-1). ,/(/))' 

+ »!lt)l\,lt- 1). \At))' +0,'(f)(\,(/- 1). \i(t))' 

+ B,,At)y„At + 2) + 0,,(f)v,, n (f+2) + 0,,(f)\,,„(f + 2)) 

The initial parameters are: (0„(O), (0). 0,„(())) — (H*. H* 1 0*,. 

Kt) for b); (0„.(U), 0,,(O), ^i(O)) = (M).3. 21. -21) fot c); and 
0 q (U) - -1.2. 0„(U) = 60.3# (0), 0,(0) = 24.9# (0). 0,(0) = 
-25.30,(0), 0 (0) = (-0.3.-1.6), (0„,(O). 0,, (0), 0,,(0)) = 
(80.5. 24.9. -25.3) for d). Their matching values are 0* — 

-1.3,0: = ( -0.39. — 1.99) / , (0*,, 0,*,, 0 ,*/) 1 = (™~. 22.5. 
—2-1.7) J . 

Typical responses of these conlrol schemes to //,„(/) — 

15 sin 0.07547, 7 =0.1 and <r () = 0.02 are given m Fig. 3 lor 
Example 5.1. and in Fig 4 for Example 5.2. The simulation results 
show that the control schemes a) and b) which ignore backlash lead 
to large tracking errors ((f) = t/(t) - ij, u (t). Our adaptive backlash 
inverse control schemes c) and d), which take into account the effects 
of the unknown backlash, lead to very small tracking errors 


Abstract —Motivated by multiresolution decomposition methods such as 
the discrete wavelet transformation, we introduce reciprocal processes on 
truncated N -ary trees. We discuss the relationship between such processes 
and nearest neighbor models. We show that we can derive a recursive 
description of the process, and that all reciprocal processes on N-ary trees 
reduce to autoregressive processes in the case of zero-valued boundary 
values at the bottom of the tree, corresponding to truncation of the tree. 
We then study the smoothing equations associated with such models. 
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] m 1 Simple tiuncatcd binary tree (with numbering ot nodes) 

1 Introduction 

Many multiresolution signals can be described on truncated \ 
ary trees For instance the discrete wavelet coefficients of an n 
dimensional signal can be identified with the nodes on a uuncated 
2 ary tree We show an example of such an \ ary tree in Fig I 
for A — 2 Each node on the tree except for the sets ol initial 
nodts in the top and the final nodes at the bottom of the tree has 
one parent’ node and \ offspring nodes Note that the simplest 
nee where \ = 1 can be identified with an interval \Z i Zj] on 
the set of integers 

In 14] Dijkciman and Mazumdar showed that toi a wide class 
ol second older processes there is nonzero correlation between 
w ivelct coefficients at all scales with shong decay in Ihc correlation 
wfiLn coefficients aic located further away from each other in the 
wavelet domain Wc art theicfore interested in the class ot stochastic 
processes on tiuncaied \ ary tices having a so called reciprocal 
property That is the conditional piobability of the value of a node 
on the tree mven the values of all other nodes in the tree is only 
dependent on the values ot the parent node and the offspring nodes 
Basseville ti al [1] introduced autoregicssive processes on A ary 
trees where a node is conditionally dependent only on its parent node 
Although causality is veiy well motivated in lime this is not really 
(lit cast in the time frequency domain The conditional dependence 
of a time frequency coefficient on the scale above it is not obvious 

Discrete time itcipiocal piocesses on an interval were studied in 
detail by Levy at al [8| Jn this note we closely follow their appioach 
and extend their results for the case A > 1 Wc like to mention 
as well the woik by Greene and levy \S\ who studied in detail 
several different solution procedures of the smoother problem tor 
discrete time icciprocal processes and the work by Levy Fabre and 
Benveniste [71 who studied independently a class of Markov random 
fields on trees Their tree structure is slightly more complicated 
but techniques and derivations similar to the work in this note 
rie reported They study a smoothing problem and discuss several 
iterative solution techniques We refer lo [1] loi a more detailed 
\crsion ot this hole 


II REC 1PROC AL PROCFSSI S AND 
Nl ARFST NNCiHBOR MODFLS ON TREES 

In this section, wc define the notion of reciprocity of processes on 
trees and establish the relation between such processes and so called 
nearest netghboi models 

Let us denote by t , t~ i \ , respectively, the parent and the 

^ offspring nodes of the node t in the truncated tree T where t 
does not belong to 7 the set of root nodes, and 7 /, the set ot final 
nodes of T for simplicity of notation we now consider processes on 
binary trees only, although the case where A > 2 is a straightforward 
extension of A - 2 We shall say that a stochastic process S( ), 
defined on a truncated binary tree 7, taking values in /?' is reciprocal 


if and only it 

P\S(t)<s{1) |SM = v(v) S €IV] 

= P[S(/) < nM|S(f)= s(f) $(/-,) = iff ,) si/ ) / 

th 

for t E mt(/') (l e, / £ T t / {T T\ |) where P( ) denotts tk 
probability 

Assume wc have a zero mean Gaussian icciprocal proctss S( ) 
We protect Sit) t € mt(T') on all other stochastic vanabics on 
the tree 

L[ 3 (f)|S(H) s e T\t] 

-= p (f)S(o+ P\(t)s\t i) + r (osu > o 

The projection implies that the so called lesidual process 

<Hl) = S(t)-(P (/)S(/) + /Mf)S(/^)+ P (f)S(f )) O) 
is orthogonal to S(^) t / s 

The relation O) speuhes a Nearest Neighbor Model (NNM) foi 
S( ) where the driving noise is the residual process r/( ) having the 
following correlation structure 

D{t) - r\<1(t)<1(t) J ] = cr /l0 
r['Ht)<m )'} =-rr\ n P'1,1- ) - r, , , ( 4 ) 

L[(/(f)rf(^) 7 ] = 0 •. ^ / / / i 1 

where a\ () is the covanance matrix of the random vector <1(1) Ihc 
correlation structure ol <l(t) tells us that it is a moving avciage process 
(defined on T) We shall discuss this later in detail Piopertv (4) 
shows that the projection man ices P ! , and the noise variance 

D ( ) cannot be specified independently ol each other Standaid 
Gaussian estimation theory implies that the existence and umcily 
of P i (/) is guaranteed if the covariance matrix of the vector 
[S(f)S(f i) S(/ )] is positive definite tor all / & inl(7) The model 
described by (T) specifics the process S( ) on nil(i ) The lesidual 
piocess (/(f) t 6 inti is uncorrelated with all S(s) s G {T I/} 
Ihereloic, we choose independently ol the lesidual pioccss a vectoi 
h of Dinchlet boundary values (corresponding to all boundary nodes) 
having an arbitrary covariance matrix which we shall denote B 
The truncation of a tree implicitly implies that the values of the 
nodes lower in the tiee are zero or independent of the truncated tree 
Thereloic as an extra assumption we could demand the following 
autoregressive properly to hold for / £ Tf 

(id) = sm - p (f)S(/) c^) 

with <1(1) orthogonal to S(s) for t £ *. That is the final nodes 
have an autoregressive property This is equivalent to the assumption 
of a reciprocal piocess on an extended tret T having a layer of 
zeio valued final nodes 

It can be verified following [81 that the model (T) with the 
specified noise structure and the Dinchlet boundary conditions is well 
defined There is a unique solution if the covariance matrix J? of the 
vector of interior nodes values is positive definite 

We show now that an NNM as in (1) with noise structuic as m (4) 
and the Dinchlet boundary conditions implies nettssary icciprocity 
Let us assume we have the lollowing Nearest Neighbor Model foi 
t 6 int(T) 

Uod)S(t)~M (t)S(t)-y i (t)S(1 ,)-U(f). S(t ) = <(/) (6) 

such I hat M-(1 ) ! — M,(t) for / — 1 2 being squatc matnces 

of size n We assume a Gaussian distributed vector ol root and final 
nodes values h having covariance matrix B The input noise <(t) is 
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a Gaussian process uncorrelated with the boundary process b and has 
the following correlation structure: 

rrMfMf)''] = Mow 

{£[r(t)r(h.) ,, ] = -*V.(f) , (7) 

^E[c(f)r(*) r ] = 0 x £ L /,/->!, ty t . 

Furthermore, it is assumed that the system is well posed, i.e,, it admits 
a unique solution. The relation between the original NNM and this 
model is easily established. Assume that S(-) is a reciprocal process 
with driving noise process //(■), such that D(t) is invertible for all 
t € inMT). The multiplication of (3) by JD(f)~ l and the definitions 
Mo( t ) = D(ty M-(t) = D{1)- [ P~(t), Af,(t) = D(t)~ 1 P,(f), 
and c(t) = D(/)~\/(f) results in (6). We can now easily verity that 
3/_(f~„) 7 = M t (t) and that (7) holds. 

This shows that any reciprocal process S(-) on T with /? > 0 
gives rise to an NNM of the form (6). Now we have to show that 
a well-posed NNM of the form (6), (7) is necessarily reciprocal. Let 
us therefore define the difference operator associated to (6) by 

\ = Mo(t)I - M-(t)Z~ } - M\(t)Z l -MAt)Z 2 (8) 


where Z ~ 1 denotes the backward shift on the binary tree Z^ ] f(t) = 
/(f), and Z\ i = 1. 2 denotes the forward shift Z l f{t) = /(f-),). 

The operator A maps n -vector functions defined on T into »-vector 
functions defined on the interior of T. Consider now the L 2 -space 
of //-vector functions defined over the interior of 7\ with the inner 
product {/. ;i) - ,„i (i) /(«)'</('>). In Appendix A, we show 

that the following Green's identity holds: 


(/. ASM)-<A/,$(•)> 


= £[/(')' /(hi)' f(ty,)')E} 

t€l, 


+ ^[/(0'/(G 7 ]E/ 


'5(f) 

5(f) 


' 5(f) ' 
5(h i ) 
Sdlj) 


(9) 


where the matrices EJ and Ej are defined as follows: 


e; = 


»! 

-M-(h i) 


Mi it) 
0 
() 


M-iU) 

0 

0 


and 


El = 


0 


M, {t) 

0 


The Green's function F is defined by 

f Ar(f. s) = It'd, x) t, x g int ( T) 
\T(t, *) = () t or.sg \T,.T,). 


( 10 ) 


(ID 


( 12 ) 


We show also in Appendix A that T(/, x) = T(*. t) 1 for t, x g T. 

Following exactly the work of Levy el al. |8|, we derive for the 
A r -ary tree that 


5(*')= 53 T(A-. s)e(x) + /-)! ) 

« 6 i m {l ) /£ I r 

+ l‘(*. ty,)M. (ty 2 ))S(i) + 53 H*. f).U-(f) J S(t). 

<* l t 


(13) 


Equation (13) can be used to show that the solution S(k) is orthogonal 
to r(*) for k ^ s 

F[S( k )< 7 <*)]= ^ l'^M)E\e(lU(x)'] = (AT(x.k)) r 

/fc" ml ( / ) 

= 77>(.s\A-). (14) 


Theorem 2.1: Let S(-) be a zero-mean Gaussian process on a 
binary tree, having covariance matrix R > 0. Then, S(*) is reciprocal 
if and only if it admits a well-posed NNM of the form (6), (7). 

Proof of Theorem 2.1: We have shown in this section that if 
5( ) is reciprocal, we can write the process in the form (6), (7). 
Now we must show that the solution S(k) of (6), (7), given by 
(13), is reciprocal. To do this, let T* be a subtree of T (having the 
same structure as T), and let / and k be two nodes of 7\ such that 
k G int(7') and l g T. 

To prove that 5( ) is reciprocal, we have to show that S{k) = 
S(k) - E[S(k)\S(t),t G jr/.r;}] ± S(l). The orthogonality 
principle derived above says that for s G int (T f ) and for t G 
{J/, T \}, r(*) -L 5(f). Therefore, using the solution of (6) on the 
subtree T' [similar to (13)], we obtain 


E[5(»r)|5(f). tt{T:.T' f )] = 53 (T(*. T )M dli) 

n i; 


+ r (k, tyy, T')M- (t- l2 ))5(f)+ 53 r(f* k - T')' S(t) 


(13) 


where I"( /.*. s: T') is the Green’s function for the model on the 
subtree T ‘. So 5(A) = ^ €l(ll( ^#) L(A\ v. V )r(.s), which shows 
that 5(A ) _L 5(/), and therefore that the process is reciprocal. This 
completes the proof. 


III. Recursive Description oi- a Retipkocai Process 

The NNM described in (6), (7), or (13) is nonrccursivc. We derive 
in this section a recursive description. 

We construct a moving average representation of the driving noise 
f{t). For this, we introduce ir(/), a white Gaussian noise (WON) 
process with intensity \V(t ), which is uncorreluied with the boundary 
process h ot the reciprocal process. Let us define ((f) by introducing 
the matrix A(t) in the following way: 

,(f) = H-(/)- | ir(f)-.4<h,) , U'(f-„) 1 II (f-1) 

- .■^(/■^) , ll'(/-,2) -1 u (16) 

for / G inl(r). Then, the relations (7) for the covariance of t (t) 
imply that the matrices .4(f) and IT(f) must satisfy, for 1 G int (7 ), 

Mod) = ir(() - ' + Ad- II)' d'd-i r'Ad-i) 

+ Adl2) , W(h i )-' Ad-, 2 ) (17) 

and for i = 1, 2. /, f*), G int(T) 

M.d) = -E[,d)((t'„) , } = A(h,)'ir(h.)“ l - (i«) 

We can now substitute, tor f, G int (7), A(t-),) = 

H'(f'), )M,(t ) 7 . We define, by choice of an arbitrary symmetric 
positive definite matrix lr(f'),), .4(f->,) = T!'(/->, )Af,(f) / for 
t~), E Tj. We see that I Y(t) must satisfy, for 1 G int(D, the 
backward difference equation 

l(•(/)“' = .Vo(t)-A/,(OH'(f>i)Jl/i(f) J -.3/,(l)ir(/^).lf,(0'. 

(19) 

Since we want the solution IV(f) to be a covariance matrix for each 
t G int(T), it must be a nonnegative matrix. We show in Appendix 
B, that this indeed holds. 

The noise representation in (16) satisfies (7) if we use (19). The 
NNM (6), (7) also implies that, for f G int(T), 

M-d) = (Ad) r H’d)~') r = Tr(fr'.4U). 


Now we can state and prove the following. 


( 20 ) 



TRANSACTIONS ON AUTOMATIC CONTROL VOL 40 NO 2 FEBRUARY 199S 


isider now the operators 

Sl = 1 - 4 (t)Z ' 

II = H (t)l (21) 

12* =I-A(h 1 ) , Z l - 4 (hi)' z 1 

hen we deuve, for / 6 int (T) 

12*11 -l ii (f) =■ 12‘U (1)~'u(i)=z ( (t) 

12*11 "‘123(f) = V 0 (/)S(/)- Ui(f)5(^,) (22) 

-Ui(/)S(f :)- U (f)S(f) 

mil thus the relation \S(f) = /(/) gives us 12*11’ 1 (12S{#) - 
i (t)) - 0 Define now foi / E 7\7 

Zd) = II '(125(0-»(0) 

= II (0“'(5(/)- 4(/)S(M - II (O) (24) 

Then we know that IT 7(0 — 0 ft int (7 ) Hus together with 
(2^) results in the following 
Pioposition f l The set of equations 

7(0 = 4 (I'i) 1 /(t ,) + !(/ ,) 1 7(1 J for/ 6 int(D 
S(/) ^ 4(/)5(/)+ ii (0 + 11 (i)7(t) for / 6 T\r 

(24) 

logether with the Dirichlel boundary conditions intioduced before 
describe the NNM (6), (7) 

l ct us assume we huve the solution ol (24) with the boundary 
conditions 7o{1) — 0 1 E 1 / and S 0 (t) — f>(t) f 6 / We get 
A it) = 0 lor all / s of interest and thus S 0 (/) is an autoregressive 
proctss (on a tiee) which can be computed recursively by performing 
the last pait of (24) This process satisfies (6) and the correct initial 
condition bui is end values are gcncially ddfeiently fiom l>i the 
Ii rial node values II V{1 s) is the Green s function the solution ol 
(6) (7) for t E int(r) is therefore given by 

S(/) - S,(/)+ Y, 1 <' -Of M 2 (5 (s)- %(•.)) (2*S) 

t 

lhc coircction lormula icquires the piecomputalion ol T (t s) / £ 7 

s 1 1 

Let us now discuss the assumption of the auloiegiessive properly 
in the final nodes as in (S) This assumption is equivalent to the 
assumption of an extended tiee I with zero valued final nodes 
Computing for the extended tree 7 the variable 7{t) t £ 7/ (oi 
1 i / £ / /) using (16) (20) and (21) gives us 

7(1) = II (t) 1 ( 5(/) - .4(f)5(f) — h (/)) 

= U„(/)5(/)- \f-(1)S(t) — <(/) = () (26) 

lesulting in using (24) 7{t) = 7oU) — 0 1 E 1 \1 Finally this 
results in S(s) = Sols) lor / E 7 That is if we assume we have 
the autoregressive property m the final nodes ol the original tree, we 
can rewrite the reciprocal process as an aulorcgicssive process on the 
tree lor all / E / Of course the rcvei se is true as well leading to 
the following 

Proposition f 2 A reciprocal process on the tree is an autoregres 
sive process on the entire tree iT and only if the reciprocal process 
has the autoiegressive property in the final nodes 


IV A Smoothing Prori pm 

We shall now apply the previous definitions and insights io 
estimation problem Consider a reciprocal process S( ) defined o 
7, having zero-valued final nodes, described by the model (6) (7i 
We are given the observations) (0 = H{t)S(i ) + i (*) when i \ ) 
is a WGN process, qncorrelatcd with (( ) having mtensit> \ ( ) Wc 
want to compute the smoothed estimate in all nodes oi the ticc i c 
S{t) = r[S(f)|l ], where 1 is the vector of all obseivations Wc 
deuve, tor t E int (7 ). following the derivation of Levs 16| using 
iterated conditioning 

MiAt)S(t) = W-(t)S{t) + U,(f)5(f-,) 

+ IC(/)6(/-)J + //(()' I (/)“'() (/) - J7(/)S(/)) (27) 

Wc have a difference equation for the smoothed estimates which 
we shall solve recursively Fquation (27) implies that we have the 
following model foi the smoothed estimates in the interior of f 

\i S(f) = ff(f)'l (/) ') (t) (28) 

with 

j\, = Woi (M/- U (1)7 1 - U,(/)7‘ - 1/ (1)7- 

\ Moi d) = !/.,(/) + U(1)‘ I (/) 'H(I) 

Then, following ihe procedure of the previous section the operator 
\/ can be repiescnted as \/ = il) II f ] il[ with 

12/ =1 - 4, (1)7 ' 

II / = 111 (f)7 HO) 

S2J =/ - 1 ,(*■)!)'7' - 1 ,d u )'^ 

whcie 1/ (/ ) = II / (/* ) M (t) 1 lot / f int (7 )(I1 1 (/ ) is an 
aibitrary symmetiic positive dchnite matrix foi / E / /) and 

11 / (/> 1 - !/,/(/) - l/,(/)11/ (/ i) 1 Mid)' 

- l/<(/)U / (/ r'MAD 1 (II) 

lor t E ml(/ ) Define now for / E 7\7 

/,(/) = 11,(0 '<2, S(0 = II/ d) '(5(f)- 4,(/)«.(/)) (42) 

Then (28) leads to 12; Z, (/) = 1 (/) 'l (/) for 1 f int(T) 
Proposition 4 J The smoothei equations (28) can be rewritten 
as (13), found at the bottom of the page These two equations 
propagate causally in the backward and forward directions (on the 
tiee) respectively, describing the structure of the smoother 
Now let Zar ( 1 ) and %{t) be the solution of (11) with boundary 
conditions Zoi (s) - 0 s £ 7/ and S u (s) = ) (s) s £ 7 The 
smoothed estimate we obtain does not take into account the boundary 
conditions in 7 and 7; Following the solution (II) wc can conclude 
that the conection foimula for the boundary conditions then becomes 

5(/)= %d) + Y (I/ (/ ’ (s '> 

Hi / 

+ !;(/ s',)V (s',))(S(s) - 1 (s)) 

+ Y T/ d s)V (s)'(S(s)- 5 ( |( s)) (44) 

-fr/, 

wheie I / is the Green s function for (28) 


'z,d)= 4, dn)'z, </-,)+ i,(/' )'/id'u) +1 (0 -, i(0 

for / E mt (7) 

5(/)= 4/ (f)5(f) + «# (t)7, (1) 
for t E 7 \7, 
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If we are able to observe 5(a) exactly at the boundary nodes, we 
have 

S(t) - So(t) + Y TeU' i)Af-<*) r iS(>) - So(*)). (35) 

In the case of an autoregressive process, or an extended tree 7 with 
zero-valued final nodes, we have At(t 7.) — = 0 

for ty t G Tj (since M,(t) = 0). Therefore, since we have chosen 
= 0 for « G TV, we get S(m) = 5o(s) = 0 for * G T f . 
Then, if we are able to observe 5(s) exactly at the boundary nodes 
in !T,, we have S(t) ~ Suit), t e T and we do not need a correction 
equation as in (34) or (35). The smoothing reduces in this case to a 
simple double-sweep algorithm. 

The Rauch-Tung-Stribel (RTS) algorithm for autoregressive pro¬ 
cesses on trees, developed by Chou et al. [2J, is quite similar to the 
double-sweep algorithm here. The difference with Chou’s algorithm 
is that the true value of S(s) for * € T, is obtained in their upward 
sweep (T, should, however, consist of just one node), and there is 
no need for a correction equation. We need to observe exactly 5(.s) 
for s G T, , or apply the correction equation (34). 

Let us study the smoothing error S(t) = S(t) - S(t). We derive, 
using (6) and (28), that A/,5(/) = A kS(t) - A rS(t) = f(f), 
with r(t) having a covariance structure similar to <(/), due to 
the orthogonality of e and Consider now the boundary process 
S(t) =. S(t) - Sit) = Y(t) - v(t) - Sit) for t € T, , 7>. Linear 
estimation theory tells us that S{t) is orthogonal to all Y(.s ), s E T. 
Therefore, the driving noise e(s) = A fS(m) for * e int(T) is 
orthogonal to all V(a). .s E T . The smoothing error Sit), 1 G T ,, T f 
can be written as a linear combination of V(a-), a g T and r{t), and 
is therefore orthogonal to r(*), s G int(7"). Thus, we have proved 
the following. 

Proposition 4.2: The smoothing error S(t) = 5(0 - 5(0 of a 
reciprocal process on a tree is itself reciprocal. 

Assume now that we have an autoregressive process, described as a 
reciprocal process on an extended tree, with zero-valued final nodes. 
Therefore, the error of the smoothing problem on the extended tree 
(without necessarily observing 5(0 in the final and root nodes of the 
original tree) is again a reciprocal process on an extended tree with 
zero-valued final nodes. We can conclude the following. 

Corollary 4.1: The smoothing error of a first-order autoregressive 
process on a tree is itself a first-order autoregressive process. 

Remark: Luettgen [9] proved this result directly for autoregressive 
processes on a tree, obtaining explicitly the model parameters. We 
derived the result as a byproduct of our study of reciprocal processes. 

V. Conclusion 

Motivated by multiresolution decompositions of signals, we have 
introduced reciprocal processes on truncated A T -ary trees; and follow¬ 
ing the results of Levy, Frezza, and Krener [8], we have described 
nearest neighbor models and recursive structures for these processes. 
We found that reciprocal processes with zero-valued final nodes 
reduce to autoregressive processes on a tree, which motivates, for 


another time, the study of these processes in, for example, II] and 
[2]. We described the smoother equations for such processes. 

Reciprocal processes could model in an efficient way many in¬ 
teresting stochastic phenomena on multiple scales. Their recursive 
structure makes them suitable for fast signal processing algorithms of 
processes. The more general study of Markov random field processes 
on trees (having arbitrary neighborhood structures, including possible 
neighbors on the same scale in the tree) is now being pursued. 


Appendix A 

In this appendix, we prove the Green’s identity used in Section 
II, and show that the Green’s function T(/, s) is symmetric, i.e., 
r(t, s) = r(.s. t )‘, for t . S e T. 

We prove the Green’s identity by means of an example which can 
be generalized for any reciprocal process on a truncated .V-ary tree. 
Let us consider the very simple binary tree as in Fig, 1, using the 
indicated numbering of the nodes. We define the matrix A as shown 
in (A.l), found at the bottom of the page. 

Using M - (/->«) 7 = M t (t), we see that A is a symmetric matrix. 
We also see that the rows of this matrix relating to the interior nodes 
of T describe the operator A in matrix form. Using the symmetry of 
A, we have for any appropriate dimensional vector f (in this case, 
7-dimensional) that the equality f 1 AS “ (A f) 1 S holds. Now we 
derive, using the definition of the inner product in Section 11, 

/ 7 AS = (/, AS(-)) 4- Y /('>' (A/«(/)S( f) - .)/,(/)S(h,) 

tfl, 

- MAns(h i ))+ Y mUmtm)- m u>s</) <a.2> 

ier t 


and 


(A f) T S — (A/. Si-)) + Y f {t)l M'(MS(f) 

n-i, 

- (/(h,)'.V_(h, ) + /(*',;)' M (t'i))S(t) 

+ Y fm r Mait)Slt)-m‘ M (t) 1 Sit). (A.3) 


Therefore, since we have f 1 \S — (A f) 1 S, we obtain, by using 
(A.2) and (A.3), 


(/, AS(-)) - (A/, 5(-() 

= Y /(0Vi/i(0S(f-,i) + .u>(nS(f'.)) 

1, 

- ]T (/(^i) , ^-(hi) + /(hi) , A/ (tl2))S{f) 
ier r 

+ ]T f(t) 1 M-(t)Sii)~ fit) 1 M (t) 1 Sit) 

iti r 

(A.4) 


as in (9). 

Now we show that the Green’s function T{t, s) is symmetric, i.e„ 
T(f, .s) = F(«, /) 7 , for t, s G T. It is easy to see that for t or 


r A/,.(0) 

—A/, (0) 

—A/j (0) 

0 

0 

0 

o - 

-A/-(l) 

Moil) 

0 


-A/ a (l) 

0 

0 

-A/_(2) 

Mo(2) 

0 

0 

0 

— A/1 (2) 

-MA'2) 

0 

-M-( 3) 

0 

A/o(3) 

0 

0 

0 

0 

—37-(4) 

0 

0 

A/ 0 (4) 

0 

0 

0 

0 

-A/-( 5) 

0 

(J 

A/n(5) 

0 

. 0 

0 

-i\/_(G) 

0 

0 

0 

A/o(C) . 


(A.l) 
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€ T t , Tf , we have T(f, #) = () = V{n* t) 1 Dehne now ihe 
vo square matrices \ and V having rows and columns associated 
i all nodes ot int(T) Each element ot the matrix \ is taken 
I om the same corresponding row and column of and therefoie 
\ is a submatnx of representing only the nodes of int (T) T 
ins on the row associated with node t and the column associated 
.vitb node s, as element T(/, *.) It can be easily verified, using 
\J (t s) — Jfi(f s) /, s 6 int(T) that \T = 7 and \ 7 = \ 
so that I = (\I ) ! = T 7 V 7 - T 7 \, and therefore I — T 1 and 
thus T[t s) -= F(s i) 1 lor t int(T) which was left to show 


Appfndix B 

In this appendix, we show that the noise covariance matrices 
II (/) f G mt(T) introduced in Section III are positive dehmte Lei 
us assume that not every noise covariance matrix 11(f) ft int (I) 
is positive dehmte We scan each scale ot the tree above If from 
bottom to top until we find a node s such that II (s) is not positive 
definite Then we stop and we denote the set of nodes scanned so 
far (including s but excluding T/) by /„ The fact that II (/) > 0 
loi t £ T \ s and the construction of II [t) through the difference 
equation (19) guarantees that II (t) 1 exists for all / £ T Denote 
with r the vector containing all ({1) 1 e T Define by (} s 1 the 
irutnx consisting ot zero s except lor the matrices 11 (/) 1 on the 
diagonal We have 

I = r\ tHt l ] = T s y Q 'T, (B l) 


where T is a lowei triangular matrix defined by (16) having all I s 
on its diagonal The matrix F is the covariance matrix of part ol 
Iht icsidual process of the rccipiocal process and since wc assumed 
H > 0 we must have f "> 0 as well otheiwisc a linear combination 
ol S (t) i £ inl(T) would be identically/eio contradicting the 1 act 
that R s 0 Iheietore Vu „ u 7 / H i/ > l) Since wc assumed that 
II (s) is not positive definite , such that < 7 II (s) 1 1 <0 

However since T is a lower triangular matrix with 1 s on its 
diagonal we can always find u such that 

T iu -[0 I) i ' 0 ()]' (B 2) 


lcsulting in 

II '/■,«, = [() 0 I 1 0 0] 

Q 1 [0 0 i ' 0 ()]' 

=. I'll (/) s 0 


(Bl) 


contradicting E l) Therefoie we must have II (/) > 0 toi all 
f G ml ( I) which is what we wanted to show 
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Comments on the Loop Transfer Recovery 

Ahmed Rachid 


Abstract —this paper deals with observer-based linear control systems. 
More precisely, it is shown that Tsui’s loop transfer recovery procedure 
can lead to poor results 


I INIROIHCIION 

It is now well understood that the CORC (combined observe! 
regulator compensator) problem is not a trivial extension ot the linear 
quadratic regulatoi (LQR) case which has been widely studied since 
eaily I960 s In particular it has been shown by Doyle 11] that lineal 
quadiatic Gaussian (LQCi) icgulators have no intrinsic robustness 
properties and can exhibit pool stability maigins contrary to the l QR 
case which ensures the well known [1/2 v | gam margin and phase 
margins of at least 60 dcgiees at the plant input [2| 

An attempt to recover these stability margins loi the CORC cast 
is loop transfer recovery (LTR) method which lortes the ictuin 
difference F,( s ) at the bieak point \ (Fig I) to be identical (in the 
limit) to the return ratio Tj j (s) at the break point \ \ concsponding 
to the full slate feedback 

Doyle and Stein [3] have proposed an Relative piocedute which 
asymptotically achieves the LTR by adding a fictitious noise at 
the input plant hcfoie designing the observer using Kalman filler 
formulas This procedure drives some observer poles toward stable 
plant zeros and the rest toward infinity 

More recently, an alternative to Doyle and Stein s appioaih has 
been given by Tsui [4], |5] [6J and aims to guaianlec some stability 
margins at the breakpoint \ \ Unfoitunately the method proposed 
by Tsui can lead to quite poor results in the sequel his technique is 
discussed and its limitations are highlighted 

II PROBITM SrAlFMLNl 
Consider the linear continuous Ml MO plant 

i — 41 + Bu 
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Fig. 1 CORC scheme. 

P = C.T (1) 

under the state feedback 

</ = — A', (2) 

The full observer equations for the considered system arc given by 
i = Fz + Lp -f TBit (3) 

under the constraints 

TA - FT = LC (4) 

A',. = NT. (5) 

Equivalently, this observer can be described by 
X = ,4.r + B u + A ( . (p ~ Cj‘ ). 

w and ^ are related by * = Tx from which one can deduce the 
following relations 

K„=T-'L (6) 

A„ = .4 — A'„C = T~' FT. (7) 

Fig. 1 summarizes the CORC scheme: At the breakpoint AW, the 
transfer function can be expressed as 

Trj(s) ~ KA*I - A 0 )~'(B A I\vG(h)) 

= K, (sI - A)~' B (8) 

At the breakpoint A’, the transfer function is 

T r (n) = K r {fil - A 0 -I- DK, )~ 1 h\,G{*) (9) 

where 

G(x) = C(sI-A)-'B. (10) 

Doyle and Stein (3| have proved that if 

K4l + C(sl - A)~' A’" J = D[C(*I-Ar'Dr' (11) 

then 

TA*) = T Jr (s). 

Tsui [4] has proposed the following necessary and sufficient condition 
allowing LTR when using either Kalman type observer or Lunberger 
type state or function observers 

77 (s) — T rje (.s) if and only if TB = Oor 
X(sl'-Fr i TB = (l 

Based on this result, he has proposed a robust observer design 
procedure which consists in selecting the observer poles so that ||rJ3|| 
is minimum. An explicit solution to this optimization problem has 
also been obtained when the canonical observable form is used for 
(A. ZAC). 

Ill. Discussion 

First, we notice that Tsui's procedure does not apply to systems 
with no zeros, i.e.. with a constant transfer function numerator; to 
illustrate this fact, let us consider the following example. 



_1L----=----------1 

-0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1 1.2 


Fig. 2. Tj j (.s) for rhc procedure of Tsui. 


(3) 

-10 


15 

(4) 

-20 

(5) 

-25 


30 



•30*—*--—■— --— 

-10 0 10 20 

Fig. 3. Full state 7V(.s) for Example 2. 


Example 1; The considered system is described by 

G(s) —■ ----— 

(*+!)(*+ 3). 

The associated observable canonical form (needed in the procedure) is 


{A.B.C 




.[0 1 ] 


Let Aj, and X> be the observer poles to be selected so that TB is 
small. Then we have two possible cases: 
a) A i ^ \j : 


b) A, = A,: 


We see that both cases yield to a TB which is independant of the 
observer poles, so that there is no way to select them using Tsui's 
procedure. This simple example shows the first gap in the method 
proposed by Tsui for LTR. 

A second major inconvenience in Tsui's technique is that even 
when it is possible to select the observer poles, LTR is not approxi¬ 
mated in many cases. In this context, the following example shows 
that one can encounter quite unacceptable results. 

Example 2; Consider the simple system 

[j.».n = ([? -JlM-io n}. 


(A D .n = ([; ;]•[ , p .1). 

The feedback state controller has been chosen so that A( A - BK r ) = 
{*“7, -7} which gives 

Ik, = [-57.78 17.78]. 

Applying Tsui’s procedure, the observer poles Ai, X* can be chosen 
as 

A, = -0.1 and A_. = -0.11. 

Fig. 2 shows the corresponding Nyquist of 77 ( s ) which is to be 
compared to the Nyquist of the full state feedback transfer function 
77-..U) (Fig. 3). 
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Rematk In terms of pole placement Tsui’s procedure consists in 
luffing some ot the observer poles at the stable plant zeros and the 
t sl at their neighborhood This is in contradiction with the Doyle 
nd Stein technique As a consequence Tsui s technique cannot be 
ipphcd to nonminimum phase systems contranly to the claims in [6] 
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Sensitivity Properties of Multirate Feedback Control 
Systems, Based on Eigenstructure Assignment 

R J Patton Ci P L m and V Paid 


\b\trait —.Some sampled-duta si slims, e.g , fly-hy-wire control sehemes, 
have a necessarily multimli shuiture, various input and/or outputs 
sampled at different rales When considering a multirale system which 
has parameter uncertainty, it is impoitanl to examine ways m which 
the lull freedom of the multlvai table design can be utilized to minimize 
the sensitivity to parameter variations, given the accompanying problems 
induced by mlersample ripple disturbance This note examines the design 
capabilities ol a class of mulfirate systems with multiple input and 
hxtd state sampling rates (MIFS), based on eigenstructure assignment 
Although the use of eigenstructure assignment for continuous and single 
rate discrete systems is well understood, the eigenstructure assignment 
Tor the design of multirafc feedback systems is an open topic ot research 
Accepting that the problems of intersample ripple are often magnified 
through multirate control, there are advantages in terms or increased free¬ 
dom for minimizing sensitivity and optimizing robustness to parameter 
variations A special feature of the MIFS class of multirate systems is the 
ibility to introduce extra design freedom m the eigenproblem bv a suitable 
ihoice of eigenstructure assignment and sample rates The criteria for the 
selection ot minimum sample rates to produce this extra freedom, and the 
implication that this has on the eigenstructure assignment problem, are 
outlined I he improved insensitivity properties are demonstrated using 
m example comparing the performance or multirate and corresponding 
single rate designs 

I In troduc i ion 

Foi a contiollable continuous-lime system described by 

»(/) = 4i(/) + Du{t) (1) 

Manuscnpl riieiveil March 8 1991 revised August 21 1993 and luly 
7 1994 This work was supported by the UK Engineering and Physical 
Research Council under Grants GR/H43953 
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where » € ft 1X1 and u 6 IV' xl arc the state anil contnd vet 
respectively, l E R'*'/?€ R x and the assigmhiluv , 
desired self-conjugate set of poles to the equivalent discrete \ t 
using multiratc feedback control has been well established i|l| | 
and references therein) For sampled data systems with fast multiple 
input rates and a slow fixed state rate there arc two mam problem 
associated with the application of state feedback control desm 
methods [6| [7] The first is the excessive and highly oscilhkuv 
nature of state and control effort that is produced 1 his often makes 
the implementation of multirate control lmpiactical The second 
problem is the increased sensitivity of feedback design to intusimplc 
behavior and disturbance elfccts The lattei is a common diawback ol 
periodic control structures as highlighted in the work ot Fiancis and 
Gorgious [41 The problem is particularly prevalent in MITS contiol 
schemes due to fast input control signal updates within a slow fixed 
state rate Clearly theie is a greater risk of undesirable mlei sample 
effects propagating through the system duung a slower stale period 
and causing an adverse response before they can be monitored The 
two problems are linked the intersample eflects (which cause the 
system to deviate from its desired behuvioi) demand a greater control 
effort to piovide the necessary corrective action 

Eigenstructure assignment can be used to alleviate both these 
problems I he specification ol the closed loop system eigenstructure 
(which compnses the eigenvalues and right cigenvtciois) is an 
important design consideration for the mullnaie feedback ptoblem 
The eigenvectors determine the modal inteiaction present in ihe 
closed loop system transient responses A multirate feedback design 
which incorporates a reduction in modal interaction will coniine the 
eflects ol undesircd intei sample behavior and distui bailees to specific 
subsystems oi individual modes lhis minimizes the occurrence of 
adverse responses due to crossfeed signal which in turn reduces the 
demanded control effect Hence a suitable choice ol the multnatt 
system eigenvectors will piovide an insensitive or well conditioned 
closed loop peifomiance Thus Ihe specific ition ol system eigen 
structure is seen to address the sensitivity and disturbance issuis 
raised in [4| 

The pnmaiy motivation for the use of eigensiructuit assignment 
methods to design a multirale system is ihe ability ol the techniques to 
directly address the problems associated with MIF S feedback control 
However a much slionger case foi its use is the peilcctly decoupled 
solutions that can be obtained with MIES sampling This special 
characteristic arises Irom the ability of Ihe MIES system to genciate 
maximum design Ireedom for the cigenpioblcm by an appiopnatc 
combination of input sample rales 

IT MIES MiniRvu SYsrrM Mooli 

The MIFS system of Etg I is classed as an m, -input n state 
pcnodic system The integer in, defines the sum of contiol input 
updates during a main sample period within which all periodic 
transitions oi the system are described 

lo descnbe the MIFS multiratc system where the sampleis opciate 
peuochcally but with different period a complete set of transition 
equations must be determined for an interval equal to the least 
common multiple of all sample penods (/ m Iig I) This interval 
is referred to as the main interval of sampling Since the sampling 
is periodic with / the transition equations will codify the invariant 
nature ol the multiratc system 

The MITS system equations state parameter are represented only 
at the main interval sample instants while the control matrix accounts 
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^ States 



Fig. 1. MJFS sampled multirate system. 


for all input changes within the period T. The system sample periods 
are defined by the following set of relationships: 

Tu ~ —, T t = —, m,, = Yu, (2) 

for / = 1,2, ■ ■ ■. w, where Ti is the base sample period, no is the 
least common multiple of the positive integers >i ( , and T, is the nh 
input sample period. 

These sampling interrelationships describe an M1FS sampled sys¬ 
tem. The time-invariant behavior of the multirate system states of 
Fig. 1 is described by 

..•<U' + l)7-) = r A7 .r(AT) 

+ Y'Y t t' ir ' jT '- T) D,dTii,ikT + jT,) 

,= l j-0 J » 

(3) 

where i/, is the it h control input, D t denotes the ith column of the 
original system control matrix B of (1). Therefore, the sampled data 
multirate model (3) can be rewritten as 

.r{k ■+■ 1) - *\r(k)j'(k) + ru(k)itu(k) (4) 

where 

» r u,{kT) 

u\rj(k) u,[kT -f T,) 

n\t[k) = . iiM t {k)~ . (5) 

jiMt»{k)_ u,{kT+ {n, - 1 )T,) _ 

for i “ 1,2,- ■ •, m, $i\i{k) 6 R"*" and T\j{k) € R" y,,> can be 
obtained from (3), and .r{k) 6 R"* 1 and u\i(k) € x 1 are the 
states and the inputs of the MIFS system, respectively. 

Fig. I illustrates the system structure with which the MIFS system 
is fonned by keeping one fixed state sampling rate. The diagram 
relates a state-space system. 

III. Eigenstructure Assignment 

For the system of (4), the eigenvalue assignment problem is to 
design a gain matrix F such that the state feedback control 

uM{k) = Fr{h), reR’"**" ( 6) 

produces a closed-loop system matrix ($a; + 1 u F) which has a 
set of desired self-conjugate eigenvalues {Ai, A*, - ■ ■, A,,}. Assume 
F m ) are completely controllable, a range of feedback gain 
matrices will provide the solution [8]-|l 11- 
Eigenstructure assignment uses the extra degrees of freedom in 
the underdetermined solution to specify the closed-loop right eigen¬ 
vectors, V =■ [r t i '2 ■■■ »»«], corresponding to the desired self- 
conjugate set {A,}. For an insensitive solution, the eigenvectors are 
chosen to be as mutually orthogonal as possible or to have a specific 


modal structure, A measure of the sensitivity of a solution is the 
conditioning of the closed-loop system right eigenvectors given by 
[12], [13] 

k(V)=II viMir- 1 || a . (7) 

The eigenstructure assignment problem is to select u set of the desired 
eigenvalues {A ( } and a set of linearly independent right eigenvector 
matrix V which minimizes k{V) and, furthermore, satisfies the 
following relation: ($u 4TuF)r, = p,A,, for #‘ = 1, ■ ■ • , », that is, 

- A,/ r«] =0. (8J 

r i\ 

To find a solution, define 

5.x, = [$m - A,/ r] (9) 

and a compatibly dimensioned matrix 



whose columns form the basis for the nullspace of S\ } . Thus, 



for any w,, x 1 vector, tv,. 

It can be deduced from (8) and (11) that 

r, = (12) 

and if the {<\ } are linearly indcpendcnl set, u real F can be expressed 
by 

F = [A/a, u'i (13) 

For the single rate eigenproblcm of (12), an achievable set of 
eigenvectors is limited by the nullity of the solution space, n\ r In 
general, m ,, — in < n indicates an underdetermined problem. In 
this case, a maximum of m elements of the desired eigenvectors 
can be achieved exactly, and the achievable set of eigenvectors will 
only approximate the desired set. For the MIFS multirale system, the 
dimension of Ih, can be extended beyond that produced by a fixed 
single rate system, by virtue of the periodic description used for the 
MIFS sampling scheme. 

IV, Feedback Design oi MIFS Sampled Systems 

It is well established that the controllability and observability 
conditions which must be satisfied by the multirate system for the 
assignability of an arbitrary sel of eigenvalues are determined by 
the choice of inpui, output, and state sample rates [1H7]. These 
sample parameters have, however, a much more significant role in 
the eigenstructure assignment procedure; a suitable choice of sample 
parameters can be used to produce maximum dimension solution 
space. The extra degrees of freedom generated in this way can be 
used to ensure a perfectly conditioned solution [14], [15]. 

The multirale feedback control problem and sensitivity measures 
relate to the assignment of n eigenvalues {A|, A-j, ■ ■ •. A„ [ designed 
by a transition matrix which represents the closed-loop system at 
the main interval sample instants. The set {A,} corresponds to an 
equivalent set {A lf(i }, representing the faster eigenvalues of the 
closed-loop system sampled at the based rale Tt,. The two sets are 
related by A, ri| = (A,) n o. The sensitivity measures relate to the n 
eigenvalues of the MIFS system representation and, therefore, only 
provide an assessment of system performance at the main sample 
instants. 
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lie criteria for the selection of the MIFS multnate sample rates 
hicve a perfectly conditioned feedback solution are based on the 
li i lulation of the extended controllability matrix 

= [r, $r, i„ *r (11 *» -T„] <i4) 

i T, is the ith column of the single late system control matrix 
m led at T, and <P is the corresponding single rate state transition 
mx le, $ = And define 

in 

i> = Yj 1 (15) 

/_ i 

I lu columns of 1 are selected such that starting from I i further 
columns are added until all vectors associated with Ti arc lincaily 
independent The process is continued for all control inputs lequired 
lor the achievement of ( 4 B ) controllability When the condition 

i 

I s m is icached then all subsequent input sample rales have unity 
input multiplicity i c // - 1 lor / = y + 1 m 
f or a contiollable system itm, ^ n eigenvalues and eigenvector 
can he assigned arbitrarily if w, * ii the n — m t entnes ot each 
ueenvettor cannot be assigned arbitianl) [ 161 Thus the indices 
(i i = 1 rn detennine the dimension ol the maximum solution 
subspact that the /til input is capable ot generating with an MITS 
samples system [171 A choice of input sample rate {I } = 
ic 11 -11 will produce lull design freedom for the eigcnstructure 

issignment procedure of Section 111 if // > n This choice of 
input rates is cfeired to as flu ideal sarnpk s l\ (Note also that a 
rtanangemtnl ol columns m the original contiol matux may produce 
a dillerent set {p } ) 

II // ^ n then /?\ 6 ] 1 foi the cigenproblem of (10) 

inus providing full design ficcdom lor the eigcnstructure assignment 
procedure This effectively allows tht piecisc assignment ol any set 
ot finite magnitude desned eigenvector of the closed loop niulliiate 
system For // n the design vector u is determined uniquely 
u suiting in a simpler lormula foi the multirate gain matrix 

1 _ [ l/\, ( \ v, ) ‘.I \ 1 \ ( S X ) 1 . ]1 06) 

loi // < 11 one method ol solving the state leedback cigenproblem 
is bv a least squares minimization ot the error between the desired 
md achieved eigenvectors le the minimization ol || 1 / — 1 ||2 
The least squares pto|ection of the desired eigenvector 1 / onto the 
solution space 1 \ \ detenmnes the design vcctoi to be 

11 = Ox >> ) ’ Sx > 1 U 7 ) 

1 ie least squaies projection method is termed the ‘direct’ assignment 
Ilk to its nomtciative assignment of the desired eigenvectors The 
'uect method is well suited for comparison purposes due to its single 
iss calculation This leaves no opportunity lor varying levels of 
tcntion in the design ol the control laws (as for example with 
pimu/ed assignment methods) 

A choice ot the ‘nomdear set ot sampling rates (// < a) 
111 also be implemented to design multirate feedback control that 
'luvides superior insensitivity performance than that achievable by 
I’c corresponding single rate design This flexibility in the choice of 
uput sample rate design multiplicity allows insensitivity properties 
^ the closed-loop MIFS multirate system to be determined by the 
lesigner An advantage ol a “nomdear set of sample rates is the 
mproved intersample behavior that can be obtained by effectively 
^Taxing the etgenproblem solvability conditions as demonstrated in 
Section V 




Fig 2 (a) State lesponsts ol the closed loop systems formed by the feedback 

gum mail ices A 1 A, (and h O indicates A 1 system — indicates Aj 
and A \ systems (b) C onlrol input responses ol tht closed loop system with 
A 4 (multirate) (c) Control input responses of the closed loop system with 
Ai (single rale) 

For cases where the sample rate selection process gives p > 
a characteristic ol the resulting feedback gain matrix produced is 
the generation of // - n null rows This is a direct consequence 
ot the overdetermined solution produced by this choice ot sample 
rates This effectively limits the number ot nonzero gam elements 
ol a feedback control matrix of any multirate scheme satisfying 
the necessary conditions for multirate eigenvalue assignability to 
< 17 2 The extra design freedom is illustrated in Fig 2 which shows 
the additional effective inputs which arise as a consequence of the 
multisampling 

Eigcnstructure assignment techniques are also noted tor their ability 
to generate closed-loop systems which are insensitive to variations or 
perturbations in the nominal system dynamics flT| This insensitivity 
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Fig. 2 (b) Control input responses of the closed-loop system with A' > 

(mullirate), (c) Control input responses of the closed-loop system with K\ 
(single rate). 


is achieved by decoupling the modal interactions presented in the 
closed-loop system. The benefits of closed-loop system insensitivity 
can be observed very clearly in the lime domain. 

V. Exampi b 

A simple example is used to demonstrate the effects of applying 
the direct eigenstructure assignment procedure and the sample rate 
selection criteria outlined in Sections III and IV. Multirate gain 
matrices are designed for ideal and nonideal sample rates for the 
example considered to produce perfectly conditioned solutions to the 
multirate eigenstructure assignment problem. The performance of the 
multirate feedback designs is compared to the corresponding single 
rate design to demonstrate the improved insensitivity of the multirate 
designs. 

Let the matrices of a system be 



1 

0 5 

/l 

1 \ 


0 

0 

i 

D= 0 

1 

( 18 ) 

V-c 

-n 

-6 

\1 

1 



TABLE 1 

k(\ ) AND II h lb OF l\ 1, l\ 2 . AND h j QjOSUD-LoOP SYSTEMS 


Performance Measure 

Feedback Gain Matrix 

A'i 


A 

A(V) 

172.14 

1 

1 

II A II* 

7.8253 

67.156 

75.622 


The desired closed-loop eigenvalues {-0.5, -0.G, -0.7} are as 
signed using a least squares solution such that the corresponding right 
eigenvectors are maximally orthogonal (ideally { r ,} =■ I„). The main 
interval of sampling is selected to be T = 0.1. This does not rendei 
any of the modes uncontrollable, thus ensuring the assignability of 
the desired eigenvalues. Formulating matrix gives two choices of 
input multiplicities: {/u = 2, /ej = 1} and {//i ~ 3, //* = 1). The 
multirate feedback design produced by these (wo choices of sample 
rates are given below together with the corresponding single rate 
design. 

1) The input sampling intervals T\ — T. T± = T and the state 
sampling interval is T (single rate): this gives the single rate 
gain matrix A'i 

_ /2.9832 5.0881 3.7070 \ 

A ‘ V—0.2931 -1.0622 -1.1307 ) 

II A . || 2 = 7.8253. 


2) The input sampling intervals T } = 7/2. 75 ~ 1\ the state 
sampling interval is T (multiratc); this gives the “nonideur 
multirate feedback gain matrix A'j 


/22.1843 -10.213 I -10.5515 \ 
A’>= 20.1005 37.5100 17.0313 

\0.525 0.3829 0.5400 ) 


07.1500. 


3) The inpul sampling intervals T\ = 7/3. 7/ - 7, the state 
sampling interval is T (multirale); ihis gives the ideal multnate 
feedback gain matrix A > 


A'i 


/25.3070 15.8104 

0.0 0.0 

—22.3352 -41.7045 
0.5243 -0.3821 


18.3884 \ 

0.0 

-20.0087 
0.5380 / 


which has one null row since I. 

Table 1 shows the eigenvectoi conditioning k (V) and || A' ||.» 
figures for the above designs. The || A ||i figure gives an indication 
of the magnitude of the control effort demanded by each design. 

Fig. 2 shows the responses of the A’i and Ab closed-loop systems 
to initial state perturbation of .?•(()) = [0.1.0.3,0.15] ; . The mam 
sample responses of the Kj and K \ design are identical. Fig. 2 shows 
that both multirate systems exhibit the perfect modal decoupling that 
all designs are attempting to achieve. 

Both multirate feedback designs have a very high|| K ||i compared 
to the equivalent single rale design. Thus, an increased demand on 
the control inputs is predicted (as verified by the responses of Fig. 2). 
The only difference in the control input responses of the two multirale 
systems (in addition to the different update rates) Is a slight increase 
in the magnitude of demanded by feedback design A‘i. The 
iij control signal for all designs is, however, smooth and has an 
acceptable magnitude. Thus, the faster o i sampling required for the 
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* 3. Intersamplc state behavior of closed-loop systems formed by the 

multirate gain matrix A'a. 

/v -« closed-loop system appears to offer no advantage over the A’* 
vsiem. Both achieve the required decoupled responses at the main 
sample instants. 

Fig. 2 shows the system responses at a rate 1 /T (which is the 
orrect state sample rate of the MIFS system). The state responses 
during the interval T are shown in Fig. 3. These responses show that 
'he improved decoupling of the multirate designs is obtained at the 
L ’°sl of large magnitude, switched intersample state behavior (rj of 
both multirate design does not suffer from any intersample switching). 

The most important point to note from the tabulated performance 
measure and the time responses is the achievement of perfect modal 
decoupling by multirate feedback gains A 2 and A 3 at the main 
sample instants. In comparison, the modal decoupling achievable by 
ihe single rate design A j is relatively poor. 

The insensitivity of the multirate and single rate closed-loop 
systems to perturbations in the nominal open loop systems dynamics 


TABLE U 

The Closed-Loop Nominal and Perturbed Eigenvalues 


Non 1 i n a 1 Eigen values 

Perturbed Eigenvalues 

A. 

A; and A ;i 

().!)- ! 'i 2 

1.015:1 

0".9530+j0.0070 

0.9324 

0.9136 

0.9530-j0.0070 

6.9418 

0.0504 

0.9:104 


is tested. A comparison of the shift in position of the A 1 (single rate), 
A* 2 and A a closed-loop system poles caused by this perturbation will 
establish the sensitivity of each system. 

The sensitivities of the closed-loop systems produced by A',. A j. 
and A :i are tested by perturbing the open-loop system transition 
matrix .4. A proportional perturbation in the system dynamics of 
the form A = (1-1- f>)A is considered. A perturbation h = - 0.1 
(which corresponds to a 10 % perturbation) just nukes the single 
rale system unstable. The closed-loop eigenvalues produced by each 
feedback gain matrix in response to this perturbation in the nominal 
system dynamics are given in Table 11 (together with the nominal 
eigenvalues). 

Thus, the multirale closed-loop system maintains stable perfor¬ 
mance, while the single rale system becomes unstable. The muUiratc 
system can, in fact, tolerate an extreme proportional perturbation 
in the open-loop system dynamics. A value of b ~ -0.75 (i.e., a 
75% perturbation) will cause the above A‘ 2 . A i systems to become 
unstable. 


VI. Conclusion 

Many systems have a sampled-data structure which is necessarily 
multirate to til in with the use of different data buses and digital 
sensors and actuators. The criteria for the selection of input sample 
rate multiplicities to achieve a perfectly conditioned closed-loop 
system have been outlined. This note has demonstrated the application 
of eigenstructure assignment to multirale systems. The criteria for 
the selection of input sample rate multiplicities to achieve a perfect 
conditioned closed-loop system have been outlined. An example has 
demonstrated the achievement under certain conditions of perfectly 
decoupled solutions at the main sample instants by an MIFS system, 
whose input sample rales are chosen using the proposed criterion. 
For the example system, two input rate multiplicities (determined 
using the general selection criterion) were examined. The nominal 
responses of both multirate designs have been compared to the 
corresponding single rate designs. The multirate solutions are seen to 
provide much more insensitive feedback control than that produced 
by the equivalent single rate design. The results demonstrate the 
applicability of the eigenstructure assignment technique for MIFS 
feedback control design. 

A direct eigenstructure assignment method, where the desired 
eigenvectors are approximated using a least squares projection, has 
been used to solve the multirale eigenstructure assignment problem. 
This serves to demonstrate the principles of multirate feedback design 
using eigenstructure assignment. The direct approach is, by no means, 
the ideal method of utilizing the design freedom offered by MIFS 
system models. A more useful approach is to optimize the solution to 
the MIFS eigenproblem such that intersample switching is alleviated 
while maintaining a near-perlectly conditioned modal structure. An 
MIFS eigenstructure assignment technique of this type is presented 
in [14]. The optimization procedure [15] makes full use of the extra 
MIFS system design freedom to balance the different closed-loop 
system qualities. 
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Abstract — We present optimality conditions for the approximation 
of an S1SO system by a truncated Kaut/ network with two repeating 
complex conjugate poles. Roth the continuoos and the discrete time cases 
are discussed. We approach the problem in a system approximation 
framework, and we do not assume that the input signal is white (or an 
impulse) The results we obtain generalize the results already known for 
truncated Laguerre networks. 


I In i RODUC I ION 

The usage ol orthonorrnal series of exponential functions in the 
representation oi signals pioneered by the work ot Lee and Wiener 
in the 1970’s [1] is an important topic of research in the aieas ot 
automatic control and digital signal processing for the continuous 
time (CT) case these orthonorrnal junctions can be obtained quite 
easily it we express them by their Laplace transforms [J| |4] 


Cn (•») — \/pk + p[ 


n A 


n' 


, ( s + 1 > ) 


k hr N 


(I) 


The poles ot these functions which correspond to the exponents ol 
the exponential functions are arbitiaiy except that they must have 
negative icul parts i e p, k > 0 1 For the discrete time (Df) cast 
the orthonorrnal sequences can also be obtained quite easily if wc 
express them by their Z transforms [5] 

__1”T k I , I v 

<•*()= y/T- |l —' ; 16 N (2) 

n . < i -« '> 


The poles ot these functions arc arbitrary except that ihey must he 
inside the unit circle |<//1 < 1 Oi all the possible selections ol 
the poles ol these (unctions the special case where all the poles are 
equal (and real) corresponding to the so called L aguenc functions 
(CF pk = p) d nd sequences (D1 in — r/) has attracted the interest 
of seveial researches [6|-(11] Because in piacficc we art foiced 
to tiuncate the l agucire series expansion of a given oi estimated 
function (sequence) after a finite number ol terms an important 
problem related to this appioach is the choice of a good value lor p (oi 
n) [121 [15J A directly related problem is the choice ol the optimal 
value of p (or a) for a truncated Laguerre senes with a specified 
number ot terms [16)-|I8| 

Unfortunately the approximation of a function whose Hailstorm 
has one or more pairs of complex conjugate dominant poles by a 
truncated L aguenc series requires a laige numbci ol terms |9] A 
simple way to ameliorate this problem is to use the oithonormal 
1 unctions given by (1) or (2) To avoid too many degrees of freedom 
one usually assumes that the poles ot the expansion are fust two 
complex conjugate periodically repeating poles i e p k \ — p 
P k = p' I £ N (and a similai definition foi the in s) In this 
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'Wc will denote the leal part of a variable by the subscript re and us 
imaginary part by im For Laplace/7 transforms a subscript re will denote 
the transform of the real part ol the lunciion/sequente not Ihc real part ol the 
transform liktwise for im The square loot of —1 will bs denoted by y 
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e, we can apply an unitary transformation to (1) to obtain [2], [3] 

[(a-pHa-p *)]*- 1 

[(* + 1>)(k ~ 


C»afc—l («) — '//' + P’( f < ~ 


(3) 


i- d 


ChkU) = s/p+v'i* + n~ 

[{* + p)(x + p*)\ k 


(4) 


t or the DT case, (2) can also be replaced by [5] 

-<r 


Gu-1 (~) = (H-« 


\J\ — (Id* 


V2 


-1; 


and 

GuA~) = |1 - " 


n/1 - a a* 

~7T~ 


-HD 


[(1 — )(1 — 1 )] k 

(5) 

[U~ 1 -«)U 1 

( 6 ) 

We will call these functions (and their inverse transforms) gcnerically 
by the name of Kautz functions with two periodically repeating 
complex conjugate poles, or Kautz(cc) for short. 

Like the Laguerre functions, the Kaulz(cc) functions can approx¬ 
imate arbitrarily well, in the integrated squared error (ISE) sense, 
any impulse response with finite energy. This property of Laguerre 
functions was used in the past to model plants with unknown or 
unmodeled dynamics |6|, |7|, [19], [20]. Unfortunately, the Laguerre 
functions are not well suited to approximate functions with strong 
oscillatory behavior—a task that can be done more efficiently with 
Kaulz(cc) functions. For this reason, some researchers have recently 
turned their attention to this kind of functions [211, [22], 

The problem of deducing the optimality conditions for (or n) in 
the approximation, in the ISE sense, of the impulse response of an 
vSISO plant by a truncated Laguerre series was solved in [16] and 
[ 171. Recently, these results have been extended to the more general 
case where the plant to be identified is excited by a signal with a 
nonconslant power spectrum und where the plant is approximated by 
a truncated Laguerre network [18]. We will extend here the results 
of [18] to the case of a truncated Kautz(cc) network. To this end, we 
will first deduce the optimality conditions for a truncated Laguerre 
network with one complex pole. We then proceed to show that the 
real part of the output of this network is the output of a Kautz(cc) 
network. Finally, using some results of the Laguerre case, wc deduce 
the optimality conditions for Kautz(cc) networks. We conclude the 
note with a simple numerical example. 


A. CT versus DT Signals and Deterministic 
versus Stationary Stochastic Signals ■ 


It is interesting to note that (5) and (6) are quite similar to (3) and 
(4). This suggests that the results for DT are similar to those for CT, 
which in fact is the case. We will therefore discuss in detail only the 
CT case. The DT case will be dealt with in a couple of remarks. 

Another point worth mentioning is the similarity of the results 
lor deterministic and for stationary stochastic signals. In the rest of 
■his note, we will need to evaluate inner products of the form (CT, 
* = 


(F Y\ GY) = 




where F, G\ A', and 1' are the Laplace transforms of deterministic 
signals with finite energy. When .r[t) and y(t) are stationary slochas- 
'ic signals with finite power, we can also evaluate an inner product of 
this form by replacing in the above expression X(jui)Y*(jjj) by the 
generalized Fourier transform of the cross-correlation between ,r(t) 
and Therefore, by a suitable modification of the inner product 
definition, we can cover the cases of deterministic and stochastic 
signals simultaneously. 



Fig. 1. Network for generating the continuous-lime Laguerre functions will) 
one complex pole. (The variable u is the real part of /»,) A similar network 
exists to generate tlic* discrete-lime Laguerre sequences 


11. Optimality Conditions for Truncated 
Laguerre Networks with One Complex Pole 

We will basically follow the approach of [18], adapted to our 
present needs and to the CT case. For convenience, we will assume 
that p = u + jr. The CT Laguerre functions are obtained easily from 
(I), with the result 

/ * \ k - ■ i 

/.*.(«. p) = k 6 N. 

(s + pr 

These functions can be easily generated by the block diagram of 
Fig. L By applying to this network a general input signal, and by 
forming a linear combination of its various outputs to approximate a 
given desired signal, we obtain a Laguerre network. 

Let A'(.s) be the Laplace transform of the input signal of the 
Laguerre network, and let D(s) be that of the desired signal. Both 
signals are assumed to have finite energy and to be real. Lei n be the 
number of sections of the Laguerre network, and let V„(*. p) he the 
Laplace transform of the network’s output. We then have 

r„(.s. p) = ^ ic„'kLk{«- p)X(s), 

k-] 

Note the dependence of the weights it,,.* on the number of sections. 
This is a consequence of the fact that the LY s are not otlhonormal 
when the weighting function, the power spectrum of the input 
signal, is not the constant function 1. With the notation p) ~ 

Lk{s, p)X(s), the error signal is given by 

n 

E„(S, p) = ^2 U’n.kXk(s. p) - D[S) 
k -1 

and. by denoting the usual inner product of complex signals by 
(•. ■)♦ we obtain the ISE of the Laguerre network with n sections 
as £,i = ( E „, E n ). (Due to Parseval’s theorem, we have chosen to 
represent the ISE as an inner product in the transform domain.) 

The optimal values for the weights u' u ,k can be easily found by 
applying the orthogonality property of Hilbert spaces, the result being 
the following set of so-called normal equations: 

(En , A k ) = (l k — L ■ ■ •. n. (7) 

Furthermore, we have for the partial derivative of with respect to 
a generic real parameter (a prime denotes here differentiation with 
respect to that parameter): 

C = £ [w„MA' k . E„ ) + w:. k (E n . A [.)]. (8) 

k—l 

To actually compute the partial derivatives with respect to a and 
we will need the following remarkable formulas: 

DLk {»> p) __ (1 — k)Lk-i (*. p) , t L + l{s.pJ 
8 a — 2 u + 2 a 


dL k (s, p) _ . (A- - l)L k -i(s. p) + (1 - 2k)Lk{#- p) 
o.. 7 


2 ir 


+ jk 


Lk +](*' p) 
2 u 


( 10 ) 
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Because we have A' k — V k A, we conclude that 2 

s= h ■ — — t- terms in Ak and 4^ — 1 ( 11 ) 

On 2 u 

and that 

+ terms in Ak and Ah -i ( 12 ) 

On 2n 

Applying ( 11 ) and ( 12 ) to ( 8 ) and simplifying the result with (7), 
we obtain for the stationary points of with respect to p (i e , with 
respect to u and i ) the following system of equations 

f M ij ( A n 4-1 , En ) 4" ll || /, ( E n , „ +1 ) — 0 

M n n ( 4 r» -h I » E 1t ) II „ n (E„i 4| t +J ) = 0 

It is a simple matter to verify that the only solutions of this system are 

a „ „ = () or ( E n , 4,^ i) = 0 

We will now show that the second condition is equivalent to 
u „41 »+l = 0 To this end, we apply the Gram-Schmidt orthonoi 
malization procedure to the set {4 a |, obtaining the orthonormal set 
{Bh } This can always be done because there are no lineai depen¬ 
dencies between the 4* ’s 1 We can express the orthonormalization 
pioccdure by 

B u =T»A n 

where T n is a lowei triangular non-singular matrix 4, is a column 
matrix defined by 

A, =(4 1 4 ,] 1 

and B 1t is defined in a similar way To simplify the notation in the resi 
of this section, we will denote the element of the last row and column 
of the matrix 7 1 ,, by / „ Note that r „ ^0 and that for \ ( s ) = 1 we 
have i „ = 1 tor all n It is also interesting to note that the element 
of the last row and column of the matrix T„ 1 is equal to r, 1 
Due to the orthonormality of the Bh \ we have 

E I = E„ + ( ,+ }B,, + l 

where < „ h = (D B, +. j) is the /»+1th Founer coefficient associated 
with the expansion of D( s) with respect to the orthonormal set {£,} 
Therefore, we have 

{£ n 4„ + l ) = ( En 4„+l ) — ( „ f | ( B„ + \ , 4,, 4 ]) 

One of the normal equations for a network with n + 1 sections gives 
us immediately {E„\ i. 4, + t ) = 0 We also have ( B n H A„ 4 1 ) — 
r'J, ^ 0 From these facts wc conclude that the condition 
( E,i . 4, 4-1 ) = 0 is equivalent to < „fi = 0 We finish the pioot by 
noting that « M+l ll+l = i 4 i < +i 
From the above results, we conclude that the stationary points of 
the truncated Laguerre network satisfy 

1 / „ nH if4- 1 1.4-1 = 0 (H) 

as well as the normal equations (7) for orders n and n + 1 
Equation (13) is exactly the optimality condition obtained in [16] and 
[I7J under much more restrictive assumptions Because the normal 
equations define completely the weights n „ k (and the weights 
u „4 i a ), the solutions of (13) are the values of p for which u „ „ = 0 
or a ,,-fi „ 4 -i = 0 One of these solutions will correspond to the global 
minimum of (other solutions may correspond to local minima or 
maxima, or even to saddle points) 

2 Thc term in Ah in (11) is missing lor the CT case but is prcscnl for the 
DT case |cf (14)) 

l ln fact bv factoring out the common nonzero term A(*) an operation 
that cun always be done because \ (s) is an analytic function in the right-half 
plane, the resultant set is orthonomial (Obviously, wc are not considering the 
uninteresting tune A (■>) = 0 ) 


Remark I For the DT ca&e, we have for the Z-transforms of the 
Laguerre sequences 

t /. - \ _ _ P T75 ( ■* ~ n ) i r - is i 


Lh(z. n) - V 1 - u 2 


(1 - a •- 


where n = u + j i, and the remarkable formulas 

0L k (~. a) = (1 - A)Ia-iU'/)+./(1 - 2AULaU a) 

On 1-i/ 2 -i 2 

Lh m(-. n) 

_ (/ Z _ |2 


0/aU a) _ (A - 1)/a-iU, n) +(2A - l)i/£*U a) 

0, 

+ (O) 

1 - II 2 — / 2 

Note that the last term of (14) is almost equal to that of (9) the 
same happening between (15) and (10) For this reason, (11) and (12) 
arc also valid lor the DT case (with 2ii replaced by 1 — a 2 - i 2 ) 
Therefore, the proof for the CT case can be adapted quite easily to 
the DT case 


III Opi imali vy Conditions hor Truncatf d Skond-Ordlr 
Kal n/ Networks with Two Compifx Conjouaii Polfs 

In this section we will show chat the real part ol the output of the 
Laguerre network of the previous section is the output of a Kaut/ 
network of second order with iwo complex conjugate poles and with 
the same number of sections as the Laguerre network 4 We then 
proceed to compute the optimality conditions for trunealed Kautz 
netwoiks To simplify the exposition we will discuss here only the 
Cl case The DT case is similar 

The Laplace transform of the real part of the output of a Laguerre 
network with a complex pole is given by s 

3,, , (s p) = ^ y [“ ' a A a (s p) + a * a La ( s p *)] 

A I 

= v/T ^ t ^ t<s 

^ A I 



It is cleai that this last expression can be put in the foim 


E n A ( S, p ) — ^ ^ ((3, A tm s 4" * n A fii 


. [(s + -+ ^)(s -;>)]" 

[(•> +■ p)( s -F !>*)]" 


where all coefficients involved in this expression arc real h It is then 
obvious that 3 r < „(s, p)/\{s) can be expressed in a unique way as 
a linear combination of equations (3) and (4), i e , 3 ,,,(*, />) is the 
output ol a second-order Kaut/ network More specifically, we have 


3 1 , > K P) = ^ aG_a i ( f* p) + b n hG 2 h (s, p)] \ (s) 

A — 1 

where the «„ a \ and the b u a \ can be obtained from the u „ a’ s (oi 
the o„ a „,N and i „ a ’s) by an invertible transformation Because 

4 Nole that for p (or n) real i e » = (1, ihe Kaut/ network collapses into 
a Laguerre network (with twice the number of sections) Wc will not discuss 
this special case here 
s Notc that the input signal is real 

b It can be shown that the conditions o tl a a = 0 and i „ a a = 0 arc 
equivalent lo the condition i<„ a = U when i 96 () 
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is 


and />, .t are related to & n » t and i, , „ by a invertible linear 
sformation when * ^ 0 , we conclude that 



Vvt now turn our attention to the problem ol deducing the op 
jiiiy conditions for truncated second order Kautz networks The 
jiting point is the equation 

t 

I = 1 ,, - D = ^[»r , t 1, t - i/„ rl„ i]-D 

A I 


\ litre we have explicitly separated the real from the imaginary parts 
ol all complex variables and functions The 1SE to be minimized is 
now =(F i Fv ) The subscript re in is just to distinguish 
the Kautz case fiom the Laguerre case Note that we have 


mil 


4, 


, « 4a(s p) + 4a(s />*) 

A ( P) = - 




i>) = 


4* (s p) - 4 a (s ) 


From rhese equations and fiom (9) and (10) it is possible to show that 


the set { 4 rr a 4 in *} obtaining the set { R, k J ], k | ]| us , m 
be expressed bv 

D , = T, A, 


where again we have used the subscript re to denote the Kautz c isi ' 
Again 7\, t is a lower triangular nonsingular matrix which in this 
case can be partitioned in 2 x 2 blocks We will denote b\ R the 
2 x 2 block whose elements are the elements ol the last two rows and 
columns ol T, , Note that 7? is itself a lower triangulai nonsingulai 
matrix Fhe 2 x 2 block similarly positioned in the matnx 7 , 1 is 
equal to R 1 

Using the orthonormal set wc have H 

i i +1 = r , -f t , 4 1 4 1 + <, -h n, i 


Using this expression and after somt manipulations similar to the 
l aguerre case we obtain 1 ' 


(i, 

(7, 


4 

4 , 



= -« 


+ i 
i i 


0 

0 


Because \R , 1 1 ^ 0 the only solution of this system is f i — 0 
which implies ii 41 M = 0 As in the l aguerre case wc conclude 
that the slationaiy points for the Kautz case satisfy 


cM^ 

On 

0 4 a 
On 

0 1 A 
0i 

M A 
0t 


= / ^ * - fl - 4 - other terms 

2u 

= A - - — 4- other tenns 
2 u 

= -/ — k f + l 4 * otliei tenns 
2 u 

= A —— 1 4 other tenns 
2u 


(17) 

( 18 ) 

(19) 

( 20 ) 


where the other terms are terms in 1 ; 1 a 4 a 1 and 

1 a 1 (some ma> be missing) 

In the present case the weights ti f have to satisfy the following 
set of normal equations 


{ 


(1 

(r 


1. A)=0 

I A ) = 0 


/ - 1 




( 21 ) 


Note that these noimal equations aie difjaint Irom those in (7) In 
lad in the Kautz ease we have 2n equations with 2n rial unknowns 
while in the l agueric case we have // equations in n tomph\ 
unknowns Iheietorc the set of weights lor the optimal Laguerre 
1 isc when tiansfonncd to the Kautz case usually do not produce an 
optimal solution 

The partial derivatives of t, with respect to a real parameter 
a now given by 

s —2(i r[ ) 


- 2 £[». ,(f, 4 

A l 


» a (r, 4(, a)J 


s »ng (17) to (20) in this expression and simplifying the result with 
1 ) we obtain the following stationary conditions loi , 

f 11 (T 1 , f 1) — u ,, (F l,t 4.1) = 0 

\u, (F, 4 „ 41)+ (ini 1 (F, r 1 , 41) = 0 


ht only solutions ol this system of equations are 


a 


= 0 or 


{ 


(F , 
(F, 


4, * 1 ) = 0 

4n 1 -h ) = 0 


As in the Laguerre case we will now show that the second con 
inion is equivalent to n, +1 4 1 = 0 Wc stall by orthonormaltzing 


n a f 1 1 “ 0 


as well as the noimal equations (21) lor orders n and //4- I According 
to (16) Ihese conditions can be translated to weights of the Kautz 
network as 


(I 0 ((I 4 1 i =0 

nr 9 

h =n \h M 4,-0 


( 22 ) 


as long as f / 0 

Rinunk 2 The concsponding proof for the D1 case is m every 
lespecl stmilaT to that of the CT case In particular wc have (foi 
< # 0) 


4 , ( II ) — } ^ [ 1 A f».A l ( II ) *4 h A A ( ^ )] ^ ( ) 

A I 

whcie a — h ~ 0 if and onl> if n = 0 Lqualions (17) (20) 
remain valid in the DT case with hi replaced by 1 - u - 1 


IV Fxamf’I b 

To illustrate the results ol the previous section wc will approximate 
the rational system 


""- E.-Vt 


hy a truncated Kauiz network using as input the signal 

1 


\( ) - 


J 4-Ob 


(24) 


(24) 


The values of all parameters of this system are presented in I able I 
I he normalized ISL surface of the approximation of the output ol 
(24) by the outpul of a Kautz network with tour sections (n — 1) is 


7 The vcdoi A, , is now dtlined as 

A, ■= [ 4, 1 4 m 1 4, In, ] 1 


K Jhor convenience wc will represeni the two leal I ourier codhuents as ihc 
real and imaginary parts ol a complex numhtr 
l) Note that 

n I __ (Ry 4 1 4 1 c f | ) (Ryu i -| I 4 , 41) 

4 1 4 11( h) (Ryy +1 4 lfll 4I ) 
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TABLE 1 

Parameters or the System (21) Thfsf Particular Valufs 

CORRFSPONL) TO A FcRRTH-OrDER ELLIPTIC BANDPASS FllTCR 


* 

Zk 

Tk 

1 

0 

0 1074 

2 

0 5819 + 0 7487j 

-0 003028 - 0 05312] 

3 

0 5819 - 0 7487] 

-0 003028 + 0 05312] 

4 

0 7640 + 0 5814] 

-0 006167 + 0 04034] 

5 

0 7640 - 0 5814] 

-0 006167 - 0 04034] 



Fig 2 Contour plot of the normalized ISE of the approximation of the outpul 
of (21) by the output of a Kautz network with foui sections as a function of 
the pole position (only first quadrant shown) Hie input of both systems was 
(24) Superimposed in this figure are the curves m 4 = 0 and 7>i j = (J For 
each value of p the weights are computed trom the normal equations 



Fig 3 Same as Fig 2 but with the curves *u t = 0 and />-, =■ () instead 
of u\ 1 =0 and />i 1 = 0 Note that the global minimum is located as a 
point (ji « 0 G19 + /() r >80) where the two curves cross 

displayed in Figs 2 and 3 using contour lines separated by 1 dB 10 
Hie input of both systems was (24) We have superimposed the curves 
01 4 = 0 and b* * = 0 in Fig 2, and the curves 0 ^ ^ = 0 and h r, r„ = 0 
m Fig 3 Note that, in this example, the global minimum is located 
at a point where u 1 * = 0 (p w 0 G19 + j 0.580) However, our 
conditions do not exclude the possibility that 1 / „ „ = 0 (and not 
u 1141 »i +1 = 0) may hold at a global minimum This is known to be 
possible in Laguerre networks 

10 For each value of p, we have computed the weights from (21) For this 
reason in these figures the normalized ISE is only a function of p 


v Conclusions 

Wc have deduced the optimality conditions tor truncated K«iut/ 
networks with two complex conjugate periodically repeating poles 1 
is shown that these conditions are a simple generalization of those o( 
the Laguerre case The system approximation case discussed here is 
more general than the function approximation case usually considered 
in the literature Surprisingly, the optimality conditions are the sanu 
in the two cases 

Another proof of these results which is not based on a Laguem 
network with a complex pole and is therefore easier to geneialize 
is presented in [21] (only tor the special case of an impulsive input) 
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Stable Adaptive Control of a Class of 
First-Order Nonlinearly Parameterized Plants 

Jovdn D BoSkoviC 


\frstract —Stable adaptive control algorithms are designed Tor a class 
pi in st-order nonlinearly parameterized plants containing nonlinearities 
similar to those arising in fermentation process models When the plant 
nonlinearity is conveniently parameterized and when the controller pa- 
i iineters are adjusted adaptively, the output error is shown to converge 
in zero Stability of the overall system Is proved using the Lyapunov 
function with a suitably chosen cubic term. 


I In production 

[he pasl seveial years have witnessed substantial progress in the 
tit Id ot adaptive feedback linearizing control While most ot the 
existing results aie derived in the context of linear parametcn/ations 
in |6] the control problem is also solved tor a class of nonlinearly 
liaiameterized plants both in nonadaptivc and adaptive cases In this 
note we suggest a method for solving the adaptive control problem 
toi a class of nonlinearly parameienzed models similar to those 
arising in fei mentation piocesses Unlike the adaplive method from 
16] which is based on high gain adaptation in this note we suggest the 
method based on certainty equivalence principle and a novel ionn ol 
lyapunov 1 unctions 

The class ot plants consideied m this paper contains a nonlinearity 
which is ol the ionn ol a ratio ol polynomials m i with thicc 
unknown parameters the two ot which do not entei linearly Such 
i model captuies all the important lealuies ot a large class ol models 
ol fermentation piocesses It is shown that loi a constant desued 
response ol the plant the appropriate plant parameterization in 
conjunction with the conesponding adaptive algonthms and a suitably 
chosen lyapunov function enables us to prove without neglecting 
my lunis in the derivative ot the Lyapunov function that the output 
eiior of the plant converges to /em ic that lino x • \t) 0 

The results presented in this note die based on the results from 
Ml and aie aimed to be an important step towards designing stable 
idaplive algorithms loi a class of uncertain nonlinear plants arising 
in continuous flow and fed batch fermentation processes 

The paper is organized as follows In Section II the motivation is 
givtn and the tracking pioblcm is stated loi a special class ot nonlin 
car plants hi Section III two plant parametui/ations aie presented 
ind the coiresponding error models arc derived Section IV contains 
the stability analysis lor both cases of plant paiameten/alions while 
ihc conclusions are given m Section V 


II Mohvaiion and Probifm Staiimini 


To lllusnatc the difficulties encountered when attempting to control 
i nonlmeai lermenlation process we will consider a class ol termen 
1 nion process models in which the growth ot microorganisms and 
iibsiiate consumption aie described by the Monod s growth model 
f| i this case, using the notation adopted m biochemical enginecimg 
he piotess model can be described b> 


\ = \ 


7v„ + 9 



S 


] 

T 


/i ^ 
7c, + S 


+ (£j “ 


( 1 ) 

( 2 ) 
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1 ngincenng Yule University New Haven CT 06S20 USA 
IFtb l og Number 9407216 


where \ and S denote respectively rmciooiganisms and snh u ik 
concentrations and D denotes the dilution ute l) - / / \ VN | UU 
r denotes the substrate feed and \ denotes the volume o tin 
fermentation broth (\ = const for continuous flow processes and 
\ = b for fed-batch processes) In the above equal ions p denotes 
the maximum value of the specific growth late k denotes the 
Monod’s cocfhcient ) denotes the yield coefficient, and S, denotes 
the concentration of the substiate in the feed 

In the above equations the functional foim ol the growth model 
may be known while the coefficients p u h and I may var) in 
an unknown fashion within known bounds If we wanted to legulatc 
S(U around some desued value S* we would use a control liw of 
the Ionn 


P(t) - 


1 I*- 

S/ - (t) 


It (i)b(l)\[l) 

A (/) + S (t) 


- - S 


} 


wheie \ (I In the ease wc wanted to legulatc \(fl atound V* 
the tontrol law would take the lorm 


D(t 


1 I /i, (OS(f)MM 
'(01 A,|/)+S(M 


+ \[\(0- 


\ 


} 


The quesimn naturally anscs as to how should the puiameter 
estimates be adjusted in each case to pmvidt that the output assumes 
values close to the desired one over an mteival of interest Due to 
the fact that some of the unknown parameters entei the stale equation 
nonlinearly the way in which the estimates should be adjusted so 
that the contiol objective is achieved is nor immediately obvious 
Thus a question arises whether the piocess nonlineaniics can he mote 
conveniently paiamctcii/ed in a lashion which would simplify the 
analysis and the design of adaptive algonthms In this context we 
will addicss two pioblems 

1) how to conveniently parameterize the nonlineantics from (1) 
(2) which would simplify the design of adaptive laws and 

2) how to piove the stability ol the overall system 

To solve these pioblems we adopt a pmtotype plant whose 
nonlinearity captures mam featuies of the nonlineantics hum (I) 
(2) The prototype plant is chosen in the lorm 


/ = 1 7 -1- (/( i )u ((f)) - ( O) 

where i( ) represents a physical quantity in total cooidinales such 
that it cannot assume values less than zero The input and the output 
of the plant are respectively denoted by k( ) and i/( ) and u F 6 
whetc is a suitably chosen set Also wc assume that / 6 1R * 
and that |y( i )| > 0 Vi 6 (U +x ) It is furthei assumed that the 
value ol can be either +1 or I We now intioduce the paiamcter 
vector p -= [/ i /. I i] 1 , and the set such that p F _ {p p ~ 

[/, * a,]' o - (/,),„ < i, i a,), , +m ml n is 

also assumed that only (A,), m and (A ,),,, * (/ - 1 2 3)are known 
as well as lhal ( F *9 - ( i 0 < i l 1 1 /, F (() -+ x ) ) 

Comment The state and paiametcis of the plant (1) can assume 
only nonnegdtive values which substantially restricts Iht class ol 
plants under consideratton On the other hand, nonlinear models of 
fermentation piocesses as well as a large number of nonlinear models 
arising in process industry are given in total cooidinales In such 
a case the total values ol the process states (eg pressures lev 
els, tcmpciatures flows conccntiations and other variables) assume 
strictly nonnegative values Such models are also chaiacten/ed by 
nonnegativc values of parameters These lacls may justify the analysis 
of the restricted class of plants (T) 
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We next consider the control objective for the plant (3). We will 
assume that the desired response of the plant is defined by 

,i *(f) = —Aa *(f) 4 .r*(0) = r (4) 

where A = b w £ (0,4x), 0 < r < r 1IldX < +x>, and r is constant. 

We further introduce the output error c (■) = .r(■) - r. Since 
f(0) = e t) - r, we have that r<, € S tv = {cv-r im|1( < 

t' v < n 4 r lnHX }. Now we are ready to state the control objective 
for the plant (3). 

Objective Design a control law for the plant (3) such that, for 
p £ S v and for e„ £ S f(J , all signals in the system are bounded, and 
limr-^< j (f) = 0. 

To solve this problem, we will first consider different possible 
parameterizations of the plant nonlinearity. 

III. Plant Parametrizations and Error Models 


A Left Parameterization 

When p = 1, we use the left parameterization of the plant (3) [3]. 
Such a parameterization is of the form 

aj ,r J 

r =--- \-gU)" (5) 

02 4 -r 


where <m = k\/k r \ and 02 = A-j/A-j, We further introduce n = 
[oi oj] y , and (\ £ S« = {(\:n = [cm n 2 ] f . 0 < (n < n , < 
(Hj) iua J — f* 2}, where (n,),,,,,, = (bj )nim /(*•.) mux < (j = 1.2), 
and (<» I )max — ( A’j )uirix /(A'j)„„„, (j = 1. 2). In this case the control 
law is chosen in the form 


«(*)« 


1 

flir(t)) 


Oi(tUUt) 

V 2 (D + .r(t) 


- \.r{t) 4 b u ,i 


( 6 ) 


where 0| and ti 2 denote the adjustable controller parameters. To avoid 
division by zero in the above expression (and to assure the stability 
of the overall system, as will be shown in Section IV), the adaptive 
algorithms should be designed to ensure that 

m € $0 = {H : » = [IMjI'.O < -t, 

<«, < + (,.j = 1 . 2 }.vi e m + (7) 


where fj — min{(<i/)mm*0*o[(o^) iu,ik (^/)ihiu]}i ,/ — 1.2 [1], 
[31. After substituting expression (6) into (3) and subtracting (4), we 
obtain 


r (/) = “Ac (/) - 


Q|(l).l" i (l) 
h (t) + .r(t) 


['W + r(t)][ff i (f) + -r(n] 


( 8 ) 


where 0 , (■) = H } (•) - a,. (j = 1,2), denote the controller parameter 
errors. 

In this case the adaptive algorithms are suggested in the form 


Oj(t) = Hj{t) = (-l )' 4-1 — 

■ |l - sign|f (f)] 


/((i)-» ,j (i) 

[^(1 ) + .r(l)]' 

(jv) mux 2g ; (l) + (n,) 

( C> j ) mux + 2fj - («i, 


imn 

nun 




_I_ \ 

O j ) max — @j ( t ) — f j ][0j (t) 4 (<1 1 )min “ (j ] 4 1 / 


j = l/2 (9) 

where *), > 0, (j = 1,2) denote the adaptive gains, and where 
boj > 1, (j = 1,2). The system (8), (9) represents the error model 
for the case of the left parameterization. 


The adaptive algorithms of the form (9) represent “smooth” adap 
tive algorithms with projection [2, 1]. Such algorithms assure that 
(7) holds. Also, it can be shown [2, 1] that (for r = const.) tht. 
following holds 

V«U, J) 6 (5,1 x 5„ x iC ).j = 1.2. (10) 


B Rif>ht Parameterization 

In the case when p = — 1, we will use the right parameterization 
of the plant (3) [3], which takes the form 

J = -•“ //(f - r)u (ID 

1 + nj. i 


where c>i = k\/h 2 and n 2 = Am/A-j. In this case n = [n icij] 1 , and 
ci £ = {cun = [cM^i] • fl ^ (cij)miii ^ Oj K. (cij )„l,vx, j = 

1,2), where (Oj)nnn = (A 2j — 1 ) mi ii /(*j) lllrl \ J (j = 1.2). and 
(»,)„„, = (k 2 , i )„.„,/< ki)„,n,Aj = 1.2). Also, in this case the 
control law is chosen as 


m(0 = 


1 

fl( r (f)) 


" fli (f)-r 2 (0 
.l+^(f)i(f) 


A./ (f ) 4- /■ 


( 12 ) 


As in the previous case, we want to assure thal (7) holds, for the 
quantities as defined above. 

After substituting expression (12) into (3) and subtracting (4) we 
obtain 


«■(*) = 




(131 


where, as m the previous case, Oj(-) — (),(■) ~o,, (j — 1,2), denote 
the controller parameter errors 

In this case the adaptive algorithms are suggested in the form 


Oj(t) = i,{t) = (-!)' 

• |l - Mgn[f (/).»•(/)] ■ 


y(^ ; +l (Q 
[l + #j(7>, (/)]j 

(ci i )iuti\ 2Hi (t) -\- (n I )mm 
(Oy)||M\ 4 2f i — ( Cl^ ) III III 


1 


AhlirtX — ) + (Oy )mm — ^] 4 




J = 1,2 


(14) 


where > 1. {j = 1,2). Equations (13), (14) constitute the erroi 
model in the case of the right parameterization. 

As in the previous case it can be also be shown that the following 
holds 


„/+i 


V(d.r».j-) e (5„ x 5,. x 1R + ).J = 1.2. 


(15) 


IV. Stability Analysis 

From the linear relationship 4>j = - a ,,{j = 1,2), and from 

the definitions of the sets $ n and A, it follows thal 

0(t) € A = |p : 0 = • |<3>y | < (Oj )„IHX 

—(ct; )miu 4 f / . j = 1,2}, Vf € IR+ (16) 

We also define sets S, and which respectively represent open 
connected neighborhoods of points < = 0 and o — 0, with 5* D A- 
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\ s it is well known [5], on the set S x S* there always exists a 
iction \ (f . 0 ) satisfying \ € x S w ), and the conditions 

0 <6nf 2 (< <0 < fM.r' + MH| 2 

Vlr.uleS, X.S. (17) 

left,/ G (0,+"x.), (/ = 1 2 j — 1 2) The conditions (17) 
, uie that M ) is positive dehnite and decrescent on S x5 , as 
\ 11 as that it is a radially unbounded function 

Jow we are icady to state the tracking conditions foi the plant O) 

I he otcm If the following conditions are simultaneously satisfied 
11 rht conditions (16) and (17) hold, and n) along the motions of the 
, si responding ercdr model the following holds 

1 (< ") < -\' 1 < 0 V(( n) g (5 X 5 ) V, ^ (1 (18) 

ihcn linii-x < (t) = 0 

Ptoof The prool follows dnectly from [7| 

The next step is to test the stability conditions in the two cases of 
plant parametenzations Since in both cases of plant parametenzations 
il can be readily shown that a quadratic tentative Lyapunov function 
is not well suited for the stability analysis of the plant 0) in this 
paper we will piesent special tentative Lyapunov functions with cubic 
terms onginally suggested in P] 


A l tjt Pat umetei izution 

In this case the tentative Lyapunov function is of the form 


1 (■ 


<>) - 




() 

+ Ml 



oi_ 

<\ + i 


(id) 


lhc hist question that arises is whether \ (< n) as defined above 
satisfies conditions (17) Expression (Id) can be lewnllcn as 


(< u) 


1 ( »| ni(n + 1/ ) \ 

-3V + T + — i ■(., +o " j 


( 20 ) 


The above expiession can be icwntten as 

- V “I ^ >( 1 f M 1 ' 


and since (7) holds, it follows that (17) holds as well Also (7) implies 
that (16) holds so thu condition i) of the theorem is satisfied We 
also have that 


\ (r o) = a + 


i , o i / 1 + 0 i \ 
* 1 * \ 1 + M I / 


We now evaluate the hrst derivative of \ (» o) along the motions of 
the system (I 1) (14) Keeping in mind (IS) \xe obtain 


\ (( o) < + 

f t>i i 
1 + 0 


< O | I * l ) < ( # 

TTT7 (i f fTon + 0 n 

n 11 (» i 1 1 -1- 0 i 

i (1+01) 1 + (i / 


< ~\< 


- -V 


- > i 

n I r it » 

(H n 0(1 + tit i )*{ 1 H n O 
1 0 V(f it) f {S x 5 ) V/ >0 


It follows that for /t = —1 when the contiol law (12) is used and 
when its parameter are adjusted according to (14) the output error 
converges to zero 

Cormmnt In both cases of plant pararnetcn/alions it can be 
leadily shown that the closed loop system is robust to parameter tune 
variations and bounded external disturbances The proof of robustness 
is based on the tact that the adaptive algorithms assure that the 
controller parameter errors are bounded for all time 

( omment One of the questions that arise is whethei it is possible 
to use left paiameten/ation in the case when p = -1 In such a case 
the plant is of the toim 


M | i 

I = - -1- c/( I )u 

M + 1 


and since the adaptive algorithms (d) assuie that (7) holds if can 
bt icadily shown that (17) holds as well Also since (7) holds this 
implies that (16) holds as well so that the condition i) of the Iheoiem 
is satisfied Further since 


\ [* 


o) = if 


"I* 1 ! 


n i 



o >{> 


and with (10) in mind, the lirsi derivative of the function \ (r <>) 
dong the motions of (8) (d) )iclds 


T (f tt) < -\t 


{ V„ L 
0"+ I + (n 


< ()] i " rl|f 1/ 

07+7 “ d> + t) 


M j ( (> I 
+ 0(0 + 0 
0 +* 

M _ + / 


^ _ _ <>|( ^ [ _ _ 

(m + r )(0 + ,) 2 (n + O 

< - \0 < 0 V (f n) G (5 X 5 ) Vf > 0 


and the control input is chosen as 

!/(/) 1 

<l(t i(D) 

which leads to the following erroi model 


-— \ i (f + h / 

0 (0+ r(f) 


n, .,(/)! (t) 


' (t> X,(, > + H (,) 4 ,(/) ' [„ + i(/)|[# (()+ .(,)] 

In this case ihe adaplive algorithms arc ot the lonn 


>,(1) = H l (t) = (- 1)' 


[» (0+ i(0]' 


[i- sign].(/»] -.-"r " fflV -i'" 

[ (O f )n ,si + i (M j )| 

1 


*«,[(<>,)„ , - H,(t) -.,][#,(/)+(->,) , - .,] -1 I 
/ = 12 


which implies lhai condition n) of the theorem is satished It can be 
concluded that when ft -= 1 when the control law (6) is used and 
vlien Us parameters arc adjusted according to (d) the output enoi 
on verges to zcio 

H Ri\>ht Patametet nation 

In this case the tentative Lyapunov function is chosen in the form 


for which the lollowing holds 


,jf,j - ( l) '^' 

V(0 n i) e US> X cS, x IR 4 ) 


1 2 


II we choose the tentative Lyapunov function of the form (Id), its 
derivative being 


Mr <>) = + — +rn—) + -L —^— <>', (21) 

2 \ i '■/ 3'a l+o.i 


1 ( , ,,) = ,, + + 


o i ( M*. + i \ 

Ao.+tJ' 
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we obtain 


iV..)S-V+'* ,Jr 




02 + .1* (<>2 + ’ i ')( 9'2 + «**) 


02 + 
< -Ar a + 


rtj c0a.r 


• 2 t >2 + r 


(^2 + .i'j 2 a-2 + T 

_ aif^^aJ 2 _ 

(f »2 + J -)(#2 + + r). 


It is seen that the proof of stability goes through if 


A > 


Vmiu((^l )m«ix 

27(o 2 + r) 


where = (fV 2 ) r „« x + fa - (oa which may be a restrictive 
requirement. On the other hand, when the right parameterization is 
applied, the system is stable for any A > 0. 


Multiproduct Production/Inventory 
Control Under Random Demands 

Jin Qiu and Richard Loulou 


Abstract — In this note we study the optimal production/lnventor> 
control policy for a single machine multiproduct production system. The 
machine produces to fill the end-product Inventory stock and the demand 
is satisfied from the inventory when available; unsatisfied demand is back- 
logged until the product becomes available as the result of production. 
For each product, the demand follows a Poisson process and the unil 
processing time is known. When the machine switches production from 
one product to another, it incurs a set-up time and a set-up cost. The 
relevant costs include the set-up cost, a cost per unit time while the 
machine is running, and linear costs for inventory and backlogging. 

This problem is modeled as a semi-Markov decision process using 
the criterion of minimizing expected total cost with discounting over an 
infinite horizon. Procedures for computing near-optimal policies and their 
error bounds are developed. The error bound given by our procedure 
is shown to be much tighter than the one given by the “Norm-based" 
approach. Computational test results are presented to show the structure 
of the near-optimal policy and how its accuracy is affected by the system 
characteristics such as capacity utilization and set-up time. 


V. Conclusion 

In this paper the stability conditions are proved for a class 
of first-order plants, which contain nonlinearities similar to those 
arising in fermentation process growth models. Such nonlinearities 
are nonnegative functions of the state of the system as well as of the 
possibly unknown process parameters. 

It was shown that different signs in front of the plant nonlinearity 
lead to different plant parameterizations. Such parameterizations are 
crucial in the choice of the control laws and adaptive algorithms for 
controller parameter adjustment. These parameterizations also serve 
as a guideline for the design of appropriate Lyapunov functions used 
to prove the stability conditions. The Lyapunov functions V( ■) used in 
the stability analysis contain suitably chosen cubic terms which assure 
positive definiteness of V( ■) as well as that the stability conditions are 
satisfied. Steps involved in the choice of the plant parameterization, 
the control laws, adaptive algorithms and the Lyapunov function, form 
a basis of a method for designing robust adaptive control algorithms 
for a class of fermentation processes. Further development of such a 
method is currently in progress [4]. 
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I. Introduction 

There has been a growing interest in design of multiproduct produc¬ 
tion/inventory control policies for capacitated manufacturing systems. 
A good control policy leads to low inventory and backlogging costs. 
Policies studied frequently in the literature of operations management 
include base stock policies (Karmarkar |8], Graves [7|). lot-sizing 
policy (Karmarkar et aI [9|, Zipkin 117]), and linear control rules 
(Graves [5J, Denardo, and Tang [2|). These policies are heuristic in 
nature since they are restricted to a predetermined policy structure 
(e.g., constant production lot size in lot-sizing policies). 

In this note we explore the optimal production/inventory control 
policy for a single machine multiproduct production system. The ma¬ 
chine produces to fill the end-product inventory stock, and the demand 
is satisfied from the inventory when available; unsatisfied demand 
is backlogged until the product becomes available as the result of 
production. There is limited stock space For each product. When the 
machine switches production from one product to another, it incurs 
a set-up time and a set-up cost, both of which are deterministic. Foi 
each product, the demand follows a Poisson process and the unit 
processing time is known. The relevant costs include the set-up cost, 
a cost per unit time while the machine is running, and linear costs 
lor inventory and backlogging. The research objective is to find the 
produclion/inventory control policy that minimizes the expected total 
discounted cost over an infinite horizon. 

Optimal control policies have been studied for decades for various 
single product production systems (see Sobel [12], Gavish and Graves 
[4], Srinivasan and Lee [13J). Chauny et al. |1] study the control 
problem for a single machine center multiproducl FMS (Flexible 
Manufacturing System) where no end-product inventory is allowed. 
They formulate the problem as a discrete semi-Markov decision 
process, which is solved by the Successive Approximation method in 
conjunction with interpolation techniques. Seidmann and Schweitzer 
[11] study the interaction of the flexible machine center and dedicated 
assembly lines that immediately follow. The linear cost charged 
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the time during which the assembly lines are idle is used as 
arrogate measure of the throughput loss. This measure may, 
iu ever, be difficult to justify when the goal of the production 
s\ cm is to produce to match the demand at the lowest cost rather 
t h; i producing at the maximum rate. Graves [6] proposes a state- 
jcpeiident heuristic solution procedure for the discrete-time, single 
in. -.chine multiproduct cycling problem. The simulation results show 
1 ( 1,1 the heuristic policies perform better than (*, S) or lot-sizing 
Prides in most cases. 

In this research, we allow both end-product inventory and backo- 
itl-rs and develop a near-optimal production/inventory control policy 
tor the production control problem. In Section II, we formulate the 
problem as a discrete semi-Markov decision process. In Section 111, 
we develop a solution procedure which leads to a near-optimal policy 
and a procedure for generating its error bound. Computational test 
results are also presented to show the structure of the near-optimal 
policy and how its accuracy is affected by the system characteristics 
such as capacity utilization and set-up time. In Section IV, we 
conclude with a discussion on the future extension of this research. 


11. Development of the Model 

Notations: 


\n 

<in(t) 


tin 


Uhu 

IUn) 


index for product, 1 < n < N. 

demand rate of product n. 

cumulative demand of product n during time 

interval [0, /]; its distribution is Poisson with mean 

A J. 


unit processing time of product n. 

set-up time and cost when the machine switches 

production from product / to product j. 

P„. Wn n = 0 Vo. 


production cost rate ($/unil lime). 

inventory holding cost rate of product n ($/unit 

time/unit). 

backlogging cost rale of product n ($/unil 
time/unit). 

inventory/backlogging 

if .i*f ( > 0 
if< 0. 


f tju I' m 

cost— { 

l-06.. .In 


Q,, limit on the end-product inventory of product n. 

i continuous discount rate, n > 0. 

Prob(i/ = i/o) probability that </ is equal to i/o. 

E[y] expectation of random variable //. 

oxp (ty) exponential function e y . 

The production/inventory control problem is formulated as a semi- 
Markov decision process (SM.DP) (Denardo |3J) with the following 
characteristics. 

State Space: The stale of the system is comprised of the inventory 
tale and machine set-up status. The inventory state space is defined 
is X = {.r (E Z s \ j’a < Qn , 1 < n < A} with .r„ representing 
die inventory (backlog if .r„ < 0) level for product n. Let index A- 
denote the machine status such that k = n means that the machine 
s set up for product n. The state space of the system is thus defined 
is S = \s € Z* + l : s = f.r. A*)}. 

Action Space: We assume that the production process is reviewed 
it those points in time when either the machine is idle and some 
demand arrives or when an item has just been processed and the 
machine becomes free. In other words, we assume that preemption 
of set-up or processing) is not allowed. At any review point, one of 
V+l actions is taken: let the machine stay idle {a = 0) and keep the 
latest set-up, or produce one of the V products (a = n. 1 < n < X ). 


The action space is thus defined as A = {</: u — o. I. •. Y } \ 
stationary policy 7r consists of the set of actions taken depending on 
the observed state *. 

Immediate Costs: Given any state of the system s = A), if 

action a {a > 0) is taken, the next decision point is the epoch ai 
which one item of product a has just been processed. The discounted 
production cost during the rime period T ka 4 r„ (note that W\ :(l - 
Tkn = 0 if k - a\ PC{k. a), includes set-up and variable costs 
and is given by 

r t fco+fi, 

PC(k. a) = \Y kll + / exp (-a ■ t) • c ■ dt 

d/'u 

m- , ( -r \ 1 - exp(-n • ) 

= VI A m -f c ■ exp (-n • Tkn) --—-. 

a 

The expected discounted inventory cost for product n during the same 
time period 7i„ 4 r„. A\ «), is given by 

( Un(k. a) - h n ( k. a) ■ .r„ if r u < 0 

(fin + 06») ■ - i) ■ F u (/. A\ a) 

+ (A\ n ) - a) ■ .i , f if J' u > 0 


where 

i/m(A\ a) = m,n ■ A„ 


(\ - exp (—o- ■ (7 a„ 4 r „)) 

V 

_ (Tka -v r„ ) exp (- 11 ■ (7am Hh r„)) ^ 


, ,, , 1 - exp(-o ■ (Tam + Tn)) 

1>„ (A’, a) = 06,, ■ -. (I) 

n 

See Appendix A for details of F„(i. k. a) and 7? #l (.r„. A\ a). 

If the decision is to let the machine stay idle (a ~ 0) at state 
s = (,r, A■), the next decision point is the epoch at which the next 
demand arrives. The expected discounted inventory cost for any 
product n until the next demand arrival is given by 


ll„ (.r„. A’, a = 0) = Eo 
= Eo 


IL 


exp ( -<v ■ t) • / (.I’m ) ‘ (It 
1 - exp (-off) 


lUn) 


where H is the random variable representing the lime between 
demand arrivals. The probability distribution of k is exponential with 
parameter A = , A„. It follows that 

/(.I- ) f - . r „ if .!■„ >0 
/?„(.!•„. k. n = 0) = —-— = < * + “ . n (2) 

A + n l ‘ ,r " d J-.. < 0. 

Therefore, the immediate cost for any action at state s, C(.s. a), can 
be expressed as 


C'(.S. a) = PC(k. n) + k. a) 


(?) 


where PC{k , a = 0) is understood to be zero. 

The dynamic programming recursion for minimizing the expected 
total discounted cost over an infinite horizon is as follows 


1 *(*) — min 

,v 


C(.s, a —0)4- />(A\ a — 0) 


n-r-r„.0 


nmi 

(4>0 


C( -S. (I) 4- p{ A-, (i) • r Pi (> i |A‘. a) 


Jl-0 


y Py(j\ | A’, fi) ■ VU -A c„ - «] 

7N =0- 


(4) 
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where 


,,(k a) = l E * l ex P<" rt • <,) 1 = X+“ 

' l ' \e*p[-o-(Tu + T„)) 

e tt = unit vector along a ih coordinate, 
Pn0n|fr. a > 0) = Prob (e/„(T\ a + tv,) » j n ), 


if a = 0 
if * > 0 

1 < a < X 

1 < n < A’ 


,/ — vector [j„, 1 < ?» < A r ], 

The first set of square brackets in (4) contains the cost for letting 
the machine stay idle until the next demand arrival. The second set 
of brackets contains the cost for producing one item of product a. 

Ill. Solution Method 

Note that the model in (4) is defined on an infinite inventory 
state space which leads to unbounded costs. The general unbounded 
stochastic dynamic programming problems have been studied by 
many authors (see Whitt [15], Wessels [14), Lippman [10]). The 
major results have been summarized and extended by Van Nunen and 
Wessels 116]. In the context of our model in (4), the key results can 
be described as follows. Let ^ be a positive function on 5 satisfying 
Assumption 2.3 in Vail Nunen and Wessels [16J, (It could be shown 
that //(*) = (I]* 1 1-f| —1 or //(*) = (X^_, |) ■' satisfy the 

above assumption for our problem.) Denote by U the class of real¬ 
valued functions on 5 with the property ||i , || = sup„^ s |r(*)|■//(*) < 
oc. Define on 0 operators H for policy n and T, respectively, by 


We also discuss the reasons for the improvement. In Subsection C 
we show the computational test results of the error bounds and tht 
structure of our policies based on problems involving two products 

A, Approximation via a Finite-State Model 

We first define a function that maps the infinite inventory state 
space onto a finite one. A policy is obtained by applying the 
Successive Approximations algorithm to a model defined on the finite 
inventory state space. This policy is then extended to construct a 
near-optimal policy for the original model. 

Construct a finite inventory state space X = {7 £ Z N : L n < 
7„ < 13 ni 1 £ ii < X) through a mapping X »-* A'defined by 

fp(.r, l ) = Xn if.I 'n>Ln ? 

\pU„) - L v if .r„ < L„ 

where L = [Z„. 1 < n < A r ] can be interpreted as a truncation state 
in the original inventory state X. We restrict L u to be less than zero 
for all n in order to allow for both positive and negative (backorders) 
inventories in A*. Define 5 = {* € Z^ + 1 , s = (7. k)\. Clearly 5 is 
a subset of 5 Let U be the class of bounded functions U(■): S •-* R , 
and 7f be a policy on 5. On 15 we define operators f/V for any policy 
7f, and r, respectively, by 

¥ F r(T) = //>,.(«) (X) 

and 

Vnu) =- rc f ,(s) (9) 


= C(s. 7T(s)H- Ct(.s. tt(s). r) 


where 


G(s, rr(*). r) = 


(p{k\ a — 0) ■ V ' r (- v " • k) 

jf 7T (.s) = 0 

/'(*< tt(h)) ■ *(*))-■■ (5) 

o^ v 0\|^ ^ s )) 

•*’(■» + '*(,) - j. *(*)) if ?r(s) > 0 


and 


Ut'(s) = min | C(s. a = 0) -I- p{h\ a — 0) 


Ei ■ r < r - r "- *> 


nun 

(i 


C(h, a) -|- f){k. n) ■ ^ P, (j] |A. a) ■ 


j i=o 


■ P\ (jv|*. a) • r(.r + r„ - a) 


( 6 ) 


Wessels [14] has shown that successive applications of l r (respec¬ 
tively, ff n ) converge in norm to V, the value function following 
the optimal policy (respectively, to V», the value function following 
policy 7r). 

Due to the infinite inventory state space, however, the Successive 
Approximations scheme cannot be applied directly to our model. 
In Subsection A, a near-optimal policy is obtained by solving a 
problem defined on a truncated inventory state space. The challenge 
of our problem lies with the development of an error bound for using 
the near-optimal policy. A general approach proposed by Lippman 
[ID] and Van Nunen and Wessels 116] foT unbounded stochastic 
programming problems results in an extremely large error bound 
when applied to our policy. In Subsection B, we derive an alternative 
approach that yields a much tighter error bound on the same policy. 


where r J ,(,s) is a function in U defined as r,,(.s) — ). k) and tt 

is a policy on 5 defined as 7r(s) - t f(/»(./). k). 

Starting with an arbitrary function ro in 15, iterations based on 
(9) are performed m times. Dcnole by r ,n and W , respectively, 
the value function and ihe policy determined by the last iteration. 
Based on the value function r"', two additional operations are 
performed as follows: r '"~* 1 — /iVT'"' and »'* = ( r"\ Lei 
y ~ max_ t - (s) - r'"{ s)| and - max-^ |T *("?) - 7’"'(*)|. 
According to the contraction mapping theory (Denaido [31), both b* 
and b* converge to zero as the number of iterations m goes to infinity. 

In the next subsection, we derive a bound on the error, V(s) - 
r»(s), due lo using policy 7r (zr(.s) jf(p{.i ). A )), for the original 
production/inventory control problem defined in (4). We will also 
discuss how the error bound is affected by X and <*>♦. 

8 Enot Bound on the Neat -Optimal Policy 

Define a row vector D{ s) = [b\ (A-. tt(.s )),■■■ J>\ (k. 7r(.s))], 
where b„ (A-, a > 0) is the immediate expected backlogging (.r„ < 0) 
cost per unit defined in (I) and h t1 (A, a = 0) = (fu „/(A ■+ n) as 
indicated in (2). 

Let = max {//. 0} and (L — .r)^ represent the column 

vector[(L„ - j„) + , 1 < n < X]. Define a sequence of value 
functions u {,) on 5 as follows: 

„ il) (s) = n(s)-[L-.,f do) 

H (,+ I, (a) = G(a. 7T(M. U 01 ). '>1 (11) 

where (7(.s. 7 r(.s). /-) in (11) is defined in (5). 

Theorem 1. Let /> = max* n {p{a)} and define a function 
r' J ,, (.s) in U as r}," ( h) — r n, (p(.r). A 1 ), then 

v*<*) < i-;r(*)+ r—— +yv''(«). 

1 -fi 


( 12 ) 
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Pioof Since 7r( s) = W(p( i) A ) by the definition of policy 7 r 
immediate inventory cost /?,(*, k tt(a)) and the immediate 
i | Lost C (s tt(s)) (both defined in Section II) can be rewritten as 

/ I k 7T( s)) = I, ) k 7T( s )) 

+fc (I - / ) 4 


T(s tt(v)) = F((A 7T( S)) + (l r A r(s)) 

I 

= C ip(l) A 7T(S))+ « (,J (s) 

H n i) (s)—C(s 7t(s)) + 6(s r(s) z / ) 

= » 41 (/'(0 M + «i u) (t) 

< i, (*)4** + i/ m (s) 

Hl'f] {*•) < H n {i, 4 ** 4 w <n ) 

=■ f (s tt(s)) + G(s T(s) ,, + « in ) 

1 I, (s) + (l + /0 ^ +II (1, (S)+M '(s) 

Since AA, / / y —> 1 f we obtain 

s x 

1 7 T (s )s 1 1 ( s )+ 1 + yti * 1 (s) □ 

The mini 2 

H-) = r " i. (s) ^ '/ M - /- (H) 

i - / 

Theorem 2 is proved in a way similar to Theorem 1 Theorems 1 and 
2 lead to the following result 

" Ms) 1 U(s) ^ t (s)-f + VV »(s) 

1 - !> ' 1 - v " 

C nmputtm* an Uppu Bound on ^ v , </ ( ’( s) The exact value of 
i N*) IS difficult to compute since itsell may involve 

infinite arithmetic operations (when 7 r(s) ^ ()) In what follows we 
piovide a procedure tor finding its upper bound 
Fust a crude upper bound on f/ ( *M 77 ( (s) can be easily derived 
md is given as follows 

- ’(■,) = ,, 1 zr [d- o + + (--d n 

Vs e s , > i 

where the row vector D* = [max* {/> (A u)} 1 S n 5 A] 

tnd the column vector -T = [max { \ /\ max* {F[d (t + 
h )]}} 1 S n < \ ] Note that F \d (r, -1 l k )] — \ (r + 7/ ,) 
Given an integer M we obtain 

'(s) < ’(S)+ Yi l(s) 

i i v+ i 

= £«‘ ’(s) + /. W /T 

I 

’(7 - I) f r U (I - />) 4 /> * 

f-“> (wo 2 

Clearly when U is large the right-hand side of the above inequality 
is close to 5^ x | » ( } M 


Second define a sequence of value functions u (s) follow 
- i/ n, (s) Vh e s 

*(s) = 77 (,) ( v) if i , < T for some n t i 

+ n (s) = ( F (s 7T(s) </ M ) if, ^ ] lor all „ 

It can be shown by induction lhat u l ] (s) _ „ ) Appendix B 

shows that </ ( J (s) can be computed within finite unthmeiic I ei 

M 

b{*) = ]T ,,( , (->)4/» u ir 

i 

' a - ') H , v u ,<) + / r 

i - (i-,-) 

The bound on the error due to using near optimal policy t \ ff ( s) - 
I (s) is thus given by 

^1* 4 h 

f(s)= H fi{s) (14) 

1 - p 

Remark 1 The error bound in (14) has thret components h* 
and h where and b* measure the enoi due lo the Successive Ap 
proximations on the truncated model Given any truncated inventory 
slate space A* and b* can be made arbitrarily small by performing 
a large number of iterations based on (9) b( s) on the othei hand 
measures the erroi due to truncation and it van only be made small 
by reducing I The last point can be demonstrated by the numerical 
results repoited in the next subsection Consequently the quality of 
the near optimal solution is mainly determined by the selection of 
the truncation state L 

Re mark 2 A general approach foi deriving an error bound for 
unbounded stochastic dynamic programming problems can be found 
in Lippman 110] and Van Nuncn and Wcsscls [lb| The key clement 
to the general approach is the norm \\Thri, - i, || dehned as 
max fs { \H n <; (s) — i ’ (*.)| //(s)( We have tiled two forms of 

/<(•>) = <rT , I* I) 1 and /'(•>) = (E n i l< I) 1 hu| I 50111 

yield extremely large enor bounds (100- l(KK)/ of the lower bound 
m all instances tested) More ciideally there is no indication that 
the error bound would approach /eio when the tiuncation stale / 
decreases to -x In contrast the results based on our approach 
show (in the next subsection) that the euoi foi using policy ~ is 
bounded by a rather small valut when the initial inventory stale is 
reasonably fai above the truncation stale I he improvement in our 
view comes from the fact lhat our approach explicitly uses the Poisson 
probability distribution (foi computing n ( 1 in b) which guaianlees 
that the probability of reaching the inventoiy state below the current 
one decreases exponentially As a result the impact of the value 
function in inventoiy slates below the truncation stale which is 
not captured by the Successive Approximations procedure on the 
tiuncated state space is not significant 

C h \pen imental Rt \ull\ Based on Tw o Produi t Prohle ms 

We now consider a manufacturing system that pioduces two homo 
gencous pioducts Four experiments (shown m Table I) are conducted 
lo examine the impact of set-up time and capacity utilization denoted 
by / (» = 2 A r, for our experiments), on the error bound of the 
near-optimal policies 

For each problem a near-optimal policy is computed based on the 
iteration procedure (9) with the following stopping ciitcnon 

111,1x1=—^-} < 0 001 

S [ I ( * ) J 
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TABLE I 
Four Experiments 


Parameters 

Experiments 

Problem 1 

Problem 2 

Problem 3 

Problem 4 

Tia = r 3 i = T 

i.o 

1.0 

2.0 

2.0 

e A 

0.25 

0.40 

0.25 

0.40 

T| w r a = r 

1.0 

1.0 

1.0 

1.0 

r a 2 At 

o.so 

0.80 

0.50 

0.60 

Cj = Cj 

13000 

13000 

S 3000 

8 3000 


$50 

$50 

$50 

$50 

9\ “ 9i 

3100 

1100 

$100 

$100 

= 9*3 

S500 

.... 

$500 

$500 

5 500 

Q\ =» 0* 

50 

50 

50 

50 

L\ = L% 

-50 

-50 

-50 

-50 

a 

0.025 

0.025 

0.025 

0.025 


The quantity on the left-hand side of the above inequality represents 
the relative error bound due to the imperfect convergence of the 
Successive Approximations procedure for generating the near-optimal 
policy. 

We then compute the maximum of the relative error due to 
truncation, defined as <*>(-s)/T"*(«), over five different inventory state 
regions. The results are presented in Table II. The following two 
observations are of particular interest. 

1) In each experiment, the error due to truncation tends to increase 
as the initial inventory state moves closer to the truncation state L, 
and becomes extremely large when the initial inventory level for both 
products is below -35 [with L = (-50. —50)J. This means that the 
near-optimal policy can be a good approximation of the true optimal 
one only when the initial inventory state is reasonably far above the 
truncation state L. Therefore, for a given initial inventory stale, the 
near-optimal policy can be improved by reducing L. This obviously 
means increased computing time required for generating the near- 
optimal policy since the iterations based on (9) have to be performed 
over a larger inventory stale space. 

2) Given the same region of initial inventory states, the error due 
to truncation tends to increase as the capacity utilization and/or set¬ 
up time increase. Intuitively, this is because the probability of the 
inventory state moving beyond the truncation state in the future 
transitions increases as the machine becomes busier. This suggests 
that the truncation state L required for generating good near-optimal 
policies is determined by the machine capacity utilization and/or set¬ 
up time; the higher the machine capacity utilization and/or set-up 
time, the lower the truncation state has to be (which means more 
computing time required as a result of observation l). 

Fig. J depicts the structure of the near-optimal policy for the 
problem with r = 0.5 and T = 1.0. The two coordinates represent 
the inventory positions for products 1 and 2, respectively. The solid 
(dashed) lines delimit the actions to be taken when the machine is 
set up for product 1 (product 2). These lines (also called switching 
curves ), create four regions, and in any one region the action is 
well defined. Thus, in region 1 (respectively, region III), one should 
produce product 1 (product 2, respectively), irrespective of the current 
machine status. In region 11, do not produce anything, but keep 
current machine set-up. In region IV, continue to produce current 
product (whether 1 or 2). The shape of the policy clearly indicates the 
monotonicity property: given the machine status and inventory level 


x2 



for one product, there is a unique threshold level tor the other product 
to switch the decision, either to stop production or start producing 
the other one. It also shows that the near-optimal policy is not of the 
classical (■■», Q) or (.s, 5) types, since the points at which production 
switches depend on both invenlory levels and the quantities produced 
depend on the quantiiies ot the realized demands for both products 
during the production run. 


IV. Conclusions 

In this note, we have developed a procedure that generates 
near-optimal policies for the single machine multiproduct produc- 
rion/inventory control problem by solving a problem defined on a 
truncated inventory slate space. The experiment test results have 
shown that the error for using a near-optimal policy is bounded by 
a small value if the initial inventory state is reasonably far above 
the truncation state. Wc have also found that policies based on 
predetermined structures such as lot-sizing and (.s. .S 1 ) policies can 
be very different from the optimal policy and therefore perform 
poorly in practice. For a problem that involves many products, 
however, our procedure cannot be both efficient and accurate due 
to the “curse of dimensionality.” For a ten-product problem, for 
instance, even if we preserve only five inventory slates for each 
product in the truncated space, the total number of inventory states 
becomes 5 1(1 x 10 = 9.7G x 1() 7 . 

We are in the process of developing a decomposilion/aggregation 
solution procedure for problems with many products. For an A'- 
product problem, the procedure decomposes the problem into N 
subproblems, each involving a single product and a composite product 
that represents the remaining A T - 1 products. Based on the solutions 
of the two-product subproblems using the method developed in this 
note, the procedure constructs a separate policy for each individual 
product such that policies for different products do not conflict at 
any state of the system. Preliminary results have shown that this 
procedure leads to very good policies. 
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TABLE II 

( Relative Error Bounds of the Near-Optimal Policies Due to Truncation 


Parameters 

Inventory State Regions 

x\ > —15 

> -15 

ii > -20 

x 2 > -20 

X\ > -25 

x 2 > -25 

Ji > -35 

i a > -35 

i, > -50 

x 2 > -50 

r = 0.5 

p 

II 

1.56 x 10-’ 3 

3.37 x 10-* 

9.76 x 10-‘ 

8.89 x 10- J 

1.61 


p 

c4 

II 

£-■ 

4.56 x IQ' 13 

9.46 x 10-* 

2.61 x 10' s 

.... ' j 

0.21 

3.57 

-* 

II 

o 

bo 

T = 1.0 

4.17 x 10- s 

. - . 

4.07 x 10- 3 

9.64 x 10- 3 

0.74 

2.71 


r = 2.o 

1.38 x 10-" 

1.16 x lO' 3 

0.24 

1.65 

8.66 


R 


Appendix A 

The Inventory Cost During Unitprocessing Time 
I f .r„ < 0, 

exp (—n ■ t) • fjim ■ E[d u {t) - .r„] • dt 

’I'ka+Tu 

exp(-o ■ t) ■ (ji, u • (A„ ■ t - j-„ ) ■ dt 


„(.r„. a) = [ 
Jo 

-f 


= a n (b\ a) - h n (k, a) ■ .r„. 


If ‘i'n > 0 

R( 


Jo 
4 - 

-L 


h-n + Ta 


exp(-n ■ / ) • (r/„ - /v[(.r*„ - (f))“*"] 


+ //*„ ■ -r/ l( (/))”■]} f// 

,7 in-+ r r. 

exp(-o ■ /) 


• | r/„ ■ - i) - Prob ( H u ( t) = /) 

;-o 

:x; 

■f fjhti ■ y (* - .r„ ) ■ Prob (</,, (/) = /) 


= r M +1 


■ r it 


-L 


T ka + r a 


exp (-0 ■ /) 


E , . , l A 1 

(•*■» - 0 .— 


(A„ ■ f)' exp (- A„ ■ t) 


■ dt 


rA-..+ r i 

Jo 


■ exp (-n ■ A) ■ film • (A,, • t — .r u ) ■ <lt. 


The second term is exactly fi„(A\ o) .— /j., («) - .r ?1 . The first term 
becomes 


i ,7 1 


JA f T k'* + r « 

H- (fh„ ) ■ \ ( .Pm - 0 * / C*p ( - ( Am + ) ■ t) 

J l) 

.1 

■ dt = (//„ 4 - Uhu) ■ - ') * P..(g A\ a) 


r--0 

(Am */)' 


here F„(L A*, n) can be computed using the recursive relationship 

A'm(l k.n)= f 

Jo 


/‘ r *« +T « „ (A„ O' , 

exp (-(Am + o) • f) ■ —:r— ■ (It 


(A„ ■ (T j L 4 - r n ))' ■ exp(-(A„ -4 o) ■ ( 7 T„ 4 - n,)) 
— (A ? , 4 n)’ i! 

Am 


A m 4 " O 


■ F„(i ~ 1. A\ rr) 


with F„( 0 . A\ n) = ■ f () llfa+Tn eX p(-(A m 4 - a) - f) ■ dt = [ 1 - 
e xp(-(A, ( +«)-( 7 U + r f( ))]/(A ll +n). 


Appendix B 

Computation of the Functions f/ , ,) 

The function ii (,, (*) is easy to compute at state s whereTr(.s) = 0. 
The following is the procedure for computing f/ (,) (w) at stale * where 

tt(«) > 0: 

i7 (,+u (.s) = G'(*. 7r(.s), u (>) ) 

•' t 4*- \~t‘\ 

= f>( A - 7T(*)) ■ ^ PlU'l|A\ 7T (.s )) - - - 

:ii=o 

J \V +■' t ( h ), A' " I'N 

£ . P,v(./y|A'. 7T (-S )) 



■ H (,| (.r + <■ n{ 

•v) “ 

./. 7T(*)) 4- p( A. 7T 

K ) ) ' l>' ~ 

1 rr 


r 


V J -H+' M*). 


■ 


- 1 (* - 1)- 

1 - 

n e 

P„ (ii*'- 

7T(,S)) 


l 

- 

ii 1 y -0 


- 


• A* + D(s) 





where the column vector D(s 

is given by 



D(s) — 

Am • ( TVoj 4- , 

7T ( .1 ) 

) 




^■n+«n 

.M, .1 

— ./, n 





£ 

P„U'|A‘. 7T ( N ) ) 

■j- i < 

»i < A 



.=n 




and < *{*) 

. „ represents the 


entry in vector f, r{f 

,i, l < a 

< A'. 
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The Partial Model Matching or Partial Disturbance 
Rejection Problem: Geometric and Structural Solutions 


We shall consider linear time-invariant systems described by 

r.r(t) = Ax(t) +Bii(t), t>0 

\y(t) = cm. t>0 

with A:<V —> X, D14 —► A', C:X —► y (dim(X) = n, dim(M) = m, 
dim(y) = p). This system will be noted by (A, D , C). 

Let us denote: 


• B = Im B. 

• A.' = KerC. 

• B\ A. )| the family of (.4, i?)-invariant subspaces contained 
in K, also called output nulling controlled invariant subspaces. 

• C(C\ A; B) the family of (C, A)-invariant subspaccs containing 
B. 

• V" the supremal element of >3 {A, D\ K’). V* is equal to the limit 
of the Invariant Subspat e Algorithm (ISA) for (A, B, C) 

rv° := A' 

\V‘ := Kr\A~ l [B + V 1 -’). |>1. (2) 

• 5* the intimal element of C{C, A; B). S* is equal to the limit 
of the Conditioned Invariant Subspace Algorithm (CISA) 

(S u := 0 

\S* := B + A(KnS'~ l ), />1. () 

■ F(V*) the family of static state feedback maps such that (A + 
BF)V * C V\ 


Michel Malabre and Juan Carlos Martinez Garcia 


Abstract —The Partial Model Matching Problem (or equivalently, the 
Partial Disturbance Rejection Problem) is revisited here. It has been 
initially introduced in [2] and amounts to matching the first k Markov 
parameters of the plant with those of the model. This obviously finds 
all its interest when no solution exists to the Exact Problem. We give 
both geometric and structural solutions to these problems. The geometric 
solution Is in terms of the steps of the well known supremal output 
nulling controlled Invariant subspace algorithm. The structural conditions 
amount to comparing some list of integers, namely some orders of zeros 
at infinity. 

I. Notation And Basic Concepts 

Throughout the paper we shall essentially follow the notational 
conventions of [111. Script capital (A*, y. •■■) denote finite- 
dimensional vector spaces over the field of real numbers JB, and 
dim(X), dim{y), • ■ ■ .denote their dimensions. The notation A' ~ y 
means dim{X) = dim(y). If V C A\ then A’/V denotes the quotient 
space X modulo V. 

Italic capitals (A, B. ■ ■) denote interchangeably linear maps and 
their matrix representations in particular bases. The image of a map 
D is written as ImB and its kernel as Ker£L The identity map on a 
//-dimensional space is denoted by /„. 

It is assumed here that the reader is familiarized with the concepts 
of (A, B) and ( C , A)-invariance [11]. 
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Infinite lero Structure 

Given any system ( A , B, C) described by (1) or equivalently by 
its strictly proper Q/xn/) transfer function matrix 

T(f.)-.= C(sr„-A)-'P (4) 

its structure at infinity, since such systems have no poles at infinity, 
is described by the multiplicity orders of its zeros at infinity. From 
an algebraic point of view, this structure can be derived Irom the 
so-called Smith-McMillan Form at infinity of T{s), say A v , which 
is a canonical form under right and left biproper transformations (sec 
for instance [10]). Indeed, there exist biproper inairices, ZM s) and 
Bj(s), such that 

B\(.s)T(.s)D'2(s) = = 

where A^ = diag[s~ " 1 , s~" 2 . • • •. s "' ), r := rank(T(s)). 

The nonincreasing list of integers (#>/) is the list of the orders oi 
the zeros at infinity of the system. From a geometric point of view, 
various equivalent definitions have been given for this structure. The 
original one, due to [1|, is 

n, = card{]>, > /} V/ € {1. ■ ■ ■, /■} (5) 

where card stands for cardinal (number of elements in the set) and 
with 

1 >, := dim ^ | j ■ V) > 1. (6) 

Other geometric characterizations have been given in [5). Indeed 



The following lemma will help us in the sequel. 


A v 0 
0 0 
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Lemma /? Let V\ i > 0, denote the subspace obtained in the 
, step of (ISA). There exists a .(non unique) state feedback F„ in 
) V f ) satisfying 

v +l =\:n.4^v\ w >0 (8) 

where Af 0 := (A + DFo). 

Remark 1: Due to the correspondence (5) between list (tij | and 
{ ; ,j. it is quite obvious that the list 

{j’i -it'.)- V' e {i-2.---.il,) (9) 

jiN the list of the orders of the zeros at infinity of (.4, B , C) which 
arc strictly smaller than i. 

II. Problems Statement 


The kth-Order Partial Model Matching Problem (PMMP(k)): 

Given a linear time-invariant system described by (1) and with 
associated transfer function matrix described by (4), and given a linear 
iime-invariant prespecified model described by 

= A m .r nt (t) + B rn u n ,(t). />() 

\y,n(t) = f> 0, 1 ' 

with the state E A )u 2ft. ,,,,, , the input tin, 6 Urn ^ 2ft ,,,|W 
and the output yw E y ^ 1RJ\ and with associated transfer function 
matrix 

T,Js) = c,„(sl - .4,,, r l D„ (II) 

we can fonnulate the Ath order Partial Model Matching Problem 
(initially appeared in [2]), as follows: 

Definition !: Let k G A be given. The Ath order Partial Model 
Matching Problem has a solution if and only if there exists a proper 
rational matrix C(s) such that 

T(s)C(s)-T„,(s) = *~ a+ ' ) F(*) (12) 

for some proper P(s). The proper transfer function matrix C(s) can 
be interpreted as a dynamic precompensator cascaded with the given 
plant. In other terms, we want to match the first A Markov parameters 
of the compensated plant with those of the model. 

We give Definition I for strictly proper plant and model. This 
assumption is not restrictive: the main interest, for the exposition, 
is that the associated geometry is much simpler. Extension to the 
proper case is quite easy. 

Note that, because of the assumption of strict propemess, (12) is 
trivially satisfied for k = 0. We shall indeed consider in the sequel 
that A: > 1. 

In a similar way, we shall consider the following. 


Die kth-Order Partial Disturbance Rejection Problem (PDRP(k)) 
PDRP(A) is defined as follows: 

Definition 2: Given a perturbed linear, time-invariant, system 


=. A.v(t) + Bu(t) + Ed(t ), t > 0 

\y(t) = C.r(t ). t > 0 

>at we shall denote by (.4, [i? F], C), with the state r G A’ ~ 2ft.", 

■he input u G U the output y E y — TR P and the disturbance 

' E D 2s--2ft. rj . (of course some smoothness properties of r/( ), like 
Measurability, shall be implicitly assumed here), and an integer 
^ 1, find necessary and sufficient conditions for the existence 
1 a state feedback map F.X —yU such that 


r CE = 0 
C(A + BF)E = 0 

X'(A + BF) k ~'E = 0. 


(14) 


When the control law includes some measurements ol the disiui 
bance, we have a modified version of PDRP(fc). namely the following. 

The kth-Order Partial Modified Disturbance 
Rejection Problem (PMDRPfk)) 

Definitions: Given the perturbed linear, time-invariant, system 
described by (13) and an integer k > 1, find necessary and sufficient 
conditions for the existence of a static state feedback law u[t) = 
Fj'(t) + Gd[t ), with F:A’ —► U and G\D —+ U, such that 

C(E+BG) = [) 

C(A 4- BF)(E+ BG) = 0 
< (15) 

C'{A + BF) k ~'{E+ BG) = i). 

We shall provide in the next sections geometric and structural 
solutions to these problems. We shall first consider here PDRP(A). 

III. Geometric Solvability Conditions 

Let us now consider the geometric solvability conditions of both 
problems introduced in Definition 2 and Definition 3. 

Theorem /. Let A- > 1 be given, there exists a static slate feedback 
map F which solves the Ath order Partial Disturbance Rejection 
Problem if and only if 


fev* (16) 

where £ denotes ImE and V* denotes the subspace obtained at the 
Ath step of (ISA) for (A, ZJ, C). 

Proof of Theorem I: Let A denotes the kernel of C, If F exists, 
which satisfies (14). obviously 

t' c a: 

(.4 + BF)£ C A' 

(A + nrf-'t' c 


thus 


f c K n . 4 ;. w: n • • • n . 4 ;;“ -' ! k (i7) 

where Ap := (A + BF). Now. it follows from (17) and (2) that 

\:n Aj'k: c v 2 :=a.: n .4“ 1 (i> + a.) 

=\;n Af‘iB + K). 

jc n Ap'K’ n A} j k =k: n aj-'uc n aj-'K.) 

C A.' n AJ ‘ V' 

C K'r\ AJ-‘(V' 2 A B) 
=K'nA-'(BAV' J )=V'. 


K.n\j\riAj J k' ■ ■ ‘'k: c k: n aj‘ (v*' 1 + B) 

=A,'n.4~ l (£ + V‘'“') = v* 


then £ C V*. 

Conversely, from Lemma 1, there exists at least one map Fq such 
that 


v , f1 = A:n.4^v', v; > o 
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where Afp :* (A+ BFo). Hence 

£CV k <=> £CKnAZ'V k ~\ ' 

* O 

_ / £ c X 

tv c, 
f f c a: 

» 4=5- < Af„£ C X 

Uk^ C V*- 2 , 

f £ C IC 

J ,4f„5 c a.: 

,4r~‘f c k : 

r O 

which is equivalent to (14) with F = Fq. □ 

The following theorem can be proved in a similar quite easy way. 
Theorem 2: There exists a static state feedback law u(t) = 
F.r(f) 4* Gd(t ), which solves the A*th order Partial Modified 

Disturbance Rejection Problem if and only if 

£ C V* + B (18) 

where £ denotes Im£, B denotes ImZ? and V* denotes the subspace 
obtained at the kih step of (ISA) for (A, B, C). 

We shall now give some structural solutions in terms of structures 
at infinity. 


IV. Structural Solvability Conditions 


Proof of Theorem 3: Note that if (19) is true, then 

V; = V\ V* 6 {0,1, --.A 4 1}. (21, 

Indeed, this is easily shown by induction: 

Clearly. V? = V° and if V c ~ 1 = V' “ l . then 

V’ := K.DA-'(B + £ + Vr‘) 

= JCD A~‘(B + £ + V'" 1 )■ Since (21) holds for i-1 
= K.:nA~'(P + V'~'), since (19) holds for i-1 
=: V 


and thus V/ = V f , V/ E 1,2, ■ ■ ■, k 4- 1. 
From (9), (19) and (21), we have that 


Pi “ Pi 






- 

= Pcl-Pri, v'/e {1.2, 


’ k 4- 1}. 


We shall now prove by induction the reverse part. 

Indeed, (19) and (21) obviously hold for i = 0. Assume that (19) 
and (21) hold for i — 1, then 

v; := JCnA-'(B 4 £ A vr 1 ) 

= /LIT A '(& 4 S 4- V l ~ [ ). since (21) holds for i-1 
= K. fl A~~' (13 4- V ,_1 j, since (19) holds for i-1 
= : V' 


The following equivalent solvability condition will be expressed in 
terms of some equalities between integers, namely some orders of the 
zeros at infinity of (A, B y C) and (A, [B E], C). Its main advantage 
comes from the fact that this structural information can be obtained 
through various ways, for instance from the Smith-McMillan Form 
at infinity, and thus is less dependent on a particular approach than 
Theorem 2. 


Structural Solvability Condition for PMDRP(k) 

Theorem 3: The kih order Partial Modified Disturbance Rejection 
Problem is solvable if and only if the orders of the zeros at infinity 
of both systems (A, B, C) and (A, [BE], C), which are smaller 
than or equal to A\ are the same. 

To prove this theorem, we shall use the ith step of (ISA) for the 
combined system (A, |£f E], C), i.e., 

fV? := A' 

\v; := k:c\ a-'os at + vr 1 ), /> 1 

which limit is V£, the supremal (A, [B £])-invariant subspace 
contained in K. 

First note that (18) is obviously equivalent to 

B 4 S 4 V’ = B 4 V\ V; 6 {0. A }. (19) 


Let us introduce the integers 



dim 


f(£ + B)nv:.-'\ 

V (£ + 8)nv,‘ )' 


Vi > 1 


which characterize the orders of the zeros at infinity of (A, [B E], 
C) (recall (7)), and remember that the list {p< \ - p, ,} characterizes 
the orders of the zeros at infinity of (A, [B E] % C) which are strictly 
smaller than i (remember Remark 1). 

We shall prove that (19) is equivalent to 

PI - P, = p, 1 - Pc., V/ € {1.2, ■ ■ *, A- 4 1). (20) 


and thus V/. = V\ Vi E 0, L ■ ■, k 4 1. 

On the other hand, from (20) wc have that 

Pi “P.-+1 =Pri -Pr/fl, Vi E {0. A 1 } 

then 

^g+*).«,(g.+ <jUS). V, e ID.,.. 

Since we have shown that V,' - V', Vi E {0,1,■ ■ ■. A* 4 1). it 
follows that 

dim (13 4 V ) = dim (B 4 £ 4 V ), Vi E {0, L ■ ■ ■. A*} 

which is equivalent to (19) and ends the proof. □ 

The following corollary illustrates the obvious fact that exact 
rejection is just a particular case of partial rejection and brings back to 
some familiar structural results (see for instance [5] for the equivalent 
case (see Theorem 4.2) of model matching). 

Corollary 1; The Exact Modified Disturbance Rejection Problem 
is solvable if and only if the Ath order Partial Modified Disturbance 
Rejection Problem is solvable for the order k = n\, where nj 
denotes the supremal order of the zeros at infinity of (A, B , C). 

Proof of Corollary I: Obvious from Theorem 3, since Exact 
Rejection means Partial Rejection for any k and since the integers p; 
are equal to zero for i > /q. □ 

We shall now deduce the corresponding structural result for 
PDRP(A-). 

For that purpose, consider the following “extended" perturbed 
system 



A /„ " 


0 " 


E 

A = 

0 0 

, B = 

B 

,C = .[C ()].£’ = 

0 


The following lemma establishes some direct links between 
PMDRP(A) and PDRP(A). 
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lemma 2 The Ath order Partial Disturbance Rejection Problem 
i the system 

(i(t) = 4/ (/) + Du(t) + Ed(t) t> 0 

\v(t) = Ci{t) t>() 

i equivalent to the kih order Partial Modified Disturbance Rejection 
)olem for the extended system 

/ r (f) - 17(f) + Bu(t ) + rd[l) t> 0 

\i7(f) = Ci(t) t > o 

Pi oof of Lemma 2 Denote \ = \ 2 and let V denote the /th 
lip of (ISA) for ( \ B C) The result is then an obvious consequence 
ol the following relation 

(V +tf)n i = v 

wInch proof IS quite direct □ 

Runark now that 

r (s/ - ir'fl ^ S '( (s/ - 4) '/? (21) 


ind 


where 


».U) 


»(/> ' 
i /(f) 


€ l 


l 4/ 


« (f) = 


u(t) 
u /(f) 


€ 14 


= */ Hi 



O' 

0 


B ^ 


B 0 
0 D 


=[( 


0]and 1 



Proportion / The / th order Dynamic Partial Distmhancc Rtjec 
lion Problem is solvable il and only if the (Malic) / th ordei Paitial 
Disturbance Rejection Problem is solvable 

Proof of Proportion l The proof is quite direct when using 
geometric arguments since it rests upon the following relationships 


V - V \, 


where V is the i th step of (ISA) for ( 4 B ( ) 
Then since t = ImF C \ it is quite obvious that 


c (sj — 4 r'r = ((*,/ - i) 'r ( 24 > 


c v* •<=>• t c v‘ 


lhis directly leads to the following 

S inn final Sah ability Condition for PDRP(i) 

IliLorim 4 The k th older Partial Disturbance Rejection Problem 
is solvable if and only it the ordcis of the zeros at infinity of both 
systems (4 B ( ) and (4 [B L | ( ) which arc smullei than or equal 
to k are the same or equivalently il both transfer function matrices 

7(s) = *, 'r(sf - t) 1 b 

ind 


which ends the proof □ 

This equivalence obviously icmains Irut if the distuibanct is 
available lor the control law i e tor solutions ol the type 

a (t) - 1 i (1) -f (t d(1) (26) 

Now it has been shown in (4J that the class of compensations like 
(26) applied to (2*>) is fully equivalent to the class of pioper dynamic 
piecompensalors like 

l/( s) = C { H)(1{ s) 


r (s) - [, [ c (-/ -4 ) 'ii < (s i - \) 'r] 

h ivc the same smallest orders of their zeros at infinity up to value k 
To obtain the structural solution ol the Partial Model Matching 
Problem we shall simply use the equivalence existing between this 
problem and the Partial Disturbance Rejection Problem 

/ tom Distw bam i Rejection to Modi! Man him » 

I he natural equivalence existing between Disturbance Rejection 
md Model Matching was shown in [41 We shall heie propose some 
‘lurnativc inspned from the nice equivalence existing between state 
1 cdback laws using dynamic extensions and general proper dynamic 
piccompensators (see [4|) and from the fact that the use of a dynamic 
Mension offers no extra possibility for Partial Disturbance Rejection 
let us define ihe following 

h namic kth Older Partial Disturbance Rejection Problem 

Do there exist some integer //^ and some dynamic extension 
Le 1111) 

i /(/) - Bui ( { t ) 

iff £ \ f j u t j £ U f j dim( \ ( j) = dim(U t j) = n f j such that the 
( h order Partial Rejection has a solution // (t) = F i (f) for the 
extended system 

Mf) - 4 . (t) + B u (t) + T d{t) t > () 

V it) = c i It) t> 0 K 9 


applied to the initial system 

/(f) = 4 r(f) +/MM m) 

//(/) = ( lit) t Ml 

This equivalence immediately shows the following corollary 

Corollary The A th older Modified Partial Dislurbancc Rejection 
Problem is solvable if and only il there exists a pioper solution say 
C (s) to the following equation 

T(s)( (s) + 7, (s)= S (M "/’(-) (28) 

where P(s) is a proper rational transfer matrix and 

T( S) (si - 4) 'll (29) 

and 

77 (s) =( (si - 4) 1 F (40) 

Note that (28) is a particular case of the k th order Partial Model 
Matching Problem where 7 (s) stands for the plant and —h (•») 
toi the model 

We can then rephrase Corollary 2 as 

Proposition 2 Ihe AIh order Modified Partial Disturbance Rejec 
tion Pioblem lor (27) is equivalent to the ki h ordei Partial Model 
Matching Problem with (29) as the plant and (40) as the model 

As an immediate consequence of Lemma 2 wc also have 

Corollary 1 The kth order Partial Disturbance Rejection Problem 
is solvable if and only if there exists a strictly propei solution to (28) 
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Proof of Corollary 3; Use Lemma 2, i.e., just replace T(s) by 
This is equivalent to replacing C(s) by r’Clif), hence 
the result. □ 

To show that the equivalence between Partial Disturbance Re¬ 
jection and Partial Model Matching is complete, we shall quickly 
describe the reverse path. 

Let us again consider the plant 

= A.r(/) + fl«(f). t> 0 

\y(t) = cm* t> o 

and the model 

f jm(t) = A n jr m lt) + D m u n Atl *>V 

\ijrjt) = Cn,jr ni (tl t > 0 

with their associated transfer matrices 


T(s):= C(*/„ -A)~'D 


and 


T in ( S ) := C\n ( lri - An, )~ ] D,n 


respectively. 

Let us introduce the following combination of the plant and of the 
mode], and let us call it the combined system 


f.r,(f) = A, J'< (t) + D, it + E ( limit). t> 0 

\y.(t) = y(n-v t «{t) = C.jrAi)* f>0 


with 


jrAt) 


m 

JTin(l) 


E X*\ A,n 


and 



0 

A» 



c=ir 


— C,„ ] and E, — 



It directly follows from the previous discussion that looking lor 
a proper dynamic compensator u(s) - C(a) u,„(*), that solves 
the A’th order Partial Model Miitching Problem is equivalent to 
considering n in (a ) as a (measured) disturbance in (31) and solving 
the corresponding A-th order Modified Partial Disturbance Rejection 
Problem. 

This complete equivalence allows us to directly deduce the fol¬ 
lowing. 


Geometric and Structural Solvability Conditions for PMMP(k) 

Theorem 5. The Ath order Partial Model Matching Problem has a 
proper solution C(.s) if and only if 


° 1 
D I 


C Im 


+ v; 


where V* is the A th step of (ISA) for the combined system. 

An equivalent structural condition is the following. 

The Partial Model Matching Problem has a proper solution C(s) 

if and only if both transfer matrices T(s ) and [7’( *): - T in (a)] have 
the same orders of their zeros at infinity which are smaller than or 
equal to A\ 


V. Conclusion 

We have considered here the solution to the Partial Model Matching 
Problem or, equivalently, the Partial Disturbance Rejection Problem 
Lemma 2, Proposition 1, and Proposition 2 precisely describe the 
basic equivalences between these problems. Our geometric solution 
(Theorems 2 and 5) uses “classical" geometric tools, which are less 
involved than Generalized Dynamic Covers, as in 12]. The equivalent 
structural solution (Theorems 3 and 5) is expressed in terms of the 
orders of the zeros at infinity which are smaller than or equal to the 
desired order of the partial problem. 

As argued in [2] this partial version of the problems is a very good 
intermediate to the exact one (see Corollary 1). It has to be noted that 
the same idea has been used in the context of systems with delays [8). 

Further study have been devoted to some algebraic complementary 
study (with polynomial tools) and to stability requirements (which are 
compulsory for possible applications) [9J: 

• It has been shown that there exists a stable solution to the 
Partial Model Matching Problem if and only it partial model 
matching is possible. Unlike in the exact case, there is no further 
constraint limitation imposed by unstable zeros for the problem 
to be solvable with stability. 

■ As is well known (see for instance |7]), exact disturbance 
rejection with stability has a static state feedback solution if 
and only if there exists a dynamic state feedback solution. As 
concerns the partial case, the situation is quite different. Indeed, 
it can be shown that there exists a dynamic stable solution 
to the Partial Disturbance Rejection Problem il and only if 
Partial Disturbance Rejection is actually solvable. However the 
characterization of static solutions (if any) is still open. 

• The combination of partial disturbance rejection and optimal 
disturbance attenuation (i.e., minimization in H x -terms of the 
effect of the disturbance on the output of the system while main¬ 
taining stability of the compensated plant) has been successfully 
achieved on some illustrative examples. 


References 

111 C. Commuull and J. M. Dion, “Structure at mfmily of linear multivari¬ 
able systems. A geometric approach/’ presented at the 20th IEEE Cent 
Deus. Cnntr., San Diego, CA, 1981 

[2] E. Emre and L H. Silverman, “Partial model matching of linear 
systems,” IEEE Ti a ns Automat Contr., vol AC-25, pp. 280-281, 1980. 

[3J E. Emre and M. L. J Hautus, “A polynomial characteri/aiion of (A By 
invariant and reachability subspaces," SIAM J Conti Optim , vol. 18, 
no. 4, pp. 420-436, 1980. 

[4] V. Kuficra and M. Malabre, “On various dynamic compensations,” 
Kybemetika, vol. 19, no 5, pp. 439-442, 1983 

f5J M Malabre, Structure a I’lnfim des Triplets Invariants Application d la 
Pow suite Parfaite de Moddle (Lecture Notes in Control and information 
Sciences), vol. 44. Berlin: Springer-Verlag, 1982. 

[6] __ “A complement about almost controllability subspaces/* Svsr 

Contr. Lett no 3, pp. 119-122, 1983. 

171 M. Malabre and J. C. Martinez Garcia, “The modified disturbance 
rejection problem with stability: A structural approach” in Proc Second 
European Contr Conf (ECC"93), Groningen, The Netherlands, 1993. 
pp. 1119-1124. 

[81 M. Malabre and R. Rabah, “Structure at infinity, model matching and 
disturbance rejection lor linear systems with delays,” Kybemetika , vol 
29, no. 5, pp. 485-498, 1993. 

[9] J. C. Martinez Garcia, M. Malabre, and V. KuCera, "The partial 
model matching problem with stability: Algebraic and structural solu¬ 
tions,” lchoraloire d'Aulomatiquc de Nantes, Ecole Centrale de Nantes, 
Nantes. France, Rapport Interne No. 93.20, 1993. 

[101 A. 1. G. Vardulakis, Linear Multivariable Control Algebraic Analv w* 
and Synthesis Methods New York: Wiley, 1991. 

1111 M. W. Wonham, Linear Multivariable Control A Geometric Approach 
3rd cd. New York: Springer-Verlag, 1985. 



H TRANSACTION!* ON AUTOMATIC CONTROL VOL 40 NO 2 1TBRUARY 1995 


Absolute Stability Criteria for Multiple 
Slope-Restricted Monotonic Nonlinearities 

Wassim M Haddad and Vikram Kapila 

I bstract —Absolute stability criteria such as the classical Popov cri- 
ui ion guarantee stability for a class of sector-bounded nonlineanties 
\ llhough the sector restriction bounds the admissible class of the nonlln- 
i ji ihcs, the local slope of the nonlinearity may he arbitrarily large In 
ihi'i paper we derive absolute stability criteria for multiple slope-restricted 
mm -invariant monotonic nonlinearitics Like the Popov criterion, in the 
smgle-mput/single-output case our results provide a simple giaphual 
interpretation Involving a straight line in a modified Popov plane 

l Introduction 

Absolute stability theory guarantees stability of feedback systems 
whose forward path contains a dynamic linear time invariant system 
and whose leedback path contains a memoryless (possibly time 
v uvmg) nonlinearity These stability criteria are gcncially staled in 
Ltmis ot the linear system and apply to every element of a specified 
chss ot nonlinearilies Hence absolute stability theory provides 
sufficient conditions for tobust stability with a given class of uncertain 
elements [5] [17| 

The literaluie on absolute stability is extensive A convenient 
w ty to distinguish these results is to focus on the allowable class 
ol feedback nonlinearilies Specifically the small gain positivity 
ind circle theoicms guatanlcc stability fot aibitranly time varying 
nonlinearilies whereas tht Popov criterion does not This is not 
surprising since the I yapunov function upon which the small gam 
positivity and circle theorems arc based is a hxed quadratic l yapunov 
Junction which permits arbitiary time vunation ot the nonhncarit) 
[5] Alternatively the Popov eiitcnon is based on a Lur e Postnikov 
Lyapunov function which explicitly depends on the nonlincauty 
thcich) restricting its allowable time variation 
Iurthcr refinements ol absolute stability tnlerij developed in 
[2| 112] [14] 119] restrict considciation to scclot bounded lime 
invariant nonlinearilies that art monolomt or odd monotonic and die 
prtdic iltd on extended L ur e Postnikov Lyapunov functions [9] 114] 
To lurther restrict the allowable class ol feedback nonlineanties the 
mthors in [4] [17] [22]—[24] develop absolute stability criteria by 
constraining the local slope of the nonlinearity These classical ah 
solult stability results extend the Popov criterion lor sector bounded 
hmc invanant nonlinear Junctions to monotonic and odd monolomt 
nonlinearilies by consti lit ting stability multipliers that ellectively 
ol ice less restrictive conditions on the hneai part of the system 
Howcvei as a result of the more involved mulupliei constiuclion 
die icsulting frequency domain conditions do not piovide a simple 
• iphical lest as ip the case of the Popov criterion 
In recent research [10] |I6] [18] a new absolute stability cn 
non for locally slope restricted nonlinearilies involving a simple 
modification to the Popov multiplier was developed Specifically it 
»s shown in |18] that replacing the Popov multiplier Z(s) = 1 + 
s by the new multiplier 1 + ) s 1 and requiring the frequency 
immain condition /i 1 + (I + \ s ’Kt(h) be positive real where 

Manuscript leccivtd January 10 1991 levised April 18 1994 This 
search wjs supported in part by the National Science Foundation Research 
1 mis ECS 9109558 and LCS 9150181 

I he authors aic with the School of Aerospace Engineering Georgia Institute 
technology Atlanta GA 10H2 0150 USA 
If FT Log Number 9407219 


(7(s) represents the tiansfer function ol the Imeai d>ninu s 
and // is a bound on the local slope ot the leedback uonlim. m \ 
provides a sufhuent condition tor the absoluu stabililx tot \ \ t 
with a monotonic tune invariant nonlineai element in the lu Ibia 
path As noted in [10] [16] however the statement as will is the 
proof of the results sported m 118] wei: lai from convmunc It 
should further be noled that the authors m 110] consider extensions 
to several differentiable nonlineanties in the leedback loop usin^ 
an involved method based on integral indices along with stability 
inequalities that anse from the frequency domain condition To 
piovide connections between the proposed absolute stabihtv condition 
and robust controller analysis using the paramctcri/ed Lvapunov 
function framework developed in [5] in this papti we extend the 
icsults of 1181 to multiple slope restricted monotonic nonlinearilies 
as well as construct explicit Lyapunov functions along with pioviding 
the underlying Yakubovich Kalman Popov conditions needed to 
present a concise statement of these results Specifically an extended 
notion o! a kinetic Lyapunov function |1] is used to show asymptotic 
stability of the nonlineai feedback system given by 

i(t) = 4i(f) - Ih’(ij) //(f) ( i(t) 

where n( ) is a time invariant secioi bounded mcmotylcss nonlin 
eanly That is instead of finding tht condition for the state varubles 
i(t) to appioach the zero equilibrium point sufficient conditions fot 
/(f) and the output t/(/) to approach zero art found Obviously 
it the system is observable and i(t) and y(t) appioach zero the 
system arrives at one of its equilibrium points and the two results 
are equivalent it the equilibrium point is unique \ inally in the single 
input/single output (SISO) case we show that the icsulting Irequency 
domain condition has a simple graphical interpretation involving a 
straight line in a modified Popov plane 

II MAIHIMAIKAI PR! LIMlNARli S 
In this section we establish definitions notation and several key 
lemmas Let ft and C denote the real and complex numbets let () / 
and ()* denote tiansposc and complex conjugate tianspose let I oi 
1 denote the // x // identity matrix and let 0 denote Ihc // x /» 
/cio matiix 1 uithcrmorc 1/ ^ 0 ( M > (1) denotes the fact that the 
Hermitian matrix M is nonnegative (positive) definite I et ?/(s) and 
(I ( s ) he polynomials in s with ieal coefficients A function c/( s) ot tht 
form r/( s) = //(s)/«/(s) is called a tational function Tht function //(h) 
is called proper (respectively strictly proper) ll deg //(h) * deg <l( s) 
(respectively deg //(s) ^ deg r/( s)) where deg denolts the degree 
ol the polynomial In this paper a ieal lational matrix function is a 
matrix whose elements are rational functions with red coefhcients 
Furthermore a transfer function G(s) is called propci (respectively 
strictly pioper) if every element of O(s) is proper (respectively 
stuctly proper) Finally an asymptotically stable transfei function is 
a tiansfer function each of whose poles is in the open left half plane 
The space of asymptotically stable transfer functions is denoted by 
RH >c i e the real rational subset ol H > l ct 


■ \ 

D' 

c 

D 


denote a state space realization of a transfer function G(s) that is 
G(s) = C (hI — 4) 1 B + D I he notation is used to denote a 
minimal realization in addition the parahermitian conjugate G" (s) 
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3*2 


of G(n) has the realization 


r-.4 r 

C* 

-B 1 

D 1 


Furthermore, the Hermitian part of G is given by He G 5 = i ( G+G *). 

A square.transfer function (?(*) is called positive real [I, p. 216] 
if 1) all poles of G{s) lie in the closed left-half plane, and 2) He 
(r(a) is nonnegative definite for Re a > 0. A square transfer function 
G(s) is called strictly positive real [21 ] if 1 ) G(#) is asymptotically 
stable and 2) He G(^)is positive definite for all real u 0 . Recall that a 
minimal realization of a positive real transfer function is stable in the 
sense of Lyapunov, while a minimal realization of a strictly positive 
real transfer function is asymptotically stable. 

For notational convenience we will omit all matrix dimensions 
throughout the paper and assume that all quantities have compatible 
dimensions. Furthermore, in this paper, G(s) will denote an in x m 
transfer function with input « 6 ft m , output y € ft”', and internal 
state jc € ft”. Next, we state the strict positive real lemma used to 
characterize strict positive realness in the state-space setting. 

Lemma 2.1 (Strict Positive Real Lemma 120]): 


A 

B~ 

C 

D 


is strictly positive real if and only if there exist 
W with P positive definite such that 

U = A r r+PA +L 1 L. 

0 = D 1 P-C + W t L, 

o = n + d 1 -ir'ir 


matrices P , L, and 

( 2 . 1 ) 

( 2 . 2 ) 

(2.3) 


are satisfied, the pair (.4,1) is observable, and rank G(.K) = w, 
J G 1 ?, where 


G{h) ] 



To state our main result, the following definitions are needed. Lei 
H € ft”' xn ' be a positive definite diagonal matrix. Next, define the 
set $ of allowable nonlinearities 0 by 

$ = {<!>■. W" - :■?(//) = [<Pi (l/i). ■ ■ •. )] 7 . 

0 (■) is differentiable, 

0 < ft 1 , 1 - L y € ft"'}. (3,3) 


Note that the nonlinear functions considered, 06 $, have decoupled 
components but unlike the multivariable extensions of the Popov 
criterion [5], [11] we assume a local slope constraint on the non- 
linearities. In the scalar case, m = 1, 0 satisfies the usual local slope 
condition O<0 (y)<y, y € ft. 

For the statement of the main result define 


4 i 


Of. x m 

c 0,„ 


B„ = 


B 

0„, 


C. = [0„, x S = [f 


Theorem 3.1. Let 


G(s) 


mill 

rss 



be symptotically stable, let X = diag [,V 1 . A T j, ■■■ , X m ] be 
nonnegative-definite, assume det CA~ l B / 0 , and define 

Z(») = 1 + S*-'. (3.4) 


Then 


g(s) i ,r 1 +Z(*)c.'(*) 


(3.5) 


is strictly positive real if and only if there exist matrices P, L , and 
H’ with P positive definite satisfying 

0 = Alr + rA„ +L'L. (3.6) 

0 = D,',P- \C„ - .S’ + ir' L. (3.7) 

0 = 2/j - 1 — ir 7 ir. (3.8) 


Finally, we state a key lemma involving controllability of an 
augmented pair. 

Lemma 2.2 / 10 ]: Given a triple (.4, D , C), if (.4, B) is control¬ 
lable and det .4 / 0. then 


(\a 01 roi\ 

\[c oj- [oj/ 


is controllable if and only if det CA 1 B ^ 0 . 


111. Absolute Stability Criterion for 
Multiple Slope-Restricted Nonlinearities 

In this section we consider the absolute stability problem for a class 
$ of locally slope restricted monotonic time-invariant nonlinearities 
0 : ft”' —* ft™. Specifically, given 


A 


C 

0 


we derive conditions that guarantee global asymptotic stability of the 
negative feedback interconnection of G(h) and 0 for all 06 $. Note 
that the negative feedback interconnection of G'(a) and 0 ( ) has the 
state-space representation 


.r(f)=Ar(f)- B<t>(y), (3.1) 

yit)=Cx(t). (3.2) 


In this case 


V(i’.y) 


'mV „[>«)' 
»(t) J 



X,v, {(T)a il(T 


(3.9) 


where y(t) = C\r{t ), is a Lyapunov function that guarantees that 
the negative feedback interconnection of G(.s) and p(-) is globally 
asymptotically stable for all 0 £ 

Proof ■ First, define z(t) = .r(t) so that 


i(t)=Ai(t)~B0(y)y, 

y(t)=C~(t) 


and 

■Lj (t) ~ A n j‘u( 1) — B„ f{y) 


where 


rjt) = 


■iit) 
llit) 


f(y) = <? (v)ii(t). 


(3.10) 

(3.11) 


(3.12) 


Furthermore, note that in this case j At) = C„.r„(t ) and i/(/) = S.r„(t). 
Now, since (.4, D, C) is minimal, (.4, D ) is controllable. Hence, it 
follows from Lemma 2.2 that if det CA~' B £ 0 then (.4,,, B„) is 
also controllable. 
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Next, we show that (3.6M3.8) imply that t7(») is strictly positive 
,il. To do this, add and subtract jwP to and from (3.6) to obtain 

0 = (-jajl - A„ ) T r + Pijxl - A „) - L r L. (3.13) 

Now. forming (-jjjI— A a )~ / (3.13) (j^I—A,, ) -1 B„ and using 
! ■. 7 ) vvc obtain 

[Kc\, + s-w r m J *i-A l .r'n, l 

+ dU-J^I - .4,,)'" 7 [AT,, + 5- \Y r T.] 1 
= DU-*'! - An)~ T L' Uju>I-A„) (3.14) 


and subtracting 2 y‘ 0 (y){I - 1 o'(.v))j> to and from (3.16. >Ute f 

grouping terms yields 

v [aI r + rA„].r, t 

- f r Uj)\B!,P - AT',, - 5].r„ 

- .rllDl P - A’C, - S \' /(;/) 

-2/' (j/)/i “ i f(y)-2y l 0 ( 11 ) 

■ (I - it~'o (//))(> (3.17) 

or, using (3.6H3.8), yields 


Adding and subtracting IF 7 IF to and from (3.14), using (3.8), and 
grouping terms yields 

[.VC„ + S](.m7-.4„)-'C„ 

+ - A *)' 7 [AT„ + S} 1 + 2/,- 1 

= [w + u^i-a,, 

- [)r + I(. M T- (3.15) 

Next, using the identities 


A) -1 0 

(,K)-'C(>'/- .4) 1 (>.■)“'/ 


S(ju.’l - A,, 
C„(M'I-A„) ' 


B„ =(V(. K ). 

J3„ = 


il follows from (3.15) and the rank condition in ii) of Lemma 2.1 that 
Me C/( )jl,' ) > 0. Hence C,{s ) is strictly positive real. 

Conversely, assuming that t7(.s) is strictly positive real, spectral 
factorization theory guarantees the existence of a spectral factor 
Xls) such that c7(.s) + i7~(w) = .V^(s).\"(.s), where A ^ 1 (x) € 
RW.x. The existence of 1\ L, and IF with P positive definite 
satisfying (3.6)-(3.8) now follows from standard algebraic stale-space 
realization manipulations. 

Alternatively, the result follows from a direct consequence of 
Lemma 2.1 by noting that c7(.s) has a minimal realization given by 


•4„ 

Da 

A’C, 4- S 



Next, for o £ <f> consider the Lyapunov function candidate (3.9). 
fust note that using integration by parts the integral term in (3.9) 
i' equivalent to : 



,V, o, ((T )<T fl(T 


E A)<// 



N,0, (<7 )(l(T 


) 


'°w, since o (//) > 0, for all o € $, it follows that 

ill {A’,o, (//; )</, - J n yi y,o,{rr) do } > 0. Furthermore, since 
' is positive definite, it follows that F(.r„) is positive definite. The 
^responding Lyapunov derivative is given by 

1 (•!'„) = .I'!, \A! I P + PAa]j‘ii 

-f r (!l)[DlP- AT„]r„ 
-.^[Dlr-NC,,]' f(y). ( 3 . 16 ) 

'Ml. it follows from (3.3) that c> (y)(I - ft~ 1 0 (y)) >0. for all 
' € $. Hence, 2 y 1 oiyHl - /<“' <i> [y))y > 0. Now, adding 


f (,r„) = — [Lx „ - H7(t,>] 7 [Lf„ - \Yf(y)} 

-2j/'p (//)(f-/<''(./(i/))i/. (3.18) 

Since 2y 7 o (,(/)(/ - /i " 1 o (tf))// > 0, it follows that l'(.r„) < 0, 
which proves stability in the sense of Lyapunov. 

To show global asymptotic stability we need to show that F( t r„) 
= 0 implies that .r = 0. Note that F(.r„) = 0 implies that ij(t) = 0, 

1 > 0, and hence f(y) = 0 and Lr„(f) = 0. Furthermore, in this 
case j„(t) = A a j\,(t) - D„f(y) - ,L,.r (1 (t). Thus, using .r„(t) » 
A„.r (1 (f), L.r„ ( t) = 0, and the observability of (A„, //), it follows 
from the PBH lest that .»■„(/) = 0, / > 0, which further implies, since 
( L C) is observable, that ,r(/) = 0, / > 0. Thus, the only solution 
satisfying F(.r, y) = 0 is the ./ (/) = 0, 1 > 0, solution and hence 
it follows from the LaSalle's theorem 120J that global asymptotic 
stability holds. □ 

Theorem 3.1 presents a generalization of Theorem I of [18] to 
the case of multivariable plants containing an arbitrary number of 
memoryless time-invariant slope-restricted monotonic nonlinearities. 

The form of c7(.s) given by (35) is standard in the classical 
absolute stability theory [14) in which Z(s) is a stability multiplier 
that distinguishes the class of the allowable feedback nonlinearities. 
As mentioned in the Introduction specific cases include memoryless 
time-invariant nonlinearities [15), monotonic and odd monotonic 
nonlinearities [9], [12] -[ 14], and locally slope restricted nonlinearities 
[4j, [22]—[24]. A key difference between the results in Theorem 
3.1 and the classical theory on monotonic and odd monotonic 
nonlinearities [9], [ 121—| 14J is that the multiplier Z(s) in (3.4) 
involves a simple twist on the Popov multiplier in contrast to the 
more involved positive real multipliers involving partial fraction 
expansions of driving point impedances of resistor-inductor (RC) and 
resistor-capacitor (RC) combinations which exhibit interlacing pole- 
zero patterns on the negative real axis [91, [ 12|, [ I3| and non-positive 
real plant-dependent multipliers [4], [7|, [23]. 

In the SI SO case, the frequency domain condition in Theorem 3.1 
has an interesting geometric interpretation. Specifically, setting 
= + ,)?/, He S(jjj) > 0 is equivalent to 

- v>0. (3.19) 

Ii 

Condition (3.19) is a frequency domain stability criterion with a 
graphical interpretation in a modified Popov plane, involving Re C> 
and u,'~‘ 1m G, in terms of a straight line with a real axis-intercept 
— 1 /// and slope — 1/A r . 

Since the condition presented in Theorem 3.1 is only sufficient 
for absolute stability a natural question that arises is for what class 
of systems will this criterion give less conservative predictions over 
the classical Popov criterion. In order to address this question first 
recall that the effect that the Popov multiplier 1 + A*.s has on the 
Nyquisl plot is to rotate each point on the Nyquist plot in the counter 
clockwise direction, Hence, if the Nyquist plot of the plant transfer 
function G(s) enters the second quadrant then it is clear that there 
does not exist an N such that (1 + Ns) G(s) is positive real. Thus, in 
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Fig 2. Popov analysis 


this case, the Popov criterion does not provide any improvement over 
the positive real test (see the example in Section IV). Alternatively, 
since the effect of the proposed multiplier 1 + .Vn" 1 is to rotate each 
point on the Nyquist plot in the clockwise direction, the criterion in 
Theorem 3.1 will always give less conservative predictions over the 
Popov criterion when the Nyquist plot of G{s) resides in the first and 
second quadrants. For example, since the Nyquist plot of the class of 
third-order lransfei functions given by 


no- 4 * 1 + ms 2 + + o) 

where an > 0, ai > 0, nj > 0, a j > 0, and a_>«i > ao«i Will 
always enter the second quadrant, the proposed criterion would give 
less conservative predictions over the Popov critenon Of course, 
using similar arguments as above, if the Nyquist plot of G(*) resides 
in the third and fourth quadrants then the proposed criterion would 
not give any improvement over the positivity entenon while the 
Popov critenon would give less conservative predictions. Hence, the 
utility ol the proposed criterion is when the Popov critenon fails. 
Finally, it should be noted that the more general class of multipliers 
consisting of the RC and the RC class [9], [12], [13] place less 
restrictive conditions on G(s) and hence allow the Nyquist plot to 
reside in all four quadrants. As a result of the more involved multiplier 
construction, howevei, the resulting frequency domain conditions 
provide a complex graphical interpretation involving frequency- 
dependent off-axis circles in the Nyquist plane [9]. Alternatively, if 
the classical off-axis circle criterion is used where a single bounding 
circle in the Nyquist plane is employed [14] as opposed to a family 
of frequency dependent circles then conservatism will be introduced 
in the stability predictions. 

Remark 3 l Note that the class $ of nonlinearities becomes larger 
as /< increases. In fact, as ft increases the strict positive real condition 
(3.5) becomes more difficult to satisfy, as expected. Furthermore, 


even though the frequency domain condition in Theorem 3.1 does not 
involve an explicit sector constraint on the nonlineanties c>(//), the 
requirement that $ 6 <I> implies that 0 <<‘>,(y,)y, <fi l yf, y, 6 R, 
/ = 1 , ■ ■ ■ , w. 

Next, we partially relax the assumption det C'A~ { B ^ 0 and 
provide an alternative Lyapunov function construction for the absolute 
stability criterion given in Theorem 3.1. The following result does not 
require a system augmentation of the form (3.12), however, in this 
case we assume that every element ol the stable transfer function 
G(s) has at least one zero at the origin, i e., (7(0) = 0 

Theorem 3 2 Let 


'A 

B 

C 

0 


be asymptotically stable, let X = diag [*V|. \\>, * , Y,„] be 

nonnegative-defimte, assume that (7(0) = 0, and define 

Z(s) = l+ Xs '. (1.20) 


Then 

£?(*) = /«' ' +Z[s)G{„) 0.21) 

is strictly positive leal, ll and only if there exists matrices 1\ L, and 
H' with P positive definite satisfying 

0=a'F+ PA + L j L. (3.22) 

U = D r P- \CA- ] -C + W'L. (3.23) 

0 = 2//"' - II' 7 ir. (3.24) 


In this case 


vo) = j'r.i +2 


±r 

i=i Ju 


X, (]<T 


(3.25) 


where y(1) = C.i(t ), is a Lyapunov lunction that guarantees that 
the negative feedback interconnection of (7(s) and o is globally 
asymptotically stable lor all q E 

Proof The proof is similar to the proof of Theorem 3 1 □ 

Remark 3 2 Note that since G(*) is asymptotically stable .4 -1 
exists. Furthermore, in order lo construct the proof of Theorem 3.2, 
it is helpful to note that since every element of the stable transfer 
function G[s) has at least one zero at the origin, CA~ 1 B = 0 and 
CA-'G*! - A) ] B = 


IV. Illustrative Numerical Example 


For illustrative purposes we consider the stable linear system with 
transfer function 


G(«) 


- 0 . 2.1 - 0.1 

, 1 + 2 s 2 + s + 1 


(4.1) 
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he closed-loop system (1 1) with linear uncertainty &(y) = Fy 
asymptotically stable for 0 < F < 4 6 The Nyquist plot for the 
near system is shown in Fig 1 Hence it follows from the positivity 
icorem that the linear system is asymptotically stable for all time 
nananl monotone nonlineanties in the sector [0 1 24) Fig 2 shows 
ie i orresponding Popov plot which in this case gives a Popov sector 
I [') 1 24) Hence since the Nyquist plot of G(s) enters the second 
(iiadrant the Popov criterion does not provide any improvement over 
lit positive real test Finally using Theorem T 1 we construct a 
modified Popov plot shown in Fig 1 Using the modified Popov 
exclusionary half plane graphical test given by (T 19) the absolute 
stability sector is now found to be [0 4 6) which is a significant 
improvement over the positivity and Popov sectors lor time invariant 
monotone feedback nonlineanties 

V Conc 11 SION 

In this paper we extended the SISO absolute stability criterion tor 
locally slope icstnctcd monotonic nonlineanties developed in [ i 8] to 
multivjnable systems containing an arbitrary nurnbei ol monolonic 
slope bounded nonlineanties Specifically explicit Lyapunov func 
lions along with extended Yakubovich Kalman-Popov conditions 
ire eiven These results can be used to synthesize robust feedback 
controllers in the spirit of |6| |8| |9| 
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Simultaneous Disturbance Rejection and Regular Row by 
Row Decoupling With Stability: A Geometric Approach 

Juan Cailos Maitmez Garcia and Michel Malabrc 


Abstract —rhe simultaneous disturbance rejection problem and regular 
row bv row decoupling problem with stability Is solved here through a 
geometric approach It is shown in this paper that the combined problem 
has a solution if and only if each problem, separately, has a solution 


1 Introduction 

As lar as its geometiic setting is concerned the combined problem 
of disturbance rejection and input-output decoupling for lineai time 
invariant systems by static state feedback has been hist discussed 
in |1] and [3] almost 20 yeais ago This problem has been recently 
revisited in [9] while the structuial approach has been used in |2] in 
order to obtain easy lo verify solvability conditions 

As the main result ot the reseaich referenced above a nice property 
has been found Indeed when no stability constiamt is imposed it 
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has been established that the combined problem is solvahle if and 
only if each problem, separately, has a solution. Intuitively, when the 
internal stability of the closed-loop system is required, this separation 
solvability should still hold. To our knowledge, this result has never 
been established. The aim of this paper is to bridge that gap through 
a geometric approach. 

First of all, we shall present some basic tools and the geometric 
solvability conditions of both the regular row by row decoupling 
problem with stability and the disturbance rejection problem with 
stability, where the notion of supremal internally stabilizable (.4, 
Z?)-invariant subspace will play an important role. 

In the second part of the paper we shall present the solution of 
the combined problem, proving that regular row by row decoupling 
with stability can be obtained using a static state feedback control 
law chosen in a particular family (built up from a particular set 
of internally stabilizable (A, i?)-invarianl subspaces), which also 
rejects automatically the disturbance, if this latter problem is solvable. 
This “separation property” is proved under the assumption that the 
System without disturbance is controllable. This is indeed the usual 
assumption when studying decoupling. 

II. Basic Concepts 

Let us consider a linear time-invariant system described by 

Ht) = A.v(t) + Bu(t). t >0, 
y(t) = Cr(f), t> 0 {i} 

with A: A' —► A', B\U —♦ .V, C:.V —* y (dim(.Y) = /?, 
dim (U) = //>, dimO’) = p) and denoted (A, B , C). 

In what follows we shall use the following notation: for a given 
map, say /?, we shall denote B its image. For a given map, say C\ 
the kernel is noted as KerC. 

A subspace V C X is said to be (A. I?)-invariant if and only 
if AV C V A B . We shall denote A(A, B\B) the family of all 
the (A. /?)-invariant subspaces included in the subspace 5 C A’. 
If V € A(A, B,B), there exists at least a map F : A —* li (called 
a friend of V) such that (A + BF)V C V C 5. The family of all 
friends of V is written as F(VL 

Let V € A, B\ S) and let us denote Rv the supremal ( A,B )- 
controllability subspace contained in V. Then, the following state¬ 
ments are equivalent: 

• V € A(A, B;S) is an internally stabilizable (A. Z?)-invanant 
subspace. 

• There exists a map F 6 F( V) such that rr{ 4 + BF | V) C CL, 
where A A- BF | V denotes the double restriction of A + BF 
to V and C_ is the open left-half complex plane. 

• rry := v(A -f 13F) satisfies <ry c CL, where A + J3F is the 
map induced in V/'Rv by A + BF (<r v is independent of the 
choice of F 6 F( V), see Property A.l m the Appendix). 

Assuming that (A, B) is stabilizable and being V € A( A. B; S) an 
internally stabilizable (A, U)-invariant subspace, there exists at least 
a map F ; X —► U belonging to F( V) such that rr(A A BF) C C_ 
(see for instance [10]). In that case, we shall say that F belongs to 
the family F*(V). 

A set of (A,U)-invariant subspaces {Vi, V 2 ,■ • •, V*} is called 
compatible if there exists at Least one common friend for all of 
them, i.e., f]Li F( V,) ^ 0, If the family {Vi. V 2 , ■, V;) is 

compatible, it is not difficult to show that their radical, say V defined 
as V := n?=i 51/*. V/* * s (-4, ZJ)-invariani and flLiFfV,) C 
it*, F(+ V) (see for instance [II, Chapter 10]). 

If flLj F(V,) = F(V)F(V, + V), then {V,,V,,-■■ ,V 4 J 
is called strongly compatible [8J. 


III. Problems Statements 

A, Decoupling 

Some Definitions: First of all we shall define the Block Decou¬ 
pling Problem via regular static state feedback (BDP). 

Definition 1 (BDP): Given a controllable system (A, B , C) and 
according to a given block partition \C \, C 2 , • ■ ■, C k } of C, the 
block decoupling problem is then defined as follows: 

“Find conditions for the existence of static state feedback control 
laws 1 #(f) = Fd‘(t) -I- Yl)-. 1 C7, i', (f) such that each block input 
(•,(/) completely controls Lhe block output ja(0 = C,.r{t) without 
affecting the k - 1 other block-outputs yj{t), j ^ 1 , under the 
constraint G = [G i G 2 ■ • ■ Gk] regular, i.e., G invertible.” 

The number of columns in each block G , is not fixed a priori. If F 
and G , regular, exist for the block partition (Ci. C 2 , • • ■, Ck [, then 
we shall say that (A, B . C) is a regularly block decouplablc system. 

Definition 2 (RBRDP )■ When k = p , BDP becomes the so-called 
(regular) Row By Row Decoupling Problem. Let us recall that in 
that case right invertibility of the system is an obvious necessary 
condition for RBRDP to be solvable. 

Definition 3 {RBRDPS): When we add the constraint rr(A + 
BF) C C_ to RBRDP we have the so-called (regular) row by row 
decoupling problem with stability. 

To give the solvability condition of RBRDPS in geometric terms, 
let us first introduce some notation. Let r, be the Ah row of C and 
lei us denote: 

• V*, respectively, V7, respectively, W7. the supremal (4. B)~ 
invariant subspace contained in KerC, respectively. Kerr,, re¬ 
spectively, fl,*, Kerr-,. 

• Kitti., respectively, V7st.ii.. respectively, W7„ ltlll , the supremal 
internally stabilizable (A. Zf)-invarianl subspace contained in 
KerC, respectively, Ken,, respectively, p| Kerr,. 

• 7v\ respectively, R*. respectively, 77, the supremal (A. B)- 
controllability subspacc contained in KerC', respectively, Ken 
respectively, K err V 

Then we have the following theorem. 

Theorem 1 [6. Theorem 3j. Assuming that (A, 71) is controllable, 
RBRDPS is solvable if and only if 

f i) v* = n v; 

|ii) n k: c v; tah 

Remark 1 It has to be noted that if RBRDP is solvable, R * 

is equal to the radical of { T* ,77. ■ ■ ■. T* [ (see for instance [4, 
Corollary V.l]). 

In an equivalent way, we have the following theorem. 

Theorem 2 [6, Theorem 4 / Assuming that (A, B) is controllable, 
RBRDPS is solvable if and only if 

'i> v* = fiv; 

< ,= l 
ii> v;„„, = nv*„ 1( , 

IS. I 

If RBRDPS is solvable we shall say that (A, B.C) is a regularly 
row by row decouplable system with stability. 

B. Disturbance Rejection 

Let us now consider the following linear time-invanant disturbed 
system 

- A.r(f) + Bu(t) 4- Ed{t), t > 0, 

?/(/) = C.r(f), t > 0 

where A, U, and C are as defined in (1) and E : V —► X 
(dim(P) = q). The disturbance input, r/(M, may or may not be 
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measured. We shall denote this system by ( A< B.C\ E). 

The disturbance rejection problem with stability (DRPS) is then 
defined as follows. 

Definition 4 (DRP-DRPS): Given a system {A. B.C\ E), find 
conditions for the existence of a static state feedback control law 
u(t) = Fj'(t) 4 Hd(t) such that y{t) does not depend on the 
disturbance (DRP) and the closed-loop system is internally stable 
(DRPS). 

In what follows we shall denote £ the image of E. The known 
classic geometric solvability condition of DRPS is given by the 
following theorem. 

Theorem 3 [II, Theorem 5.tip Assuming that [ A. B) is control¬ 
lable, DRPS is solvable if and only if £ C V* hlh 4 B. 

Note that Theorem 3 is still valid if (A.B) is just stabilizable. If 
the disturbance is not measured (H = 0), the well-known geometric 
solution is £ C V* iBh . 

Finally, as concerns the set where the solutions (feedbacks F) 
can be found, (if any), it is quite well known that, in both cases 
(disturbance measured or not), it is given by F.,(V* tll |,). 

C. Simultaneous Disturbance Rejection and Regular Row by 
Row Decoupling Problem With Stability (SDRDPS) 

Let us consider again the disturbed linear-time invariant system 
(A.D.C.E) 

.<'■(/) = A.r(t) -f Bu(t) + Ed(t). t > 0 
y[t) = C.r(1). t> 0. 

Because of decoupling requirements, we shall assume that the sys¬ 
tem without disturbance (A, B. C ) is controllable and right invertible. 

Definition . 1 ) (SDRDPS): Find conditions for the existence of a 
sialic slate feedback control law u(t) = F.r(t)+Gc(t) + Hd{t). with 
(7 := |TV| Cl* ■ ■ ■ C7 /( ] regular and r{1) = \rj (lj v? (t) ■ ■ ■ r,! (t)] 1 , 
where l',(f) are block input vectors (since m > p) and such that 
!j(t) does not depend on the disturbance, the closed-loop system is 
decoupled from r(t) to //(/) and is internally stable. 

IV. Solution or SDRDPS 

For that solution, we shall characterize a family of control laws 
which solve SDRDPS. Before that, and in order to introduce our 
extension, we shall quickly recall the formal statement of the decou¬ 
pling problem and the way the combined problem is solved, when 
no stability constraint is imposed. 

To express the objective of decoupling formally, let {7i, 
7, • • •, T t , ( be the set of subspaces which /',(/)./ E {1. 2, ■ ./>}, 

can control. Thus 7, is the (A. Z?)-controllabilily subspace given by 
l - (A + BF | B\mG,).i e {1.2, - . 7 ^- 

Now, the objective of decoupling can be established as follows 
for instance [11, Chapter 9]): 

“Find conditions for the existence of compatible {A.B)- 
controllability subspaces 7, included in f] j ^ i Kerr, and satisfying 
I 4 Kerr, = .V, for all / € [1. 2, •./>}.” 

A set of controllability subspaces {7i, 7i. ■ ■ ■. T v } satisfying these 
1 onditions is called a solution to the row by row decoupling problem. 
Let us recall the following theorem. 

Theorem 4 [8, Theorem 7J: A regular solution to the Row By 
Row Decoupling Problem exists if and only if B = £n7]\ If 

{7|*, 7/. ■ - ■, T *} is a strongly compatible solution. 

A decoupling solution (if any) F : V —♦ U can be picked 

U P in the family f)!’. 1 F(7j*). G is usually directly deduced from 

im BGi := B n T* 

The first important result for the solvability of the simultaneous 
disturbance rejection and regular row by row decoupling problem 


when no stability constraint is imposed (SDRDP), is the following 
separation property. 

Theorem 5 (1 , Theorem 3]: SDRDP is solvable if and only ij 
both RBRDP and DRP are solvable separately. 

We shall here add the stability requirement. In this case, ask¬ 
ing for the solvability condition of RBRDPS is strictly equiva¬ 
lent to asking for conditions of existence of a nonempty fam¬ 
ily nLiT^T,*) C IX _1 F(7 r *). By Theorem 1. we know that 
nr=i F*(7,*) is nonempty if and only if f|'' , 77* C V*, . 

To solve the simultaneous disturbance 'rejection and row by row' 
decoupling problem with stability, we shall proceed as in the case 
when no stability constraint is imposed, i.e., we shall characterize a 
subfamily of F* (7'*), in terms of internally stabilizable (A. B )- 
invariant subspaces, in such a way that any map F : .V — 4 
14 belonging to that subfamily will also automatically reject the 
disturbance, provided that this latter problem is solvable. 

The result established by the following theorem will play a key role 
in the solution of the simultaneous disturbance rejection and regular 
row' by row decoupling problem with stability. Let VV* l(l |, denote the 
radical of {W’ 1 s ini>. VVJ Mtllll . W* sl , li( }. 

Theorem 6: Let (A.B.C) be a regularly row by row decou- 
plable system with stability. 'There then ajwavs exists a common 
stabilizing friend^ for the family { W*, nh 4 VV* slah . 4 

The proof of this theorem will rest upon the following results, 
which proofs are in the Appendix. 

Theorem 7: Lei both Vi, V 2 be internally stabilizable (A. B)- 
invariant subspaces such that Vi C VV Then, for any F\ € F(V| ) 
such that <r( A 4 BF\ | Vi) C C_ , there exists a map Fj £ F(Va) 
such that F 2 | Vi = F\ | Vi and ^(A 4 BF> | Vo) C C. , i.e., 
F> is such that (A 4 BF 2 )V 1 C Vi, (A 4 BF>)V> C V 2 and 
rr(AA BF 2 I Vl>) C. C_ . 

Lemma l: Let (A, B, C) be a regularly row by row decouplablc 
system with stability, then VV* /fl/l = f)!_i W7- V*,,^. 

Property /. I-el (A, B. C) be a regularly row by row decouplable 
system with stability, then A’ = . 

Proof of Theorem 6: From Property 1 we have that the state 
space X can be partitioned as 



Note that ( W* Ull , 4 £/- j VVJ Malj ). for all / € {1.2, ■ ■ ■, f, is an 
internally stabilizable (A, B) -invariant subspace. Since VV* lall is an 
internally stabilizable (A, B )-invarianl subspace (indeed, we have 
from Lemma I that = V* la i,L we £an always find a map 

fo E F(VV*. lh ) such that rr( A 4 BFn | VV* (1I ,J£C C_. 

By Theorem 7, there exists a map F\ € F(VV^ lili 4 W7 K ,.,h) such 
that 

Fi I >C,., = Fo | Ch 

and ( r(A 4 BF\ | (VV* ulj 4 VV**■»,.)) C C . 

Applying again Theorem 7, we can find a map Fi G F(H7i i( i. 4 
Wr. s , Hl , 4 W 2 % lltl) ) such that 

Fi I (VC., 4 wr wlRb ) = Ft I oC,., 4 w: 

and 

<r(A 4 BF 2 I (VC.,. + C C.. 

Following the same procedure we obtain a sequence of maps 

Fi .€ F| w; uil) 4 W 7hui. 

\ 


V/ € {1.2, ■ * ./>} 



m 
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such that 


F< 


(wu + £ w;,„ h j = F ,-, | (w;^, 



and <r(A + BF, \ W; lllb + £; =l w; <ub ) C C_. 

Finally, F = F p is the desired map. 

From Lemma 1 we note that W* uh C for allj E 

{L2, ■ ■ ■,/>}. Consequently PL, F(VV;„ fab ) = (7,U F(W; itlb 4 

Thus, the result established by Theorem 6 can be rewritten as 
follows. 

Theorem 8: Let (.4, D % C) be a regularly row by row decouplable 
system with stability, then the family F,(VV; ifl jnHL] FA W** f(l6 ) 
is nonempty. 

Before considering the Simultaneous Regular Row By Row Decou¬ 
pling and Disturbance Rejection Problem with Stability, let us first 
clarify the connection between the family F fl (W* Mtttb ) and the 
family f|f=i F*(37) (proved in the Appendix). 

Lemma 2: For any system (A<B,C), the following inclusion 
always holds 

F (W r \„J C F(7;*), V?€ {1,2,•••,?}. 


An obvious corollary is the following. 

Corollary l: FAWt^CFAV). for all / € {1,2,■ ■ ■ ,ji}. 
We can now present our main result. 


A. SDR DPS. The Final Result 

Theorem 9; The simultaneous regular row by row decoupling 
and disturbance rejection problem with stability has a solution if 
and only if both the regular row by row decoupling problem with 
stability and the disturbance rejection problem with stability are 
solvable separately. 

Proof of Theorem 9 ■ Necessity is obvious. For sufficiency 
suppose that system (-4,21,0 is regularly row by row decouplable 
with stability, then by Theorem 8, the family F a (W* tah ) n 
(X»i F «w* slab ) is nonempty. By Corollary 1, the inclusion 
F,(W,\ uh ) C FAV), for all / 6 {1.2,- always holds 
and consequently 


p 

Oov;.. 

i — 1 


D F '(T). 

l-l 


Since 

MChin nr.(w;, llb ) c f)P.(w ( U) c f|p.(D 

,zil i = 1 

any map F 6 F„(W* rtl) ) 0 f]!-, F,(W** (rtb ), with a suitable map 
G : U —* 14 1 will decouple, while insuring internal stability of the 
closed-loop system. 

Now, if the disturbance rejection problem with stability is solvable, 
any map F E F,(VV; ub ) n (T /-1 with (eventually, 

depending on the fact that d(t) is measured or not) a suitable map 
H : D —► 14 , also rejects the disturbance. Indeed, from Lemma I 


we have F(W; uh ) = F(V; iall ) and then 

V 

F -(>v; ub )nf] F ,(>v;. tBb ) 

»=i 

p 

= P.(K.b)nf|P.(w:„. h ) c f.(v; UI) ) 

i=l 

which ends the proof. 

V. Concluding Remarks 

We have considered here the simultaneous disturbance rejection 
and regular row by row decoupling problem with stability (SDRDPS). 
We have proved that the separation property (which states that the 
simultaneous problem is solvable if and only if each problem is 
solvable separately) is still true when internal stability is required. 
This has been done under the non restrictive assumption that the 
system without disturbance is controllable. The weaker assumption 
of stabiltzability should be sufficient, but most of the results related 
to decoupling which have been used here had been previously stated 
within this controllability assumption. Adaptation to this broader 
situation should be quite easy, just leading to heavier development. 

Thanks to this result, it will be possible to exploit the recent 
geometric and structural solutions of DRPS [5, Theorem 7] and 
RBRDPS [6, Theorem 6| to propose a synthetic solution for SDRDPS 
in terms of infinite and unstable zeros. 

This will be detailed in a future work. 

Appendix 


A. Proof of Theorem 7 

It will rest upon the following property 

Property A l [11, Theorem 5 7 and Corollai v 5 2] Let V E 
MA.B;X) be given. For any F E F(V) write .4 4 BF foi 
the map induced in V/Rv by -4 + DF. Then -4 4 BT is 
independent of F E F(V) and rr(A + BF | V) = oi Uir V \ where 
(t i := (t(A 4 BF | Tvy) is freely assignable by suitable choice 
of F E F(V) and rr v ;= rr(A 4 BF) is fixed for all F 6 F(V) 
A subspace V E A(A,B:X) is thus internally stabilizable if and 
only if <ry C CL. 

Property A. 2 [7, Le mma A 2]; Let V ]t V> € A( A, B\ X) be 
given. Let us denote A 4 BF 2 the map induced by A 4 BF 2 in 
V 2 /V|. If V\ C V, then for any map F, E F(Vi ), there exists 
a map F 2 E F(V 2 ) su ch that F\ | Vj = F\ | Moreover, if 
V 2 = V, 4 Rv ,. then cr(A -f BF 2 ) is freely assignable by suitable 
choice of F 2 E F(V 2 ) and ov, C cr Vl . 

Proof of Theorem 7 ■ Let Fi E F(Vj) be a map such that 
(r(A 4 BFi | Vi) C C_. By Property A.2, we can choose a 
map Fc E F(C := Vi + 7 ly 2 ) such tha t Fc | Vi = Fj | Vi 
and o(A 4 BFe) C CL, where A 4 BFc is the map induced by 
-4 4 BFc in C/V i. Applying again Property A.2, wc can choose a 
map Fi E F(V 2 ) such that F 2 | C = Fc \ C. 

Let us now denote -4 + BF 2 the map induced by .4 4 BF 2 
in V 2 /C. Note that <t(/4 4 BF 2 | V 2 ) = (r(A 4 BF\ \ 
V, )U <t(A 4 BFc )Ufr(A + BF 2 ). 

Since the internal stabilizability of the subspace V 2 means that 
(tv 2 C CL (see Property A.l). and since <rL4 4 BF 2 ) C <tvj, we 
obtain (r{ A 4- DF 2 | V 2 ) C CL , which ends the proof. 

B . Proof of Lemma 1 

To prove the result established in Lemma 1 we shall use some nice 
properties of regularly row by row decouplable systems with stability. 
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Property BA: Let (A, B,C) be a regularly row by row decou- 
table system with stability, then W; rt . ah , for all 

e (1.2.-4 

Now, to prove this, we shall need some properties of control- 
able regularly block decouplable systems. Let us first complete our 
previous notation. 

Let C be the C matrix without the ith block, noted C t , and let 
us denote: 

■ F^KerC,). respectively, T^KerC',), the supremal (4*1?)- 
controllability subspace contained in KerCfe respectively, 
KcrF,. 

Property B.2 [4, Properties V.l and V.3): Let (.4, B, C) be a 
controllable regularly block decouplable system , then 

k 

]P T’(KerC,) = X and Tl^KcrC,) = ^ 7'*(KcrC i J ), 

i— 1 J*< 

Vie{l,2,■■■,*}. 

Property B.2 let us write the following. 

Property B.3: Let {A. B, C) be a controllable regularly row by 
row decouplable system. For any j G I and any :6r, where 1 and r 
are any sets contained in {1,2.--,p) such that I D r = 0, we have 

n v ; n v * ■«<•*=- v - 

>ei .er 

Proof of Property B.3: Built a block, say C'h , , formed with the 
rows of C which index belongs to 1 and built a block, say Ch z , 
formed with the rows of C which index belongs to r. Let us now 
denote the supremal (A, 7J)-controllabiliiy subspaces contained in 
Ker Ch, and KcrCfe by (KerC« v ) and R* Bz {KcrC Bz ), 
respectively. From Property B.2 we have R* H (KerCfe) + 
'R-h, (KerCw.) = .V. Since K* Hj (KcrC Hj ) C Vj 

and R' h (KcrCn;) C fl^er v * we get fl >e i V*+ 

a fr v: MUl) , = .v. 

Proof of Property BA: Note that Property B.l obviously holds 
lor p = 2. Let us give the proof for p = 3. 

Since (.4. B , C) is regularly row by row decouplable with stability, 
the same obviously holds for any selection of outputs within C, 
namely for (4,1?,C\), for all i G (1.2, •••,/)}. Theorem 2 thus 
implies that VVfe,*,. = f| Mj K for any j = 1.2. p. Then 

Y w; Hi * u — vvj hi ah + vvjnnu, 

/- > 

= vf Mlab n vj *, llb + vr 8 ,.hi, n v; s i* b . 

From Property B.3 we have that VJ* ulj + V7 Ml;ib H VJ Hll „ b = -V 
consequently V7 slab IT (Vfe.*,, + IT V^ub) = V,\ lah D 
^Uiab-bVjV^.nV^,,^ = V7 Hlllli nA’ = V7 Hlftll . From this follows 
,hai iij?i = vr. u ,b- 

In a similar way we can show that W* ala \ t = VI sub anc ^ 

Wj Nlrti. ~ Vi Hi till’ 

A similar treatment can be applied for the case p > 3. □ 

We can now achieve the proof of Lemma 1 

Since (A, B, C) is regularly row by row decouplable with stability 
1 is easy to see from Theorem 2 that VVfe,. ab = ffe/, Vfe lab , 
0r all i 6 {1,2, •■■,;!}, and consequently Dlfei W.**ub = 
D j^, V r*thb ~ ni' = , =: V; tHb . From this. Property 

‘3 1 and Theorem 2 we have 

:= few;.. h = V ’>«b = f| □ 

J?< 1 I = J 


'C. Proof of Property 1 

Direct from Property B.2. Indeed 

'■ l ' = £ T -* c !>'<*»< »•■ 
7^1 V-l 


D. Proof of Lemma 2 

Lemma 2 is direct from the following results. 

Theorem DA: Let V* illh {S) be the supremal internally stabi - 
lizable (4, B)-invariant subspace contained in S and let f R*(S) 
be the supremal [A, B)-controllability subspace contained in S. 
For any F t F(V*, flfl (S)) n<? have 

n'(S) = (A + BF\BnV: lt JS)). 

Proof of Theorem DA: Let V*(S) be the supremal (4,13)- 
invariant subspace contained in *!>. Note that 

b n v; Uxh (S) = ts n v*(5) = b n 

Indeed, C ^nv; tllh (5) C BOV*IS) = BMV{S) (see 

[J 1, Theorem 5.5J). With F G F(V; iab (S)), write 7? := (4 + BF \ 
snv; tnh (S)). 

Property A.2 stales the existence of a map F* G F(V*(5)) such 
that F I v; iab (^)) = F* I V; iixh (S)). Then R = (4 + DF | 
B n V;, hb (5)) = (4 + BF’ \ BflV’lS)) =: 7?*(S), since 
R, m (S) = (4 + BF' | BnV(S)). for any F G F(V*(S)) [11, 
Theorem 5.5], which ends the proof. 

Corollary DA: 

F(V; uih (S))C.F(K*(S)). 

References 

11] M. Chang and I. B. Rhodes, “Disturbance localization in linear systems 
with simultaneous decoupling, pole assignment, or stabilization," IEEE 
Trans. Automat. Contr.. vol. AC-20, pp. 518-523, Aug. 1975. 

[2] J. M. Dion, C. Commault, and J. Montoya, “Simultaneous decoupling 
and disturbance rejection: A structural approach,” Int.J. Contr., vol. 59, 
no. 5, pp. 1325-1344, 1994. 

131 E. Fabian and W. M. Wonham, “Decoupling and disturbance rejection,” 
IEEE Trans. Automat. Contr,. vol. AC-20, pp. 399-401. Mar. 1975. 

141 S. Icart, J. F. Lafay and M. Malabre, “A unified study of the fixed modes 
of systems decoupled via regular static state feedback,” in Proc. Joint 
Ctmf. New Trends in System Theory, Genova, Italy, 1990, pp. 425-432. 

|5] M. Malabre and J. C. Martinez Garcia, “The modified disturbance 
rejection problem with stability: a structural approach,” in Proc. Second 
European Contr. Conf. (ECC '93). Groningen, The Netherlands, 1993, 
pp. 1119-1124. 

|6] J. C. Martinez Garcia and M. Malabre, “The row by row decoupling 
problem with stability: A structural approach,” IEEE Trans. Automat. 
Control, to appear, 1995. 

[7] A. S. Morse, “Structure and design of linear model following systems.” 
IEEE Trans. Automat. Contr., vol. AC-18, pp. 346-354, Aug. 1973. 

[81 A. S. Morse and W. M. Wonham, “StaLus of noninteracting control,” 
IEEE Trans. Automat. Contr., vol. AC-16, pp. 568-581, Dec. 1971. 

[91 P. N. Paraskevopoulos, F. N. Koumboulis, and K. G. Tzierakis, “Dis¬ 
turbance rejection with simultaneous decoupling of linear time invariant 
systems,*’ in Proc. First European Contr. Conf ( ECC ‘91), Grenoble, 
France, 1991, pp. 1784-1788. 

[101 I- M. Schumacher, “A complement on pole placement," IEEE Trans. 

Automat. Contr., vol AC-25, pp. 281-282, Apr. 1980. 

[111 W. Wonham, Linear Multivariable Control; A Geometric Approach. 
New York: Springer-Verlag, 1985. 




m IEEE 

Robust Controller Design for 
Delay Systems in the Gap-Metric 

Akira Kojima and Shintaro Ishijima 

Abstract —^ robust stabilization problem in the gap-metric is discussed 
for a system ‘with delays in control. We derive a design procedure of 
robust controllers based on finite-dimensional Ricattl equations and it 
is shown that the resulting controller has a structure of observer-based 
predictive action. By employing completing the square argument, a game 
theoretic Interpretation Is also provided on the trade-off between the 
initial-uncertainties and attenuating the disturbance caused by plant 
uncertainties. 

1. Introduction 

In recent years, a robust stabilization problem in the gap-metric has 
received much attention [7], [17] and, for finite-dimensional systems, 
it is recognized that the design procedure enables us to take a trade-off 
between the robustness and the performance of closed-loop system 
[18]. Moreover, this argument has a merit to obtain a control law in a 
closed formula, which has a well-known feature of LQG-controllers. 

For a class of infinite-dimensional systems, the robust stabilization 
problem in the gap-metric has been studied by various authors 
[3], [8], [19|. In [31, the maximum robustness margin and the 
admissible controllers are derived based on an inpul/output argument; 
alternatively an operator theory derivation is applied in [8] and 
provides a number of results on the gap and graph metrics. Especially 
for a system with delays in control: E p(s) = I?(ft) • a closed 
formula is given for the evaluation of maximum robustness margin 
119). 

Even for the time delay systems, however, the design procedure 
of robust controller is not solved in the general setting. In [81, the 
procedure is derived for single-input/single-output (SISO) first-order 
input delay systems and, in [19], a sequence of finite-dimensional 
controllers, which approach optimality, is given for SISO retarded 
delay systems. Very recently in the preparation of this article, 
alternative design procedure is reported for SISO input delay systems 
[6] associated with [8|, It should be also noted that, even if we employ 
optimization method for the infinite-dimensional systems [111, 
[24], the computational work needed to check the solvability involves 
quite enormous calculations. 

in this paper, we focus on the following class of input delays 
systems 

Er: r(t)=Ar(t) + Bti(t-h) 
y(t) =C.r(t). 

u(t)eir\.v(t)eir\y(t)eB r 

and consider the robust stabilization problem in the gap-metric. Our 
objective in this paper is to derive a design procedure of robust 
controllers based on finite-dimensional Riccati equations. The derived 
control law has a structure of observer-based predictive action 

u(t) =s Kox{t) -1- I h i ('j)u(t + 
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and the procedure does not require the restrictions on the degree 
of plant and the number of input/output channels. A game theo¬ 
retic interpretation is also provided on the trade-off between the 
initial-uncertainties and attenuating the disturbance caused by plant 
uncertainties. 

In the following, we first reformulate the robust stabilization 
problem as an H 00 output feedback problem for time delay sys¬ 
tems. Then applying completing the square argument of particular 
quadratic forms [22], [I3J, [2], we directly derive a robust control 
law based on auxiliary operator type Riccati equations. The approach 
employed here also enables us to clarify the corresponding worst- 
case disturbance and, further, provides game-theoretic interpretations 
on the trade-off between attenuating the disturbance and the initial 
uncertainties. Finally, we show that the solutions to the operator-type 
Riccati equations are constructively given based on finite-dimensional 
Riccati equations and summarize the design procedure. 

11. Formulation and Preliminaries 

For a system with delays in control 

E/•: = A.r(1) T Bu(t-h) 

y(t)=C.r(t ), (!) 

nit) er,.r(f) € n\i/(t) e ir 

let := m*) ■ e-’ h = At 1 (A' ■ <•"“*) and pj. .YJ be the 

normalized left coprime factorization of the rational pari /?(.,). Then it 
is clarified by [19] that the maximum robustness margin is analytically 
given in the following way. 

Fact 1 (Partington et al. //9/) Let [A\ M] := [0, I] + Cn (si -- 
An P 1 [li//, Hh ] and be the largest solution to the equation 



where Pn and Qu satisfy the Lyapunov equations respectively 
A„P,i + PnA/t + Bn Oh 4- Hulli, *0. 

AiiQn + Qn An + C(,('a = (I, 

Then the maximum robustness margin in the gap-metric is given by 



The analytic result stated above provides deep insight on the effect 
of time delay h and, further, promises the first stage to attack the 
robust controller design problem in the gap-metric. Our objective in 
this paper is to derive a design procedure of robust controllers in the 
framework of finite-dimensional operations. 

Robust Controller Design Problem 

For the time delay system Ep, derive a design procedure of robust 
controllers such that the closed-loop system is stabilized against the 
perturbations {A ;= .A = [Aa/, Av]} 

n^(-s) = (M + Am F j (-V ■ r~ ,k + A*) 

{A = [A W .A,V]:A€ ff* ||A|U<0. 

f < < 1 (2) 

where [A/, JV] are the normalized left coprime factors of the rational 
part /?(*) and {Ep} indicates the plant set with the radius t in the 
II gap-metric. 


0018-9286/95$04.00 © 1995 IEEE 
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On the realization of nominal plant Up (1), we make a following 
^sumption: 

Al) (C* A, B) is observable and controllable. 

The robust stabilization problem stated here is well posed as an 
7^ optimization problem to find a stabilizing law K(*) which 
ali sties the following condition 

7 


(7 -2hK)- 1 M-' 


<e 


( 3 ) 


Moreover condition (3) is equivalently reformulated as follows 

7 

K 

j 

i 

h~ 


(1 - T, P K)~ l Sr 1 [M. N • f -’ h ] 

(i -Up 


<€ 


(4) 


where the delay term c” fl/l in the left hand side is artificially 
introduced in order to finally reduce the design procedure into finite¬ 
dimensional operations. 

In the sequel, we define an augmented time delay system associated 
with (4) 

,v(t) = A.r(t) + J3ir 2 (f - h) + Bu(t - h) 

zi(t) =C.r(/) + ir,(/) 

-a(f) ~ u[t) 

y{t) = C.r(t) + fr,(/) (5) 

and discuss an output feedback problem for the system E. 
Namely, the problem is to find an admissible output feedback law 
such that: 

Cl) the closed-loop system is internally stable, and 
C2) the closed-loop transfer function from the disturbance w := 
[ii'i , to the regulated output r := [-J , ] 7 

satisfies ||D-m-|U. <7 for a given constant 7 := C \ 

t <>,.„/ < 1- 

By employing a state-space setup investigated in [10], 120J, [21], 
we next describe the lime delay system I! on appropriately defined 
Hilbert space. On a Hilbert space .V := /?" x LA —K 0 ; R'") endowed 
with the inner product 


(</'. 0 ) := 


/■o 

, or 0 , / 

” 0 + L 



k°l 


0° 

— 

y 

6 -V, o — 

[o'J 




€ -V 


( 6 ) 


the augmented system E can be written in a form of an evolution 
equation [JO], [20], |21] 

E: .r{1) ~A.v(t) + Si/'a(0 4* Bu(t) 

7(M =i/(0 

y(t) =c’.f(0+ u'i(f). (7) 

“ he operator A is an infinitesimal generator defined as follows 

r.v + Bo x {~h)' 


A 0 = 




PM) ={o € .V : r/> J E U t1,2 (-/i,0: R 1U ). o' (0) = 0} (8) 

here J denotes the Sobolev space of i? 1 ' 1 -valued, absolutely 
untinuous functions with square integrable derivatives on |—/#, 0). 
he adjoint operator A* is given by 


= 


-v 




and, extending the state space A' to V := D(A*)\ it is verified i\-, 
PvM) = X and the separate Hilbert space V*. .V and V arc ahi 
continuous, dense injections satisfying 

VcA'cv (A = .r>. 

The input/outpul operators B and C are defined as follows 

S: 77"' -+ V, BV = *'•' (0) (*• g V* = PU*)) 

,V -» 7?'. C’<? = Co 0 (</> 6 ,V). (|0) 

III. Robust Controller Design 

In this section, we first construct an admissible control law based on 
auxiliary operator type Riccati equations. By employing completing 
the square argument of particular quadratic forms [22], [131. [2], the 
control law is directly derived together with the corresponding worst- 
case disturbance. Then it is clarified that the solution to the operator 
Riccati equations are constructively given based on finite-dimensional 
Riccati equations. 

The admissible control law for the posed H x output feedback 
problem is formally characterized as follows. 

Theorem 2: Let V > 0 (V € C(X)) and 5 > 0 (5 € £( V.V*)) 
he the stabilizing solutions to the operator Riccati equations 

PA*0 4- APt - TCCVy 4* BB* p = n. </• 6 V* (II) 

S ( A+ ^i vc ' c )° 

+ ^ • peej ■ so - sets’so 

+ -L_. spc'cpso + -f—re = o. 

o 6 -V (12) 

then an admissible control law satisfying (Cl) and (C2) is given by 

t/(f) = -B*Sx(t) 

x(t) = Ax(t) 4- Bu(t) 4- VC*(y{t) - Cx{t))< 

i(0) — 0. (13) 

In saying the stabilizing solutions V and we mean that the 
operators A - VC*C % A - BB*S 4- (1/(7 2 - 1)) ■ PC*C(I + PS) 
generate exponentially stable semigroups. 

Remark 1: For the linear operator E £(V\ V 4 ), the positive 
definiteness 5 > 0 requires that S = S* £ C{V , V 4 ) and, for given 
6^0 (o € V), (o, Sc>)vy> >0 hold. The properties of Hilbert 
adjoint operations between V and V* are found in [24, Section 2.5], 

Before describing the complete proof, we give here some pre¬ 
liminaries and motivations. The Riccati equation (11) is with the 
unbounded operator BB* and the structure of solution is not generally 
clarified [20], However, in case we deal with the augmented system 
E, the solution P > 0 (P E C(X)) is constructively given based on 
finite-dimensional Riccati equation. 

Lemma 3: Let P > 0 be the solution to the finite-dimensional 
Riccati equation 


PA r + AP - PC r CP+ DD r = 0 


( 14 ) 


then the stabilizing solution P > 0 (P E C(X)) to (11) is given by 


I’M*) = {<• € .V : V' 1 G W l ' 2 (-h. 0; 77"'). 

= 77 T V’ 0 } (9) 


V = 


P 0 
0 X 


€ C{X) 


and there exists a bounded inverse on .V. 


(15) 
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V2 


PtooJ Direct verified by substituting the solution (15) to 
(11) QED 

The above lemma guarantees the existence of bounded inverse 
P 1 6 C{ l) and enables us to directly apply completing the square 
argument associated with the operators 5 and P 1 (22] (23| 

Proof of 1 heoiem 2 Let t = i - p and describe the closed loop 
system with respect to the state x , ( 


lU) 


A - M'S 

rex ‘ 

Z(t) 

(it) 


0 

A - PCC 

r(t ) 


PC 

-PC*B 


o(D 


(t) = 


n(t) = 


c 

c 

*(f) 

-P'S 

0 

/(ft. 


1 
0 

o i (t) 
02 (f) 


0(f) 


At) = 


At) 

2(f) 


(16) 


In the following we will show that the closed loop system L 
satisfies the conditions Cl) and C2) 

Internal Stability CD Since «S > 0 and P > 0 are the stabilizing 
solutions it is easily verified that the operators A - SP*S and 
A - PC*C generate exponentially stable semigroups While the 
operator PC*C is bounded Hence it follows that the closed loop 
system is internally stable [12] 

Disturbance At'ttmotion C2) To verify the inequality 
||£ ll-x < m le II 11/ < IMli. 0 0 we hist locus 

on the restricted class ot disturbances 


H fc c '([o x) n 4 in/ (ii\ i( + ) (17) 


which is continuously differentiable and denote the disturbances as 
11 F ( 1 1 > ([0 x ) 7? + ) The space ( '/ ([() x) /? 4 ) is dense 
in f (0 \ R f ) and it is known that il x(0) r (0) f P(*4) 
and 0 6 C’HIO x) 77 f ) the solution [x(t) r (T)] (7 >0) is 
everywhere differentiable [4] Consider the functional 

l (ft =(x(t) Sx(ft) + * (ftft P '/(ft) 


on \ then differentiating boih sides with respect lo t inserting (II) 
and (12) we have 


1 (f) = -|| 
- ( 


+" 


- 1) 


INI ^ II" 'f|| 

0 1 q (\-, ^ C{ 1 + PS)t 


Integrating both sides with lespect to t over the interval (0 x ) we 
finally obtain 


(x( -v) Sx{ x)) + * * (f (x ) P 1 c ( x )) 

-<*((» Sx( 0» - - M0) P 1 (0)> 

= -|| 117. + II" 11/ - * 

||« V ||i - P - 1) 

u ] + - - - C(I + PS)X 

^ -1 


(IS) 


Since D is shown to be internally stable r( x) = < ( x) = 0 
and without loss of generality in the input output analysis wc can 
asume the initial state as x(0) = ( (0) = 0 Hence it follows from 
the equality (18) that 

IMk <" ll"ll/ . « 09) 

holds tor the continuously differentiable disturbances 
n € C 1 7 j ([0 x), R +r ) 11 f 0 


For given u € L 2 (0 x Ft ), there exists a sequence {// 

« 6 C 1 f j([0 x), 77 ,+ ' 1 )} which converges to u e L 2 (0 x 

R ft ) Hence it then follows from the fact that x(t) t(f) (f>0) 
continuously depends on {// } that (IV) holds foi all the disturbances 
u € I A 0 x 77 +,w ) 11 £ 0 QED 

Remark 2 Fquality (18) clarifies the worst-case disturbance in the 
/P output feedback setting The worst-case disturbance 1 / = o° is 
given by 

H 0 = [“ l 1 u V = -Cl + — l — C(I + PS)T 
II 2 J - 1 

ft *'*'* T*l I 
U = P ( 

and with the control (H) say 0 = u° the strategies (m“ m°) form 
a saddle point as follows [22J |H] 

/(" ") =11 Hi — II" 11/ 

7(// n u ) < 7( 1 # 0 0 ) < 7(u ti {) ) 


Remark 7 The completing the square argument employed here 
provide direct interpretations on the tiade off between attenuating 
the disturbance and the initial uncertainties 
Let i(0) = 1 a o(r) = iii(r) (-1/ £ r < 0) be the initial 
uncertainties of plant and apply completing the square argumtnt in 
like fashion Then we have lilt following inequality on the dislurbancc 
and Ihe initial uncertainty 

II 11/ £ II" 11/ + fl//’ ' ' +■ j II' (T)«( (7)//' 

1 e 


_ 11 il/ 

ll» 11/ + >! p 1 " + / 


- <r 

o f (t)i/ (t)(It 


( 20 ) 


In inequality (20) the solution P ' 0 to the filtering Riccati equation 
(14) plays a part ot weighting malnx between the uneeirainty caused 
by disturbance u and the initial plant uncertainty i A laigci 
P allows greater initial uneertamty » and the resulting closed loop 
system has favorable tune domain perlormancc 122J |ll| 

Howevei tor the time delay systems it should he nokd that the 
effeet of initial uncertainties in the input delay segment i/j( ) is 
independent to the choice of P and causes definite damage to the 
time domain performance 

The operator Riccati equation (12) has similar structure to those 
anse in LQ pioblem for delay systems [5] In |5| it is shown that 
the solution 5 > 0 (5 € £(V V*)) has a form of an integral kernel 
repiescalation {So Si S } 

(5o)° =S ( o+ j S,( <)«*( /)<// 

(5< )’ ( 1 ) — S,' (,.)< ° + j S (n /)»'(/)// 

-Il !.«<<) 


With the integral kernel representation of operator 5 € C(\ V*) 
the control law (H) is described by 

//(/) = h[)x(f) + j Aj(f)t/(/+ Dfll 

x(f)= Ax(t) + Du(t-h) + rC 1 (u(t) - C x(t)) 

7\o =-5/(0) A,( ) =--5.(() ) (21) 

and has a feature of observer-based predictive controller The feed 
back gams Ao and A 1 ( ) are the parameters defined by the integral 
kernel representation {S 0 Si Sj] 
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Next we will show that the stabilizing solution S > 0 (S G 
t(V, V*)) is constructively given based on finite-dimensional Ric- 
ati equation. The following result enables us to avoid the direct 
alculation of (12) (see also [23]). 

Theorem 4: Let M>{) (M G £(V, V*)) be the stabilizing 
solution to the operator Riccati equation 

MAo 4- A*M6 - MSB*Md> 4 C*C6 = 0 , 6 G X ( 22 ) 

then, for given constant 7>(1 + A m *x{MV)) l/2 , the stabilizing 
solution S >0 to ( 12 ) is described by 

S:=(l-i-(I + MP)) M. 

Furthermore, based on a solution M >0 to the finite-dimensional 
Riccati equation 

M A A A r M - MBB r M A C T C = 0 (23) 

the stabilizing solution M > 0 (A4 G £( V. V*)) to ( 22 ) is construc¬ 
tively given as follows 

r° 

(MO) 0 := Mo* 0 + / (24) 

(Mo)'(o) := A// (n) 0 U A / 

-h < n < 0 (25) 


Now, we will verify that the parameters {A/ 0 , A/, . M, \ mec , ; 
integral kernel representation of operator M. Using the represe.ii. 
tions (24) and (25), (22) is written as partial differential equation' 

Mo A + A t M 0 - A/, (0)3// ( 0 ) + C'c = 0 , 

Mu — A/n (26) 

A A r - A/,<0)A/ a (0.d) = 0. (27) 

-(^ + - A/,(a,0)A/,(0.,i) = 0. 

M 2 {n.ii) ~ M-] '(.l.o). (28) 

And further, since Mo € V* = V(A'), the following boundary 
conditions are required 

Mi (-/)) = Mo B. M 2 (~h.i)) = B r Mi (,1) 

(-/|<;3<0). (29) 

Hence, substituting directly, it is verified that the parameters 
{A/o, Mi . Mi } meet the integral kernel representation of operator 
M G £( V. V"). 

Next, wc will show that the solution {A/o. A/i. \I 2 } is positive 
definite and the stabilizing solution. Since the inequality 

/ o 

m r (£)C r Ctu(£)d£ >0 
w(i):=r AU+l ' ) 0 n A e AU ~ l3) Do'(,■))<!,) 


A/o =e Alh Mc Ah + f f Ar <t+ h >C r Cr AU+h) dZ 

J h 

A/,(il) =c ATh Mf- A1 DA I <■ * I{C+I,} C T 

■Cr A «-»DdZ 

M -2 (f i, ,1) = B r c~ AT "Mr- Ai, B + f B r e A T U-«) 

./mux (r.v./if) 

.C"c<- AU ~ :1) D,li. 


Proof: Based on (11) and (22), it is verified that the equality 
M (a A 77 ^ • PC'C^j (i - ~ ■ (I + PM ^6 

a(i-^-(I + MP ) j ^ + rr 4r ■ Pt ’* c ) Mo 


- MBB’Mo A — 


- 1 


.VtPC*tTM<> + — 


7 2 -l 


i • (I + ,VCP)jc’*C - ~ ■ (I + PM) j 


■6 = 0 


holds for q G X. Hence, interpreting PM and MP on .V, the 
’Iterator S := (I - (I/ 7 2 ) ■ (I 4 - MP))~'M with the stabilizing 
olutions M > 0. V > 0 is shown to be positive definite for given 
> (1 4 A mH x(A / TP)) 1 / 2 . It follows from the equality 




SB* 4 


7 2 — 1 


'PC*C(I + VS) 



^■(lAVM^jo 

^■{lAPM) S j(A~B m BM)6 


holds for all o ^ 0 (<> G V) (note that (C, 4, B) is observable 
and controllable), the operator M is shown to be positive definite. 
While the system 

p(t) =Ap(t) + Dv(t-h) 

lit) = —B T M |f" U, p(f) + I r~ Ali lhit A i) (I d 

associated with the infinitesimal generator A - SB* M has stable 
poles A(4 - BB r A/), hence the operator M is shown to be the 
stabilizing solution to (22). Q.E.D. 

Remark 4: The value (1 + A nmx (-VI P )) _1/ ‘ 2 stated in Theorem 
4 qualifies the alternative analytic representation of the maximum 
robustness margin f up i (see [3]). Since the operator MP is compact, 
it is also possible to approach the value f„ vi based on the finite-rank 
approximation of operator MP. 

By Theorem 4, it is shown that the operator 5 is constructively 
given based on finite-dimensional Riccati equations. However it 
should be noted that, in return for avoiding the direct calculation 
of ( 12 ), it arises infinite-dimensional manipulation concerning with 
the compact operator MP. Based on the previous results stated 
in Theorems 2 and 4, the design procedure is summarized in the 
following way. 

Theorem 5 (Main Result): For a prescribed margin of the required 
robust stability e (f < e OIt , < 1, see Fact 1), the robust controller 

u ( t ) = h\)x( t ) 4 f /v 1 (3 )u(f 4 3) (13 

J-h 

x(t) = Ax(t) -F Bu(t - h) 4- Unit) - Cx{t)) 

is obtained based on the following procedure. 

Step 1: Calculate the solutions A/ > 0, P > I) to the finite- 
dimensional Riccati equations 

MA A A r M - MBD t M a C t C = 0. 

PA T + AP - PC T CP + BD r =0 


'at the operator 5 > 0 is a stabilizing solution. 


and let L := PC 1 . 
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m 


Step 2 Define the operators U € r(V,V") (Theorem 4) and 
P € C{ \ ) (I^emma 3) calculate the following operator 

S = (J-^ (1 + HP))' 1 H 

= z — —j(I — T) 'M 

1 — t 4 

f 2 

T = -- VIP (30) 

1 — r* 

Since the operator 7 € 7'( \ ) is compact, wc can construct a 
sequence of finite-rank operators {T } which converge to T in the 
sense of the uniform operator topology ||7 — 7"|| —► 0 U follows 
from the fact ||(7 - 7, ) J - (J - 7 ) _1 1| —♦ 0 that the sequence 
{5 T ) 5 — (1/(1 — (I - 7 )“' M also converges to 5 in 
the sense of the uniform operator topology 

On the finite lank appioximalion of operator 7 the averaging 
method 11), [9] is applicable In this case the manipulation of operatoi 
(1 - 7 ) is reduced to the calculation of inverse matrix 

Step 3 Based on the integral kernel representation of 5 € 
CiX V*) determine the feedback gams An Ai( ) 

A =-*>/(»> A i ( ) = — S (0 ) 

In case we approximate the operator by averaging method, say 5, 
the parameters s/(0) 5_>(0 ) ait determined bhy partitioning the 
corresponding matux form representation ol £ 

IV Conc 1 l SION 

A robust stabilization problem in the gap metric is discussed lor a 
system with delays in control Wc derive a design procedure of lobust 
controllers based on hmte-dimensional Riccati equations and it is 
shown that the resulting controller has a structure ol observer based 
predictive actions The completing the square aigumcnt employed in 
the derivation enables us to provide game theoretic interpretations 
on the trade olf between the initial uncertainties and attenuating ihe 
disturbance 

The approach discussed here is applicable foi the output 
feedback pioblem in more general setting and it is also shown that 
the admissible control law is constructively given in the tramcwoik 
ol finite dimensional operations 
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A New Balanced Canonical Form 
for Stable Multivariable Systems 

Bernard Hdnzon 


l Introduction 

A new balanced canonical form is presented for stable multivari 
able linear systems In |3| overlapping continuous block balanced 
canonical forms were introduced for the stable single input/single 
output (SISO) case as a generalization of the balanced canonical 
form of [9] lor the SISO case In the search tor a generalization ol 
these results to the multivariable case a new multivariable balanced 
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anonical form was discovered, which is of interest in its own 
,ght and which will be presented here. The new canonical form 
■as a number of nice properties. The integer invariants that appear 
ii the canonical form are the multiplicities of the Hankel singular 
ulues and a number of new invariants, which are in one-to-one 
i ijective correspondence with the Kronecker indexes of subsystems. 
Iruncation of the state vector leads to stable minimal models in 
anonical form, just as in the case of [9J. In the SISO case the 
anonical form coincides with Ober’s balanced canonical form (up 
id a different signs convention). The reachability matrix of a system 
in canonical form with identical singular values is positive upper 
triangular. For the subclass of stable multivariable all-pass systems a 
tlcluiled treatment of the canonical form is presented in [4|, Section 
IV 

II. Balancing, Canonicai Forms, and Kronecker Indexes 
Let us consider continuous-time multivariable systems of the form 
J m i — - 4 . 1 ’/ + But. ( 2 . 1 ) 

lli = C.r, + Du, (2.2) 

with / G R.u, G R'\.n G R\y t G R".A G R ,,X \D G 
R"*'\C G R wy \D G R"' x/ ’. 

Let for each n G {1,2.3. - - -} the set C u be the set of all 

quadruples (.4. J?,C. D) G R nX " x R" X1> x R mXn x R" 1 **' with 

the properties: a) (.4, B.C.D) is a minimal realization and b) the 
spectrum of .1 is contained in the open left-half plane (i.e., .4 is 
asymptotically stable). A quadruple (A. B. C.D) or triple (A.B.C) 
will be called asymptotically stable iff the spectrum of .4 lies in the 
open left-halt plane (irrespective of minimality or nonminimality). 

As is well known, two minimal system representations 
( 4 1 , Z?i. C'i. D\) and (,4_>, B 2 . C 2 .D 2 ) have the same transfer 
(unction G'(,s) = C\(sl—A]) 'B\4~D\ = Cj(.s/—. 4 * 2 ) ] B'>A~D 2 . 
and therefore describe the same input-output behavior, iff 
there exists an 11 x 11 matrix T G Gl n [R) such that 
4, = TA>T \lh = 77?,.C 1 = CjT-'.Di = D>. In 
that case we say that (.4 1 . B\ . C \, D\) and (A 2 .B- 2 .Cj.D 2 ) 
are i/o-equivalent. This is clearly an equivalence relation; write 
( A 1 . B\ . C\ . /)i) ~ (A 2 .B 2 .C 2 .D 2 ). A unique representation of a 
linear system can be obtained by deriving a canonical form. 

Definition 2.1: A canonical form for an equivalence relation 
on a set A* is a map 

T:A - X 

which satisfies for all .r. y G .Y : 

i) IV) ~ .r 

ii) .r ~ y <=> T(.r) = V(y) 

Equivalently a canonical form can be given by the image set T(X ); 

‘i subset B C .Y describes a canonical form if for each .r G A“ there 
1 - precisely one element h G B such that b ~ j\ The mapping 
' —> B C A', .»* •-+ h then describes a canonical fonn. 

Let ( A.B.C.D ) G C„. The controllability Grammian IT, is the 
>sitive definite symmetric matrix that is given by the integral 

W. = / vx\>(At)BB I exp(.4 / t)dt. 

,/n 

vs is well known, W, can be obtained as the unique solution of the 
"Mowing Lyapunov equation 

AW, +\Y,A r = -I3I3 J . (2.3) 

1 a dual fashion, the observability Grammian \\\, is the positive 
Infinite symmetric matrix that is given by the integral 

U t> = f ex\)(A J t)C r C exp(Af) dt. 

J 0 


1995 ^ 

This matrix is the unique solution of the following Lyapunov equal n 

a j ir„ 4 - ir„.4 = —c 1 c. • 

Definition 2.2: Let (A.B.C.D) G C„. then (A.B.C.D) p, 
called balanced if the corresponding observability and controllahilitx 
Grammians are equal and diagonal, i.e., there exist positive nuntbeis 
(Ti.fr>. - - -. (t„ such that 

\V» = It; = <li(iy(fT \. • • -, cr„ ) = ; (2 5) 

The numbers are called the (Hankel) singular values 

of the system. It will be convenient to call an arbitrary quadruple 
(A.B.C.D) G R nXn xR" Xv x R"' *" x JT * v balanced if the pair 
of Lyapunov equations 4S + SA 7 = -BB 1 . A 1 ^+E4 = -( (/ C 
has a positive definite solution of the form E = diag(o-|, ■ • ■. a,,) 
(assuming neither asymptotic stability nor minimality). 

By a suitable permutation of the components of the state of a 
balanced system, a representation of the system is obtained for which 
the singular values are ordered with respect to size: rr\ > rr> > 

■ * ■ > (t„ > 0. The singular values, ordered in this way, arc known 
to be uniquely determined by the inpul-output behaviour of the 
system. 

Definition 2.3: A balanced canonical fonn (on C r( ) is a canonical 
form TrC",, —► C„. such that Y( A. B.C.D) is balanced for each 
quadruple (A.B.C.D) G C„. 

Definition 2.4: Consider a pair (A.ll) of matrices ,4 G R 1)XV 
and B G R" xv . Let /?„ = B„(A,B) = [B.AB. ■ ■, A" D] denote 
the corresponding reachability matrix. Suppose the pair (A, B) is 
reachable, i.e., the reachability matrix has rank n. The selection of the 
first n linearly independent columns is called the Kronecker selection. 
It has the property that it is a so-called nice selection, which means 
that if the j th column of /?„ is in the selection, then either j < p 
or otherwise the (j - ;;)th column is also in the selection. For each 
i G {1.2. ■ • - .p} let (J , denote the smallest nonnegalive value of j 
such that the (jp + Oth column is not in the selection. Then we will 
call (r/|,r/j.■ ■ ■. (I,,) the dynamical indexes (also called successor 
indexes) corresponding to the selection. By ordering these according 
to magnitude, one obtains a nondecreasing sequence of p indexes 
h 1 < Hi < • ■ ■ < /«;, which arc called the Kronecker reachability 
(or controllability) indexes. 

With any nice selection corresponds a sequence of integers p = 
so > s\ > s -2 > > si > s =: 0 which add up to n + V 

and a sequence of sets of indexes {{/,(l),/,(2),- ■ ■. / y (-s^)} C 
{1. 2. ■ ■ ■. 1 }. j = 1. 2. ■ ■ •. /} with the property that of the .s , -1 
columns that can be chosen from A J 1 B in the nice selection the 
/ y (1 )-th, the /,(2)-th etc until the ij(sj)lh are chosen. Because the 
Kronecker selection is also a nice selection these quantities are also 
defined for the Kronecker selection. It is clear that the sequence 
of sets of indexes determines the Kronecker selection completely 
and is in bijeclive correspondence with the sequence of dynamical 
indexes {<U , d>. ■ ■ ■, <7,,} that describes the Kronecker selection. It is 
well-known and can easily be derived from the foregoing that the 
Kronecker indexes are in one-to-one bijective correspondence with 
the sequence {*„(cf. e.g., [2]). 

Remark: A similar definition holds for the Kronecker selection of 
rows from the observability matrix of a pair of matrices (A.C) G 
R ux " x R t,,Xn . the corresponding Kronecker observability indexes 
etc. 

The following lemma is basic for our considerations (see e.g., [9]): 

Lemma 2.5. Let M G R 1,xl .nmk(M) = n < l. There exists 
an orthogonal matrix Qu G R" Xn and natural numbers 1 < 
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ii <n< <in < I such that 

Afo := QoM 


/° ■ 

■ til 1 4 ^ 

♦ - 

■ * 

... ... 

0 

0 

0 ■ 

• ni2ij 

* ... * 

\o •• 

• 0 

0 ■■ 

0 

rilnin * / 


with ni j t = 0 for all i < i; and m JtJ > 0 for all j € 1,2, •■ ■. n; Mq 
is unique and Qo is unique. Such a matrix will be called positive 
upper triangular with independency indexes ii. * 2 „ • • ■, *». 

A matrix will be called full rank upper triangular if it is positive 
upper triangular up to multiplication of some (or possibly all or none ) 
of its rows by - 1 . 

HI. A Balanced Canonical Form for 
Systems with Identical Singular Values 

The following theorem of [1] is basic for the relation between 
systems with all singular values equal and stable all-pass systems. 

Theorem 3./: Let m = p. 

i) If a balanced triple (A,U.C) has identical Hankel singular 
values <t = 1 then there exists an orthogonal matrix D such 
that C = -DD r . 

ii) (A, ZL C, D) is a balanced asymptotically stable realization of 
a stable all-pass system iff (A, U, C) is a balanced asymptot¬ 
ically stable triple with identical singular values and D is an 
orthogonal matrix such that C = — DD r . 

It will be useful to extend the usual definition of orthogonal square 
matrices to rectangular matrices. 

Definition 32: An w x //matrix V will be called orthogonal if 
V 1 U = I p (if p < m) or UU 1 — I UI (if m < p). 

Using this, a balanced realization (A, D , C) of a stable system with 
identical singular values (with possibly m ^ p) can be characterized 
as follows: 

Corollary 3.3: The following three statements are equivalent: 

i) A triple (A, B , C) is a balanced asymptotically stable realiza¬ 
tion of a stable systetn with identical singular values a > 0 

ii) The pair (A,U) is reachable and A 4- A 7 = -(1 /rr)BB J = 

)C T C.a > 0 

iii) The pair (.4, D) is reachable, .4 + A r = -( l/ir)DD‘, and 
there exists a (possibly rectangular) orthogonal matrix D such 
that 0 = -DU 7 

Proof: Without loss of generality one can assume that m — p , 
because if m ^ p than one can add a sufficient number of zero 
rows to C or zero columns to B to obtain a system with the same 
number of in- and outputs and clearly if the result holds for the square 
system obtained in this way, it also holds for the original system. If 
m = //the theorem follows from the previous theorem together with a 
theorem of [10], (here applied to the special case where the Lyapunov 
equation involved has the identity matrix as a solution) which says 
that if (A. D) satisfies the equation A 4 * A 7 = -UU 7 then: A is 
asymptotically stable iff (A, B) is reachable. □ 

The following canonical form for the set of stable multivariable 
all-pass systems of fixed McMillan degree will be important for our 
constructions. 

Theorem 3.4 ([4], Section IV): The following two statements 
are equivalent: i) A system IT is a stable all-pass system with 
McMillan degree n. ii) There exists a unique balanced realization 
(A.U, C\ D) 6 Cn of II of the following form: There are integers 
p — mq > mi > S‘2 >-•■>*/> m/. h = 0 which add up to n 4- />. 
such that 

-(”) 


where Ui is an si x positive upper triangular matrix; 

/An An 0 0 \ 

A’!! A n A'ill *• ; 

A = 0 .4.1 i 0 

: ’• "■ '■ At-i.i 

0 ••• 0 .4/.,-, Au / 

a block tridiagonal matrix with .4„.,. an x a x matrix, v.v 6 
{1.2, •••./}, .4. = 0 if |ji -v\ >1; 

An =A U - ±lh B[ 

An an otherwise arbitrary skew symmetric mi x matrix; .4 „„ = 
Am, an otherwise arbitrary skew symmetric «„ x matrix for each 
1 / 6 {2,3, ••■,/};Au+i.h a positive upper triangular m„+i x s„ 
matrix for each v € {1,2. ■ ■ ■ J - 1}; A 1 = -Aj+ l(l for each 
u 6 {1,2, • ■ ■ J - 1}; D an otherwise arbitrary orthogonal p x p 
matrix and 

C = —DD t . 

The indexes s „, u = 1. ■ J are in bijective correspondence with 
the Kronecker indexes. The canonical form is balanced and its 
reachability matrix is positively upper triangular. 

For triples (A, B.C) with all Hankel singular values equal to unity, 
possibly with hi ± />. one obtains the same balanced canonical form 
with the only exception that, while in the case of all-pass systems 
the matrix C can be determined from B and I), here instead C is 
an arbitrary solution of the equation 



(Note that because DiB{ is an *1 x s { positive definite matrix, 
one has M] < m. s\ < p.) It follows that the matrix C can be 
partitioned as [Ci.O], C\ an 111 x mi matrix which is a solution of 
C{ Ci = B\B]. Let B\ = [{B\ U( ) i/j .()]. where the zero matrix 
is si x (p - mj) and (B)B\ ) l ^~ is the mi x mi positive definite 
symmetric square root of B\B{. Then clearly C{ C 1 — B\B\ 
and C 1 C = BB l with B = () an m x s\ matrix. From the 
corollary above it follows that there exists an m x mi orthogonal 
matrix D such that C = -DB 1 and so Ci = -DBl. Partition 
D = [— V. Y] where l is an w x mj orthogonal matrix and 1 is an 
in x [p - mi ) orthogonal matrix. It follows that C\ — f (Ui B[ ) l/7 . 
Because (UiU/ ) ] ' 2 is positive definite, the relation between C f i 
and V is bijective: V = Ci{B\B( )~ (l T J) and therefore V can be 
used to parametrize Cj. In this way one obtains a parametrization 
of the canonical form for systems with one unit singular value: The 
matrices B and A together have mi + m 2 + ■ ■ ■ 4 m/ = n positive 
parameters due to the requirement that Ui, A 21 , Am 2 . ■ • ■, A/ /-1 
are all positive upper triangular, while all other entries in these 
positive upper triangular matrices are either prescribed to be zero 
by the structural indexes, or are free to vary over the real numbers; 
furthermore the skew symmetric matrices Ai 1 . A 22 , ■ • ■, A/,/ have 
Yj\i- 1 ( 1/2 )m„ (*„ - I) parameters that are free to vary over the 
real numbers and finally Cj = U(BiB{ ) I/J is parametrized by the 
rn x mi orthogonal matrix U\ the set of all such orthogonal matrices 
has dimension - (1/2 )a 1 (.s 1 4 - 1). Of course one could add 
a feedthrough matrix to the system, which would have mp freely 
varying real parameters. 

If (A,U,C) has all singular values equal to rr > 0 , then 
(A,7T.P) = (A, BJy/ffX'/y/o) has all its singular values equal to 
unity. Requiring (A, 7T, V) to be in the canonical form just described 
leads to a canonical form for (A, U,C) by taking (A. U,C) - 



USKE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 40, NO. 2, FEBRUARY 1995 


/A, y/aT5* s/fC*)- This canonical form will be used in the sequel. 
Clearly 'BB’ = BD T /<r,CF T = CC r /*. etc. 

IV. A New Balanced Canonical Form 
for Stable Multivariable Systems 

Consider a system with balanced state space representation 
i A,B'CiD). The Grammians are equal and of the form 

£ — diag( O ’i / r ,( i) , (T'2 fn{2)'' ‘ ’ m ^kln(k) ) 

where <r\ > rr 2 > • • > 0 are the Hankcl singular values of 

the system and /#(1), n(2), ■ ■ ■, »(fe) the corresponding multiplicities. 
Partition A, IS, C , according to n (1). n (2), ■ ■ •. n ( k) to obtain 

/-4(1.1) A(l. 2) ■■■ .4(1.A) \ 

A(2,1) .4(2.2) .4(2, A) 


and 


n = 


a = 


B( 1) 


B(k) 


\A(kA) A(A\ 2) 


c = [c(D a 2) 


(4.6) 


A(k.k)) 


C(k) 


(4.7) 


The following result is very important for the constniction of balanced 
canonical forms. 

Theorem 4.1 ([JO], [7]): Let ( A,BX\D) be partitioned as 
above, balanced in the sense of definition 2.2, but not necessarily 
asymptotically stable and minimal. Then (A.7LC. D) G C„ iff for 
each / G {1.2. ■ ■ ■, A-}, (A(/\/), B(i),C(i),D) G C Ij(l) . 

The balancing (Lyapunov) equations for (4. B . C.D) are given in 
lemis of the A (/, j),B(i ). C(j).i - 1, ■ ■ ■. k. j = 1, ■ ■ •, k by 


A(i.i) + A(i.i)' = - -D(i)B(i) r 

(T, 

= - -CU)'C(i) 

<T, 

A(hj )<Tj + A(j. i) 1 (t, = - B(i)B(j)‘ 


A(i.j)cT, + A(j.i) 


-C'(i)'CU) 


(4.8) 

(4.9) 
(4.10) 


where i £ l}.j £ {1. ■ ■ -. A-}. / ^ j. Note that the last 

lwo equations can be solved in terms of A{i,j) and A(j.i) for 
given pair (/.j),/ ^ j and given B(i). B{j)X'{i),C(j)< because 
^ / (Tj. Therefore, and because of the theorem 4.1, the construction 
of a balanced canonical form can be reduced to constructing a 
balanced canonical form for the subsystems (A( /',/), B (/), C( /)), i = 
L 2, ■ ■ •, k. For each i G {1. • ■ ■, A } such a system is a system with 
identical singular values, or perhaps one should say with one singular 
value (t,. For such systems the canonical form presented in Section 
HI can be used. This leads to the following result. 

Theorem 4.2: 

a) The following statements are equivalent: 

i) E is a stable input-output-system with p inputs and rn 
outputs and McMillan degree ». 


ii) E has a realization {A.BX\D) e C 
of the following form: There are numty i . 
<r\ > tri > • ■ ■ > (tk > 0, the Hankel singula- 
values,, and integers n(l),»i(2). ■ ■ •. n(k), ilk 
corresponding multiplicities of the singular values, 
and for each i 6 {1.2.-- ,kj there are integers 
P = "<>(/') > All#) > ■■■ > /) > (, ) = () 

which add up to n{i) + p, such that 


.4 = 


•4(1.1) 

•4(1.2) ■ 

4(1. A) \ 

•4(2.1) 

.4(2.2) • 

.4(2. k) 

\-4(A. 1) 

•4(1,2) • 

■■ .4(1, A-) / 


(4.11) 


B = 


and 

D( 1 ) 

B(k) 


C = \C( 1) C{ 2) 


C(k)] (4.12) 


where A(i,j) is an n(/) x n(j) matrix, B[i) an 
ti(».) x p matrix and C{j) an m x »(j) matrix, 
i — 1,2, • ■ ■. k.j = 1. 2, ■ ■ ■, k and the triples 

(A (/./’). B(i). C (/)) have the following form 

iMOl 

0 


B(i) = 


0 J 


(4.13) 


where Bid) is an .sj(/) x p positive upper triangular 
matrix and see also (4.14), found at the bottom of 
the page, a block tridiagonal marrix where for each 
u.r G {1.2, ■•■./(/')}, A u . v (i-i) is an b u (i) x «,,(/) 
matrix, A UmV {iJ) = 0 if |?/ - r | > 1; A\i(iJ.) — 
Auii.i) - {l/2)D\{i)B](i) 1 /(t,. where Au(i.i) is 
an otherwise arbitrary skew symmetric jsj (/') x s\(i) 
matrix; A„ u {i.i) = A ilu (/',/) an otherwise arbitrary 
skew symmetric n„(i) x s„(i.) matrix for each a £ 


{2.3.- 

•■./(<)}; 

A ,. u+ 1 (tl ( i. i ) a positive upper triangular 

«..+i(i 

) x «u(i! 

) matrix lor each u G {1.2, 

■••.((<)- 

1); -4, 

i.,*+i(». i 

) = -Am+i.hIi./) 7 for 

each 11 £ 

{!• 2’ ■ 

■■.id) - 

- 1}, and furthemiorc 



cm 

1 = [C\(i)Jl -■■,()] 

(4.15) 

where 





Ci(i) = 

L-(i)(D 1 (i)D l (i) T )' / ' 2 

(4.16) 


in which U{i) is an m x s { (i) orthogonal matrix, 
i.e., U[i) r V{i) = l s , (: furthermore the matrices 
A(i,j ), i ^ j: /, j £ {1, ■ • •. k} are determined as the 
unique solution of the linear equations (4.9), (4.10). 

b) For each system E the the realization {A, B, C\ D) G C„ 
described in ii) above is unique and balanced. This determines 
a balanced canonical form for i/o- equivalence on O,. 


-4(L/) = 


fAuiUJ) A ] .AU) 0 
Aa.iJi.i) A‘ 2 ,‘2(1.1) /) 

0 A,,:,(/./) 


V 0 


0 


0 

A/(*)-U(i)(l i) 

Ai(i)JU)- i('A') /(,)(/, / ) / 


(4.14) 
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Proof: From the introduction to this theorem it is clear that 
each stable multivariable linear system has a unique, balanced, 
representation of this form and that Vo-equivalent systems have 
exactly the same realization. Therefore this determines a balanced 
canonical form. It remains to be shown that each system of this form 
is indeed minimal and stable. This follows from theorem 4.1 together 
with the fact? that the canonical form that is used for the systems with 
identical singular values has the same property: for each choice of 
the parameters that is allowed the resulting system is minimal and 
stable (cf. [4], Section IV.). □ 

Remarks: 

i) If si = p then one can just as well parametrize C\ (t) by 
Gf/') = (Hi)B(i):U(i) a (possibly rectangular) orthogonal 
matrix. 

ii) Truncation of the last n - k components of the state vector 
corresponds to the truncation mapping 


Boundary Fractional Derivative 
Control of the Wave Equation 

Brahima Mbodje and Gdrard Montseny 

Abstract —The wave equation, with fractional derivative feedback at 
the boundary, is studied. The existence and uniqueness, as well as the 
asymptotic decay of the solution towards zero is proved. The method 
used is motivated by the fact that the input-output relationship of 
generalized diffusion equations, defined on the Infinite spatial domain R 
with collocated sensor and actuator control, can be expressed In terms of 
fractional integrals. Compared to other methods, the payoff Is as follows: 
1) The proofs are simpler. 2) The method used can easily be adapted 
to a wide class of problems involving fractional derivative or Integral 
operators of the time variable. 

I. INTRODUCTION 


(A,B,C.D) ~ Uh.0)A(h,0) r Ah,0)B.C{h.Q) r .D) 

(4.17) 

The canonical form presented has the property that if truncation 
is applied, the result is again in canonical form. Therefore the 
resulting lower order system is again minimal and stable. 
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Over the last decade, there has been a considerable amount of 
research on the modeling and control of beams and waves, spurred 
on by the increasing use of flexible structures in space technology. 
In particular, the issue of boundary control has received a special 
attention. In the case of boundary velocity control, and when the 
sensor is a deflection sensor, the control law involves lirsl-order time 
differentiation of signals proportional to the deflection and/or the 
slope of the deflection at the boundary. Besides it is well known 
[3], [8] that uniform exponential decay of energy for the wave 
and the beam equations can be achieved through boundary velocity 
control. However, when it conics to implementing such controllers, 
the noise level in the control law could become intolerable due to 
the differentiation process to be performed. Therefore, to reduce the 
level of noise, one is naturally led to consider control laws based on 
lower order derivatives [ I ], namely fractional order derivative control 
laws. Another strong motivation for considering fractional derivative 
controls may be found in |4]. There it is shown, for a Bernouilli-Eulcr 
beam, that the H optimal wave absorbing problem at the boundary 
is solved by a fractional derivative compensator of order one-half. 

Only a few other authors [10], [13] have employed controls 
involving fractional derivatives to stabilize distributed parameter 
systems. However, in their investigation, they almost exclusively 
resort to finite dimensional approximations and techniques pertaining 
to the Laplace domain. 

Our objective is to study via a novel method based on an auxiliary 
diffusion equation this more general type of feedback control law. As 
will be noted, the method used here is applicable to other mathemat¬ 
ical models such as the Bernouilli-Euler and the Timoshenko beam 
equations. But in order to keep computational complexities low, we 
choose to examine the wave equation. 


Fractional Calculus 

As in 12.|, we define the fractional derivative operator of order 
a . 0 < n < 1, by 


sw 


(t-T)-” 'If, ,, 
TTi- r -r (T),,T 

HI - a) dr 


( 1 ) 


where F denotes the well known Gamma function. 
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As used in (1), the fractional derivative operator is the inverse 
perator of the fractional integration defined as follows [9] [12] 

ir/](f)= / ( '~r(n) f(T),h (2) 

Note that in contrast to the classical derivative operator, the fractional 
lcnvafive of f(t) does take into account the whole history of the 
ignal f(t) from time 0 up to time t 
From (1) and (2) it is clear that 


hor extension of the above definitions to the case where n e R + the 
reader is referred to |9], [12] 

In the Laplace transform domain, the fractional derivative operator 
has the property 


£ 


U f 1 

= s- (1 - V’ 

Af 

dt 


dt 


£[/]-•>• 'f( 0) (4) 


Notice how this equation reduces to the well known formula in the 
classical case of n = 1 

Also for 0 < f\ < 1 and 0 < i < 1 it can be shown by direct 
calculation that 


<1 f 


dt 


(0 = /(') -/( 0 ) 


<1_ 

dt 


»/7 


if/ 1 


+ * 


/ 


i/f 4 4 




(*5) 

( 6 ) 


for a more thorough discussion on the properties of fractional 
calculus sec [9] [12| 

Modi l Definition 

We consider the wave plant described by the following equations 
(without loss ol generality the wave velocity can be set to unity) 


Oi ft(i t) - 0jft( i 

o = o o <v i < i 

t > 0 

(7) 

0(0 

t) = 0 0 (9(1 t) -- 

= -hdt 0(1 t ) 



0 < <\ 1 

h> 0 

(8) 

<n i 

0) = iM() O,0(i 

0) = (9i( i ) 

(9) 


where h is the constant involved in the boundary control 
Remat ks i) Equations (7) (9) ate clearly a closed loop system 
with feedback 7 1 d t ti{ 1 t) n)Also the noise amplification caused 
1 v the differentiation process at high frequencies is attenuated by the 
ti actional integration 

Our objectives are to prove existence and uniqueness of the solution 
< I system (7) (9) as well as the asymptotic decay of its energy to 7ero 
* >ur analysis falls into three parts 
First of all we shall show that the above system can be replaced 
v an augmented system obtained by coupling the wave equation 
ith a higher order diffusion equation As will be seen, the main 
lvantage of tackling the problem in this way (rather than in the usual 
amework of Volterra partial intcgrodifferential equations) lies in that 
ie analysis of the augmented model will require only a reasonable 
mount of functional analysis In fact the augmented model may be 
framed into the operator theoretical form Z(t) — AZ(t) in an 
ppropnate Hilbert space This very convenient lorm will then allow 
te use of results from semigroup theory More, the appearance of the 
actional derivative operator in the second boundary condition in (8) 
ould make the direct analysis of the original set of equations (7)-(9) 


much more complicated, as in that case the search tor a L\ \\u\ 
functional could be more difficult 

The second part of our investigation is devoted to proving cm n n 
and uniqueness of the solution of (7) (9) The third pait is tonumc i 
with the asymptotic decay of the solution to zero 

II AUGM1 NTbD SYSTI M 

We now demonstiate how a diffusion equation can fit used 
to reformulate system (7) (9) Consider the higher order diffusion 
defined on the spatial domain R with input Ut 

0>Ay t) = (— 1) , " + V(v t) 

+ Wi(/) D k) („) / >0 9 e \ ie \ do) 

U) = v?0((/) (II) 


where t)^ k) denotes the / th derivative of the Dirac delta function 
We define the output of the system as follows 

y,(t) = (t>' l ' ) r ( t)) = (-nV„«> o (i2) 


Let /(c) = f-* f(v)r € 'rfi/ define the Foimer transform of J{q) 
The following assertion is important in that it constiiules the idea 
upon which the remainder of the article is based 
Pioposition / Let 0 < A < ij - 1 and ^(i/ ()) = () then setting 
n — (2k + I)/2 q and 0 = 2q sm M7r the input-output relationship 
of system (10)—(12) is given by 

idt)-* HAD (H) 


Pi oof It can readily be shown that the self adjoint operator 
l =. (-1) ,+1 0 y 1 on the Hilbert space 1 1 (R) generates a contraction 
semigroup Also from the Founer transform of (10) we have 

D = -t, i 'y(i. n + (-i)VM/U) (14) 


Hence 

Ai D = e~ i 'V0(0 


+ (-/)* f £*< " t) Ui(t)<It (IS) 

/ 0 

Now setting y0 ( <) = 0 (zero initial condition) and using (12) we 
arrive at the following 




1 




n.») 

2q 7T 


r j— 

L It-T)- 


Ui(t)iIt (16) 

Ui[t)(It wherea - - (17) 

2q 


which is the result looked for since T(1 - t\)l{r\) = 7 r[sinnT] 1 □ 

Remat k& i) We realize that by assigning to k and q var>ing values 
a will run through all rationals between 0 and 1 Besides nonrational 
values of o and the limiting case o — 1 can be achieved by merely 
allowing A to vary continuously within the interval 0 < A < q - 1 /2 
which amounts to considering in (10) fractional derivatives of the 
Dirac distribution [12] as well n) Implementation of a control law 
based on equation (1) is very difficult in the convolution form 
On the contrary system (10M12) may be approximated in a veiy 
straightforward way by using standard numerical techniques such as 
finite differences or elements, which lead to very efficient algouthms 
[7] These numerical aspect are not developed here but will be the 
subject of a future paper 

In view of the foregoing proposition, we can model the fractional 
derivative term in the second boundary condition in (8) by means of 
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m 


The diffusion system (10)—(12) System (7M9) is therefore completely 
described by 

= 0 (18) 

<M V- f) + (-i)'^'V(v. *) 

-0t),0(l /)^ (A, (i/) = 0 (19) 

0 ( 0 . M = 0 , 0,9(1. t) -f) 

( 20 ) 

0 (i, 0) = 0 o (i), <->,(9( * 0) 

= #,(■) »■-(//. 0 ) = 0 ( 21 ) 

Remaik As will be seen system (18)—(21) possesses an infinites 
imal generator of a ( u semigroup In other woids, wc have managed 
to replace a system with an heiedirai\ boundary condition by a 
nonhereditaty system 


Proposition 2 Let £ t i(t) =|| tp ||^ (R) and £„ (t) =|| 9 ||^ <0 
+ || 0i9 ||J. (() 1} denote the diffusion and the wave energies, 
respectively Then the total energy £(t) = £ u (1) + £ i(t) is 
nonincreasing along the classical solutions ol system (24) 

ProoJ Differentiate f, (t) with respect to time, use integration 
by parts and conditions ( 20 ) 

f (f) = -ft<),0( 1 f)(A ,M p( ,()> (27) 

Differentiate £i(t) with respect to time, use integrations by parts and 
the fact that {9, d t 9 p) r , being a classical solution ol system (24), 
p must belong to H‘( R) 

i'i(t) = -2 II 0‘ Y +hd, 0 (l, v-( , t)) (28) 

Now, by adding memberwise (27) and (28), we obtain the desired 
result 

t'(1) = -2 ||«/J y -||; (R) □ (29) 


III EXISTfcNO AND UNIQLItNESJ> 

To analyze (18H2I) we introduce, as in [ 8 ] the following nota 
non 

#;/(() i) = {# e i \o i) o 0 ()> o h e z/(o i) 

0 ( 0 ) = 0 } 

We consider (he Hilbert spaces // ( J(1 0 ) L~{{). 1 ) and L'(R) 
equipped with the inner products 

[u\u )H[\ (0 1 ) = - f 0 at) udi (i |< )/ M0 \) -\ j * *<li 
^ In ~ In 

(V'lv 1 )/-fH» = ^ /rv'rf* (22) 

Our state space will be the Hilbert space H = /f,j (0 1 ) x / * (0 1 ) x 
L"{ R) wilh inner product) 

(((/. <, p) 1 |(4i. i, p) r )h = 

( f/ 1 w )//*(o i) 4- (< | # )/-(n i) + Mv)/*(Ri (21) 

Setting ■= {9 0i9 y ) 7 £ H (18)—(21) can be pul in the abstract 
form 

L = V (0) € H (24) 

where \ H -* H is the linear unbounded operator defined by 


Next we prove the following assertion foi anothei useful estimate 
Proposition 1 Let E/(t) =|| ihp ||J J(H) and E (t) =|| 
<>i0||f,i ( „ n + II Or" llf .,0 n . then the sum E(t) - E (/) + E,(/) 
is increasing along the solutions of (24) with initial conditions in 

D(V‘) 

Piool II rjfcD(\“) then =T(/)\ 0 is a classical solution ol 
(24) Hence we may proceed as m the proof of Pioposition 2 We have 

E (/) = -/»), 0(1 O<0“’ /)) (10) 

and 

Ei(t) = -2 || 0',0i Y ' ||, (R) +/««>, 0(1 0(0“’ J, y”( /)) (II) 

Hence 

E(0 = -2|| ( );c^ ||, , HI n d2) 

We are now able to consider the following 

theorem I The operator \ defined hy (2S) (26) generates hTo 
semigioup {7(f) / > ()} in H Tor ( 0 ) € D(\) the unique 
classical solution of (24) is given by (/) - I {t) (0) £ J)(\) 

/ 2 0 

Proof By the l umer Phillips theorem 1111 il suffices to show that 
\ is maximal dissipative Since Pioposition 2 already establishes the 
dissipalivily of \ on the space D{ \), it lemains to prove that the 
operator \7 — \ is onto lor some A 0 Let A - 1 Hence our 
problem becomes the following 

foi any n i * )’ £ H, find {u i r ) J £ D{ \) such that 
these three equalities hold 


\ = 


“0 

Of 

0 


I 

0 

d(fi\i i) )* {k) 


0 

0 


(2S) 


with I s ! (i) dehned by (iS (i), /( i )) = /( 1 ) The domain D( \) of 
the operator \ is given hy 

7?(\)= {(« I r)' p H II 6 7/,-f . fc Hi 

Mi)0'‘' + <-i) ,H ^V e L-( R). 

0 1/(1 > + / i ( 0 a) ,) = <> } ( 26 ) 


Note that (// / ) 1 € D( \ ) implies Y € //'(R) since, by 

assumption 0 < k < 7-1 

We now prove the following proposition about the dissipativuy of 
system (24) 


m( / ) = ll( I ) - 1 ( 1 ) (13) 

l(i)—~ 1 ( 1 ) 0* 11 ( 1 ) (14) 

(V) - ,’(V) - iMl)0“'(v) + (-1 )'<?;V('/) (Vi) 

From (11) and (14) and using the fact that n{ 0) ~ 1 (0) — 0 it 
is fairly easily shown that 

1/(1 /?) = /? sinh ( 1 )— / {n(/r )-f f(^)} sinh( 1 - (t)iI(t (16) 
In 

i{i D) = fi sinh (/) — n( 1 ) — f {n(<r) 

la 

+ l(tr )} sinh ( 1 — (r)d(r, (17) 

where D is a constant to be determined soon 
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Now to show the existence of p,- we first note that 


\W = 


T(g> -I- (-»C)*.?f( 1 , D ) 
1 + 


(38) 


solves the Fourier transform of (35) when D is known. Therefore 
the inverse Fourier transform of \ is the unique solution of (35) and 
belongs to the space H q ( R). Substituting (36) and the inverse Fourier 
transform of (38) into the last boundary condition in (26) yields B: 


Proof: First, if ;<> = (d 0 . *o) r € D{ A 2 ), then cleatl\: 
(d,did) T € Wo(0, 1) x -ff(J (0. 1). Now using the differetnia! 
equation (18) and the fact that £(t ) and £{t ) are bounded, we see that 

- f\d r e) 2 dr 

+ [ [0rd]~fl.i- -)- I [dj 0/d] 2 d.r < C, for some C > II. 


2I?7rcosh(l) - f 2n {a( rr ) + ,i(<r)} coshfl - (r)dcr 

J o 

l f 2 * 

•ii 


+ Bh\) sinh (1) 


- hOa(l) 


L r 


+ £2« 


, 1 + f 2 " 


e k 


- I {fi( (T ) -f /J(<r)} sinh ( 1 - rr) r/rr ^ 

fJ U - 

7r1- 


- + e' 


+ 27r(— 1 ) k h f -) (-y)dyf>{y)dy = 0 (39) 

J H 


where p(y) is the inverse Fourier transform of [1 + £ 2f/ ]“ l . This ends 
the proof. □ 

IV. ASYMPTOTIC STABILITY 
Now consider the subspace of H defined by 


5 = {- G H: £(t) = 0, V/ > ()}. 


Hence we conclude by the compact embedding of 7/^(0. 1) x 
Hi (0,1) into Hi (0. 1) x L 2 (0. 1). □ 

Proposition 4: If z 0 = (0o. 0j, y?n) 7 G D( A 2 ), then the trajec¬ 
tory of the third component of the solution of (24), is precompact 
in 7 2 (R). 

Proof: Since for G D{ A 2 ). r ?(t) is a continuous mapping 
from [0. + x.) into L~( R), it is therefore sufficient to show that 
II y( f ) ||fJ(R)—» 0 as t —► oo whenever G 7)(A 2 ). 

From relations (29) and (32), wc have 

[ (It f [dtdlrfdy < so. f < <x.. (46) 

Jo J r 7o 7 h 

Using the above relations together with the Cauchy-Schwartz in¬ 
equality, we can easily show that 




v?( </. f )] J r//y exists. 


(47) 


Now, from the second equality in (46) and from (47), we have 


1 i in 

t —♦ OG 


/[a;; y My. 1 )]% = t _^ m x 0| 2 rft = <)• (48) 


(40) 


Therefore, from (48), it is clear that || >p(t) ||/^ (H ) will tend to 0, 
as t approaches x, if 


Lemma /. The only classical solution of (24) in the subspace 5 
is the zero solution. 

Proof: Let (0, d ( H. r ") 7 G be the classical solution of (24); 
ihen from (29) we have 

W > 0. I\d^(u.t)\ 2 dy = 0 (41) 

I R 


/ |;’U. = o. 

7 ^ 7|£|<J 


(49) 


We shall now prove the above equality (49) by the dominated 
convergence theorem. 

1) Applying both Parseval’s equality and Fubini's theorem to both 
inequalities in (46), we have 


which, together with the fact that y?(-, t) € 7 2 (R), yields 

Vf > 0, Y :(-. t) = 0 a.e. inR. (42) 

From (42) it is clear that system (18)—(20) reduces to the set of 
equations: 


f |cV?iVU* 7)|“r/f < oc. 

Jo 

r iawe. < x-, 

Jo 

a.e. in £ G [-1. 1]. (50) 


OfH.v, t) — Oj fl(.r. t) = 0, (43) 


0(0. /) = (), djJH f) = 0, /.) = 0. (44) 

Using (5) and the last equality in (44), we have 0(1. t) = 0(1, 0). 
■ombining this with (43H44) leads to the identity: 0(.r, t) = 0 (In 
act, this identity may be obtained by separation of variables). □ 
Therefore, according to LaSalle’s invariance principle [14], a 
>olution will tend to zero provided its trajectory is precompact in H . 
We shall prove the precompactness in Lemma 2 and Proposition 4. 

Lemma 2: If -o = (0n, 0i, <p o ) J G D(A 2 ) then the trajectory of 
<he pair (0, <9/0) 7 is compact in 77d(0, 1) x /, 2 (0, 1). 


So that, by the same arguments that led to (49), we may 
conclude that 

, 0j&<£. Ml 2 = , *)l 2 = «• 

a.e. in ^ 6 [-L 1]. (51) 


Hence 

/ ^ m oc ,l^(£-^| 2 =0. a.e.infe[-l. 1]. (52) 

2) Now, taking the Fourier transform in (19), solving for ^ and 
then using integration by parts with respect to time and the 



m 
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fact that 0(1, t) is bounded (Proposition 2), it is not difficult 
to see that 

MC O| J <(U(Or + y>e/'(0 1) for some Q > 0 

(S3) 

FmaPy from (52) (53) and the dominated convergence theo¬ 
rem, we obtain ^ II V?(0 ||/J(R)= 0 □ 

Corolla ?> For -0 G H || 7 (O~o ||x—0 as t —* x 
hoof For *0 € D( ) it is a direct consequence 0 1 proposition 
4, Lemmata 1 and 2 and the LaSalle’s invariance principle f 14| Since 
D( \ z ) is dense in H the result still holds lor 0 G H □ 


V CONCLUSION 

The existence, uniqueness and asymptotic decay towards zero of 
the solution of the coupled wave-diffusion system (24) has been 
proved Moreover, since the original system (7)-(9) is embedded 
in the ‘augmented” system (24), all Ihe results obtained tor the latter 
carry over to the original system Also the approach presented here 
proves to be both simpler and very efficient 
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Observer-Based Parameter Identifiers for 
Nonlinear Systems with Parameter Dependencies 

Shahab Sheikholeslam 


Abstract —The class of nonlinear dynamical systems whose dynamics 
depends linearly on the unknown parameter vector is considered After re 
viewing a standard observer-based identifier for estimating the unknown 
parameters, we propose a family of new identifiers which exploit the 
a priori known parameter dependencies. Then we establish that, under 
mild assumptions on the dynamical system, 1) the proposed Identifiers 
are stable, 2) the weighted norm of state-parameter errors using Ihe 
proposed identifiers are less than the corresponding errors using the 
standard Identifier for a length of time after t — 0 The main contribution 
of this paper is that it introduces a family of observer-based identifiers 
which exhibit better transient performance than the standard identifier 

I Introduction 

The subject ol adaptive identification and control has been studied 
extensively in the literature (for tht basic theory reler to [8| [5| 
[11J [9|) Adaptive parameter identifiers studied in the litciatuie 
fall into two classes 1) identification algorithms which use Ihe in 
put-output relations to continuously estimate the unknown paiametcrs 
(sec [81 |5|) and 2) model relerence identification algorithms (11 
P ^0| [9] [7] which use an observer lo continuously estimate 
the unknown parameters The algorithms in class 2) are useful lor 
adaptive control The identifiers in this paper belong lo class 2) 

In most engineering applications the unknown paiameters in the 
system equations aie known functions of the physical parameters of 
the model In many such cases the unknown parameters are known 
functions of a strictly proper subset oi all the unknown panmelcrs 
Our approach in designing new parameter identifiers toi a class ol 
nonlinear systems is to exploit such parametei dependencies whilt 
maintaining the stability ol the identifier Intuitively an identihu 
which exploits the a pi ton known parametei dependencies should 
yield smaller errors than a similar identifier which does not make use 
of such dependencies In [3] and [ 11 the authois propose identification 
algorithms in class 1) that make use of paiameter dependencies 
The mam contribution of this papei is that it introduces i family 
of identifiers in class 2) fFiat exhibit belter transicnl pcrlormance lhan 
a standard identifier The method of proof used lo show the supciior 
transient performance can be used lo prove that the parametei criors 
resulting from using the identifier in [3| in class 1) arc smallu lhan 
the corresponding errors using a standard least squares identification 
algorithm 

Recently some authors have studied the pioblem ol iobusi adaptive 
control of linear time invariant systems [6] [41 As pointed out in 
[6] one of the main limitations to the theory is the lack of a more 
convenient quantification ol the transient performance We believe 
thal the results presented in this paper are preliminary attempts at 
such a quantification 

The paper is organized as follows Section II proposes a new 
observer based identifiei for a class of nonlincai dynamical systems 
and states theorems regarding its stability and transient peitormancc 
to compare the transient performance of the proposed identihei with 
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t hat of the standard identifier, Section III shows simulation results 
for adaptive identification of the longitudinal dynamics of a vehicle; 
Section IV outlines the main results and provides future directions 
lor research; Section V contains the proofs of the theorems. 

II. Parameter Identifiers 

Throughout this paper, we use | • | to denote the Euclidean norm 
of a vector and || • || to denote the induced norm of a matrix. 

A. Background 

Consider the nonlinear time-invariant dynamical system 
r = W' t (.!\ a)0' 

.,-( 0 ) = .fa ( 2 . 1 ) 

where .r E R n denote the state of the system, u E R‘“ denotes the 
control input to the system, 0* E R v is a constant vector of unknown 
parameters, and is a p x n matrix of piecewise continuous 

functions. H r denotes the transpose of the matrix If. 

Parameter Structure: In (2.1), components of the unknown pa¬ 
rameter vector 0* are known functions of the physical parameters 
(eg., engine time-lag (r) and drag coefficient (K,i) in the vehicle 
dynamics (2.10) with 0* defined in (2.13)). Previous papers [ 13J, [14J, 
116]. [ 2 ] have proposed observer-based identifiers, for ( 2 . 1 ), which 
do not exploit these parameter dependencies. 

Identifier Structure (Not Using Parameter Dependencies) [13], 
{141,1/61,1101: Let A E /?" x " be a Hurwitz matrix and Q E R n * n 


be a given symmetric positive definite matrix; let P E /?"*" denote 
the unique symmetric positive-definite solution of the Lyapunov 
equation 

A l P + PA = -Q. (2.2) 

For the dynamical system in (2.1) the identifier is chosen to be 

i" = -.r) + H'V. u)f>" (2.3) 

= —U (.r. n)P(-v" -.!') (2.4) 

.*■” (0) = .?•(, (2.5) 

#"(()) = #u. (2.6) 


The superscript n in r ". 0" indicates that the Lyapunov function, 
used to prove the stability of the above identifier, is unconstrained 
with respect to parameter dependencies in ( 2 . 1 ). 

Assumption 1 (Boundedness of Regressor): We assume that If : 
/r x R" 1 —♦ R llX " is bounded. 

Stability of the Identifier: Define the stale error and the parameter 
error by 

- (2.7) 

0 ":= 0 " - 0 *. ( 2 . 8 ) 

Theorem 1 (Stability of Identifier (2.3h(2.4)): Consider the dy¬ 
namical system (2.1) with the identifier (2.3)-(2.4). Let Assumption 
! hold; then t " E L > D I *. r" E L * . and 6 n E L . x .. 

Proof of Theorem I : See [16 (and references therein)], [11, sec. 
2.4). The proof of Theorem 1 uses a standard unconstrained Lyapunov 
function candidate 

rv\ 0 n ): = yT Pc" + (2.9) 

■T New Identifier 

In many engineering applications, the components of 0* E R v are 
mown functions of a strictly proper subset of 0* for / = 1 . 2 , ■ ■ ■ .p. 
0* denotes the rth component of 0*.) This is illustrated in the 
allowing example. 


Example: Consider the following nonlinear differential cquaium 
representing the longitudinal dynamics of a vehicle [15], [ 12 ]: 


.r 



.. , Kd .2 . 
■*' -F —■*' + 
m 



a 

niT 


( 2 . 10 . 


where r is the position of the vehicle with respect to a fixed reference 
point O on the road; I\ ,i denotes the vehicle’s aerodynamic drag 
coefficient; rn denotes the mass of the vehicle; r denotes the vehicle s 
engine time-constant; d w denotes the mechanical drag of the vehicle; 
and u denotes the throttle-command input to the vehicle’s engine. 

Equation (2.10) can be put into the form 


where 


«'(.[•, j\ i/): = 


and 


.r = w 1 (i 

.V\ u)0* 

( 2 . 11 ) 

2 

d ni ii ,r ~' 

■1 

— — j.i', — .r 


( 2 . 12 ) 

rn 

rn m w 

f/u. 

I. *-] T 

(2.13) 


For the vehicle dynamics (2.11), = 0*0J. 

Our approach in designing a new parameter identifier tor (2.1) is to 
exploit such parameter dependencies while maintaining the stability 
of the identifier. 

Parameter Dependencies: 

Definition (Parameter Constraint Set): Consider the dynamical 
system (2.1) with 0* E R p - The parameter constraint set is an .V- 
dimensional C 1 submanifold of 1V } parameterized by 0*. ■ ■ ■ , 0 ^. 
Hence, the parameter constraint manifold is described by 


(?({“)«< * = -V + f ■■*/'}) 

: = {0 E R 1 ’ | 0, =-m(0i. 0.v ) for/= + ],■•■,/»} 

(2.14) 


where for /' = iV + 1 , • - ■ 7 , : ,/? A —<■ R is a known C 1 map (i.e., 

7 , is at least twice continuously differentiable). 

Penalty function: Consider the parameter constraint set in (2.14). 
We define a penalty function P: R v — R with 

P(#):= (2.15) 

where A,- > 0 for all i — N + 1 . ■ • - .y >. 

For the vehicle dynamics (2.11). the parameter constraint set is 
C( ( 7 ;,}) - [0 E R a I 0:\ = 71 (0i, 0*_»): = 0|0 2 J- The penally 
function (2.15) with respect to the parameter constraint set C( {':•}) 
is 

P(0) = ±A:,[ 0 :l -M 2] 2 (2.16) 


where > 0 . 

Identifier Structure (Using Parameter Dependencies): Let A E 
R u *\ Q E /?"*", and P E R" *" be the same matrices used in 
(2.2) and (2.3)—(2.4). Let P be the penalty function in (2.15) for the 
parameter constraint set in (2.14). 

The new identifier for the dynamical system (2.1) is 

■r = ,4(.r' - .r) -F VI T (.r, i/)[ 0 r -F DP(0 r )} (2.17) 

0 = u)P(r -,r) (2.18) 

i‘ r (0) = .ro (2.19) 

0* (0) = 0n (2.20) 


where DP (•) denotes the differential of the penalty function. 

We have used the superscript c in .r‘ , 0‘ to indicate that the 
Lyapunov function candidate, used to prove the stability of the above 
identifier, is constrained with respect to parameter dependencies in 
(2.1). (See (2.23) below.) 
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Assumption 2 (Existence and Uniqueness): Throughout this pa¬ 
per, we assume that the solutions of the differential equations 
(2.17H2.18) with initial conditions (2.19H2.20) exist and are 
unique on TO, oO- 
Stability of the identifier: Denote 

• r - .1- (2.21) 

0 ':=( 9 ' ( 2 . 22 ) 


Theorem 2 (Stability of identifier (2.I7)-(2.1R)): Consider the dy¬ 
namical system (2.1) with the identifier (2.17H2.I8). Let t'. o' 
denote the identifier’s state-error and parameter error vectors, re¬ 
spectively, Suppose Assumption I (Boundedness of Regressor) and 
Assumption 2 (Existence and Uniqueness) hold, then <' 6 L 2 H 
P € L , and o' € L ^ . 

Proof (Theorem 2) See the Appendix. 

Comments: a) Proof of Theorem 2 uses a Lyapunov function 
candidate of the form 

V (<', o' ): = i< ,T /V + iflk'V + PW +#*). (2.23) 


Comparing (2.9) with (2.23), we note that the penalty term P(<5>' +0*) 
is the new addition to the standard Lyapunov function candidate. The 
design of the new identifier (2.17) -<2.18) is motivated by using the 
Lyapunov function candidate (2.23). 

b) In [3, (2,14)], the authors propose a similar Lyapunov function 
for improving the robustness of a least squares type identification 
algorithm. The main differences between the identification algorithms 
presented in (3] and the observer-based identifier in (2.17H2.18) 
are: 1) the identifier in (2.17M2.18) belongs to the so-called model 
reference identification family of algorithms [11, p. 50]. The algo¬ 
rithms in this family are useful for adaptive conlrol. The identification 
algorithms in [31 use the input -output relations to estimate the 
unknown parameters; 2) the dynamical systems in [3] are S1SO. 
linear, time-invariant, whereas, we considet MIMO, nonlinear, time- 
invariant dynamical systems (see (2.1)); and 3) in |3|. the parameters 
are multilinear functions of the unknowns (i.e., the sec (2.14), are 
multilinear functions); whereas, we allow more general parameter 
dependencies in the definition of the parameter constraint set (2.14). 

Transient Performance We now compare the transient perfor¬ 
mance of the new identifier (2.17)-(2.18) with that of the standard 
identifier (2.3M2.4). 

Consider the dynamic system represented by (2.1) with the pa¬ 
rameter constraint set (2.14). I^et t >-> (H\'(1 ),«■ ■, (1 ), 

+1 (f ),-■■. 0JJU)) 1 £ R 1 ' denote the solution of (2.4) with initial 
condition 0"(O). 

Assumption 3 (Deviation from Const) amt Set) We assume that 
there exists a j 6 {.V + l,- ■ ■ ,/>) such that 


dt 


K(0)*(0). • ■.(0))] 


*0. 


(2.24) 


{2 I) 


f r : i?+ —> /?", o' : R+ —> R v be defined as in (2.21), (2.22), 
respectively. Under these conditions, there exists an f > 0 such that 
for all f 6 |0. r), 


1 (e)PC (0 + 1 (*)0* (O < [*-)P< 'V) + o ul if)o“{f). 

(2.25) 


Proof (Theorem 3) See the Appendix. 

Comments 1) Theorem 3 is the main result of this paper. It shows 
that initially the proposed identifier (2.17M2.18) results in smaller 
state and parameter errors than the standard identifier (2.3H2.4). In 
[3], the authors present simulation results to verify the improvement 
in the overall robustness characteristics of their proposed identifiers; 
they do not prove lhai their identifiers result in smaller parameter 
errors. 

2) Theorem 3 does not make any assertions comparing the mag¬ 
nitude of the parameter errors between the two identifiers. Using 
the same steps in the proof of Theorem 3 for Ihe least squares type 
identification algorithm in [3, eq. (2.18)], one can show that 


there exists an f > 0 such that for all r £ [0. r), 

«/ I (f)rt'(f)<»" , (f)rt ,l (f). (2.26) 


Whether (2.26) remains valid for the identifier (2.17)—(2.18) remains 
an open question. 

3) In general the largest value of tor which (2.25) holds, depends 
on the choice of the penalty function P( ■). the exogenous input vector 
u . and the dynamics of the system. 

4) At the present time, there are no systematic procedures for 
selecting the values of A, in (2 15). Intuitively, the choice of the A, 
reflects a tradeoff between the weighted norm ol the identifier’s state- 
paramctei erroi vectors and the deviation of the parameter estimates 
from the parameter constraint set. 


Ill Simulation 

To compare the transient performance of the proposed identifier 
(2.17H2 18) with that of the standard identifier (2.3)—(2.4), we ran 
simulations using the nonlinear differential equation (2 10) represent¬ 
ing the longitudinal dynamics of a vehicle. 

The standard identifier lor (2.11) is 


d . II 

dt J 

.. a 

= -<T,{ * - 

.r ) -t- a 

1 [ (.i. 

.V . n )H n 

(3.1) 

r»" 

dt 

= —(T f ,ie(.i , a 

\ v)Cr 

— .r 

) 

(3.2) 

where .V (0) = .in 

. 0“(O) - iv 

.. 11 

1 £ 

n. r 

€ P'. a, 

> 0, and 


(To > 0. 

Using the penalty term P(-) in (2.16), the proposed identifier for 
(2.11) is 


Remark. Intuitively, Assumption 3 indicates that the direction of 
the velocity vector of the parameter estimates at t = 0. using the 
update law (2.4), lies outside the tangent space of the parameter 
constraint set (2.14) at 0"(O). Hcncc, aftet 1 — 0. the parameter 
estimates may not lie on the parameter constraint set (2.14). 

Theorem 3 (Transient Performance): Consider the dynamical sys¬ 
tem represented by (2.1). Let the proposed identifier (2.17M2.1K) and 
the standard identifier (2.3M2.4) have the same initial conditions 
(2.5H2.6). Suppose the parameter estimates initially lie on the 
parameter constraint set (2,14). Suppose Assumption l (Boundedness 
of Regressor), Assumption 2 (Existence and Uniqueness), and As¬ 
sumption 3 (Deviation from Constraint Set) hold. Let <": R + —> /?". 
o n : 7?+ —* R 1 ' be defined as in (2.7). (2.8), respectively. Let 


d,, 

r) + ir ' (.i. 

!•. u)[H' + DP(H')] 

(3.3) 

d /l( . 

— ft = —(T0 A 

dt 

if )(.•■' - .(•) 


(3 4) 


where i' (0) = j* 0 . 6 1 (0) - 0 O , .r e /?, and O' € R\ 

Vehicle Parameters In all the simulations conducted, the vehicle 
parameters were as follows: m = 1130 kg, d,„ = 150.71 N, 
K ( i ~ 0.300 kg/m, and r = 0.18 s. 

identifier Parameters Wc chose (Tj — 1 and rr^ = 2. For the 
proposed identifier (3.3H3.4), At = 2 was used for the penalty term 
P(ft) in (2* 16). The initial state and parameter estimates were ro — 1. 
0n = [0.44. 3. 2_2] 1 , respectively. 
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-3 L-i-—... .. 1 

0 2 4 6 B 10 

I ts) 

Hj, I o (t) t (t) (/ = J 2 3) i (/) and t (/) versus f 

Simulation S (tup Initially the vehicle was traveling at a constant 
speed ot 17 9 m/s (i c i (0) =■ 17 9 m/s * (0) - 0 m/s‘ i (0) - 0 
m/s*) Starling at time f - 1 s, the throttle command input increased 
linearly lrom its initial steady stale value of 181 N al a rate of I H6 
N/s until it reached its final value ol 1S17 N al / — 2 s 
Simulation Ri suits \ ig 1 shows the state and the parameter er 
mis lor the standaul identihei HI) O 2) and the proposed identifier 
0^) (14) Note that the magnitude ot the parameter errors using 
the proposed identihei (1 1)-(1 4) arc smaller than the corresponding 
utois using the standard identifier (1 l)-(l 2) for a length of time 
after t - 0 s Also note that in eithei case the paiametcr enors do 
not converge to zero 

Fig 2 shows the weighted norm of the state parameter error \ ectors 
for the standard identihei (1 l)-0 2) (i e l I 1 (e '(/) o 4 (f))) 

tnd the proposed identihei (11)-(14) (i e, f m | (< (t) q (/))) 
Note that for all 1 €[0, 10] I * (c (t) p (/)) < I (* (t) o (/)) 
These simulation results show that the proposed identifier 
(1 has belter transient performance than the standard one 

il l)-0 2) Thus, we expect that using the parameter estimates from 
he identifier (11H1 4) together with a certainty-equivalence control 
law will result in superior closed-loop adaptive-control performance 
of a vehicle’s longitudinal dynamics Simulations o! indirect adaptive 
control laws for a platoon of vehicles support this conclusion 

IV Conclusion 

We have proposed a family ol observer based identifiers for a 
class ot nonlinear dynamical systems which use the a prion known 
parameter dependencies Under mild assumptions on the dynamical 
system we have shown that 1) the proposed identifiers are stable (see 
Theorem 2), and 2) the weighted norm ol state-parameter errors using 
the new identifier are less than the corresponding errors using the 
standard identifier, for a length of time after t = 0 (see Theorem ^) 
There remain a number of open questions 




1) How can we select the coefficients A tor / — A + I p m 
the definition of the penally function P{ ) (see (2 IS)) so as to attain 
the ‘best’ transient peifonnance* 

2) Is the proposed identifier more robust with respect to pai.imeter 
variations and exogenous measurement noise } 

1) Can we design stable identifiers which have the least number ot 
states (eg the update law only updates the basic paramcteis other 
parameter estimates are then computed using estimates of the basic 
parameters ) } 

We believe that satisfactory answers to the above questions will 
lead to systematic stiategics lot designing robust nonlincai adaptive 
control laws 

Appendix 

Hi oof of Jheoiem 2 

Consider the following lyapunov function candidate 

1 U „) = if ' P< +V' 'u +r(n + #*) (SI) 

Differentiating both sides ol (SI) with respect to lime, and using 
(2 2), we get 

1 ( f (/) * (f)) = -±r l {t)Q( (t)< 0 (*>2) 

Hence by the standard reasoning r £ and n £ I ^ Since 
\\ is bounded (le. Assumption 1) DP{ ) is bounded (DP{ ) is 
continuous and n +0* £ I < £ L ■*. and o £ l -x we 

get ( E lx 

Integrating both sides ol (S 2) from zero to t and noting that 
Q = Q ] > A^/„x„ >0 (A^ is the smallest eigenvalue of Q) 
we get 

1 (e'(f) <*(/))-1 (r (0) <>'({))) < 1 {T)< (r)rtr 

(S D 

Dividing both sides of (S 1) by - jA^, we get i £ L> □ 
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Fig 2 \ l [( (/) o (0) \ 11 (* (1) f»'(0) versus / 


Proof of Theotem 1 

Consider the Lyapunov function candidates (2 9) and (5 I) toi the 
erroi dynamics (2 3 y (2 4) and (2 17) (2 18) respectively Since the 
initial paiamctcr estimate Ho lies on the parameter constraint set (2 14) 
(i e , m) = 0) and both identihers have the same initial conditions 
from (2 9) and (5 I) we get 

\ (< (0) O (II)) = 1 "(r (0) r> (0)) (S 4) 

Diflerentiating both sides of (2 9) with respect to time we get 

!(• 1 ) 1 ' U) « (t)) = -r' (t) (5 S) 

Thus from 2), (S 5) and given < (0) = < (0) we gel 

'uni !s,(< 10) " (0))-- l ( _ , )U ,,(< (0) <>(())) (5 6) 

Since ( (0) = i (0) rt(0) = c» (0) DP(a (0) + «') = 0 

« (0) = < (0) (*> 7) 

Dilleieniiatmg both sides of S) and (“i 2) with icspect to lime 
(S7) and given < (0) — > (0) we gel 

'in-nji»i(* (0) o (())) = Vania t) (r (0) o‘ {(») <S8) 

Writing the favlur expansion ot 1 (r (f) o (0) - 1 ' (<•) 

o '(*)) about / “ 0 and noting (5 4) (5 6) and (S 8) wc get 

1 (' (0. » (*)>-! o 1 !,))-(){*') (59) 

where ' •-* (7(r l ) denotes a (unction such that lim _o|0(* ’)/' ( | < 

X 

Foi i = \ + 1 p let 

7(0 — - „(0,U) (S 10) 

Since H (0) lies on the parameter constraint set (2 14), for / ■= 
\ + 1, p we have 


Since r (0) = r (0) n (0) = (0) (lor i = \ + 1 p) 

ql0)= Ji^ (,)) ~ ( ^ l(0) H12) 

Successively difleientialmg both sides ol (2 IS) with respect to 
time along the solution trajectories of (2 17) (2 18) wc get 

i 

1 \ I ). IK)(# ) = 53 \ </<1 (*> M) 

\ -i i 
i 

P(- i*)( M)(^ ) — M7 +77) (^ 14) 

- v 1 1 

Hence from (S 11) (2 IS) (S M) and (S 14) we gel 

P(V «)))-() (SIS) 

Pi a. 1 )U IK)(# (0)) tr () (S 16) 

/ 

Pi> i->u .ml# (0)) = £ (0) (5 17) 


From (S 12) and Assumption 3 (deviation fiom constraint set) wc 
note that for some j £ { \ + 1 p} 7,(0) ^ () Hence from 
(S 17) we note that 

Pu (°)) > V/7< 0) ^ 0 (S 18) 

Writing the Taylor expansion of P{H (r)) about / - 0 and noting 
(SIS) and (5 16), we get 

P(0 < (f)) = f 2 P u ,- )w (0)) + O(e*) (S 19) 

From the definition of W in (S 1) and I " in (2 9) we note that 
\ (<(0 MO) = 1 "(0(0, u (O)+P(MO + #*) (S 20) 


7 (CD = 0 


(S ID 
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Substituting the expression for V‘(r‘‘(f). <j> c {()) from (5.20) into 

\9). we get > 

I’VR), V(f)) + P(» r ( f ))-r , (c"(f), «>») = 0 (A (5.21) 

Substituting the expression for P(9'(f)) from (5.19) into (5.21), 
after algebraic manipulations, we get 

r'V (0. <tS (€))-V u (e"(f). *“(*)) 

( 2 ■■ 

= -J p U 1T)(2.|K) («"(())) + 0(f’). (5.22) 

From (5.18) and (5.22) we get 
r"(r ,r ff). v'(f)) - r'W). d>"(e)) 

+ (5.23) 

By definition, lim«_o|0(f‘‘ )/f A \ < oc; hence, there exists an 
( > 0 such that for all f € [0, f ), the right hand side of (5.23) 
is negative, i.e., 

-y\i«i(O) + 0('*><O. (5.24) 

Thus, from (5.23) and (5.24) we get, for all r £ [0. f), 

V"U' (€l </>») - r>”(r). <!>"(€)) < 0. (5.25) 

Noting the definition of V" in (2.9), from (5.25) we get, for all 

f € [0, f), 

f fJ (f) Pc (O + o' 1 (f)o' (f) < e ul (e)A*'(f) + 0 u1 (f)o"(f). □ 

(5.26) 
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Computation of Approximate Null Vectors 
of Sylvester and Lyapunov Operators 

Ali R. Ghavimi and Alan J. Laub 


Abstract —This paper describes an effective algorithm for computing 
approximate null vectors of certain matrix operators associated with 
Sylvester or Lyapunov equations. The singular value decomposition and 
rank-revealing QR methods are two widely used stable algorithms for 
numerical determination of the rank and nullity of a matrix .4. These 
methods, however, are not readily applicable to Sylvester and Lyapunov 
operators since they require on the order of //' arithmetic operations on 
order n 2 data. For these problems, a variant or inverse power iteration is 
employed to compute orthonormal bases for singular subspaces associated 
with the small singular values. The method is practical since it relies only 
on the ability to solve a Sylvester or Lyapunov equation. Certain practical 
aspects are considered, and a direct refinement technique is proposed to 
enhance the convergence of the algorithm. 

1. Introduction 

The purpose of this paper is to describe an algorithm for com¬ 
puting approximate numerical null vectors of a matrix operator 
associated with the Sylvester equation AX + XD ~ C or the 
symmetric Lyapunov equation AY + l\4 y = Q ~ Q 1 , where 
X £ R nX,, W — Y 1 £ R' ,x ", and the coefficient matrices are 
compatibly dimensioned. These equations arise in many applications 
in applied mathematics and control theory. For example, the solution 
of linear elliptic boundary value problems on rectangular domains 
can be written in the form of a Sylvester equation [22], 127]. 
The Sylvester equation also arises in connection with multi-inpul 
pole assignment problems [6], [16] and the design of reduced- 
order observers that achieve precise loop transfer recovery [3], 
[26]. The Sylvester equation also plays an essential role in many 
eigenproblems. Examples of such algorithms that require solution 
of a related Sylvester equation include invariant subspace problems, 
block diagonalization of a matrix [10], reordering the eigenvalues 
of a quasi-triangular matrix [1], and computing real square roots of 
a real matrix [13]. Important applications of the Lyapunov equation 
include stability analysis of linear systems [18], 119], model reduction 
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(5J, f 14 ], [21], and determination of covariance matrices in filtering 
and estimation. 

In floating-point arithmetic with finite precision, it is quite possible 
that these equations are nearly singular. Equivalently, some of the 
small singular values of the matrix operator 


5=7,,, ,4 + B r •. 

J g R-.-.X...,, 

0) 

fo t the Sylvester equation, or 

L = /„ .4 4* A k : 

/„ e R" 3 *" 2 

(2) 


for the Lyapunov equation, arc of the order of the relative ma¬ 
chine precision (here denotes the Kronecker product [12]). In 
this situation, computed solutions of the matrix equations using 
general-purpose algorithms are usually of large norm and generally 
inaccurate. 

Examples of ill-conditioned Sylvester and Lyapunov equations can 
be found in a variety of problems in control theory, especially in 
connection with the control of large-scale dynamical systems and 
problems subject to /f-* constraints. For example, consider ihe 
deterministic optimal control problem whose quadratic performance 
index is of the form 

min / {> 7 (t)G.v(t.) -f v 1 (t)Rv(t)]dt 
a Jo 

subject to x(t) = A.r(t) + Bn(t) with a*(0) = .ro. where R = 
R 1 > 0 and G = G J > 0 are symmetric matrices, B is an 
ii x w input matrix, and n(t) is the control law. Under the usual 
stabilizability and detectability assumptions for (.4, B) and (-4,(7), 
respectively, the optimal feedback control for this cost function is 
given by = -./?”* B ] Px{t) and the associated minimum 

cost is ,7|,iiu = ,» o Pro, where P is the unique symmetric nonnegative 
definite stabilizing solution of the algebraic Riccati equation 

PA + A 1 P - PDR- 1 B 1 P + G = (). ( 3 ) 


A reliable algorithm [20] for solving (3) involves a Schur decom¬ 
position of the related Hamiltonian matrix 


.4 -BR l B r 
-G -A 1 


= QSQ 7 : S = 5 ( ’’ 



where Q is orthogonal and 5 is quasi-upper triangular. The eigenval¬ 
ues of 5 are ordered via additional orthogonal similarities if necessary 
so that 5u and S 22 are quasi-upper triangular and 5 11 contains the 
stable eigenvalues. Reordering of a Schur form requires solutions of 
certain Sylvester equations where the coefficient matrices are related 
to the diagonal blocks of S. For the sake of illustration, let 



where An € R 1 '* 1 ' and A 22 € with p.q = I, or 2. The 

problem is to find an orthogonal matrix .V € *(?+</) suc j| that 


and .4n and Art have the same eigenvalues. In other words, .4 is 
similar to .4 and the eigenvalues of the (1, l) and (2, 2) subblocks 
have been interchanged. It can be shown that .V is obtained from 
the QR factorization 



where A 6 R /,x// solves the Sylvester equation A 1 j A - A\4 22 = 
A\ j. This Sylvester equation can be nearly singular for a number of 
reasons. A case of particular interest is when in and .4 22 contain 


eigenvalues that are reflections of each other with respect to the 
imaginary axis and are close to the imaginary axis. That is, if a ± iij 
are the eigenvalues of A vi , then -a ± iij are the eigenvalues of .4 22 
and 0 < o <£ 1. In this case, the associated Sylvester equation is ill 
conditioned, N can be computed only approximately, and the zero 
block in .4 is actually nonzero. As a result, the computed solution of 
(3) may not have sufficient accuracy and, consequently, may induce 
instability in the computed closed-loop matrix. 

Computed solutions of (3) may be refined via Newton’s method 
[17]. This involves rewriting (3) as P(A - FP) + (-4 - FP ) J P = 
-G - PPP, where F = BR~' B J . and suggests the iteration 

Pfc+iM - FPk) + (.4 - FPu ) 1 A+i = ~G - PaPPa (4) 

which can be shown to be equivalent to Newton’s method for solving 
(3). It can further be shown that if -4 — FPu is stable, the sequence 
generated by this iteration ensures the stability of the “closed-loop” 
matrices .4 - FPa at each step and lim*— x Pa = P. Potential 
difficulty may arise, however, when the Lyapunov equations in (4) 
are ill conditioned. This occurs if, for example, A — F P a has an 
eigenvalue very close to the imaginary axis. In this case, the procedure 
can give inaccurate solutions which may cause the process to leave 
the region of convergence for Newton's method. 

An alternative approach for solving ill-conditioned Sylvester and 
Lyapunov equations is to consider least-squares-lype solutions. This 
then allows a computed solution to be expressed more accurately in 
terms of a part that is purged of the numerical or approximate null 
vectors (in a sense to be defined below) and a part thal is spanned by 
the numerical null vectors. As in the case of linear sysiems, however, 
a knowledge of approximate null vectors is needed as a basic tool 
lor any least-squares solver. 

The numerical rank and nullity of a matrix are best understood in 
terms of its singular value decomposition (SVD). Many theorems and 
results in linear algebra depend explicitly or implicitly on knowing 
the rank of a matrix. Computational difficulties may arise, however, 
when working in finite precision. For example, a matrix that is known 
to be nonsingular may be nearly singular. It is therefore appropriate 
to define numerical f-rank and a -nullity for some small f. In (8| and 
110], these concepts are defined in terms of the SVD. Specifically, 
it is shown that the t -nullity of a matrix is equal to the number 
of singular values that are less than a prespecified t in magnitude. 
Moreover, the numerical null vectors are best characterized in terms 
of the associated singular vectors. In this respect, the remainder of 
the paper adopts this convention and we refer to f-null vectors of a 
nearly singular matrix as the approximate or numerical null veclors 
of that matrix. 

The organization of the paper is as follows. Section II describes 
a variant of the simultaneous iteration method for the dominant 
eigenvectors of a Hermitian matrix [23] to implement an algorithm for 
computing approximate null veclors of a matrix. A simple refinement 
technique is then proposed to enhance convergence. Section III 
specializes the method to nearly singular Sylvester and Lyapunov 
equations. Certain practical aspects also arc considered to exploit 
efficiency. Section IV presents a numerical example to demonstrate 
further the effectiveness of the algorithm and concluding remarks are 
given in Section V. 

II. Simultaneous Inverse Iteration Method 

This section considers the subspace version of an inverse iteration 
algorithm for finding the smallest singular value of a matrix [28]. 
The method is analogous to the orthogonal iteration technique (a 
generalization of the power method) for finding a prescribed set of 
the dominant eigenvalues of a Hermitian matrix [23]. The differences 
can be resolved by first viewing the singular values of a matrix A in 
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ins of the eigenvalues of the Hermitian matrix 1 1 ,4, and secondly 
plying the orthogonal iteration to V 1 rather than 4 Note that 
mlar ideas can also be found in [151, [28] and other scattered work 
joughout the literature An expository outline of the main ideas is 
lesented here for the sake of completeness 
Suppose that 4 e R" x " is nearly rank m-deficient That is, 4 has 
smaW singular values that are of the order of the relative machine 
piecision Define an SVD of 4 by 


ti¬ 

ll 

M 

*** 

= [Y, 

Xi] 

‘Si, 

0 

0 

t»l 

1j] t 

0) 

where <r 1 > a 2 > 

> (T, 

— it 

» (r, 

-t +1 

> 

> <T 

> 0 


and the partitions are conformal It then follows that the orthonormal 
columns of \> and are bases for the approximate left and right 
null vectors of 4, respectively [10] When the SVD of 4 is not 
available explicitly, the near-singular subspaccs \j and can be 
jppioximated implicitly by L and \ , respectively as lollows 

Simultaneous Insersc Iteration Algorithm 

■ Choose [ n G R * with orthonormal columns (almost arhi 
trary) 

• Loop until convergence 

1) Solve If 11 = C 

2) 1 +i — 1 + 1 'T + i (Q1 factorization) 

4) Solve 1 1 1 fi =4+j 

4) l ,m = l +ii?+i (QL lauon/ation) 

• Fuel loop 

• l -— converged I ^ i and T *— conveiged \ 41 

The QL factorization in the above algorithm is a natural extension 
ul the well known QR lacton/ation This decomposition is ol the 
form \ — (jl where Q £ R y has orthonormal columns and 
l 6 R y is lower tnangulai It is useful however to focus 
mention further on the unique choice ol the QL factorization of 
l f H y (or possibly squaic) by icquinng Z £ R x to be 
lower triangular with positive diagonal elements I he uniqueness of 
such a decomposition follows by similar analysis lor the skinny 
QR factorization [10] 

The convergence ol the simultaneous inverse iteiation algorithm 
paiallels similai aigumenls for the convergence of the QR method in 
[28] or orthogonal iteration in [24] A detailed convergence proof 
of the algorithm is also given in [9] Seveial ol the key points 
die summaii 7 ed as follows The above algorithm conveiges to a 
set of m singulai vectors of t almost independent of the choice 
ol l u The convergence might be unstable however, in the sense 
that some of the converged singular vectors do not correspond to 
lliose associated with the small singular values This happens when 
v ( l 0 ) is orthogonal to at least one of the singular vectors in \ 2 In 
practice, this situation is somewhat unlikely to occui since rounding 
rrors usually turn unstable convergence into a delayed stable one 
n any case, a countermeasure should be taken to overcome this 
ndesirable effect It is easy to verify that the last column of l +1 
1 e, Step 4 of the above loop) is most accuiate in the sense that 
1 has the most contributions from the singular vectors associated 
vith the small singular values On the othci hand, the first column of 

+1 is the least accurate since it contains the components of singular 
ectors corresponding to singular values larger than a , +1 This 
uggesls that the first column of U,+ j can be replaced by a random 
ector that is orthogonal to the last m - 1 columns This process 
an be term mated once the convergence of the first column vector 
s ascertained 

If the singular values satisfy the condition stated after (5), the 
olumns of l\ and 1 k converge to vectors that are linear combina¬ 


tions of the left and right singular vectors, respectively 1 e tncs sp n 
the same subspaces Furthermore, when the small singulai valui ir 
all distinct and well separated from each other, then the linimm 
values of l\ and h converge to the individual singular vector 
denoted by the columns of V 2 and 1 2 , respectively The convergence 
rale in this case is quadratic with a ratio of max j —Li ~ j 
with 1 > ) Even if the small singulai values are pooily sepamlul 
Ik and I k converge to a linear span of the singular vectois in \ 
and 1 1 at a rate of (n- M _ „+ i/rr„ ,, ) J . a rapid convcigence rate by 
assumption Note that tor singular values of neaily equal magnitude 
additional delations for convergence to the individual singulai vectors 
is generally slow and computationally not tractable In othei words 
while the individual singular vectors associated with a sei of clustered 
singular values are poorly determined the singular subspaces spanned 
by them are well determined Moreover a direct refinement procedure 
may also be utilized to both enhance the conveigencc and leduce the 
computational cost 

The following refinement scheme is somewhat similai to a pro 
cedure used b> Stewart for accelerating the convergence of the 
approximate dominant eigenvectors of a matrix via the orthogonal 
iteration method [24] Similai techniques have ulso been used in 
connection with finding complex conjugate eigenvalues of a real 
matnx [28], diagonalization of a real synimetnc matrix 110 ], and 
calcuiation of the dominant invariant subspau ol a nonsymmetric 
matrix [2] These methods aie generally known as Ritz accclciation 
techniques 

Suppose that the orthonormal columns of l and I arc approxuna 
lions to the left and light singular vectors respectively conesponding 
to the set of small singular values of \ whose SVD is given by (S) 
Define P Q £ R x and B € R' y ' by 


and 


\\U 


Ti 

\'l 


r 




p/11 


'(it' 

J 


Q _ 


n = 1 1 w - r' \ei r \ 


( 6 ) 


^ r‘\Q = r!'.uQi + q> <7) 


Moreover denote the SVD of D by B — MW 1 where M and 
A are orthogonal and V ~ diag( A 1 \ \ ) whose entries 

satisfy \| > > > A When Pi — (J 1 = 0 Pi and (}i 

are orthogonal Phus D 22 = \ Furthermore if the* singular values 
are distinct then corresponding singulai vectors can be recovered 
by choosing U = P/ and \ = Q[ since IM — \P\J = 
\*P*P] = \> Similarly I \ = I The above suggests that 
when the column spaces of f and I die good approximations of 
the spans of the singular subspaces associated with the small singular 
values, Pi and Q\ are nearly zero, P. and Q> are nearly orthogonal 
and a \ In addition, l M and I 4 are better approximations 
to and 1 2 than l and \ , respectively The lollowmg states the 
algorithm in a formal manner 


A Refinement Method for Approximate Singular Vittois 

• Start with I and ^ as appioximations to \i and 1 2 in (5) 
respectively 

■ Form B — l 1 41 

• Compute the SVD of B = \I \ > 1 

• l <— l M and 1 <— I 

Details of the above procedure parallel similai results in [24] and 
has been examined closely in [9] Thus, only certain highlights 
of the method are summarized here Let = \ - £ 2 * denote 
the deviation of the refined singular values from the actual ones 
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Then |AE| = |A - S 22 | < M^IIIKM! + (ItPill 1 2 + 

|| Pi || 2 ||Qi || a + \\Qi \\ 2 )« where the absolute value of a matrix is in the 
componentwise sense. This hound implies that for small values of P\ 
and Qi , the refined singular values are therefore good approximations 
of their actual values. As far as the accuracy of the refined singular 
vectors is concerned, it is seen that V(V) and UM(VX) span the 
same space. Therefore, the refined singular vectors are just as good 
approximations of the singular subspaces of interest as C and \' are. 
Furthermore, CM and VX are better enhancements of the individual 
singular vectors than V and V, respectively. This can be seen from 
(UM) r MVS) = M'U'AVS = M l B Y = A as S,.,. whereas 
B — U 1 AY is generally a dense matrix. Moreover, the refined 
singular vectors tend to converge at a faster rate than the original 
ones. This suggests that the refinement algorithm can be employed 
after a few steps of simultaneous iteration to further enhance the 
convergence rate of the iterates. The next section incorporates the 
above results to construct bases for the approximate null vectors of 
nearly singular Sylvester and Lyapunov equations. 

III. Inverse Iteration for Syf.vester and Lyapunov Equations 

This section extends the simultaneous inverse iteration method to 
compute orthonormal bases for approximate null spaces of matrix 
operators associated with nearly singular Sylvester and Lyapunov 
equations. The extension follows by viewing the Sylvester or Lya¬ 
punov equation as the linear system S'.r = r or It/ = (/, respectively, 
where S and L are as specified in ( 1 ) and (2), and .r = vcc( X ).<* -- 
vec(f'). y sr vcc(V), and q = vec (Q). Note that vec of an m x n 
matrix M is the mo -vector formed by successively stacking the n 
columns of M on top of each other. Since the Lyapunov equation is 
a special case of the Sylvester equation, the results are presented for 
the Sylvester equation. The method applies to the Lyapunov equation 
with only a slight and obvious modification. 

Suppose that A £ IR" X " and B € R M,> " n are given and that the 
associated matrix operator 5 in (1) is nearly singular. Algorithms for 
deflated solutions of nearly singular linear systems are given in [7], 
[251. Note, however, thi(l while the extension of the simultaneous 
inverse iteration method to a nearly singular Sylvester equation 
seems natural, certain steps are crucial. Potential difficulties in using 
the method derive from the fact that numerical nullities are not 
known a priori and that the matrix S in ( 1 ) generally should not 
be formed explicitly. It is therefore logical to construct hases for the 
numerical null spaces one vector at a lime. Successive formation of 
the approximate null vectors involves normalizations with respect to 
the Frobenius norm, and orthogonalizations with respect to the inner 
product (»’.-) = tr 1 i ■= trdF'Z), where w = vec(lF) and 
: “ vec(Z). 

Let the orthonormal columns of P € and Q E R"" ,x,# 

denote the left and right singular vectors of 5, respectively, associated 
with the v small singular values. Die following algorithm constructs 
approximations to P and Q by T and V, respectively. 

Simultaneous Inverse Iteration for Nearly 
Singular Sylvester Equations 

• For nullity v > I, choose To € with orthonormal 

columns. 

• Loop until convergence 

1) Solve SI' = r„ for V 

2 ) F = IT (QL factorization) 

3) Solve S‘t = V for l' 

4) V = VB (QL factorization) 

5) GzzC'SV 

6 ) G = M\X‘ (SVD) 


7) r *— UM and V <— VK 

8) If A, > tol, a user-specified tolerance, for 1 < j < p, 

a) r <— U{\J+l : i/) (truncatethe firsts columns) 

b) V <— F(:,,/+l ; u) (truncate the first j columns) 

0 v = j 

d) Stop 

Else 

e) v =: V + 1 

f) fo = [rr] such that t\i = I 

g) Return to Innerstep 1). 

The following provides certain key details of the actual imple¬ 
mentation of the algorithm. As was mentioned above, the large 
structured matrix S is generally not available explicitly. Therefore, 
appropriate manipulations are necessary for operations involving S 
Let .i*a and in be the corresponding left and right singular vectors 
of S associated with the singular value rr^. Thai is, Sin = 
and S ] .i'k = ffk in . It then follows that AYk + 1 \ B = <tkXi 
and A 1 Xk *f A \B ] ” otVa, where A \ ~ unvec(.ii) £ 
= unvec(//A) E R" y,, \ and unvec(-) is the operator that 
“undoes" the vec operator, i.e., forms a matrix from a column vector 
stack. Furthermore. Innerstep 1 ) is a linear system with multiple 
right-hand sides. The equation is of the form 5TF = Z, where 
IV - (ir,. ■■■,«•,.) and Z = € R'"" v 1 . The hi h 

column of IT’ is found by solving Sun = -a. Phis can be written 
as .411 ’a 4- \\\B = Za, where IT* - unvec(in) 6 R M> "' and 
Zk — unvec() E R" Thus, the columns ot l V are obtained b> 
successively solving the associated Sylvester equations and setting 
u k = vec(U\ ). Moreover, Innerstep 3) is also a linear system ol 
the form S 1 TV = Z. Hence, IT can he solved by rewriting the 
Sylvester equation of the above loop as A 1 IT* 4 - I \\B l — Z*. It 
the Banels-Stewart algorithm [4] or the Hessenberg-Schur method 
1111 is used, these equations can he solved fairly inexpensively with 
the aid of previously computed factorizations ol .1 and B. Innerstep 
5) is of the form (I = (</,j) = IT 1 SZ £ R' **. Let < / denote the 
A th elementary unit vector. Then the entries ol (V are given by 

(/,, - r/ W 1 SZ* j = «’/ Si, = (veeir, ) y vee( 1Z, 4 -Z,B) 

- tr(ir/(.lZ, +Z,B)). 

Innerstcps 8 a) and 8 b) arc MATLAB notations for overwriting l 
and V by deleting their first j columns. Innerstep 8 c) indicates that 
the numerical rank of S is u. Innerstep 8 f) is realized hy letting 
//’ = : — VV 1 5 :, where ■ E R'"" is a random vector. Then 7 T - p-| ( 
and V = unvec(F). 

IV. Numericai. Results 

The following example is presented to illustrate the effectiveness 
of the results. All of the compulations were performed using MAT- 
LAB implemented on a Sparc 2 workstation. The relative machine 
precision reported by MATLAB is about 2.22 x 10 1,1 . 

Let A and B be of the form A = M'Air / and B = ZI Z 7 , 
where TF £ R M *" and Z € R"' y "' are random orthogonal matrices 
Moreover. A — diag(n. u — 1. ■■ ■ ,3, n.»,m ) and T = diag(m. in — 1 

■ .3. L. ), where oi.n^, and ij, are between 3.0 x 10 “ 11 
and 1.0 x 10 -,r \ Note that IF and Z in this example arc formed via 
the QR factorization of random matrices. A thorough discussion ol 
computing random orthogonal matrices can be found in [24]. Since 
IF and Z are orthogonal, the eigenvalues of A and B are given by the 
diagonal entries of A and F, respectively. Furthermore, 5 = M DM 1 
where M = (Z TF) is orthogonal and I) = A 4 - F I„ 
is diagonal. Hence, the eigenvalues of S are given by the sum of 
any two diagonal elements of A and V. Moreover, the singular 
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dues of S are the absolute values of its eigenvaues, i.e., S is 
^rmal. This implies that 5 has four small singular values, namely, 
11 Aih ), (oH -^2 h (<> 2 +/*i), and (a^Aih ) for any m, ti > 3. Thus 
,e Sylvester equation AX + XB =* C is nearly singular. For the 
:l ke of illustration, the dimension m was kept constant at 10 while 
l( - was varied from 3 to 100 in increments of one, and vice versa. 
Approximate null vectors of 5 were found directly by computing an 
•;VD of 5 and selecting the singular vectors associated with the small 
singular values. Next, the proposed inverse iteration algorithm was 
used to compute the singular subspaces of interest, In each case, only 
one or two iterations at most was necessary to achieve convergence. 
The results were compared numerically by measuring the angles 
between subspaces [10] obtained directly (via SVD) and those by 
the inverse iteration algorithm. It was observed that null vectors 
obtained by the two methods were essentially the same and were 
good approximations of each other. The computational advantage of 
the inverse iteration algorithm over the SVD method was apparent 
as the dimensions m and n increased. This results from the tact that 
the direct method requires the SVD of an urn x nm matrix, whereas 
the inverse iteration algorithm requires solving only a few n x m 
Sylvester equations. In terms of floating-point operations, the SVD 
method and the inverse iteration algorithm therefore require on the 
order of mn 3 n A flops and 1 iJ(ni 3 +n 3 )+7(wi z n’fmn 2 ), respectively, 
where n. J, and 7 are some modest constants. For example, if m = 
n = 10, the number of flops needed by the SVD method is on the 
order of 10*' compared to 10 1 for the inverse iteration algorithm. Even 
greater efficiencies can be realized when the computed factorizations 
of A and B in the first step are used in succeeding steps. 

V. Concluding Remarks 

An analysis has been given of computation of approximate nu¬ 
merical null vectors for matrix operators associated with Sylvester 
equations. The method is reliable, efficient, and requires nothing but 
the ability to solve linear matrix equations. Computational complexity 
can be reduced by saving computed decompositions of the coefficient 
matrices. The method becomes relatively more expensive, however, 
as the number of small singular values increases. 

One advantage of the inverse iteration algorithm lies in its modu¬ 
larity. Note that the linear systems in the innersteps 1) and 3) of the 
algorithm are independent processes. This means that the method can 
be easily extended to the Lyapunov equation AX A- XA 1 = Q or the 
discrete-time Sylvester equation A XB -I- X = C and the discrete- 
time Lyapunov equation AX A 7 - X = Q. In fact, it can be applied 
lo any matrix equation that can be fopnulatcd as a standard linear 
system. Another important factor is that the choice of linear solvers 
is also an independent step in the algorithm. This is significant since 
specialized solvers can be employed in place of conventional ones 
when appropriate. 
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Observer Design for Nonlinear Systems with 
Discrete-Time Measurements, Moraal and Grizzle . 

Observer Design for Nonlinear Systems has been a hoi topic in non 
linear conti ol theory since the early 1980 s when Krcner and Isidon 
published their seminal paper of linearization of nonlinear systems 
bv output miecuon and the relevance ol this concept to nonlinear 
observer design Much effort has been devoted to finding a nonlinear 
analog of the now standard observer construction procedures in linear 
systems theory Predominantly the method of attack has been to try 
out some nonlinear generalizations of linear techniques Even though 
there have been sporadic successes, these attempts have not lead to 
a rich theory 

Ihe work of Moraal and Grizzle is aimed at moving away Irom 
this approach and focussing more on techniques that have to do with 
inverting nonlinear maps The notion of uniform \ observability 
essentially says that there are a fixed number of inputs and a system 
of equations the solution of which yields the initial state ol the 
system uniquely Now the problem of solving the equation can 
be approached as any numerical analyst would do by resorting to 
numerical map inveision techniques such as the Newton Raphson 
method A suitable generalization leads to a system which the authors 
call a Newton Observer This procedure is desenbed in gieat detail in 
the paper and mild conditions that guaiantec convergence die given 
It is shown that Newton Observers are closely related to Kalman 
fillers An example to illustrate the concept is given in the form ol 
an application to a bioreuctor system 

A Probabilistic Approach to Multivariable Robust 
Filtering and Open-Loop Control, Ohm Ahlen andSternad. 

A new approach to robust filtering prediction and smoothing of 
discrete lime signal output ol modeling errors is piesented A simple 
design based on the minimization of the squared estimation error 
averaged both with respect to model errors and noise is obtained An 
optimal robust filter is calculated by means of an averaged spectral 
factouzdtion and an unilateral diophantine equation 

The robust estimator is relerred to as a cautious Wiener filter It 
turns out to be only slightly moie complicated to design than an 
ordinary Wiener filter The methodology can be applied to any open 
loop filtering or control problem This paper illustrates this tor the 
design of robust multivariable feedforward regulatois, decoupling, 
nd model matching filters 

A New Model for Control of Systems with Friction, 

anudas de Wit , Olsson Astiom, and Lischinsky, 

The modeling of friction is an important concern in the design 
ol control systems for high quality servo mechanisms Friction can 
lead to tracking errors, limn cycles and undesired stick-slip motion 
A high gain control loop can sometimes be used to compensate tor 
friction, but in other circumstances it is necessary to compensate for 
'he effects of friction based on a suitable friction model Friction 
models based in static maps, such as Coulomb and viscous faction, 
may not be satisfactory fpr applications such as high precision 
pointing or low velocity tracking Dynamic friction models are 

"This section is written by the Transactions Editorial Board 


generally belter fot such applications and a number of such models 
have been developed In this paper the authors introduce a new 
dynamic model for friction which is consistent with experimentally 
observed effects 1 he model properties relevant to control systems 
are investigated and discussed The authors show that the model 
predicts the limn cycle behavior sometimes observed in servos with 
PID controls They also use the model m the design of a friction 
observer to compensate for faction in position and velocity tracking 
control 


Adaptive Nonlinear Design with Controller-Identifier 
Separation and Swapping, Krstu and KoLorom, 

This paper introduces a modular approach to adaptive control 
design tor a class of nonlinear systems in parametric-strict feedback 
iorm By achieving a separation between the designs of the adaptive 
controller and the parameter identifier it is shown that the stability 
properties of the certainty equivalence laws can be enhanced thereby 
enabling a wide variety ol adaptive update laws to tune the control 
The former is designed such that the states ol the nonadaptive system 
are bounded if the paiametei errors and their derivatives are bounded, 
the latter guaiantees the boundedness of the parameter errors and their 
denvativcs A nonlinear veision ol the swapping lemma is used to 
facilitate the proofs 


Approximate Decoupling and Asymptotic Tracking for 
MIMO Systems, Godbole and Sasti). 

This paper proposes an algorithm for approximate input-output 
decoupling of nonlinear MIMO systems where the standard decou¬ 
pling algorithm produces poor results The systems considered are 
regular, but are numerically ill posed in that application of the 
exact decoupling algorithm requires inversion of an ill conditioned 
matrix The algorithm is numerically robust and also avoids pole 
zero cancellations thus providing controllers with reasonable gain 
even when the system has far off zeros The general algorithm here 
is motivated by the work ol vaaous authors on physical examples 
such as flight control reaction kinetics, and electric motors 

A Generalized Orthonormal Basis for Linear Dynamical 

Systems, Heuheiqa Van den Hof , and Bosgra . 

In many areas ol signal, system, and contiol theory, orthogonal 
functions play an important role in analysis and design In this paper, 
it is shown that theie exist orthogonal functions that are generated by 
stable linear dynamical systems and that constitute an orthonormal ba¬ 
sis for the signal space l 2 The orthogonal functions can be considered 
as generalizations of, c g the pulse functions, Laguerre functions, 
and Kautz functions, and give nse to an alternative senes expansion 
of rational transfer functions It is shown how these generalized basis 
functions can be used to increase the speed of convergence in a 
senes expansion, 1 e , to obtain a good approximation by retaining 
only a finite number of expansion coefficients Consequences for 
identification of expansion coefficients are analyzed, and a bound 
is formulated on the error that is made when approximating a system 
by a finite number of expansion coefficients 
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tioo Control Via Measurement Feedback for General 
Nonlinear Systems, Isidori and Kang . 

The Hog control problem is a well-motivated problem for a 
wide variety of engineering applications. The problem is very well 
Mudied for linear systems. A comprehensive and elegant solution is 
available for linear systems and is well documented in various recent 
pgpefr. The generalization of these results to nonlinear systems is an 


important problem. The authors provide a solution to the nonlinear 
version of the Hog control problem with measurement feedback 
(i.e. T the partial observations case), by investigating solutions of 
Hamilton-Jacobi inequalities. In their analysis, the authors follow 
the lines of the H optimal control results for linear systems. A 
certainty equivalence argument is applied to treat a nonlinear version 
of the Hoc control problem with imperfect observations. The results 
of this paper extend a number of recent achievements in this area. 
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Observer Design for Nonlinear Systems 
with Discrete-Time Measurements 

P. E, Moraal and J. W. Grizzle, Senior Member, IEEE 


Abstract —This paper focuses on the development of asymp¬ 
totic observers fpr nonlinear discrete-time systems. It is argued 
that instead of trying to imitate the linear observer theory, 
the problem of constructing a nonlinear observer can be more 
fruitfully studied in the context of solving simultaneous nonlinear 
equations. In particular, it is shown that the discrete Newton 
method, properly interpreted, yields an asymptotic observer for a 
large class of discrete-time systems, while the continuous Newton 
method may be employed to obtain a global observer. Further¬ 
more, it is analyzed how the use of Broyden’s method in the 
observer structure affects the observer’s performance and Its 
computational complexity. An example illustrates some aspects 
of the proposed methods; moreover, it serves to show that 
these methods apply equally well to discrete-time systems and 
to continuous-time systems with sampled outputs. 


I. Introduction 


A General 

T HE need to study state estimators (observers) for dy¬ 
namical systems is, from a control point of view, well 
understood by now. For the class of finite-dimensional, time- 
invariant linear systems, a solution to the observer problem has 
been known since the mid 1960’s: the observer incorporates 
a copy of the system and uses output injection to achieve 
an exponentially decaying error dynamics. For the class of 
continuous-time nonlinear systems, the reader is referred to 
138], [39] and the references therein for a summary of the 
theory up to 1986. More recent developments include the 
work of Krencr et al. [23] on higher-order approximations for 
achieving a linearizable error dynamics. Tsinias in [37] has 
proposed (nonconstructive) existence theorems on nonlinear 
observers via Lyapunov techniques. Gauthier et al. [12] and 
Deza et al. [9] show how to construct high-gain, extended 
Lucnberger- and Kalman-type observers for a class of non¬ 
linear continuous-time systems. Tomambd [36] and Nicosia et 
al. [31] have proposed a continuous-time version of Newton’s 
algorithm as a method for computing the inverse kinematics 
of robots; moreover, the latter paper also presents a symbiotic 
relationship in general between asymptotic observers and 


nonlinear map inversion. Finally, Michalska and Mayne 127] 
have used a dual form of moving horizon control to construct 
observers for nonlinear systems. 

Less attention has been focused on the observer problem for 
discrete-time systems. It was shown in [6] that certain prop¬ 
erties, like observer error linearizability [22], are not inherited 
from the underlying continuous-time system. Moreover, the 
class of continuous-time systems that admit approximate solu¬ 
tions to the observer error linearization problem for their exact 
discretizations with sampling time T in an open interval is 
limited to the class of nonlinear systems that are approximately 
state-equivalent to a linear system and hence is very restricted 
[5]. These results motivated the search for a structurally 
more robust approach to the observer problem. In Section 
II, it is argued that instead of trying to imitate the linear 
observer theory, the nonlinear observer problem should be 
studied in the context of solving sets of simultaneous nonlinear 
equations. This viewpoint is supported by showing in Section 
III that the discrete Newton method, properly interpreted, 
yields an asymptotic observer for a large class of discrete-time 
systems. In Section IV, a relationship between this observer 
and the well-known extended Kalman filter is established. 
As an extension to the result from Section III, it is shown 
in Section V, that the continuous Newton method may be 
used to obtain a global exponential observer. Section VI 
addresses an alternative to using Newton’s method in the 
observer design—namely, Broyden’s method—in the case that 
computational efficiency is an important issue. Finally, an 
example will illustrate the theory presented herein. 

Some of the results reported here have previously appeared 
in [16], [17], and [28], Extensions to the case of singularly per¬ 
turbed discrete-time systems have been presented by Shouse 
and Taylor [33]. 


B. Notation and Terminology 

Consider a continuous-time system 


y X = /(*,«) 

c ' y = h(x,u) 


( 1 ) 
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where x € JR n , u € JR m , and y € BF. Its sampled-data 
representation, obtained by holding the input constant over 
half open intervals [kT, (k -I- 1)T] and measuring the output 
at times kT, will be denoted 

E (T ): Ifc+1 = ^r(xk,uk) (2 ) 

Vk — h^XlcjUk) 


0018-9286/95S04.00 © 1995 IEEE 
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whore Xk ■— x(kT), yk •— y(kT), and u* := u(kT). The 
symbol means that the object on the left is defined to be 
equal to the object on the right; the reverse holds for 

It is worth noting that if (A,B,C,D) are the matrices 
describing the Jacobian linearization of (1) around a given 
equilibrium point, then (cxp( AT), fj' exp(Ar) Bdr,C,D) 
are the corresponding matrices for (2) about the same equi¬ 
librium point [14], Consequently, if the linearization of (1) is 
controllable and/or observable, the same will be true of the 
linearization of (2) for “almost all” T 135]. 

A discrete-time system will be denoted as 

Z: Xk+1 = (3) 

Vk = h(x k ,u k ) 

where x e IV',u G J? m , and y £ M p . It is convenient to 
let F u (x) := F(x,u) and h u (x) := h(x,u) so that things 
like F(F(x,ui) } U 2 ) and h(F(x,ui),V 2 ) can be written as 
F U2 oF Ux (r) and h u2 o F Ul ( x ) respectively, where “o” denotes 
composition. 

In the sequel, we will often be dealing with a set of N 
consecutive measurements or controls; these will be denoted 
as 


arises from a continuous-time system (1), (7) can be obtained 
directly from (1) by sampling the inputs and outputs AT-times 
faster than the state; in other words, x 3 := x{jNT),u l 3 = 
u((j - 1)NT + iT ), etc. More generally, one could sample 
the inputs and outputs at different rates, or even some input 
components at faster rates than others, but we will not pursue 
this here. Throughout this paper, the notation || ■ || will be used 
to denote both a vector norm and the corresponding induced 
operator norm. Finally, we recall that if g\ JR ri —► 1tt r * is 
at least once continuously differentiable, its rank at a point 
xq £ lR ri is the rank of its Jacobian matrix at x 0 , [4] that 
is, rank [^(j-q)]. 


11. Observers for Smooth 
Discrete-Time Nonlinear Systems 


A. General 

Consider a discrete-time system on ]R n 

£.■£*.+1 = F(x k ,u k ) 
Uk = h(x k ,u k ) ' 


( 8 ) 



’Uk-N+l ' 


+ 

£ 

1 

Jc 

Yffc-jv+i,fc] ■’= 1 


, £/[fc-JV+l,A] ■ = 



. yk . 


. VL - 


A second system 


Zjt+i = V(zk,iJk,Uk) 
jk = v{zk, yk, nk) 


(9) 


If N is fixed and clearly understood, then the abbreviations 
Y k and U k will be employed, so that certain formulas will be 
easier to read. 

To a discrete-time system £, we associate an N-lifted 
system (see [11]), T , N , by block processing the measure¬ 
ments and controls over a window of N sampling instances. 
Specifically, fix N and let Y, := Y Nj = i.jvj], 

Uj := U Nj = U[ N0 _ and Sj ::= x Nj . Wnte out Uj 
in terms of its vector components as Uj = col(fi], • ■ •, fi^) 
where <2* := Let 


with z k € some l > 0, is an asymptotic observer |25) 
for (8) if it satisfies: A) V.rj £ JR n ,Vu k £ 5? m ,3zi € ]R l 
such that x k = x k for all k > 2, and B) Vi*! £ JR n ,\/ u k £ 
lR m ,Zi G JR\ liiiiA.-.oo || s k - x k \\ = 0. If the read-out map r/ 
in (9) is the identity, i' k = z k , then (9) is called an identity 
observer [25]; if the convergence of x to s is exponential, then 
(9) is called an exponential observer. 

For later use, the observer (9) will be said to be dead-beat 
of order d , if, upon writing r(z k ,jj K ,u k ) =: r v *- ,u (zfc) and 
V(zk,yk,uk) =: n yk ' Uk (zk ), then 


*(x,0):=F* N o --o F {l \x) (5) 


0 ... 0 r yi ' ui {z 1 ) = x d (io) 


and 


H{x<V) 


V‘(f) 

h*\ F* N ~ l O-O^fi-) 


The TV-lifted system is defined to be 


(6) 


E*: . (7) 

Yj = 

Note that its dynamics is nothing more than the dynamics of 
(3) iterated AMimes. The state of (7) is the state of (3) at 
the beginning of each “window” of length AT, and $ simply 
describes how the state evolves from window to window. The 
representation (7) can be termed “multirate” because, if (3) 


independently of the particular observer initial condition z \, 
where x d is the state of (8) at time d. It is remarked that 
dead-beat observers are of interest for stabilization problems, 
because, if u k = a(x k ) is a stabilizing feedback for (8), 
then u, k - a(x k ) will always result in an internally stable 
closed-loop system whenever the observer (9) has the dead¬ 
beat property. This is one of the rare instances of a nonlinear 
separation principle. 

All the above has been stated in a global fashion. Let us note 
that there are at least two ways of localizing the concept of an 
observer. The first is essentially infinitesimal: one guarantees 
the existence of open neighborhoods O x and O z of the origin 
of (8) and (9), respectively, and an open neighborhood of 
controls O u such that A) and B) hold as long as z e O z 
and V/c > 1, u k £ O u and x k G 0 T . The work on observers 
with linearizable error dynamics [6], [20H24], for instance, 
falls into this category. A second way to localize the concept 
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ould be called ( S , V)-quasilocal: one is given subsets S and 
V, of the state space of (8) and of its controls, respectively, 
naving the property that, for every initial point x\ G 5, there 
exists an open subset O z (x i) of the state space of (9), such 
that A) and B) hold as long as z\ e O z (x\) and Vfc > 1, 
r k g S, and Uk € V. In other words, for the case of identity 
observers, instead of guaranteeing the existence of an open set 
about the origin of the product state space JR n x R n where 
everything works, one is assuring the existence of an open set 
about the diagonal of R n x R r \ whose projection onto the 
r-coordinate contains S . 

In the following, an approach to the construction of ob¬ 
servers for discrete-time systems is developed. The authors’ 
perspective was influenced by the work of Aeyels [1], [2], 
Fitts flOJ, Glad [13], and the multi-rate time sampling results 
of 114]. 


B Dead-Beat Observers 

Consider once again the system E (8) and let Yji,^] denote 
a vector of TV consecutive measurements 


y [l.Nl 


/ h^(x) \ 

h Ul o F' 1 ' (j-) 

\h UN o F UN 1 0-0 F Ul (.r)/ 


H(s , t/[j,jv]) 

GO 


E is said to be TV-observable 1 |l], [32], [35] at a point 
j 6 R n ,N > 1, if there exists an TVtuple of controls 
t/[i 7 7v] = col(ui, ■ • ■ ,7/ £ (R T ") N such that x is the unique 
solution of the set of equations 


Y[UN] =H(x,U\i ,jv]) (12) 


where 


This constitutes an order TV dead-beat observer foi y, |» , 
Conversely, suppose that (9) is a dead-beat observer oi otcIci 
TV. Then 

rf yN,UN 0 0 ,..oP^( 2 ) = r ^ ( 1 7) 

for all z € R l \ thus the left-hand side of (17) does no! 
depend on 2 and is a solution to (15)—(16). This shows that 
constructing a dead-beat observer of order TV is equivalent to 
left-inverting (15) and composing the result with the right-hand 
side of (16). In a similar vein, an asymptotic (nondead-beat) 
observer can be thought of as constructing a solution to 
(15M16) as TV —► 00 . Clearly, for nonlinear systems, insisting 
that this can be done in closed-form is very restrictive. It 
is therefore natural to formulate an extended concept of an 
observer as a possibly implicitly defined dynamical system, 
involving successive approximation routines, logical variables 
and/or lookup tables to dynamically “estimate” the state of a 
deterministic nonlinear system 116]. This perspective will be 
further pursued in the next section where Newton’s algorithm 
is interpreted as a nonlinear observer (9). 

Before doing so, however, let us first tic in the notion 
of a dead-beat observer with the observer error linearization 
approach [6], [20]—[24J. For simplicity of exposition, suppose 
that (8) does not have any inputs. One seeks a (locally defined) 
coordinate transformation .r = T(.r) in which (8) takes the 
form 


= Ax k 4- 0(|/a) 

UK=Cf k (18) 


where the pair (A,(’) is observable. This gives a family of 
infinitesimally-local observers 


y ZfrH = (A - KC)z k + 0(/a) + Ky k 

H = r-'(z k ). 


(19) 


Y lhN] =H(s,U [hN] ). (13) 

The system is uniformly TV-observable if the mapping 

H*: R n x (^ m ) N - ( R P ) N x {R m ) N (14) 

by Or,£/[],N]) (//(x,£/[ 1,AT]), U[ 1 ,N]) is injective; it is 

locally uniformly TV-observable with respect to O C JR TI and 
U C (R m ) N if H* restricted to O x U is injective. 

Whenever E is uniformly TV-observable, the system of 
equations 

Yj/t_Ar+i,fc] = H{Xk-N+i,U[ k _ N +i'k]) (15) 

can be, for each TV applied inputs U[ k -N+ i,fc], uniquely solved 
for xjt-jv+i, and the current state x k obtained by 

x k = $ u i k - N ' k -v(x k -N+i)’ (16) 

1 TV refers to the minimum number of measurements needed to recover the 
state. In [2], Aeyels shows that, “generically," TV can be taken to be 2n + 1. 


Letting a := x k - z k yields 

a +1 = Kcy k . (20) 

Choosing K to place the eigenvalues of (A - KC) at zero 
makes E into a dead-beat observer of order n. In other words, 
the ability to achieve a linear error dynamics (20) implies the 
explicit knowledge of a left-inverse to (12). 

III. Newton s Algorithm as an Observer 

Consider again the system E, (8). It is said to satisfy the TV- 
observability rank condition with respect to O C R n and U C 
(R rn ) N if H'\ O x U (R P ) N x {R m ) N is an immersion 
[35]; that is, it has rank n H-TVrn at each point of O x U (recall 
that H* was defined in (14)). Note that E is TV-observable 
and satisfies the TV-observability rank condition with respect 
to O and U if, and only if, H*:OxU^ (R P ) N x ( Si m ) N 
is an injective immersion; 2 this is in turn equivalent to: for 
each U[i t N] € U , O (R P ) N is an injective 

immersion. 

2 That is, an embedding [4], 
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Newton’s algorithm for 

fy-N+1,*] - #(*fc-N+l, ffy-N+l,*]) = 0 (21) 


is 


r+'-f+ 




-1-1 


1 (yi*-jv+i,k) - H(C, U[h-N+i,k])) (22) 


where, for simplicity, it has been assumed that the set of (21) 
is square; in the case that there are more equations than states, 
the inverse in (22) should be replaced by a pseudo-inverse 
[26, p, 309], [8, pp. 222-224]. The standard convergence 
theorem for this algorithm can be found in [26], For the 
moment, assume that is hxed, and let H(x) = 

H(x, C^fc-jv+i,*]) for this fixed value of U. 

Theorem 3.1 [26]: Suppose that H is twice differentiable 
and that || ^£r(x)|| < K for x e IF?"; suppose there is 
a point € JR" such that P 0 := ^(£°) is invertible 
with IIP- 1 !! < 0 O and IIP-Mn - H(£o))|| < r lo . Under 
these conditions, if the constant h 0 = 0 o r) o K < 1/2, then 
the sequence £' generated by (22) exists for all ? > 0 and 
converges to a solution of (21). If instead ||§^r(x)|| < K 
only in a neighborhood B of ( 0 with radius 

r> ^(l- v^l-2/ic)^ (23) 

then the successive approximations generated by Newton’s 
algorithm remain within this neighborhood and converge to 
a solution of (21). 

The most interesting point is that Theorem 3.1 gives an 
estimate of how good the initial estimate of x* should be 
before a few iterations of (22) will generate better estimates. In 
this regard, the quantity ||^r(z)||, which measures the degree 
of nonlinearity of (21), is seen to be of central importance. For 
a linear system* ||f^r(a;)|| = 0, and the initial estimate can be 
artritrarily poor; when ||^^(t)|| is large, the initial estimate 
should, in general, be better. 

Newton's algorithm is now interpreted as a quasilocal 
exponential observer. Suppose that N has been fixed; for 
notational ease, let VJt = Y[k-s+ 1,*] be the vector of the 
last N measurements and similarly let Uk = ^ifc-zv+i,*-) = 
col(u*_jv+i> ■ ■ ■, Hit) be the vector of the last N controls. 
Define 


e v *- i; ‘(o = c + 




(Y k -H(C,U k )) 


(24) 


and let (G yt ' Vk )^(£) represent (£) composed with 

itself d-times. 

Let O be a subset of 2R'\ V a subset of R m ,N > 1 a given 
integer and e > 0 a positive constant. Denote the complement 
of 0 by ~0 and define dist(x, ~ 0) = mf{||ar—y||: y €~ 0}, 
and O e /2 = {x € 0:dist(x, ~ 0) > e/2}. Finally, define 


constants a, 0, 7, L, and C by 


dH t m 


a = sup| 

= sup | 

= sup | 

= sup | 

1 f||3 2 0 

= 2 sup ilN 


\dH, 

-1 

IfeH 



dx 2 


(x,U) 


dF ( ^ 
d 2 e v ' u 

dx 2 


:*€0 e/2 ,r/€ V*J 

:xe0«/ 2 ,t/6V N J 

:xe0 </2 ,ue V N J 

: x € 0 f /2)« € v| 

: Y = H(x, U). 


(x) 




Theorem 3.2: Suppose that the following conditions hold: 

1) F and h in (8) are at least three times differentiable 
with respect to x\ 

2) there exist a bounded subset O C JR n and a compact 
subset V C FT 1 such that for each x £ O there exists 
u £ V such that F(x,v) £ O (i.e., O is controlled- 
invariant with respect to V); moreover, the controls are 
always applied so that F(x , u) £ 0\ 

3) there exists an integer 1 < N < n such that the set of 
equations (11) is 

a) square, 

b) uniformly N-observable with respect to O and V N , 

c) satisfies the AT-observability rank condition with 
respect to O and V N . 

Then, for every r > 0, the constants a, (3, 7, L, and C are 
finite; moreover, whenever 

4 - rain {4L’4iKl’8ifaI’2rl} (25) 

and 

d > max{l,log 2 log 2 4L}, d £ IN (26) 

then 

*k+i = (e y ‘- t/ ‘) (d) (F(x fc ,u fe _N)) (27) 

x h = F xlk ~ i or- 2 (28) 


is a quasilocal, exponential observer for (8) in the sense that. 
A) if si £ O and z/v+1 = xi, then f* = Jk for all k > N + 1 
and B), if xi £ 6 ,||zjv+i - .ri|| < t> and for all k > 0, 
dist(xA,~ 0) > f, then ||x fc+ i - xj. +1 || < £||x fc - x*||. 

The proof may be found in (17]; the basic idea is to view the 
observer problem as one of solving a sequence of nonlinear 
inversion problems, each described by (12). Since the set O 
is relatively compact and controlled invariant with a compact 
set of controls, Newton's algorithm can be shown to have a 
uniform rate of convergence over the entire sequence of prob¬ 
lems. The idea then is to iterate long enough on each problem 
(the parameter d) so that F applied to the solution of the fcth 
problem is a very good initial guess for the (fc + l)st problem. 

The set O is assumed to be bounded, but not necessarily 
small; if it is not controlled-invariant, then only finite time 
estimates are possible; the same is true of the observer error 
linearization approach of [20]-[24] (for discrete-time systems, 
see [6]). 
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The observer (27M28) is coordinate dependent. It is inter- 

ting to note that the coordinate transformation approaches, in 

neral, would only favor convergence of (21) if they reduce 
U)\l I n particular, eliminating low-order polynomial 

rms in favor of high-order terms will not always accomplish 
, ns task. 

Remark 33 : 

a) In Theorem 3.2, one may take d = 1 if, in (25), ^ is 
replaced by 2 cU- 

b) Once again, assumption 3-a), that (11) is square, is 
NOT essential. One could try eliminating certain rows 
of (11) while still preserving the rank condition 3-c), 
but this would, more-than-likely, invalidate 3-b). The 
better alternative is to replace the inverse in (24) with a 
pseudo-inverse, as in [26, p. 309] or [8, pp. 222-224]. 

c) The observer (27)-(28) bears some resemblance to the 
iterated extended Kalman filter of [7]. This will formally 
be established in the next section. 

d) By modifying the step-size in Newton’s algorithm, 
“globally convergent” versions of the algorithm can 
be shown to exist. Chapter 6 of [8] presents this very 
nicely from a numerical analytic viewpoint. A different 
way of “globalizing” the algorithm is to systematically 
produce a good point at which to initialize it. This is 
discussed in [15], [16]. In Section V, the continuous 
Newton method will be shown to yield a global version 
of the above observer. 

e) It is often pointed out, and in [29] shown to be a 
valid practical concern, that the evaluation of the Ja¬ 
cobian ^r, be it explicitly or using finite difference 
approximations, may be computationally very expensive 
or even prohibitive. Modified Newton methods have 
been proposed, in which the Jacobian is not explicitly 
evaluated at every step, but updated iteratively without 
requiring additional function evaluations. Section VI 
explores the consequences of using Broyden’s method 
instead of Newton’s in the observer (27)-(28). 

IV. Relation Between Kalman 
Filters and Newton Observers 

To show how the Newton observer is related to the extended 
Kalman filter, we will consider an invertible, autonomous 
discrete-time system 


Xk+i = F(x k ) ) Xk 6 JR n 

y k = h(zfc), y k € BP (29) 


m which we replace the output map h by the extended output 
map H, defined in the following manner 



/Vk-N+l \ 



Vk -1 



/hoF-i K -»{x k )\ 

hoF il (x k ) 

\ h(x k ) / 


H(x k ). 


(30) 


Assume that the above system satisfies the ^-observability 
rank condition and is ^-observable; furthermore, assume that 
H is a square map, i.e., H: R n -* R n . A common way 


to construct an observer for system (29H30) is to appl\ i]i> 
extended Kalman filter to the associated noisy system, i.c . the 
system with added artificial noise processes 

Zk+1 = F(*k) + Nw k 
(k = H{zk) + Rvk (31) 


where v k and w k are assumed to be jointly Gaussian and 
mutually independent random processes with zero mean and 
unit variance. The extended Kalman filter for this system is 
given by the following equations: 
measurement update 

x k = x k -h Kk{£k - H{x k )), 

Q ~ k 1 = {Q;r 1 + H][(RR T r i H k 


time update 

£ k +\ = 

Q~ k+l = A k Q k Al + NN T 

where 


?r\-1 


K k = Q~H[{H k Q- k H[ + RR t ) 

A dF <~ 1 

A* = 

u dH <~\ 

Hk = ~dI (Tk) 


and TV, R , and Q 0 are the design parameters. Let us choose 
TV = pi and f? = e1 , and consider the equations for the error 
covariance Q * and the observer gain K k 


Q; +1 =A k ^Q- k )- l + ^HlH k ^ + 

= e*A k (e\Q~ k r l + Hi H^ 1 A T k + p 2 l 
Kk = Q k Hl(H k Q k Hl + e 2 iy\ 


Given any positive definite Q 0 , the update equation for Q k 
in the limit as e —♦ 0 is given by 

Qfc+i = ^ 1 - 


Substituting this in the equation for K k and letting e —► 0 gives 
K k =,i 2 Hl(H k p 2 Hl )- 1 
= Hl(H k Hlr' 

= h;' 

which is valid since, given the observability conditions, H k 
is invertible. The extended Kalman filter equations are then 
given by 

x k = T- + H' l (Y k -H(x;)) (32) 

= F(x k ) (33) 


which is exactly the Newton observer for system (29) with 
one Newton iteration per time step. 

In [3], it was recently shown that the measurement update 
equations for x k in the iterated extended Kalman filter are 
exactly those arising from the minimization problem 


mini 


K 


<tt - H(x;)) T R-H(Y k - H(x;))+\ 
(xk-x;) T (Q k )~ 1 (x k -x k ) ) 


(34) 



m 
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when a Gauss-Newton method (an approximate Newton 
method) is used with rjjT as initial guess. This shows that 
the covariance matrices R and may be interpreted as 
weights on the norms in the output space and state space, 
respectively. It remains presently unclear, however, how the 
update equations for the covariance matrix Q~ k in the Kalman 
filter can be given a meaningful interpretation in terms of 
updating the weighting matrix in the above minimization 
problem after every iteration. 

It must be pointed out that the extended Kalman filter, 
although commonly used as a nonlinear observer, had 
not been actually proven to be a convergent asymptotic 
nonlinear observer until recently. In [34], it is shown that, 
under suitable observability conditions, if the state evolves 
in a compact set and the Kalman filter is initialized close 
enough to the true state, then the error covariance matrices 
Qk and remain bounded, and the observer error 

goes to zero exponentially. A proof of convergence for the 
continuous Kalman filler with a special choice for the initial 
error covariance matrix is given in [9]. 


V. Continuous Newton Method as a Global Observer 

In the remaining sections, we will, for notational simplicity 
and without loss of generality, restrict ourselves to invertible 
and autonomous systems. The results that are obtained can 
without any difficulty be extended to noninvcrtible systems 
and/or systems with inputs (see example section). Consider 
once again the discrete-time system 

ifc+i = F{x k ), Jk 6 K n 

Vk = h(x k ) 9 Vk € W (35) 


with the extended output map if, defined in the following 
manner 



/Vk-N+l \ 


n* 

yk-\ 

\ Vk / 

~ 


/hoF-l N -V(.r k )\ 

lioF-'W 
h(x k ) / 




(36) 


Assume that H is a square map. 3 The discrete Newton method 
with step size h % for solving - H(.r) = 0 is 


*1 +I = 4 + M(4) _1 (n-//(*!)) (37) 

where J(z) ■.= ^( 2 ). If we consider this equation with 
an infinitesimally small step size, we obtain the following 
differential equation 


4 = J{zk)~\Y k - H(z k )), z k ( 0) = zl (38) 


which is referred to as the continuous Newton method; the- 
right hand side of (38) is commonly referred to as (gradient) 
Newton flow [19]. The stability of Newton flows has been 
studied extensively (see [19], [40] and references therein). 

We now construct a global asymptotic high-gain hybrid ob¬ 
server for the system (35), by interpreting (38) as an observer 

'This seems to be a crucial assumption 


for that system. Assume that file time interval between Xk an< 
.Tfc+i is T, i.e., x h = jr(kT). We thus obtain the followim 
hybrid system-observer 


a-fe+i = F(x k ) (39 

n = H(x k ) 

i* = KJ{z k )~'[Y k - H(z k )];z k (kT) = F{z k ^(kT)) 
?k = z k ((k + 1)T) (40) 

where K is a positive scalar, to be determined later. 

Theorem 5 / 1 Assume that (35) is uniformly /V-observable 
and satisfies the N< -observability rank condition with respect 
to ]R n . Suppose that 



c)F, . 


L : — sup 
r6l? n 

o7 {r) 

< x>; 

P ■ = sup 

of i 
-r) 

< oo; 

Tl 

3 

dr 


1 

Wi)H 

\ "Ml 

§•* 

II 

W 3] ) 1 


(41) 


If K > ^ log(2/,r*/^), then (39) is a global asymptotic 
observer for (35). 

The proof for this and the next result can be found in [28] 
In general, the quantities ||^~||. and ||^|| will not 

be uniformly bounded on JW n , nor will the system (35) be 
globally observable. For these cases, we can still obtain a 
nonlocal convergence result for the observer (40). 

Proposition 5 2 m Suppose the following conditions hold: 

1) F and h are at least once continuously differentiable; 

2) 3 compact O C JR" such that: 

a) (35) is -observable with respect to 0\ 

b) (35) satisfies the N-observability rank condition 
with respect to O; 

3) 3p > 0 such that O ^ := {./* € O | ||.r — /y || > 

p) is F-invariant and nonempty. 

Then, if K > ^ log(2 Lafl), if.ro € Op and if ||co(0)-.ro|| < 
5 ^, it follows that lim*-,™ \\x k - j*|| = 0. 

Remark 5 3 If no a priori information is known about 
the initial state of the system, one may, for lack of a better 
alternative, initialize the observer at the origin. Suppose O 
contains the closed ball centered at the origin, with radius R 
B( 0, R). Then the previous analysis showed that convergence 
can be guaranteed if \\i'o -^oll = ||®o|| < or, what ma> 
provide a more practical estimate, if \\H(0) - Yw_i|| < 

Most global modifications of the discrete Newton method art 
based on choosing a proper stepsize (e.g., Armijo stepsm 
procedures) and/or search direction (e.g., trust region updates) 
[8], [30], such as to assure a decrease in the term WYk-Hiz^W 
in the zth step of the algonthm. Obviously, in terms of 
a connected region of convergence, one cannot do better 
than allowing an infinite number of iterations, each taking 
infinitesimally small steps in a guaranteed descent direction, 
i.e., the continuous Newton method. 
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In the previous section, it was seen that a large region of 
onveigence of the Newton-observer could be guaranteed if 
>ne used the hybrid form presented in (39). To implement the 
hybrid Newtort-observer, however, a closed-form expression 
*or the inverse of the Jacobian matrix would be necessary; 
'his does not seem very realistic. Moreover—as mentioned 
<n Section III—even in the discrete Newton algorithm, the 
computational complexity of repeated Jacobian evaluations 
might prove prohibitive in practice. This provides sufficient 
motivation to pursue methods for approximating the Jaco¬ 
bian and the investigation of the associated convergence 
properties. Here, we will look at Broyden’s method, as it 
is among the most popular of such approximation schemes 
[ 8 ], [30]. 

Broyden's method for solving a system of nonlinear equa¬ 
tions P(j) = 0 is a modified Newton method in which the 
Jacobian is not calculated exactly at each step, but rather 
iteratively approximated using secant updates. In contrast to, 
ior example, finite difference approximations, no additional 
function evaluations are required. Define J(x) = 
Broyden’s method for solving P(x) = 0 is [ 8 ]: Given x°, 
the initial guess for J', where P(x) = 0 , and A °, the initial 
approximation for the Jacobian of P at .r° 


solve A' h' = -P(r') for s 1 
./■' fl .= + 

i/ .= P(P+ l ) - P(x') 

(v 1 - ,4V)(* l ) T 


A 1 * 1 ■= A' + 


(s ') 1 s' 


(42a) 

(42b) 

(42c) 

(42d) 


The update A‘ for the Jacobian has the property of bounded 
deterioration: even though it may not converge to J(s), it 
deteriorates slowly enough for one to still be able to prove 
convergence of {.r 4 } to 

Conditions for convergence are the same as those of 
Newton’s method, with the additional requirement that, with 
P(x) = (), not only the initial guess x° be sufficiently close 
to j\ but that also A 0 be sufficiently close to J(x). For 
Broyden’s method, local superlinear convergence can be 
proven. Newton’s method, on the other hand, exibits local 
quadratic convergence. 

In the following we will show that, in general, Broyden’s 
method alone cannot successfully be used in the Newton- 
observer; occasional recalculation of the exact Jacobian seems 
to remain necessary, basically because the property of bounded 
deterioration no longer holds when Broyden’s method is 
applied to a sequence of problems: P k (x) = 0 . For the class 
of slow-varying or weakly nonlinear systems, however, this 
approach can substantially reduce the computational complex¬ 
ity. 

In the observer problem, at each step k we want to solve 
P k (x k ):=H(x k )-Y k = 0 . (43) 


Note that this sequence of equations has a special structure: 
^ 4 a +A s ^ and their solutions are related by = F(x k ), 
Assume for simplicity of exposition that H\ R n —> JZ n . Let 


O be a subset of R n such that the following quantities „ < 
finite 


L := sup 

»F II 

a? (x >| 

< OO 

/3 := sup 

t$Q 

dH , ,1 

< (X 

a := sup 
reo 

(£<'>)" <» 

7 := sup 
xeo 

d 2 H. . 

dx 2 (j) 

< 00 . 


Given the dth iterate in the kih problem, xf and i4 1 /,. being 
approximations for x k and J(x *), respectively, the initial 
guess for the (k + l)st problem is taken as .rj +1 = F(xf). In 
terms of (42a)-(42d), this defines .s* and i)*. and hence .4^, 
the initial approximation for the Jacobian at T as 

(jfr - 


A 0 — A d 


+ 




(44a) 


where 


h = F(4) ~ 4 (44b) 

h = Pk+\ (a-fr+j) - /U i (4 ) (44c) 

4+) = F (4)- (44d) 


Note that this update is the same as m Broyden’s method, 
(42d), except that a t is determined by (44b), which is a 
consequence of switching from the fcth to the(fc +1 )-st problem 
in the sequence {P k (.r) = ()}. 

To develop bounds on the approximation error, we will need 
the following lemma from [ 8 ]. 

Lemma 6.1 (Bounded Deterioration): Let D C lR n be 
open and convex; x ', x 1 * 1 6 D,x' ^ r. Let A 1 € ]R nyn 
and let A l+l be defined by (42a)-(42d). Assume that J is 
such that 37 < oo verifying 


\\.J(x) - J(x)\\ < 7 ||.r - x\\ Vr E D. (45) 


Then, for either the Frobenius or the / 2 -mainx norm 

im ,+1 - m\\ < p* - m\\ 

+ ^7(l|x ,+1 -^|| 2 + lk'-r|| 2 ). (46) 

To begin with, let x k and x k +\ be such that Pk(*h) = 
H(x k ) - Y k = 0 and P k +i(x k +x) = H(xk+i) ~ T* n = 0. 
Let Af and x d k be given and determine A J +1 and x k + x by 
(44a)-(44d). Furthermore, define E{ := A\ - ./(a) and 
e k x \ “ x k’ A bound on the error || can then be 
derived to be 

ll^fc+il! < P*ll + 7lk* - xfc + i|| 

+ ^7(lkfcll + Ik2+j|| + lk*+i - • r *.ll) 

< ll^ll + 57(1 + L)H\\ + §7lkfc - tm\\. (47) 

From Lemma 6.1 we get the following 

ll^ll < M + 57(l|e2ll + 2^ IkHlV (48) 
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From the superiinear convergence of Broyden's method, it 
follows dial, with ||.EjJ|| sufficiently small, for any 0 < c < 1, 
the following holds: provided dun ||e£|| is sufficientiy small 

K + 1 II<<« Vi>0. (49) 

Now we can write (48) as 

M < ra+57(11411+2( i frr- - r ) ii p "n) 

<11^11 + 5711411(^4^7—) (50) 

and (47) becomes 

ll^fc+lll < ll^fcll + |7ll*fc - *fc+i|| 

- H r( 1 + i-f +1 +(1+L)cJ ) ||e211 - (51) 

The above inequality actually provides an upper bound 
on the worst-case deterioration of the approximation to the 
Jacobian after d iterates and a switch from the fcth to the 
(k + l)st problem. There are two special cases of interest: 

1 ) H is linear, hence 7 = 0 , and we then obtain uniform 
superiinear convergence of the sequence of problems 
{Pjt(T)t) = 0 }, which is essentially reduced to one 
single problem to which Broyden’s method is applied. 
This yields an asymptotic observer different from the 
classical Luenberger observer. 

2) The system is operated near an equilibrium point, in 
which case the term ||xfe+i - x*|| is small, or H is 
weakly nonlinear, in which case 7 is small. In either 
case, \\E° k \\ will remain sufficiently small over a number 
of problems. 

In general, due to the last two terms in (51), the sequence 
{\\A° k - J(x k )\\) not be uniformly bounded from above. 
Hence, for some fc, Ajj! will no longer be sufficiently close to 
J(x k ), which, in practice, may be indicated by slow decrease, 
or even increase, of the term ||Ffc - J7(x’ fc )||, or by ill- 
conditioning of the matrix A\. In an actual implementation 
of Broyden’s method, one will, for reasons of computational 
complexity, typically update the Q/?-factorization of ^ 4 ^, 
rather than A\ itself, and ill-conditioning will be checked 
for to avoid numerical instabilities [ 8 ]. For the iterates to 
converge throughout the sequence of problems, J{x k ) will 
have to be recalculated occasionally. Although one might be 
able to obtain a tighter bound than (51), the term \\xk+i -Xfr|| 
will remain. This term represents in a sense the “distance” 
between two subsequent problems, which is prescribed by the 
system’s dynamics and, in general, cannot be made smaller 
such as to coerce uniform convergence of {xj|}, unless the 
observer is being used in a closed-loop situation and the state 
is being regulated to a fixed value, for example. 

Suppose the discrete-time system (35) is the exact dis¬ 
cretization with sampling time T of an underlying continuous¬ 
time system: i — /(x). Then the term ||xfc + i - Xk\\ in (51) 
can be estimated by 

11**+! - **ll = N(* + 1)3T) - x(leT")H < fi/T (52) 


where /3/ is the Lipschitz constant for / on some given set. 
Slower sampling means more time in between samples for 
calculating Xk given V*. However, (52) indicates also that 
the error in ||j 4* +1 ~ ./(xfc+i)|| grows faster as T increases, 
hence the Jacobian has to be recalculated more often too. 
On the other hand, as T -> 0, the approximation to the 
Jacobian will deteriorate more and more slowly. It should 
be pointed out though, that for very small T, the problem 
of solving Yk - H{Xk) = 0 becomes ill-conditioned, since 
consecutive measurements will differ only slightly from each 
other. From a numerical analytic point of view, the system 
becomes practically unobservable. Thus, as is usual, there are 
trade-offs to be made. 

VII. Example: Mixed-Culture Bio Reactor 

with Competition and External Inhibition 

In this section, we will illustrate the Newton observer and 
the continuous Newton observer by means of an example 
concerning a mixed-culture bioreactor. An application of the 
Broyden observer to an automotive problem can be found in 
in [29]. 

The system under consideration describes the growth of two 
species in a continuously stirred bioreactor, which compete for 
a single rate-limiting substrate. In addition, an external agent 
is added which inhibits the growth of one species, while being 
deactivated by the second species. The measured quantity 
is the total cell mass of the two species. The example is 
taken from [18], where the system was shown to be globally 
feedback linearizable, provided that full state information is 
available. 

Here, it will be shown first that, assuming noise-free mea¬ 
surements and dynamics, a discrete Newton observer may fail 
to converge if not properly initialized. The (global) continuous 
Newton observer is then implemented and shown to converge 
for a wide range of operating conditions irrespective of the 
observer initialization, i.e., even when large initial observer 
errors are present. 

The system dynamics are given by 

0AS(x \, £ 2 ) 

*1 — n ns 1 ct I ^1*1 

0.05 + S{xi,J' 2 ) 

. _ 0.015(ti, J 2 ) 

X2 (0.05 + S(./*i, x 2 ))(0.02 + xj) X2 Ul,r2 

X 3 = -0. 5 x 1 X 3 - U 1 X 3 + U 1 U 2 (53) 

where S(.r 1 , 3 * 2 ) = 2 - 5xi - 6.667x2, time is expressed in 
hours. 

x \: cell density of inhibitor resistant species 

X 2 : cell density of inhibitor sensitive species 

X 3 : inhibitor concentration in fermentation medium 

u\\ dilution rate 

1 / 2 : inlet concentration of the inhibitor. 

Following the notation of the previous section, let x* := (x*. 
x^,xJ) T , Uk := (uJ,,u|) T , and yk denote the states, inputs, 
and outputs, respectively, evaluated at time kT , with T being 
the sampling time with which the system (53) is discretized. 
Furthermore, let Y k := (yk-2,yk-uVk) T and U k := 
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Fig. 1. Discrete Newton observer: relative observer error e = (e 1 .e 2 .p 3 ). 
where e . = for x = (0.2,0.02,0.005) and i = (0.02,0.2. 0.015) 

when 111 (f) = 0.3 and 112 ( 1 ) = 0.0067. 

A discretization of the above system with sampling time T 
may then be expressed as 

zjb+i = F^ k (x k ) 

Vk = h{x k ) (54) 

and the state-to-measurement map is given by 

/ K x k- 2 ) \ 

H(x k - 2 , U k . 1) := h o 2) = n. 

(55) 

Computing the rank of ^ at a number of different points in 
the state space showed that the iV-observability rank condition 
is indeed satisfied for N = 3. It was also found, however, that 
system (54) is poorly observable, indicated by ill-conditioning 
of ratio of its largest to smallest singular value for a 
sampling time of T = 1 (i.e., one hour) is on the order of 
500. A typical response for the Newton observer is shown in 
Fig. 1, simulations showed however, that the Newton observer 
fails to converge if it is not initialized closely enough to the 
actual states, e.g., when 

/0.2 \ /0.02 \ 

x = I 0.02 1 and x = 0.2 . 

\0.005 ) \0.015 ) 

Given the time scale—sampling times in the order of 
minutes or evert hours—there is virtually no restriction to 
available CPU time in the observer design. We therefore 
simulated the continuous Newton observer as well. It is given 
by 

** = D* ~ H ( z k .tffc-OL' 

z k — 2fc-l 

** = F^-'iz^kT)) 

As expected, this observer did converge for all physically 
feasible initial values. Responses are shown in Fig. 2 for four 
values of the observer gain K. The plot confirms our finding 
that the observer may fail to converge if K is too small. 
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Fig. 2. Continuous Newlon observer with different observer gains I\ \ shown 
is the relative observer error for 12 , when 111 (f) = 0.3 and 112 (f) = 0.00G7. 


VIII. Concluding Remarks 

In this paper, we have provided a new observer design 
method for nonlinear systems with discrete measurements. 
The method relies on asymptotically inverting the state-to- 
measurement map, which is constructed by relating the sys¬ 
tem’s state at a given time to a (predetermined) number of 
consecutive measurements. By using a continuous Newton 
method for the map inversion, the observer error was shown 
to converge to zero, globally and exponentially. If, instead, 
a computationally less expensive discrete Newton method 
is used, the observer shows quasilocal exponential conver¬ 
gence. Even more computational advantage may be gained by 
employing Broyden’s method, whose effect on the observer 
performance was also investigated. The theory was illustrated 
on an example. Results on using this observer in a closed-loop 
setting have been reported in [17], 
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A Probabilistic Approach to Multivariable 
Robust Filtering and Open-Loop Control 

Kenth Ohm, Anders Ahl6n, Senior Member , IEEE, and Mikael Stemad, Senior Member IEEE 


Abstract —A new approach to robust Altering, prediction, and 
smoothing of discrete-time signal vectors is presented. Linear 
time-invariant Alters are designed to be insensitive to spectral 
uncertainty in signal models. The goal is to obtain a simple design 
method, leading to Alters which are not overly conservative. 
Modeling errors are described by sets of models, parameterized 
by random variables with known covariances. These covariances 
could either be estimated from data or be used as robustness 
tuning knobs.” A robust design is obtained by minimizing the 
Ji r norm or, equivalently, the mean square estimation error, 
averaged with respect to the assumed model errors. A polynomial 
solution, based on an averaged spectral factorization and a 
unilateral Dlophantine equation, is derived. The robust estimator 
is referred to as a cautious Wiener Alter. It turns out to be only 
slightly more complicated to design than an ordinary Wiener 
filter. The methodology can be applied to any open-loop Altering 
or control problem. In particular, we illustrate this for the design 
of robust multivariable feedforward regulators, decoupling and 
model matching Alters. 

I. Introduction 

F OR any model-based filler, modeling errors are a potential 
source of performance degradation. Here, we will pro¬ 
pose a cautious Wiener filter for the prediction, filtering, or 
smoothing of discrete-time signal vectors. As in the scalar 
case, discussed in [36J, it constitutes a generalization of the 
polynomial equations methodology pioneered by Kufiera [211. 
The design is based on a stochastic description of model errors, 
with relations to e.g., the stochastic embedding concept of 
Goodwin and coworkers [11], [12], To be more specific, our 
problem formulation is as follows: 

• A set of (true) dynamic systems is assumed to be well 
described by a set of discrete-time, stable, linear and 
time-invariant transfer function matrices 

F = ?o + A* (1.1) 

We call such a set an extended design model, in which T 0 
represents a stable nominal model, while an error model 
AT describes a set of stable transfer functions, parameter¬ 
ized by stochastic variables. The random variables enter 
linearly into AT, and they are assumed independent of 
the noise. 
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^ Associate Editor, B. Pasik-Duncan The work was supported in part by 
Swedish Research Council for Engineering Sciences (TFR) under Grant 
775 

The authors are with the Systems and Control Group, Uppsala University, 
0 Box 27, S-751 03 Uppsala, Sweden. 

IEEE Log Number 9408271 


• A single robust linear filter is to be designed for the whole 
class of possible systems. Robust performance is obtained 
by minimizing the averaged mean square estimation error 
criterion 

J = trace£E(e(fc)e(fc)*). (1.2) 

Here, e(k) is the estimation error vector, E denotes 
expectation over noise and E is an expectation over the 
stochastic variables parameterizing the error model AT. 

The averaged mean square error has been used previously 
in the literature by e.g., Chung and Bdlanger [9], Speyer 
and Gustafson [32], and by Grimble [13] These works were 
based on assumptions of small parametric uncertainties and on 
series expansions of uncertain parameters. We suggest the use 
of the criterion (1.2), together with a particular description 
of the set (1.1): transfer function elements in AT have 
stochastic numerators and fixed denominators. Such models 
can describe nonparametric uncertainty and undermodeling as 
well as parametric uncertainty. A discussion of the utility, and 
versatility, of linearly parameterized stochastic error models 
can be found in [36]. 

Most previous suggestions for obtaining robust filters have 
been based on some type of minimax approach [I0|, [24]. A 
paper [26] by Martin and Mintz takes both spectral uncertainty 
and uncertainty in the noise distribution into account. The 
resulting filter will, however, be of very high order. Minimax 
design of a filter H becomes very complex, unless there exists 
either a saddle point or a boundary point solution A crucial 
condition here is that inin^ max/’ equals max/- ming. If so, 
one can search for models whose optimal filler gives the worst 
(nominal) performance and use the corresponding filter. As 
compared to finding the worst case with respect to a set of 
models, this is a much simpler task. It can still, however, be 
computationally demanding. See [19], [28], [31], [38], and 
the survey paper by Kassam and Poor [20]. The condition 
min^ max,/- = inax/r min^ is not fulfilled in numerous 
problems, which makes them very difficult to solve. See, e.g., 
Example 5 in [36] and the example in Section IV. 

Kalman filter-like estimators have recently been developed 
for systems with structured and possibly time-varying para¬ 
metric uncertainty of the type 

x(k + 1) = (A + DA(*)E)x(fr) + w(k) 

where the matrix A(k) contains norm-bounded uncertain 
parameters. See [30], [7], and [39] for continuous-time results 
and [40] for the discrete-time one-step predictor. See also [16] 
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for a related method. For systems which are stable for all 
A(fc), an upper bound On the estimation error covariance 
matrix can be minimized by solving two coupled Riccati 
equations, combined with a one-dimensional numerical search. 
This represents a computational simplification, as compared 
to previous minimax designs. Still, the resulting estimators 
are quite conservative, partly because they rest on worst case 
design. This conservativism is illustrated and discussed in [29] 
and [37]. 

The method suggested in the present paper is computation¬ 
ally simpler than any of the minimax schemes referred to 
above. It also avoids two drawbacks of worst case designs. 
First, the stochastic variables in AF need not have compact 
support. Thus, the descriptions of model uncertainties may 
have "soft” bounds. These are more readily obtainable in a 
noisy environment than the hard bounds required for minimax 
design. Second, not only the range of the uncertainties, but also 
their likelihood is taken into account by using the expectation 
E(') of the MSE. Highly probable model errors will affect 
the estimator design more than do very rare “worst cases.” 
Therefore, the performance loss in the nominal case, the price 
paid for robustness, becomes smaller than for a minimax 
design. In other words, conservativeness is reduced. There 
do exist applications where a worst-case design is mandatory, 
e.g., for safety reasons. We believe, however, that the average 
performance of estimators is often a more appropriate measure 
of performance robustness. 

In the present paper, one of our goals will be to present 
transparent design equations and to hold their number to a min¬ 
imum without sacrificing numerical accuracy. We use matrix 
fraction descriptions with diagonal denominators and common 
denominator forms. This leads tp a solution which is, in 
fact, significantly simpler and numerically better behaved than 
the corresponding noihinal ^-designs (without uncertainty) 
presented in [1] or [14]. Somewhat surprisingly, taking model 
uncertainty into account does not require any new types of 
design equations. We end up with just two equations for robust 
estimator design: a polynomial matrix spectral factorization 
and a unilateral Diophantine equation. The solution provides 
structural insight; important properties of a robust estimator 
are evident by direct inspection of the filter expression. 

This paper is organized as follows. The filtering problem, 
model structure (1.1), and criterion (1.2) are discussed in more 
detail in Section II. Section III presents the design equa¬ 
tions and some tools for performance evaluation. The design 
procedure is illustrated by a thorough numerical example m 
Section IV. The resulting estimator reduces the impact of 
mode] uncertainty and limited signal energy by using multiple 
sensors in an efficient way. In Section V the design of robust 
feedforward regulators, servos, and model matching filters is 
discussed. 

Remarks on the Notation: Signals and polynomial coeffi¬ 
cients may, in the following, be complex valued. (This is 
required in, e.g., communications applications.) Let p* denote 
the complex conjugate (and transpose for matrices) of a 
polynomial coefficient pj , For any polynomial 

P(q~ l ) = ;><»+ pi ? -1 + • • • + Pn P q~ np 


in the backward shift operator q l , define the conjugal 
polynomial 

A(g) * pi + p\q + ■ • ■ + Pn P Q np 

where q is the forward shift operator. A polynomial P(q, g ” 1 ) 
having coefficients of both q and g” 1 will be called doubl? 
sided. Rational matrices, or transfer functions, are denoted 
by boldface calligraphic symbols, e.g., K{q~ l ). Polynomial 
matrices are denoted by boldface symbols, such as P(g“ 1 ), 
while constant matrices are denoted as P. For example, the 
identity matrix of dimension n is denoted I n . We denote 
the trace of P by trP. For polynomial or rational matrices, 
P,(g) and 72* (g) means complex conjugate, transpose, and 
substitution of q for g" 1 . When appropriate, the complex 
variable z or e is substituted for the forward shift operator 
g. Arguments of polynomials and matrices are often omitted, 
when there is no risk of misunderstanding. The degree of 
a polynomial matrix is the highest degree of any of its 
polynomial elements. Square polynomial matrices P{q~ l ) are 
called stable if all zeros of det P(z~ l ) are located m \z\ < 1 
A rational matrix is defined as stable if all its elements are 
stable. Causality is defined in the same way. 

A rational matrix G{q~ l ) may be represented by polynomial 
matrices as a matrix fraction description (MFD), either left 
5 = i4 1 " 1 Bi or right Q — *. It may also be represented 

in a common denominator form Q = B/A , where B is a 
polynomial matrix. The scalar and monic polynomial A is 
then the least common denominator of all elements in G 
Denominator matrices m MFD’s are assumed to have identity 
matrices as leading coefficients of their matrix polynomial 
representations, thus A t (0) = I above. 

II. The Robust Estimation Problem 
Consider the following extended design model 

y(k) = 0(q _1 )u(fc) + W(<7 -1 M*0 

u(k) = r(q~ l )e(k) (2.1) 

J(k) = Viq-'Mk) 

where Q , W, F, and V are stable and causal, but possibly 
uncertain, transfer functions of dimension p\», p\i\ s\n , and 
f\ s, respectively. The noise sequences (f’(fc)} and {i;(fr)} an 
mutually uncorrelated and zero mean stochastic sequences. Ti 
obtain a simple notation they are assumed to have unit covan 
ance matrices, so scaling and uncertainty of the covariance ^ 
are included in T and 7f, respectively. The signal y(k) 
assumed measurable, while f(k) is the signal to be estimated 

A. Multisignal Estimation 
From data y(k) up to time k + m, an estimator 

f(k\k + m) = K(q-')y(k + m) (2.2) 

of f(k) is sought. See Fig. 1. The estimator may be a predictor 
(m < 0), a filter (m = 0), or a fixed lag smoother (m > 0). 
Here 72, of dimension t\p, is required to be stable and causal. 
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hp I A general linear filtering problem formulation Bailed on noisy 
rnt-dsuremems i/(A* -|- in), the signal /(A)is to be estimated. Model errors 
in transfer functions are described by stochastic error models 


We have made these choices to obtain tidy and trunspjiep' 
design equations and to avoid coprime factorizations, which 
are known to be numerically sensitive. In (2.4), it is assumed 
that Q , W, and T may be uncertain. Introduction of uncertainty 
in the weighting matrix W is not motivated m filtering 
problems. Its role in open-loop control will be discussed m 
Section III-B below. It is shown in Appendix C that uncertainty 
in V does not affect the optimal filter design, provided it 
is uncorrelated to uncertainties in other blocks. Therefore, 
uncertainty in V is not introduced. 

The extended design models, cf. (1.1) and (2.1) 


The transfer function H is designed to minimize the averaged 
mean square error (MSE) criterion (1.2) 

e 

.1 = tr EE(e(k)e(k)*) = ££(e(Jfc)*e(*:)) = £££|e,(fc)| 2 

1 = 1 

(2.3) 

where 

*(*) = (*i(fc) ■ • • = Wfo- 1 )(/(*) - /(*|Jk + m)). 

Above, W is a stable and causal t\f rational weighting 
matrix, with a stable and causal inverse. It may be used by 
the designer to emphasize filtering performance in particular 
frequency bands. In filtering problems, VV is not assumed 
uncertain. 

Model (2.1) offers considerable flexibility. For example, 
when estimating a signal u(k) in colored noise, wc set Q = 

V = I*, giving f(k) = u(k). In deconvolution, or input 
estimation problems, G is a dynamic system and V = I s . In 
a state estimation problem, u(k) is the state vector, Q and 

V arc constant matrices while H v(k) represents (colored) 
measurement noise. Other special cases are discussed in [2], 
|3], and 18]. 

Example. An application where uncertain dynamics in Q 
is of interest is equalizer design for digital mobile radio 
communications [23]. A signal u(k) then propagates along 
multiple paths, with different time delays, represented by 
delays in Q. The receiving antenna may have p > 1 elements 
(diversity design). See, e.g., [4]. Thus, an appropriate model 
of Q is a column vector of FIR channels, i.e., a vector 
ol polynomials. The polynomials coefficients are estimated 
bom short and noisy training sequences, with a known input 
{ u(k)}. Estimation errors are inevitable. The task of a (robust) 
equalizer is to estimate u(fc), based on noisy measurements 
'/(fc + m), a nominal model Go , and an estimate of the amount 
of model uncertainty □ 


i Parameterization of the Model 

We choose to parameterize Q and H as left MFD’s having 
diagonal denominators, 1 while T, Z>, and W are parameterized 
in common denominator form 

g = A~ 1 B ] H = N~ l M ( 2 . 4 ) 








1 Note that this is a natural choice, if transfer functions are obtained by 
neans of identification. 


0 = {?o + Ag, H = Ho + AK F - F„ + AT 

are now expressed in polynomial matrix form. Using B 0 = 
A\B 0 , B\ = A a B\ etc. we introduce 

G=A; 1 B 0 + A^B 1 AB 
= a: 1 A:\Bo + B t AB) = A~'B 

h = n: 1 m 0 + n^m 1 am 

— N~ 1 Nj i (M 0 + M\AM ^ N~ : M (2.5) 

e = -^-c 0 + -±-c 1 ac 

JJ 0 U\ 

Above, g o = A~^B 0 represents the nominal model and AG = 
A[ X B\AB the error model. The same holds for H and T. 
The diagonal polynomial matrices A — A 0 j4i, N = N„N U 
and the polynomials D = D 0 D \, T and V are all assumed to 
be stable, with causal inverses. Denominator polynomials are 
assumed monic. In the error models, the polynomial D\, the 
diagonal matrices A\ and N\, and the matrices GY B\ and 
are fixed. They can be used to tailor the error models for 
specific needs. For example, if multiplicative error models are 
deemed appropriate, we use A\ = A 0f By = B 0 B m etc., with 
B m to be specified. 

The matrices AB , AC , and AM contain polynomials, 
with jointly distributed random variables as coefficients. These 
coefficients parameterize the class of assumed true systems. 
One particular modeling error is represented by one partic¬ 
ular realization of the random coefficients. 2 Element ij of a 
stochastic polynomial matrix AP is denoted 

AP ,J = [An, = A P? + ApiV 1 +' • • + A p\ J p q- hp (2.6) 

where bp is the degree of AP , i.e., the highest degree 
appearing in any polynomial AP tJ . All coefficients have zero 
means, so the nominal model is the average model in the set. 
Only the second-order moments of the random coefficients 
need to be specified, since the type of distribution, and higher 
order moments, will not affect the filter design. The parameter 

2 For a given system realization, the coefficients are assumed time-invariant 
and independent of the time-senes r'(fc) and v[k). This is in contrast to 
the approach of Haddad and Bernstein in [15], who represent the effect of 
uncertainties by multiplicative noises. For a given uncertainty variance, a 
noise representation would underestimate the true effect of (time-invariant) 
parameter deviations in the dynamics. 
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covariances art denoted S{Ap]?)(Ap^ k )* and are collected in 
covariance matrices F^p^; see Section Il-C. 

We now introduce the assumption 

* Ah The coefficients of all polynomial elements in AC 
are independent of those in AB. 

It is possible to exclude Assumption At, but it does simplify 
the solution, and it is also reasonable in most practical cases. 

Error models can be obtained from ordinary identifica¬ 
tion experiments, provided the model structures match. For 
SISO systems, error models can be estimated in presence 
of under-modeling, using a maximum likelihood approach 
[11]- Even if the statistics is hard to obtain, one could 
still use the elements of covariance matrices pragmatically, 
as robustness “tuning knobs.” They are then used similarly 
as when weighting matrices are adjusted in LQG controller 
design. An objective could be to obtain reasonable perfor¬ 
mance for the uncertainty set, for a prespecified acceptable 
degradation of performance in the nominal case. The error 
models may also be used to account for a slowly time-varying 
dynamics [25J. 

One way of obtaining the models (2.5M2.6) is by senes 
expansion of state-space models with parametric uncertainty 
[37]. Parameter deviations are represented by stochastic vari¬ 
ables. For small uncertainties, a first-order expansion can 
be used, which will directly lead to models of type (2.5). 
For larger uncertainties, a second-order Taylor expansion is 
usually sufficient; see [29]. Error models for nonparametric 
uncertainties can be adjusted directly to frequency domain 
data. In that context, a very useful concept is provided by the 
stochastic frequency domain theory of Goodwin and Salgado; 
see [12]. 

C. Covariance Matrices for the Stochastic Coefficients 

To represent the uncertainties of the system in a natural 
way, covariance matrices will be organized as follows. The 
*jfth element of a stochastic polynomial matrix AP can be 
expressed as 

A) = ^(g -1 )?,, (2.7) 

where 

g> T (g _1 ) = (ig -1 ■■■q ~ ip ); P,j = {Ap‘j Ap\ J -■ Ap' s 3 p ) T . 

(2.8) 


The cross covariance matrix of dimension 6p •+ 

l|tfp+1 , between coefficients of AP iy (q~*) and AP ek (q~ l ) t 
is given by 

P % tk) = Ep x ,p,' k 

■E(Ap'J)(Ap‘ 0 k r E(Art)(Ap tk )*- 

E(Atf p )(Api k r ■ E(Ap l ,' p )(Ap tk r _ 

(2.9) 

where P^p U ' ) is Hermitian and positive semidefinite, while 
p& £fe) = (p£p ,u) r. Thus 

EiAP^APi^E^^p^A q )) = SP^%l 

( 2 . 10 ) 

With autocovariances, (ij) = (ffc), we model the un¬ 
certainty within each input-output pair. Cross-dependencies 
between different transfer functions may also be known For 
example, uncertainty in one single physical parameter may 
very well enter into several transfer functions between inputs 
and outputs. Such effects are captured by cross covariances, 

m * m- 

We collect all matrices of type (2.9) into one large covari¬ 
ance matrix, organized as shown by (2.11) at the bottom ol 
the page. If AP has dimension n\m , then P A p is composed 
of nm by nm covariance matrices The structure of 

(2.11) is useful from a design point of view. It, for example, 
a multivariable moving average model, or FIR model, is to be 
identified, then (2.11) is the natural way of representing the co- 
variance matrix. If we instead prefer to use the blocks 
of (2.11) as multivariable “tuning knobs,” a given amount ol 
uncertainty can be assigned to a specific input-output pair 

III. Design Of Robust Filters 

A. An Averaged Spectral Factorization 

We define an averaged spectral factor fl(q~ l ) as the nu 
merator polynomial matrix of an averaged innovations model 
It constitutes a key element of the robust filter. The average 
over the set of models, of the spectral density matrix $ y (e ,u; ) 
of the measurement y(k) is given by 

E{*y(e' u )} = 
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r AP 
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square polynomial matrix 0(z~ l ) is given by the stable 
mon to 

00„ - E{NBCC+B*N+ + DAMM*A+D m ) (3.1) 

\ ie that JV" 1 and A” 1 are diagonal and will thus commute 
« i averaged second-order statistics of y(k) is thus described 
I the same spectral density as for a vector-ARMA model 

U {k) = ±A- 1 N-'fit{k) 0 2) 

vs here t(A) is while with a unit covariance matrix This model 
is denoted as the averaged innovations model (Note that 
q(k) ^ q(k)< but — E {$ y (< ,u '’)}) When constructing 

the right-hand side of (3 1), the lollowing results are useful 
Lemma 1 Let H(q q ~*) he an m |/n polynomial matnx 
with double-sided polynomial elements having stochastic co- 
dlicients Also, let G(qf 1 ) be an n\m polynomial matrix with 
polynomial elements having stochastic coefhcients, indepen¬ 
dent of all those of H Then 


/?(()). Us degree, rip, will be determined by the maximal d< «r 
of the two right-hand side terms of (3 6) * 

To obtain the right-hand side of (3 6), aveiagcd polvoo 
mial matnees J£( AP//AP*) have to be computed when 
H{q q~ l ) = CC* or I. It is shown in Appendix B thai ihe 
tj th element of AfAP/fAP#) is given by 


W o I 


E[APH = ti// 


P 


(•1 jD 


T>(ll /II l) 

L*ap 


0 v 1 



0 7) 


where ip r was defined in (2 8) The block covariance matrix 
in (3 7) constitutes the block-Uansposc ot the / /th block [ ] 
ofP Ar in (2 11) Average factois in (3 5) are readily obtained 
by substituting AC, A B, and AM for A P in (3 7) 


E[GHG>] - E[GE(H)G*} (3 3) 

□ 

P/oof Sec Appendix A 

Now, introduce the double-sided polynomial matrices 

CC.±L(CC t ) B c B ( , = E(BCCJJ, ), 

MM , = E(MM r ) (14) 

Invoking (2 5) and using the fact that the stochastic coefficients 
arc assumed to be zero mean, gives 

CG\ = C„C„* + C } Zs(ACAC,)C u 

= B n CC+B { „ + B x E(±BCC,<iB+)Bu 
MM* = M 0 M (J * h M J it/ 1 (AMAM*)M 1 * (3 5) 

1 acton/ations to obtain C, B c , etc need not be perloimcd 
Ihe double-sided polynomial matrices are expressed as 6 T C*, 
etc merely to simplify the notation 
Lemma 2 Let Assumption A1 hold By using (3 4), (3 5) 
and invoking Lemma 1, the aveiaged spectral factorization 
0 1) can be expressed as 


B A Set and Spec tr a I Fac to/ ization 

In the feedforward control problems discussed in Section 
V, we shall allow for W being uncertain Using a common 
denominator loim, W is parameterized in a similai way as T 
(d (2 5)) 

w = vL + avv,L = J_(v„ + apk,) £ ~v 

< a t'l I'fjt'l f/ 

(3 8) 

A stable square matrix V, with P(0) nonsingular, is inlio- 
duced as a solution ot the right spectral factorization 

V,V = E{V.V) = V,„V 0 + V U E(AV^V)V } (3 9) 

Also, introduce the following assumptions 

• A3. The coefficients of AP are independent of all other 
stochastic coefficients 

• A4. The right-hand side of (3 9) is nonsingular on the 
unit circle 

Whenever W is known, (W = V„/U n — V/t ), (3 9) need 
not be solved, and V = V 0 = V This will be the case in 
filtering problems 


00, = NBcBc.N, + DAMM+AJ)* (3 6) 

□ 

Ptoof See Appendix A 

With a given right-hand side, (3 6) is just an ordinary 
polynomial matnx left spectral facton/alion It is solvable 
inder the following mild assumption 

• A2. The averaged spectral density matrix is 

nonsingular for all u; 

This assumption is equivalent to the nght-hand side of 
13 6) being nonsmgular on \z\ = 1 Then, the solution to 
3 6) is unique, up to a right unitary factor (If HH * = I, 
hen 00, = (/?/f)(//*/?,).) Under Assumption A2, a solution 
exists, with 0 having nonsingular leading coefficient matrix 


C The Cautious Multtvai/able Wienet fiber 

Theorem I Assume an extended design model (2 1), (2 4), 
(2 5), (3 8) lobe given, with known covariance matrices (2 11) 
Assume A1-A4 to hold A realizable estimatoi of f(h) then 
minimizes the averaged MSE (2 3), among all linear time- 
invanant estimators based on q(k -I- m), if and only if it has 
the same coprime factors as 

f(k\k + m) = Hy(h + vi) = jV 'Q0- l NA n(h + m) 

(3 10) 

'When solving 0 6), we have utilized an algonlhm by JcJek and Kuicra, 
presented in (181 It provides a solution with an upper triangular full 
rank leading coefficient matnx For an overview of spectral facton/aiion 
algonthms, see |22J 



IEEE TRANSACTIONS ON AUTOMATIC CONTROL. VOL. 40, NO, 3, MARCH 10 s 


410 


Here, is obtained from (3.6), K(g- 1 ) from (3.9) while 

together with L.(q), both of dimensions l\p, is the 
unique solution to the unilateral Diophantine equation 

q- m VSC& t & at N' = Q0, + qL,UTDl p (3.11) 


with generic 4 degrees 

nQ = max(ni) -I- uh + nc -f m, nu + nt + nd - 1) 
nL * — max(nr + nb Q + mi - m, nft) - 1 

(3.12) 

where v» = deg S etc. When applying the estimator (3,10) on 
an ensemble of systems, the minimal criterion value becomes 


*VSc[l n -C.B„.N.p: i p- l NB, I (\ 

x C.S.V ,}-. (3.13) 

J z 


□ 


Proof: See Appendix C. 

Remarks: The only new type of computation, as compared 
to the nominal case described in [1]-[3], is the calculation of 
averaged polynomials using (3.7). 

Since both V and p are stable, the estimator K will be 
stable. 5 If Assumptions A2 and A4 hold, V^O) and /?(()) are 
nonsingular, so H will be causal. 

Note that the diagonal matrix NA = N 0 N\A 0 A\ appears 
explicitly in the filter (3.10). Thus, important properties of the 
robust estimator are evident by direct inspection. For example, 
assume some diagonal elements of ATf 1 or Af l in the error 
models to have resonance peaks, indicating large uncertainty 
at the corresponding frequencies. Then, the filter will have 
notches, so the filter gain from the uncertain components of 
y(k + 7?i ) will be low at the relevant frequencies. 

The nominal Wiener filter has as a component a whitening 
filter. The robust estimator has a similar structure. By mul¬ 
tiplying R by the stable common factor D/D, the filter in 
(3.10) will contain fi~ l NAD as right factor. This averaged 
counterpart of a whitening filter is the inverse of the averaged 
innovations model (3,2). 

The model structure (2.4M2.5) was selected to obtain a few 
simple design equations. Other choices are possible, but lead to 
various complications. For example, if stochastic polynomials 
had been introduced in the denominators, no exact analytical 
solution could have been obtained. Also stability would have 
been a problem. The use of general left MFD representations, 
instead of forms with diagonal denominators or common 
denominators, would have led to a solution involving seven 
coprime factorizations. Such a solution is presented in [29], 
but it provides less physical insight. It does also exhibit worse 
numerical behavior, since algorithms for coprime factorization 
are numerically sensitive. 


4 In special cases, the degrees may be lower. 

s Stable common factors may exist in (3.10). They could be detected by 
calculating invariant polynomials of the involved matrices. If such factors have 
zeros close to the unit circle, it is advisable to cancel them before the filter is 
implemented. Otherwise, slowly decaying (initial) transients may deteriorate 
the filtering performance. 


Furthermore, (3,11) is a unilateral Diophantine equatioi 
since Q ahd L+ appear on the same side of the terms in whic) 
they are involved. (When the unknowns appear on opposiu 
sides, the equation is bilateral.) This property is a consequent! 
of our choice of U, T, and D as scalar polynomials. Unilateral 
equations can easily be transformed into a system of linear 
equations, AX = B, where A is a block-Toeplitz matrix. For 
an example, see Section IV. 

Robust design also makes the solution less numerically 
sensitive. Almost common factors of del/?* and UTD with 
zeros close to \z\ = 1 would make the solution of (3.11) 
numerically sensitive. In the presence of model uncertainty, 
the risk for this is less than in the nominal case, due to the 
presence of averaged factors in (3.5). The averaged spectral 
factor P will, in general, have its zeros more distant from the 
unit circle than the nominal spectral factor, given by (3.20) 
below. This reduces the numerical difficulty of solving both 
(3.6) and (3. 11). 

For every cautious Wiener filter, there exists a system 
(without uncertainty) for which this estimator is the optimal 
Wiener filler (see 129]). It is therefore possible to represent 
model uncertainties by colored noises and then design a 
Wiener filter for the corresponding system. This correspon¬ 
dence provides a way of understanding the structure ol the 
above desing equaiions. We do not recommend such an 
equivalent noise-approach in the actual design, however, for 
two reasons: 

• It is far from trivial to obtain a noise spectrum having 
similar effect on the filter design as do uncertainties in the 
block & in Fig. I. This is true in particular if the block 
T is also uncertain, and if the problem is multivariable. 

• It is advantageous from a design poini of view to 
have separate tools which handle different aspects 
Error models should represent the effect of modeling 
uncertainty; noise models should represent disturbances, 
criterion weighting functions should reflect the priorities 
of the user. A method which does not distinguish between 
these aspects will tend to confuse ihe designer. 

The attainable performance improves monotonicully with 
an increasing smoothing lag in. The following result gives the 
lower bound of the averaged estimation error. This bound can 
be approached poinlwise in the frequency domain for in < rx 
by using a criterion filter W with a high resonance peak. 

Corollary 1: The limiting estimator for in —* oo, th( 
nonrealizable cautious Wiener filter, can be expressed as 

lini q m Tl = l-SCC.Bo'N.p-'p-'NA. (3.14 

TH—>O0 J 

Its average performance is given by (3.13) with L = 0. 1 
W = If, the spectral distribution of the lower bound of the 
estimation error f(k) - f{k\k + rri) is 

J&, */-/('■'“)= fm^ tr<se 

[i. -6.i3 m N.p;'r'NB.c]c.s.). (3.15) 
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c bound can be attained at a frequency by an estimator 
; rh finite smoothing lag, if it is designed using a weighted 
, lerion where 

.'Pfc^JwO, (3.16) 

□ 

Proof: In a similar way as in Appendix A.3 of [8], it is 
straightforward to show that L —► 0 as m —► oo in (3.11). 
Thus, (3.11) gives 


lim q m Q = VSCC t B nt N*/}~ 1 . (3.17) 

The substitution of this expression into (3.10) gives (3.14). 
The use of L = 0, V = I( and U = 1 in the integrand of 
(3.13) gives (3.15). When U(e~ tUi ) « 0, we obtain the same 
effect on the Diophantine equation (3.11) at the frequency vj 
as if L —♦ 0: the rightmost term vanishes. Thus, at the gain 
and the phase of the elements of the polynomial matrix q m Q 
are approximately equal to those of (3.17) and the estimation 
error approaches the lower bound (3.15) ■ 

Remarks: Note that for realizable estimators (m finite), the 
lower bound (3.15) is only attainable at distinct frequencies u>i 
by means of frequency weighting. For frequencies outside the 
bandwidth of W, the estimate may be severely degraded. The 
results of Corollary 1 are illustrated at the end of the example 
in Section IV. 

I). Analytical Expressions for Performance Evaluation 

Theorem 2: Let a nominal estimator 72 ri be designed based 
on a nominal model, with no uncertainties. Applying it, instead 
ol (3.10), on an ensemble of systems results in an increase, 
as compared to (3.13), of the averaged MSE. The increase is 
given by 

l.rEE(e(k)e(k)*) n - tr EE(e(k)e(ky) uun = 

|| W(7£„ - Tl)z m D~ x A~' 1 N~ l fi\\\ (3.18) 

where f) is defined by (3.1 )-(3.6) and 72 is the robust estimator 

(3.10). n 

Proof: To obtain (3.18), the nominal filter 72 n is ex¬ 
pressed as 72+ (72,,-72). The optimality of 72 implies that any 
modification gives an orthogonal contribution to the criterion. 
This, and the use of the averaged innovations model (3.2), 
gives (3.18). Mixed terms vanish, due to the orthogonality, g 
Theorem 3: Let a robust estimator 72 be designed by 
(3.6)—(3.11). When applying it on a system equal to the 
nominal model, the increased MSE, as compared to the 
minimum obtainable with a nominal estimator 72 n , is 

•r E(e(k)e(ky) - tr E{e{k)e(kT) n = 

\\W(K - Tl n )z m D; l A- 0 'N: l {i 0 1||. (3.19) 

Here, D^A^N; 1 ^ is the nominal innovations model and 
ft 0 is obtained from the nominal spectral factorization 

ftnPo* = N 0 IJ 0 C 0 Co*B 0 »N 0 + -T D 0 A 0 M 0 M 0 +AomDo*’ 

(3.20) 

□ 



Fig, 2. Mode! and Idler structure in the design cxajTipIc. 


Proof: Analogous to that of Theorem 2, by expressing 
72 as 72, , -T (72 — 72 n ). g 

Remarks: Expression (3.18) can be used for arbitrary linear 
estimators 72, M for example, minimax-designs. Expression 
(3.19) quantifies the price paid in nominal performance for 
obtaining a robust design. The averaged innovations model 
in (3.18), and the nominal innovations model in (3.19), to¬ 
gether with the filter W, can be seen as weighting func¬ 
tions. 

The largest effect of robust filtering is obtained at moderate 
and high signal-to-noise ratios. If the variance of broad-band 
measurement noise is increased, the gains of both the nominal 
and the robust filters decrease. If the noise level is high, 
performance differences between nominal and robust solutions 
tend to be small. 


IV. A Design Example 

Assume that a scalar signal.. u(k) is to be estimated. It is 
described by a first order AR-process without uncertainly 

,i(fc)= i-()U-» t;(fc) : Ec[kf = L 

Thus ,V = S/T - 1, Dj = 1 ,D- D u = 1 —0.E >q~\C 0 = 1, 
and Ci = 0. This signal is measured by two transducers 
(p = 2), with nominal models being second order FIR filters. 
The transducers are modeled by 


y(k) - (B„ + Ay 1 AB)u(k) + w(k) 


with 


(B} t l \ / 0.100 + 0 . 080 r /~ 2 \ 

~ V 1 - lA <r ] + 0.92 if 1 ) 


(4.1) 



Thus, B\ = A a — I 2 and A — A\ are used in (2.5). See 
Fig. 2. 

In the first transducer B 11 , there is only a single uncertain 
parameter. It affects the coefficients A b l 0 l and Awith 
opposite signs, so they have zero mean, variance r{ and 
cross-covariance -r\. In the second transducer, the stochastic 
coefficients are assumed mutually uncorrelated, with zero 
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means and equal variance t\. Thus, the auto-covanancc ma¬ 
trices are 

P i l B n) = r? ^ 0 0 0 j P^ 21) = ii h (4 2) 

The scale factors (standard deviations) of the uncertainties 
are set to 


The spectral factorization (3 6 ) has dimension p\p — 2\ 
Using (4 6 ), it reduces to 

P0 m = B c B c * + DAMM+A+ D* 

=r A\B 0 B 0 +Au + E(ABAB*) + 0 QlDAiA^D* 

where 


/i = 0 02 r 2 = 0 10 (4 3) 


Coefficients of AB 11 and AB 21 are assumed mutually uncor¬ 
related The complete covariance matrix (2 11 ) then becomes 


Pab 


l 0 P 


(4 4) 


The measurement noises w t (k) have variance 0 01 They are 
white and mutually uncorrelated Thus, w(k) = Mv(k), with 


+ 3 " 5 ) 

By using the Newton-based algorithm described in [18], a 
stable averaged spectral factor, with / 3 (f)) nonsingular, was 
found to be (4 9), as shown at the bottom of the page 
In the Diophantine equation (3 11 ), we use m = 0 , V = 
V = 1 , S = 1 , cc* = 1, N+ = 1 2 , U — 1 , and T = 1 
Equation (3 11) thus reduces to 


M = M 0 =-0 1 1 2 (4 5) 


Ba*A u = Qfa + L t ql)l 2 (4 10 ) 


The goal is now to design a filter (m = 0), which estimates 
u(k) based on the two measurements yi(k) and t/ 2 (/r) Fre¬ 
quency weighting is not used used here (W = 1 ), but its 
influence will be illustrated at the end of the example In 
(3 4H3 5), wc obtain 

CC\ = E(CC m ) =1 

E(MM *) =0 01I 2 (4 6 ) 

Be Be * = EiBCC.B*) - B 0 D + L(ABAB«) 


Expression (3 7) or (2 10) gives 
E(AB n Aii“) = ^P^ n Vi 

=(i 7 -v>^o i o 

-f?(-(/ 2 +2-r/- 2 ) (4 7) 

Note that F (AB^AB] 1 ) has zeros at 2 = 1 and at z — -1 
Thus, the static gain and the high-lrequency gam are assumed 
to be exactly known Furthermore 

E(An 2l ABl') = * T V = Irj (4 8 ) 


The degrees (3 12) are nQ = 0 nL * — 2 By expressing the 
polynomial matrices as matrix polynomials (4 10) becomes 

(B(* + B*f/+</ 2 )(Ii + A**?) =Q n (0,* +At? -1 fl\q 2 -+ A*'/ J ) 

+ (Lo+Lj(/+Lify 2 ) 

x (—0 5 + 7 )I 2 


Transpose this equation and note that since the coefficient 
matrices are real-valued, P* J — P, By equating the two 
sides for each power of 7 separately, a lineai system of eight 
equations, m block-Toepliz form is obtained 


/ Bo 
Bi+AiBo 

B2+A1B1 

AiB > 


/A) 

- 0 5I 2 

0 

0 ^ 

/Qo 

Pi 

h 

-0 'll2 

0 

L 0 

Pi 

0 

Ii 

-0 5Ii 

Ll 

\Pi 

0 

0 

Ii / 

\Li 


With numerical values from (4 1) and (4 9), wc obtain (x), as 
found at the bottom of the page The solution is 


Q = (0 4005 0 7710) 

L, = (0 0290 + 0 0200(7 - 0 205 3 + 0 .322 \<j - 0 1299? 2 ) 

(4 II 


P 



0 1330 - 0 01807(7- 1 + 0 01022?' 2 0 07802 - 0 01488(7 1 + 0 00905? 2 

-0 1474?" 1 + 0 2908?~ 2 - 0 1325?- 1 1 1585 - 2 0327? 1 + 1 6219?' 2 - 0 4705?' 


1 N 


1339 

07802 

- 5 

0 

0 

0 

0 

0 


(Q?\ 

l 


0 

1 1585 

0 

- 5 

0 

0 

0 

0 


Ql 2 

0 


- 01807 

- 01488 

1 

0 

- 5 

0 

0 

0 


C 

-2 


- 1474 

-2 0327 

0 

1 

0 

- 5 

0 

0 


el 2 

08 


01622 

06905 

0 

0 

1 

0 

- 5 

0 



1 76 


2908 

1 0219 

0 

0 

0 

1 

0 

- 5 


n 2 

0 


0 

0 

0 

0 

0 

0 

1 

0 


ty 

V- 552/ 


- 1325 

- 4765 

0 

0 

0 

0 

0 

1 ) 


\eyJ 


) 


(4 9 


(X) 



N ttal MULTIVARrABLE ROBUST FUTLRING AND OPEN LOOP CONTROL 



ol possible true systems Magmtudt plots for the gams from </i(A) (upper) 
jncl i/iU) (lowu) aic shown for the robust estimator (dash dolled) md lor 
the nominal Wiener tiller design (dashed) 


Finally the robust estimator (3 10) becomes 

n = Qp 'A: = Jj-(hy K]*) (4 12 ) 

where the monic denominator is //,((/"') =dct f) (i/ -1 )/dr1 
p t) We obtain 

A," = 2 9922 - 1 r »l SHr/ 1 f 2 7 30'*/ 2 - 0 

A, 1 -' 0 4()'» r ) — 0 3141 r/ 1 — 0 0011 r ><] 2 + 0 05841r/ 1 

/?, — 1 - ] S19tr/ 1 + t (>() 13</ 2 0 6 r )84i/ _1 

4 0 08 179(/ _ 4 4 0 0()9l82f/ _r ' 

A tonesponding nominal estimator (with no uncertainty as 
sumed) is given by (A, 11 A, 1 *’)//?„ wiih 

A, 1 , 1 - 0 7419 - 1 0944r/~ 1 + 0 3617 <f 2 
K) 2 - 0 8792 - 0 3707r; -1 - 0 0 314 r jq~ 2 
R„ = 1 - ] 7780 q 1 + 1 A2b l )q~ 2 - 0 39 {hi; -1 

Fig 4 shows the Bode magnitude plots tor the robust and 
nominal estimators Also shown is the nominal transducer 
model and 1S randomly chosen systems These were generated 
by using B — B t) + £B, with covanance matrix (4 4) and 

Odussian distributions The channel B l] has its uncertainty 
concentrated aiound the notch while B 21 is uncertain mainly 
it low frequencies 

The gains of the nominal estimator (dashed curves) ate 
determined exclusively hy the nominal signal to noise ratios 
The gains of the robust estimator (dash-dotted) are determined 
the balance between noise levels and model uncertainties 
n the two channels For example the robust filter “knows” 
that channel 1 is well known, as compared to channel 2 
Consequently, a higher gam is used from iji(k) as compared to 
the nominal case, and a lower gain from ijzih ) The difference, 
is compared to nominal design, is largest at low frequencies 
There, the dynamics ol channel 1 is almost perfectly known, 
while channel 2 is very uncertain The nominal filter gain in 
channel 2 is an approximate inverse of the nominal transducer 
In contrast to the nominal filter, the robust filter has hardly 



10® 


K-Q&JPSr RLTl R MA( N1 , 


0 1 2 \ 4 


Fig 4 Bode magnitude plots lor the transfer tunUion from ufA) m 
u(A|A)tor the nominal system with lohust ind nominal estimators 





Fig S MSI for lobust (dashed) and nominal lillei (solid) One ol the lour 
uneerlim parameters is varied while the others ire held at nominal values 
Also shown is the vanance of i/(A ) (uppu dotted) It eorrtsponds to tin error 
caused by the trivial estimate u(K) — 0 The lower dotted eurvt is the 
lower bound aelnevible with knowledge ol the trut punmeler values Rings 
(o) indicate the two standard deviation limits e>i eaeli parimeter 


any peak at the (uncertain) notch around uj - 0 7 It utilizes 
channel 1 more at this frequency 

Fig 4 shows Bode magnitude plots of transfer functions 
from u(k) to u(A|A) Since the noise levels are rather low 
the nominal estimator performs an almost complete inversion 
of the nominal transducers The robust estimator is somewhat 
more cautious but it also accomplishes a rafher good inver 
sion It utilizes the two measurement signals differently than 
the nominal estimator 

Fig 5 shows the mean square estimation cnor when one 
of the uncertain parameters 

Afc 1 , 1 = - Abj 1 = i A/> 2] At,’ 1 Afc? 1 

is varied while the others arc zero The foui parameters above 
span the set of assumed true systems, the extended design 
model On average, over the four uncorrelated stochastic 
coefficients, the MSE is 0 32 for the robust filler and 0 90 
for the nominal design Note, however, that when i is varied, 
the robust design (dashed) is actually slightly moie sensitive 
than the nominal design This is a price paid tor reducing the 
sensitivity in the other dimensions 6 

6 It is natural that the robust filter has a somewhat increased sensitivity to 
model errors in channel I it has higher gain in (hat channel This result is due 
lo the much larger uncertainly in channel 2 at mosi frequencies A designer 
worried about this effect could simply increase the value of the standard 
deviation ? i used in the design 
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Fig. 6, The average spectral density of the estimation error u{k) - u(k\k:). 
Shown in the plot are the lower bound, according to (3.15), (dash-dotted) 
and the average spectral densities obtained with (dashed) and without (solid) 
frequency weighting. 



Fig. 7. A block diagram which is dual to the one in Fig. 1. Arrows arc 
reversed, summation points and node points are interchanged, and transfer 
functions are transposed. The corresponding robust control problem is to 
design K 1 to minimize the average, over the class of models, of the Hi -norm 
of the transfer function from v to r = (z{ r.j ) r . 


The robust estimator, of course, does not perform as well 
as the nominal one in the nominal case. This is mainly due to 
its somewhat lower gain from u(k) to u(fc|fc); see Fig. 4. It is 
evident from Fig. 5 that this performance loss is very small, 
as compared to the improvement in nonideal situations. 7 

As an alternative, we tried to investigate minimax designs, 
i.e., worst case 7f 2 -designs, assuming rectangular parameter 
distributions. This turned out to be prohibitively difficult, since 
no point where min^nuiXAZje(fc) 2 = max a# miri 7 ? e(k) 2 
could be found. These difficulties were in marked contrast to 
the ease of designing a cautious Wiener filter based on the 
averaged Ti 2 -criterion. 

Let us finally illustrate the effect of using a frequency- 
dependent weighting function in the criterion (2.3). Assume 
that the performance at frequencies close lo lj = 0.9 is of 
particular importance. The choice 



1 

1 - 1.2184 r/" 1 + 0.9604r/~ 2 


(4.13) 


with a high resonance peak (\z\ = 0.98) at lj = 0.9 should, 
according to Corollary 1, result in a performance, at that 
frequency, close to the lower bound. Fig. 6 confirms this. 
Performance is substantially degraded at higher frequencies, 
however, where estimation accuracy is not emphasized. 


V. Robust Feedforward Control 

A class of feedforward control problems turns out to be 
dual to the filtering problems discussed in Section II. We 
include a brief separate discussion of them, since it offers 
several engineering insights. Feedforward compensation does 
not affect the classical sensitivity function. The effect of an x% 
model deviation at a particular frequency, however, on e.g., 
the step response, will very much depend on the (nominal) 
magnitude of the transfer function at that frequency. As the 
gain at a particular frequency is increased by a feedforward 
link, model errors at that frequency become more and more 
noticeable. Therefore, it is of value to take model uncertainty 
into account explicitly in the feedforward design. 

7 It can also be noted that it is of advantage to use both channels. The 
minimal MSE, for channels equal to the nominal models, is 0.07 if both 
channels are used. It is 0.59 if only channel 1 is used and 0.11 if only channel 
2 is used. The average MSE of the robust filter (0.32) is in fact lower than 
the nominal MSB for an estimator which uses only channel 1 (0.59). 


To stress the duality to filtering problems, the output of an 
uncertain but stable model will be described by 

y(k) = (Gl + A g r )u(k)+V r w(k) 

= {BlA ~ t + Aff 7 B T X A\ T )u(k) + J,s r w(k) (5. 1 ) 

where A' 7 denotes inverse and transpose. The rational and 
polynomial matrices above have properties as outlined in 
Section 11. The matrices Q 1 and V 1 may contain delays. Based 
on possibly delayed or advanced measurements of 


w(k) = 

(W 7 + A W T )v(k) 


- 

(■k v ‘+K* v ‘ v ')” W; 


E(v{kfv(k)) = 

1/ 

(5.2) 

a stable controller 



u(k) 

II 

1 

Ki 

+ 

(5.3) 


is to be designed to minimize the averaged H 2 -norm of the 
transfer function from v to ( z{ ) 7 = {{T 1 y) T (H 7 u) 1 ) 7 


./' = E 


r T (v T -g T K T 3 m )w 1 


- n T n L 


i w l 


(5.4) 


The control weighting 7 i 7 = M 7 N~ 1 and the outpul 
weighting T 1 = C 7 /D are normally specified by the designer 
and exactly known, i.e,, AH 7 — 0, AT 1 = 0. 

Problem formulation (5.1)—(5.4) may represent a distur¬ 
bance measurement feedforward design. When rn > 0, the 
disturbance w(k) can be measured before it affects the system 
via V 7 . (Such a situation could, equivalently, be described by 
a delay q~ m in V 7 .) The formulation also covers reference 
feedforward problems, feedforward decoupling, and model 
matching. Then, w(k) is a command signal and W T v(k) is 
a (possibly uncertain) stochastic model, describing its second 
order properties. A servo filter Tl 7 is to be designed, so that 
the output — G T u(k ) optimally follows the response model 
V T w(h ). 8 In decoupling problems, V T is diagonal. 

The duality of feedforward control to the previously dis¬ 
cussed filtering problem has been described in [5] for the 

8 The corresponding nominal result (without uncertainty) was discussed in 
135] and in |5]. See also 127), where common denominator forms were used. 
The robust controller for S1SO systems was presented in [36]. In (6], the 
applicability of the averaged W 2 -criterion is demonstrated for a somewhat 
more general class of open-loop type problems. 
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rninal case. The corresponding result in the presence of 
■odel errors is given below. 

Theorem 4: A feedforward filter, which solves the robust 
t ontrol problem (5.1M5.4) under assumptions A1-A4 is given 
v transposing K from (3.10), or 

: u{k) = -A T N T fi- T Q T V~ T ^w{k + m) (5.5) 

where /3, Q, and V are given by (3.6), (3.11), and (3.9), 
respectively. □ 

Proof: The averaged H 2 -norm is invariant under trans¬ 
position. Thus, it fulfills the basic requirement of [5], By 
extending the discussion in [5] to uncertain models the result 
is obtained. ■ 

The uncertainty AW 1 of the disturbance or reference model 

(5.2) enters via the spectral factorization (3.9). Transposition 
of (3.9) gives 

V T vl = VxV T 0 VZ,U u + U n Vj'E(AV r AVl)VlU 0 ,. 

(5.6) 

This is the kind of left spectral factorization encountered when 
two noise sources are described by one innovations model. In 
fact, the uncertainty AW 7 has exactly the same effect on 
Ihe controller design as would a measurement noise on w(k), 
with spectral density Vf E(AV T AV^)Vl^/lJiU u . We do 
not need to solve a right spectral factorization (3.9) in this 
problem. The left spectral factorization (5.6) can be solved 
instead. 

As in the dual filtering case, uncertainty in the direct 
feedthrough, or response model V 1 = S T /T does not affect 
the optimal solution, if it is independent of the uncertainties 
in G r . 


Vl. Conclusions 

A method for designing robust filters and feedforward con¬ 
trollers, based on imperfectly known linear models, has been 
presented. Modeling errors were described by sets of models, 
parameterized by random variables with known covariances. 
A robust design was obtained by minimizing the ?f 2 -norm, 
averaged with respect to the assumed model errors. The 
estimator minimizes this criterion by balancing model uncer¬ 
tainties against noise properties, at different frequencies and in 
different measurement channels. When using robust filtering, 
the greatest sensitivity reduction is obtained at moderate and 
high signal to noise ratios. Dually, the largest impact of robust 
control is obtained for designs with low input penalties. 

One variant of the discussed filtering problems is to ex- 
olicitly define a part of the measurement vector as being 
i noise-free signal. This signal could e.g., represent known 
mputs to the system. Such a formulation is also of use in 
the optimization of decision feedback equalizers for digital 
communications [33], [34]. 

There exist efficient numerical algorithms based on a poly¬ 
nomial equations approach. (We have implemented them as 
MATLAB .m-files, and the code is available upon request.) For 
multivariable problems of high order and high signal vector 
dimension p , Riccati-based algorithms do, however, perform 
better numerically. For dimensions of the measurement vector 


up to, say, four, algorithms based directly on polynomial 
manipulation can in general be used safely. For higher 
mensions, we recommend analytical solutions to be obtained 
by an input-output approach, to gain engineering insight, but 
algorithms e.g., for spectral factorization to be based on state 
space formulations, cf. [22] 

Appendix 


A. Proofs of Lemmas 

Proof of Lemma 1: Let G = [G 7 ' ■ ■ ■ G 7 ] 7 , where 

Gj G n represent the n polynomial row vectors of the ?i|m 
matrix. Then, with H of dimension m|m, the //th element of 
E(GHG+) can be expressed as 

[E(GHG»)]ij = E(G t HG .,*) 

= E(lrGj*GjH) = tvE{G^Gi)£!(Hl 

In the last equality, we used the fact that all elements of G y *G» 
are independent of all those of H. We also have that 

tr E{G jm Gi)£(H) = E(GiE(H)Gj+) = [E(GE(H)G*)] h 

which proves (3.3), since £?(■) operates on all elements of 
G//G* ■ 

Proof of Lemma 2: If the coefficients of the elements 
of a polynomial matrix AP(f/" 1 ) are stochastic variables, 
then so are the coefficients of the elements in APAP*. The 
coefficients of the polynomial elements in A C and A B are 
independent and so are the coefficients of the elements in 
ACAC+ and ABAB+. By defining CC* as H and A B as 
G in Lemma 1 and using (3.4), the right-hand side of (3.1) 
becomes 

NE\BCCJh]N» -F DAE(MM+)A+D* = 

= N E[BE(CC+)B+]N * + DA£{MM*)A*D* 

= NE[BCC,B+]N* + DAMM+AJJ*. 

By once more utilizing (3.4), we obtain (3.6). 


B. Calculation of Averaged Polynomial Matrices 

Consider the matrices discussed in Lemma 1. Let the 
polynomial matrices G = AP and // be of dimensions n\m 
and m|m, respectively. Denote the /-th row of AP by 


A Pi = [AP' 1 • - ■ AP'”'] 


where A P' 3 are polynomials with stochastic coefficients. If H 
is assumed deterministic, the ij th element of E(APHAP*) 
can be written as 


ElAPHAP^tvEiAPj.AP^H = 

=ivHE[APi l ■ ■' A/V m ] r [AP , ‘ 1 ■ ■ ■ AP ,m ] 
r A P i 1 A Pi 1 - ■ A P trn A Pi 1 n 


=tr HE 


LAP a AP* m ■ • ■ AP* m AP* m J 


By using (2.10), we readily obtain (3.7) . ■ 
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C. Proof of Theorem 1 

A technique for constructive derivation of polynomial de¬ 
sign equations for Wiener filters was presented in ft]. This 
method is utilized here to minimize (2.3). The estimation error 
M pven by 

e(fc') = W (/(*)- /{Jb|* + ro)). (Cl) 

All admissible alternatives to a proposed estimate /(fc|fc + ra), 
can be described by 

d(k) = Hy(k -f ra) 4- P(k); u(k) = M y(k 4- m). (C.2) 

Here, M is a rational, stable and causal, but otherwise arbitrary 
transfer function. Define the weighted variation 

v(k) = WP(k.) 

= jjVM q m ^A~ 1 B-j-Ce(k) + 

Optimality of (2.2) is obtained if no perturbation v(k) will 
improve the average estimator performance. This occurs if and 
only if the error e(k) is orthogonal to any admissible weighted 
estimator variation v{k). in other words 

tr EE{e{k)v{k)') = tiEE{e{k)*v(k)) = 0. 

Then, the perturbed criterion value becomes 

.7 = tr EE[W(f(k) - d(k))][W(f(k) - d(k))Y 
= tr EE(e(k)£(kY - <r(A:)i/(fc)* 

- e{ky,y(k) + Kfc)i'(Jfc)*) 

= tr EE(e(k)e{k)* + (C.3) 

This expression is evidently minimized by t'(Aj) = 0. 

Since all transfer functions in (2.4) are assumed stable, both 
e(k) and v(k) are stationary. Parseval’s formula may then be 
used to express trEEe:(k)i;(k)* as 

tTEEjjV^S-(rnA- l B^Ce(k)-q m nN~ l Mv(k)^ 

x | ^VMq m ^A-'B^Ce(k) + N-'Mvik)^ J 

= tr E— l - l — 

2irj ,/|,| =1 UU.DD, 

x V$yZ~ m lpSCC,B*A- 1 -KA-'N- 1 

x ( NBCC.B.N . + DAMM.A,D.)N: l A ; 1 } 

., dz 

— . (C.4) 

z 

Note that A and N commute, since they are diagonal. We 
are allowed to move the expectation E inside the integration, 
since, for any particular realization of the elements of AC , 
AB , AM, and A V, all elements of the integrand are Riemann 
integrable on the unit circle; see e.g., f 17, Theorem 3.8]. The 
use of the trace rotation. trV{- * = trV^V {■ ■ }A4*, 


the spectral factorizations (3.1) and (3.9) and the Assumptiot 
A3, leads to 

X j 2 - m £(Jscc.B.)ii ; 1 

-nA~ 1 N~ 1 i3p t N; l A; 1 }M^ (C.si 
From (C.5) it is now easy to see that uncertainties in 

D = £». + £*** = 4 f S 

(C.6) 

independent of AC, AB and AV , will not affect the filter 
design since, using Assumption A1 

E(±SCC.B.) = E(j,S)E(CC.)E(B.) = j, S 0 CC t B 0t 

(C.7) 

(The minimal criterion value will be affected, however, 
which is evident form (C.12), below.) In the sequel we thus 
use T 0 = T and S 0 = S. 

Using (C.7) in (C.5) and extracting T to the left and A" 1 
to the right now gives 

t,£&(*)-(*)• =lr liJ—' V-V 

{ 2“ m SCC.B - THA~ l N~ l pp,N : 1 } 

xA^M,^. (C.8) 

To make (C.8) zero, all poles inside \z\ = 1 are eliminated. 
This is achieved if, in every element of the integrand, all such 
poles are cancelled by zeros. We first cancel what can be 
cancelled by means of H directly. Thus 

n=^V~'Q0~ l NA (C.9) 

where Q{z ~ 1 ) is undetermined. Inserting (C.9) into (C.8) gives 
lvEEe(k)v(k)* as 

tr— /---V, 

2t T j J UU.TDD, 

{z~ m VSCC.B 0 .N, - QP,}n:'A- 1 M^. (C.10) 

Now, LJ, T, D, A, N are all stable, so they have zeros only 
inside \z\ = 1. The poles of M are inside \z\ = 1. This 
means that U~ l , D~ l , A~\ N~ l , and M* have all theii 
poles outside \z\ = 1. No poles will thus exist inside \z\ = 1 
in (C.10), if and only if 

z~ m VSCC+B 0 +N+ - QP. = zLJJTDl p (C.ll) 

for some polynomial matrix L+(z), This is (3.11), if q is sub¬ 

stituted for z. The filter (C.9) coincides with (3.10). Necessity 
follows because choices of 72. other that (C.9) correspond to 
u{k) ^ 0 in (C.3). 

Unique solvability of (3.11) is demonstrated as follows. 
The Diophantine equation will always have one or several 
solutions, since the invariant polynomials of UTD l p are all 
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ble. while those of 0, are all unstable. Thus, there exist no 
nimon invariant factors. Let (Q 0 , Lo») be one solution pair, 
i cry solution to (3.11) can then be expressed as 

(Q,L*) = (Qo - XqUTDl p , L^+Xfi.) 


where the polynomial matrix X (ry, q~ l ) is undetermined. Now, 
Q is required to be causal, so it can not have any positive 
powers of q as arguments, while L+ must contain no negative 
powers of q , to assure optimality. Thus, X(q,q~ [ ) — 0 is the 
only choice. We conclude that the solution to (3.11) is unique. 

The degrees (3.12) are determined by the requirement that 
the maximum powers of q~ l and q are covered on both sides 
of (3.11). They assure that the number of unknowns equal 
the number of equations in the corresponding linear system of 
equations. For details, see [1] or [3J. 

The minimal average estimation error, ./ mjri = tr EE (e(k) 
?(k)*),nin . is obtained as follows. First insert (3.10) into the 
criterion (2.3), use Parseval’s formula, take expectation and 
use (3.1), (3.9) and (3.5) in this order. Then we obtain 


- SCC.B ot A; } H.T,z m - z m TKA l B n CC,S, 

~7- (C.12) 

Now, the use of (C.9), - trV{ - }V* and 

completing the square gives 


•/min = If 


7 7k|=i 


vscc,s*v. 


2k j J lzl=1 UU,TT.DD. 

+ ( z- m VSCC.B 0 .N.K l - Q) 
x (0~ i NB u CC t S t V,z m ~Q*J 

- VSCC,B„,N t 0; l 0 - 1 NB n CC,S,V. j ^. 


Finally, use Diophantine equation (C. 11) in the middle term 
and rearrange the terms to obtain expression (3.13). 
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A New Model for Control of Systems with Friction 

t Canudas de Wit, Associate, IEEE , H. Olsson, Student Member, IEEE, K. J. Astrdm, Fellow. IEEE, and P. Lischinsk) 


Abstract —In this paper we propose a new dynamic model Tor 
friction. The model captures most of the friction behavior that has 
been observed experimentally. This includes the Stribeck effect, 
hysteresis, spring-like characteristics for stiction, and varying 
break-away force. Properties of the model that are relevant to 
control design are investigated by analysis and simulation. New 
control strategies, including a friction observer, are explored, and 
stability results are presented. 


I. Introduction 

F RICTION is an important aspeci of many control sys¬ 
tems both for high quality servo mechanisms and simple 
pneumatic and hydraulic systems. Friction can lead to tracking 
errors, limit cycles, and undesired stick-slip motion. Control 
strategies that attempt to compensate for the effects of friction, 
without resorting to high gain control loops, inherently require 
a suitable friction model to predict and to compensate for 
the friction. These types of schemes arc therefore named 
model-based friction compensation techniques. A good friction 
model is also necessary to analyze stability, predict limit 
cycles, find controller gains, perform simulations, etc. Most 
of the existing model-based friction compensation schemes 
use classical friction models, such as Coulomb and viscous 
friction. In applications with high precision positioning and 
with low velocity tracking, the results are not always satis¬ 
factory. A better description of the friction phenomena for 
low velocities and especially when crossing zero velocity 
is necessary. Friction is a natural phenomenon that is quite 
hard to model, and it is not yet completely understood. The 
classical friction models used are described by static maps 
between velocity and friction force. Typical examples are 
different combinations of Coulomb friction, viscous friction, 
and Stribeck effect f 1 ]. The latter is recognized to produce a 
destabilizing effect at very low velocities. The classical models 
explain neither hysteretic behavior when studying friction 
for nonstationary velocities nor variations in the break-away 
force with the experimental condition nor small displacements 
that occur at the contact interface during stiction. The latter 
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very much resembles that of a connection with a stiff spring 
with damper and is sometimes referred to as the Dahl effect. 
Later studies (see, e.g., HI. |2J) have shown that a friction 
model involving dynamics is necessary to describe the friction 
phenomena accurately. 

A dynamic model describing the spring-like behavior during 
stiction was proposed by Dahl {3]. The Dahl model is essen¬ 
tially Coulomb friction with a lag in the change of friction 
force when the direction of motion is changed. The model has 
many nice features, and it is also well understood theoretically. 
Questions such as existence and uniqueness of solutions and 
hysteresis effects were studied in an interesting paper by 
Bliman [4]. The Dahl model does not, however, include the 
Stribeck effect. An attempt to incorporate this into the Dahl 
model was done in |5] where the authors introduced a second- 
order Dahl model using linear space invariant descriptions. The 
Stribeck effect in this model is only transient, however, after a 
velocity reversal and is not present in the steady-state friction 
characteristics. The Dahl model has been used for adaptive 
friction compensation |6], |7], with improved performance as 
the result. There are also other models for dynamic friction. 
Armstrong-Hdlouvry proposed a seven parameter model in [1]. 
This model does not combine the different friction phenomena 
but is in fact one model for stiction and another for sliding 
friction. Another dynamic model suggested by Rice and Ruina 
[8] has been used in connection with control by Dupont |9]. 
This model is not defined at zero velocity. In this paper we 
will propose a new dynamic friction model that combines the 
stiction behavior, i.e., the Dahl effect, with arbitrary steady- 
state friction characteristics which can include the Stribeck 
effect. We also show that this model is useful for various 
control tasks. 

II. A New Friction Model 

The qualitative mechanisms of friction are fairly well un¬ 
derstood (see, e.g., [1]). Surfaces arc very irregular at the 
microscopic level and two surfaces therefore make contact at 
a number of asperities. We visualize this as two rigid bodies 
that make contact through clastic bristles. When a tangential 
force is applied, the bristles will deflect like springs which 
gives rise to the friction force; see Fig. 1. 

If the force is sufficiently large some of the bristles deflect 
so much that they will slip. The phenomenon is highly 
random due to the irregular forms of the surfaces. Haessig 
and Friedland [ 10] proposed a bristle model where the random 
behavior was captured and a simpler reset-integrator model 
which describes the aggregated behavior of the bristles. The 
model we propose is also based on the average behavior of 
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Fig. I. The friction interface between two surfaces is thought ol as a contact 
between bristles. For simplicity the bristles on the lower part are shown as 
being rigid. 


the bristles. The average deflection of the bristles is denoted 
by z and is modeled by 


dz Id 

di = V ~g(uj 


( 1 ) 


where v is the relative velocity between the two surfaces. 
The first term gives a deflection that is proportional to the 
integral of the relative velocity. The second term asserts that 
the deflection z approaches the value 

= y^.0(v) = ff(u)sgn(»>) (2) 


in steady state, i.e., when v is constant. The function g is 
positive and depends on many factors such as material prop¬ 
erties, lubrication, temperature. It need not be symmetrical. 
Direction dependent behavior can therefore be captured. For 
typical bearing friction, g(u) will decrease monotonically from 
g( 0) when u increases. This corresponds to the Stribeck effect. 
The friction force generated from the bending of the bristles 
is described as 

dz 

h =(7 " z+(Ti di 


where rr 0 is the stiffness and o\ a damping coefficient. A 
term proportional to the relative velocity could be added to 
the friction force to account for viscous friction so that 


F 


dz 

= F V] -r + a 2 v - 
fit 


(3) 


The model given by (1) and (3) is characterized by the 
function g and the parameters a {h o\ and rr 2 . The function 
(To g{i>) -F (t 2 v can be determined by measuring the steady- 
state friction force when the velocity is held constant. A 
parameterization of y that has been proposed to describe the 
Stribeck effect is 


ffofl(i') = Fc + (Fs - Fc)< - (4) 


where Fc is the Coulomb friction level Fs is the level of the 
stiction force, and tt„ is the Stribeck velocity; see [1]. With 
this description the model is characterized by six parameters 
(t 0 , tfi. ^ 2 * F c , F S ', and v s . It follows from (2H4) that for 
steady-state motion the relation between velocity and friction 
force is given by 

F„(v) = (Tog(v)agn(v) + a 2 i’ 

= F r sgn(u) + (F* - F c )(- {v/Vl)i sgn(«) + <r 2 »- 


Note, however, that when velocity is not constant* the dy¬ 
namics of the model will be very important and give rise 
to different types of phenomena. This will be discussed in 
Section IV. 

Relation to the Dahl Model 

The model reduces to the Dahl model if g(i>) = Fc7cr 0 , 
and (Ti = (T Z = 0. Equations (1) and (3) then give 

dF dz ( F \ 

_ = (5) 

Dahl actually suggested the more general model 



see [11]. Most references to Dahl’s work, however, do use 
the simpler model (5). Dahl’s model accounts for Coulomb 
friction but it does not describe the Stribeck effect. 

An Extension of the Dahl Model 

An attempt to extend Dahl's model to include the Stribeck 
effect was made by Bliman and Sonne [5]. They replaced the 
time variable / by a space variable s through the transformation 

S= [ |/)(r)|f/r. 

./I) 

Equation (5) then becomes 

dF F 

— = -(To — + (7 0 sgn (o) (6) 

(Lh F\ 

which is a linear first-order system if sgn(u) is regarded as an 
input. Bliman and Sorine then replaced (6) by the second-order 
model 

d 2 F .. dF 2ll 
— 7t + 2(uj— F uj h =- Ltr/vsgnOO 
ds z di s 

to imitate the Stribeck effect with an overshoot m the response 
to sign changes in the velocity. This model, however, will only 
give a spatially transient Stribeck effect after a change of the 
direction of motion. The Stribeck effect is not present in the 
steady-state relation between velocity and friction force. 


III. Model Properties 


The properties of the model given by (1) and (3) will now 
be explored. To capture the intuitive properties of the bristle 
model in Fig. 1, the deflection z should be finite. This is indeed 
the case because we have the following property. 

Property 1: Assume that U < g(t>) < a. If |z(0)| < a then 
|~(0I < a V / > 0. 

Proof: Let V = z 2 j 2, then the time derivative of V 
evaluated along the solution of (1) is 
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TABLE 1 

Parameter Values, Used in All Simulations 


Parameter 

Value 

Unit 

(Tq 

10* 

|N/m] 


s/W' 

INs/m] 

a 2 

0.4 

| Ns/ml 

Fr 

1 

IN] 

f s ■ 

1.5 

(NJ 

l\S 

0.001 

[m/s] 


The derivative ^ is negative when \z\ > g(v). Since g(v) 
is strictly positive and bounded by a, we see that the set 
H = {z : \z\ < a) is an invariant set for the solutions of 
(1), i.e., all the solutions of z(t) starting in fl remain there. 


Dissipativity 

Intuitively we may expect that friction will dissipate energy. 
Since our model given by (I) and (3) is dynamic, there may be 
phases where friction stores energy and others where it gives 
energy back. It can be proven that the map <p z is 

dissipative for our model. For more details on the concepts 
and definitions concerning dissipative systems, see 112]. 

Property 2: The map < p \ v »—► z, as defined by (1), is 
dissipative with respect to the function V(t) = ^ z 2 (t ), i.e., 

I z(t)v(t) dr > V(t) - V(0). 

Jo 


Proof: It follows from (1) that 

dz Id 
? -1—i—L ■ 

' dt g(v)' 

dz 

lir 


> z- 


Hence 


[ z(t)v(t) dr > / z(t) ^; T - d.T > V(t) — V 7 (0). 

Jo “ Jo 


Linearization in Stiction Regime 

To get some insight into the behavior of the model in the 
stiction regime we will consider a mass m in contact with a 
fixed horizontal surface. Let x be the coordinate of the mass, 
i.e., v = dx/dt. The equation of motion becomes 


(fix dz dx 

"‘iP=- h 


(7) 


where 2 is given by (1). Linearizing (1) around 2 = 0 and 
’> = 0 we get 


dz __ dx 
dt dt 


( 8 ) 


Inserting (8) into (7) gives 

d 2 x . , dx 

m ~dfi + ^ fT °' X = °‘ 

This shows that the system behaves like a damped second- 
order system. Notice that the bristle stiffness, cro, is usually 



Fig. 2. Presliding displacement as described by the model. The simulation 
was started with zero initial conditions. 


very large, and therefore it is essential to have <tj ^ 0 to have 
a sufficiently damped motion. The viscous friction coefficient, 
<r 2 , is normally not sufficiently large to provide good damping. 

IV. Dynamical Model Behavior 

As a preliminary assessment of the model we will inves¬ 
tigate its behavior in some typical cases. They correspond 
to standard experiments that have been performed. In all the 
simulations the function g has been parameterized according 
to (4) and the parameter values in Table l have been used. 
The parameter values have to some extent been based on 
experimental results [1]. The stiffness rr 0 was chosen to give 
a presliding displacement of the same magnitude as reported 
in various experiments. The value of the damping coefficient 
o\ was chosen to give a damping of ( = 0.5 for the linearized 
equation (9) with a unit mass. The Coulomb friction level Fc 
corresponds to a friction coefficient p, % 0.1 for a unit mass, 
and Fs gives a 50% higher friction for very low velocities. 
The viscous friction rr 2 and the Stribeck velocity v a are also 
of the same order of magnitude as given in [1]. 

The different behaviors shown in the following subsections 
cannot be attributed to single parameters but rather to the 
behavior of the nonlinear differential equation (I) and the 
shape of the function g . The presliding displacement and 
the varying break-away force are due to the dynamics. This 
behavior is also present in the Dahl model. The Stribeck shape 
of g together with the dynamics give rise to the type of 
hysteresis observed in the subsection on frictional lag. 

A . Presliding Displacement 

Courtncy-Pratt and Eisner have shown that friction behaves 
like a spring if the applied force is less than the break-away 
force. If a force is applied to two surfaces in contact there will 
be a displacement. A simulation was performed to investigate 
if our model captures this phenomenon. An external force was 
applied to a unit mass subjected to friction. The applied force 
was slowly ramped up to 1.425 N which is 95% of F $. The 
force was then kept constant for a while and later ramped down 
to the value -1.425 N, where it was kept constant and then 
ramped up to 1.425 N again. The results of the simulation 
are shown in Fig. 2 where the friction force is shown as a 
function of displacement. The behavior shown in Fig. 2 agrees 
qualitatively with the experimental results in [13]. 
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Fig. 3. Hysteresis in friction force with varying velocity. The velocity 
variation with the highest frequency shows the widest hysteresis loop 



Fig. 4. Relation between break-away force and rate of increase of the applied 
force. 

B. Frictional Lag 

Hess and Soom 114] studied the dynamic behavior of fric¬ 
tion when velocity is varied during unidirectional motion. They 
showed that there is hysteresis in the relation between friction 
and velocity. The friction force is lower for decreasing veloci¬ 
ties than for increasing velocities. The hysteresis loop becomes 
wider at higher rates of the velocity changes. Hess and Soom 
explained their experimental results by a pure time delay in 
the relation between velocity and friction force. Fig. 3 shows 
a simulation of the Hess-Soom experiment using our friction 
model. The input to the friction model was the velocity which 
was changed sinusoidally around an equilibrium. The resulting 
friction force is given as a function of velocity in Fig. 3. Our 
model clearly exhibits hysteresis. The width of the hysteresis 
loop also increases with frequency. Our model thus captures 
the hysterctic behavior of real friction described in [14]. 

C. Varying Breakaway Force 

The break-away force can be investigated through exper¬ 
iments with stick-slip motion. In [15] it is pointed out that 
in such experiments the dwell-time when sticking and the 
rate of increase of the applied force are always related and 
hence the effects of these factors cannot be separated. The 
experiment was therefore redesigned so that the time in stiction 
and the rate of increase of the applied force could be varied 
independently. The results showed that the break-away force 
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Fig. 5. Experimental setup for stick-slip motion. 
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Fig 6. Simulation of stick-slip motion 


did depend on the rate of increase of the force but not on the 
dwell-time; see also [16]. Simulations were performed using 
our model to determine the break-away force for different rates 
of force application. Since the model is dynamic, a varying 
break-away force can be expected. A force applied to a unit 
mass was ramped up at different rates, and the friction force 
when the mass started to slide was determined. Note that since 
the model behavior in stiction is essentially that of a spring, 
there will be microscopic motion, i.e., velocity different from 
zero, as soon as a force is applied. The break-away force was 
therefore determined at the time where a sharp increase in the 
velocity could be observed. Fig. 4 shows the force at break¬ 
away as a function of the rate of increase of the applied force. 
The results agree qualitatively with the experimental results 
in [15] and [161. 

D. Stick-Slip Motion 

Stick-slip motion is a typical behavior for systems with 
friction. It is caused by the fact that friction is larger at rest than 
during motion. A typical experiment that may give stick-slip 
motion is shown in Fig. 5. A unit mass is attached to a spring 
with stiffness k = 2 N/m. The end of the spring is pulled 
with constant velocity, i.e., dy/dt = 0.1 m/s. Fig. 6 shows 
results of a simulation of the system based on the friction 
model in Section II. The mass is originally at rest and the 
force from the spring increases linearly. The friction force 
counteracts the spring force, and there is a small displacement. 
When the applied force reaches the break-away force, in this 
case approximately 0oy{Q), the mass starts to slide and tht 
friction decreases rapidly due to the Stribeck effect. The spring 
contracts, and the spring force decreases. The mass slows 
down and the friction force increases because of the Stribeck 
effect and the motion stops. The phenomenon then repeats 
itself. In Fig. 6 we show the positions of the mass and the 
spring, the friction force and the velocity. Notice the highly 
irregular behavior of the friction force around the region where 
the mass stops. 
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j K 7 Block diagram tor the servo problem with PID controller 



Fig 8 Simulation ot the PID position contiol pioblem in Fig 7 


V Fftdback Control 

To further illustrate the properties of our friction model we 
will investigate its application to some typical servo problems 
First wc will use it to show that it predicts limit cycle 
oscillations in seivos with PID control We will then use it 
to design observer based friction compensators 

A Limit Cycles Caused hv Friction 

It has been observed experimentally that Inction may give 
rise to limit cycles in servo drives where the contioiler has 
integral action, for references see |2| This phenomenon is 
often reierred to as hunting 

Consider the linear motion of a mass m at position j . The 
equation of motion is 

- 11 ~ F CO) 

where dr/dt — v is the velocity, F the friction force given by 
(}) and u the control lorcc which is given by the PID controller 

a = -K t ii - h p (i - i- h, j(i - i lt ) ( 11 ) 

A block diagram of the system is shown in Fig 7 In Fig. 8 
wc show the results of a simulation of the system The friction 
parameters are given by Table I, m = 1 and the controller 
parameters are h\ = 0 , K p — 3, and K, — \ The reference 
position is chosen as j ( \ = 1 . The model clearly predicts limit 
cycles as have been observed experimentally in systems of 
this type. 

B Fiunon Compensation 

Only linear feedback from the position was used in the PID 
control law (II) Knowledge about friction was not used. It 
is of course more appealing to make a model-based control 
that uses the model to predict the friction to compensate for 



Fig 9 Block diagram tor the position control problem using 1 tritium 
observer 


lt Since the model is dynamic and has an unmeasurable state, 
some kind of observer is necessary This will be discussed 
next 

Position Contiol with a Friction Ohserxei Consider the 
problem of position tracking for the process (10) Assume that 
the parameters <r 0 , rr t , and ct 2 , and the function q in the friction 
model are known. The stale 2 is, however, not measurable and 
hence has to be observed to estimate the friction force. For this 
we use a nonlinear friction observer given by 


dz M . 

— = V - 7-7 Z — A < 

dt q(v) 


A > 0 


7- 


dz 

— rr 0 z 4 - a 1 —- + a>n 
dt 


( 12 ) 

(13) 


and the following control law 

12 

(14) 

where c = / - 14 / is the position error and j f i is the desired 
reference which is assumed to be twice differentiable The term 
kc in the observer is a correction term Irom the position error 
The closed-loop system is represented by the block diagram 
in Fig 9 With the observer based friction compensation, wc 
achieve position Backing as shown in the following theorem 
Theorem l Consider system (10) together with the friction 
model (l) and 0 ), friction observer ( 12 ) and ( 11 ), and control 
law (14) If 11 (s) is chosen such that 


<7(s) 


+ (Tq 

rn s 2 + H (s) 


is strictly positive real (SPR) then the obsciver error, F — F, 
and the position error, c , will asymptotically go to zero 
Proof The control law yields the following equations 


( 


dz 

dt 


m s 2 + II(s) 

*(»r + 


<-/■) 


<Ti .S 4 rr ( ) 
ms* 4 //(s) 




where F = F — F and z — z - z Now introduce 

v = en + -r 

A 


as a Lyapunov function and 


dt 

dt 


= At + B(-z) 

= ct 
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Fig. 10. The block diagram in Fig, 9 redrawn with v and z as outputs of a 
linear and a nonlinear block, respectively. 


which is a state-space representation of G(s). Since G(s) is 
SPR [17] it follows from the Kalman-Yakubovitch Lemma 
[17] that there exist matrices P = P T > 0 and Q = Q T > 0 
such that 

A t P + PA = -Q 
PB = C T . 


Now 




= S T QS - T~7~\ z 
< -fQt- 


2 _ H _-2 

fc g(v) 


The radial unboundedness of V together with the semi- 
definiteness of dV/dt implies that the states are bounded. Wc 
can now apply LaSalle’s theorem to see that £ -+ 0 and z —► 0 
which means that both c and F tends to zero and the theorem 
is proven. 

The theorem can also be understood from the following 
observations. By introducing the observer we get a dissipative 
map from f to z and by adding the friction estimate to 
the control signal, the position error will be the output of 
a linear system operating on z. This means that we have 
an interconnection of a dissipative system and a linear SPR 
system as seen in Fig. 10. Such a system is known to be 
asymptotically stable. 

Velocity Control: The same type of observer-based control 
can be used for velocity control. For this control problem the 
controller is changed to 

u — -H(s)c + F + 

^ = v ~lW) i ~ k{v ~ ,,d) ' k>0 (15) 

r , di 

F = (Ti)Z -1- — + (J2V 

at 

where v - v<i is the velocity error and v<i the desired velocity 
which is assumed to be differentiable. Velocity tracking is 
achieved as shown in the following theorem. 

Theorem 2: Consider system (10) together with friction 
model (1) and (3) and observer based control law (15). If 
H(s) is chosen such that 


G{3) 


(TjS + (*Q 

ms + H(s) 


is strictly positive real, then the observer error, F - F, and 
the velocity error will asymptotically go to zero. 

Proof: The theorem is proven in the same way as Theo¬ 
rem 1 after observing that the control law yields the following 
error equations 


V - Vd =s 


1 


ms + H («) 


(-F) = 


<T\S + <Tq 

ms + H(s) 


(-*) = ~G{s)z 




This is again an interconnection of a dissipative system with z 
as its output and a linear SPR system with v~vd as its output. 

To assume that the friction model and its parameters are 
known exactly is of course a strong assumption. Investigation 
of the sensitivity of the results to these assumptions is an 
interesting problem that is outside the scope of this paper. The 
accuracy required in the velocity measurement is a similar 
problem. 


VI. Conclusions 

A new dynamic model for friction has been presented. The 
model is simple yet captures most friction phenomena that 
are of interest for feedback control. The low velocity friction 
characteristics are particularly important for high performance 
pointing and tracking. The model can describe arbitrary steady- 
state friction characteristics. It supports hysteretic behavior due 
to frictional lag, spring-like behavior in stiction and gives a 
varying break-away force depending on the rate of change ot 
the applied force. All these phenomena are unified into a first- 
order nonlinear differential equation. The model can readily 
be used in simulations of systems with friction. 

Some relevant properties of the model have been investi¬ 
gated. The model was used to simulate position control ot a 
servo with a PID controller. The simulations predict hunting 
as has been observed in applications of position control with 
integral action. The model has also been used to construct 
a friction observer and to perform friction compensation foi 
position and velocity tracking. When the parameters are known 
the observer error and the control error will asymptotically go 
to zero. Sensitivity studies, parameter estimation and adapta¬ 
tion are natural extensions of this work. 
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Adaptive Nonlinear Design with 
Controller-Identifier Separation and Swapping 

Miroslav Krstid, Student Member , IEEE , and Petar V. Kokotovid, Fellow, IEEE 


Abstract —We present a new adaptive nonlinear control design 
which achieves a complete controller-identifier separation. This 
modularity is made possible by a strong input-to-state stability 
property of the new controller with respect to the parameter 
estimation error and its derivative as inputs. These inputs are 
independently guaranteed to be bounded by the identifier. The 
new design is more flexible than the Lyapunov-based design 
because the identifier can employ any standard update law 
gradient and least-squares, normalized and unnormalized. A key 
ingredient in the identifier design and convergence analysis is a 
nonlinear extension of the well-known linear swapping lemma. 

I, Introduction 

T HE estimation-based approach to adaptive control has 
been extremely successful in linear systems. In con¬ 
trast to the Lyapunov-based approach, which restricts the 
choice of parameter update laws and controller structures, 
the estimation-based designs are versatile. For linear systems, 
any common update law and any stabilizing controller can 
be employed as long as the boundedness properties of the 
identifier are sufficient to allow a “certainty-equivalence” 
design of the controller. This versatility is of conceptual 
and practical importance. It is due to a modularity feature: 
the identifier module achieves its boundedness properties 
independently of the controller module. 

Thanks to its versatility, the estimation-based approach 
unifies many diverse adaptive schemes. For linear systems, this 
unification, initiated by Egardt [5], was extended by Goodwin 
and Maync [6], 

Attempts to apply estimation-based designs to nonlinear 
systems have had only limited success. The nonlinearities 
were either matched [24], [21, [3] or severely restricted [27], 
135], [9], [10], [39], Otherwise the results were local, i.e., 
valid in regions which were not a priori verifiable. A cause 
for this difficulty is a fundamental difference between the 
instability phenomena in linear and nonlinear systems. The 
states of an unstable linear system remain bounded over any 
finite interval, so that there is enough time for the identifier 
to “catch up.” The situation is fundamentally different in a 
system with nonlinearities whose growth is faster than linear 
(t 2 , x\i' 2 . ( ,T , etc.). Even a small parameter estimation error 
may drive the state of such a nonlinear system to infinity in 
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finite time. This explains why estimation-based designs have 
been mostly for systems with linearly bounded nonlinearities. 
Topically, linear growth constraints had to be imposed not only 
on the plant nonlinearities, but also on those derived during 
the design. 

The only nonlinear estimation-based results which go be¬ 
yond the linear growth constraints were obtained by Praly 
et ai [29]-[331. In [32] a unified framework of control 
Lyapunov functions was used to characterize relationships 
between nonlinear growth constraints and controller stabilizing 
properties. In the absence of matching conditions, all the 
nonlinear estimation-based schemes presented in [32] involved 
some growth restrictions. 

In contrast to the difficulties experienced by the estimation- 
based designs, the new recursive Lyapunov-based designs for 
sysiems in the parametric-strict-feedback form [12]. [8], [19], 
[36] and the output-feedback form [21], [22], [14] were suc¬ 
cessful in achieving global boundedness and tracking without 
any restrictions on nonlinearities. However, these designs do 
not allow any flexibility in the choice of the parameter update 
law, excluding, for example, the least-squares update laws. 

In spite of the previous difficulties with nonlinear 
estimation-based approaches, their flexibility and modularity 
motivate us to pursue their development. Since the indepen¬ 
dence of the identifier is not sufficient for modularity, we 
place the burden of the task of boundedness on the controller. 
For parametric-strict-fcedback systems we seek (and find!) 
nonlinear controllers which guarantee boundedness in the 
presence of bounded parameter uncertainty. More precisely, 
we consider the parameter estimation error and its derivative 
as two independent disturbance inputs and design controllers 
which achieve input-to-state stability [37] (ISS) with respect 
to those inputs. In addition to such ISS-controllers, we also 
design weaker SG-controllers which only provide a small gain 
property and are presented for comparison with linear designs. 

These new controllers creaie a possibility for a complete 
identifier-controller modularity. The remaining task is to de¬ 
sign identifiers with guaranteed boundedness properties. A key 
ingredient in the identifier design and convergence analysis in 
this paper is our nonlinear extension of the well known linear 
swapping lemma [25]. Various forms of swapping were also 
used in most of the early nonlinear estimation-based results 
[27], [24], [291, [30], [35], [2], [9], [10], [39]. The identifiers 
in this paper are based on two different parametric models: the 
plant model and the error system. They allow a wide variety 
of update laws—gradient and least-squares, normalized and 
unnormalized. 
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The paper is organized as follows After the problem 
itcmcni in Section II, in Section III we design the 1SS con¬ 
fers and prove that the input-to-state stability is achieved 
ction IV presents Ihe nonlinear swapping lemma Parameter 
kntihers with gradient and least-squares update laws arc de- 
<, loped in Section V, and the stability prools foi the resulting 
1 iptive systems are given in Section VI In Section VII we 
nalyze performance of the new adaptive systems To reveal 
Ik connection with linear estimation-based designs we present 
i) Section VIII the design of a weaker SG-controller The new 
controller designs and performance are illustrated by examples 
in Section IX 


II Problpm Staipmpnt 

The problem is to adaptively contiol nonlinear systems 
uansformable into the parametric stnct-feedback form 

i, = 1 1+] + 6 1 <p,{n i,) !</<»-] 

= M)())" + 0 T ip„(i) 

Q ~ J i (2 1) 

wheie 0 (_ R 7 is the vector of unknown constant parameters 
/(, and the components of </> = [y>\ v„] are smooth 

nonlinear lunctions in R r * and Mj(j) / 0 Vi £ R' 
Necessary and sufficient conditions for a nonlinear system to 
be transformable into the form (2 1 ) are given in [ 12 ) It should 
be noted that (2 I) is feedback lineari/able tor any bounded 
0 £ R p 

The control objective is to force the output y of the syslem 
(2 1 ) lo asymptotically tiack the output //, of a known linear 
reference model while keeping all the closed-loop signals 
bounded Lhe reference model has the form 



■ 0 



■ 0 ■ 

1 m — 

0 

1 

1 m 4 

0 


-ID 0 



J* m _ 


flexibility in the update law selection we now A kvU \ 
estimation-based design which treats the controllu and 
identifier as separate modules 
Notation For vectors we use 1 1 \p = ( i T P \ )* u> dui 
the weighted Euchdean norm of / Foi matrices i \ ] r A 
(lr{A J V }) 1 / 2 — (tr{ \ \ T }) li/2 denote the Fiobcnius and 
|A"1 2 the induced 2-norm of A The £ x C> and £ norms 
for signals are denoted by || || x , || \\ 2 and || 1 ^ respeciively 
By referring to a matrix A(t) as exponential!) stable we 
mean that the corresponding LTV system i - !(/)? is 

exponentially stable The spaces of all signals which are 
globally bounded, locally bounded and square mtegiable on 
[0 If), If s 0 are denoted by £ x [0 If) £ v ,[() / A ) and 
£ 2 [0 f/) respectively By saying that a signal belongs to 
£ X [U tf) or lo £?[(), If) we mean that the corresponding 
bound is independent of If 

III ISS CONTROLl l R DFSKiN 

Our modular estimation-based adaptive design loi (2 t) 
places Lhc burden of achieving boundedness on ihe controller 
module We require that the controller guaiantee input to state 
stability (ISS) with respect to the parameter error 0 - 0 - 0 
and Us derivative 0 — 0 as disturbance inputs 

Using the backstepping proceduie, which is well known 
fiom [ 1 21 119] and |18] the adaptive nonlincai control lei 
is recursively designed as follows 

~i — ] i ~ l m i — O,-] 


(\{( 1 1 0 I ni i) — i ] 1 1 •'i 0 

< 1 

A 

— S, ( / 0 I in i ] 


1=1 X 


On, j 

* A + l + “- 1 v Ml 

Ol,n A 


(j, — bn I (2 2) 

where A/(s) = s" ■+ w ? ,_ A s M_1 + -f my s + is Hurwitz, 
f m > 0 and i(t) is bounded and piecewise continuous 
\noihcr way of stating the same objective is to asymptotically 
track a given reference signal y, (t) with us first 7 / derivatives 
known, bounded and piecewise continuous 

The above pioblem was first posed and solved in [12] 
using 7 ip estimates for p unknown parameters This number 
of estimates was subsequently reduced in halt in [ 8 ] The 
)ver-parametrization was completely removed in [19] by the 
ise of “tuning functions M In |40] the adaptive scheme of 
112 ] was extended and recast in the observer based setting 
for the case when the nonlinearilics in (2 1 ) are polynomial, 
a solution employing growth conditions was given in [13] 
Possibilities to enlace the class of systems that can be 
idaptively stabilized using the approach of [12J were explored 
in [ 1 ] and [361 

The Lyapunov-based results [12], [ 8 ], [ 19], and [36| employ 
only one type of parameter update laws To increase the 


M'l 0 I = W 

rn < )r >. 


" ,, , V L - 'fl V ■ " ■ VI / "'ll • III I 

IM>) 

ID n — \ t m n hjn l] ( ^ 1 ) 

where 7, - [f A 7 m > - [i,„ i im *] r 

(),/ = ], n and, lor notational convenience * s o = 0 

\ 

= 0 In these expressions the nonlinear damping functions 
s,(7, 0 7 m ,_i) are yet to be designed We will employ 
these functions to achieve the desired ISS properly of the 
system obtained by the recursive design procedure (3 1 ) Fhis 
nonlinear system, called the error system, is readily shown to 
be 

z = A (z,0 f)z + W{z , 0 l) 7 0 + D(z 0, t) 7 0 z £ R" 

(3 2) 



«s 


IEEE TRANSACTIONS ON AUTOMATIC CONTROL VOL 40 NO 3 MARCH 199 


where 2 i = -M -3 m j = y-y, represents the tracking eiror, 
and A t <W,D are matrix-valued functions of z 6 and t 


A t (z, 6, t) — 


-61 - Si 1 

-l -a-tti 
0 -1 

o 


0 

l 


0 1 c ,i s„ 


W(z 6, t) 7 = 


w{ ■« 

Ul\ 


vd j 


€ R nX/ \ 


D(z, 0 t) 1 = 


The explicit dependence of w t and da^i/09 (and hence s,) 
on t is due to the reference model, for example, p\{i\) = 
+ X w 1 (0) 

Except for the term D(z 0 l) 1 9 ihe error system 
(3 2H3 3) is similar to the error system in [ 19] where the term 
D(z , 9 , t) 7 9 was accounted for by using tuning functions 

Here we let both 0 and 6 appear as disturbance inputs Their 
boundedness will later be guaranteed by parameter identifiers 
To design the nonlinear damping functions s f we will em¬ 
ploy the following lemma which evolved from [15] and [37] 
Lemma 3 1 (Nonlinear Damping) Assume that for the sys¬ 
tem 


0 

d0 

Ock 


£ R 71 


on 


■'•-/(j 0 + $(j Ob' + p(j 

a feedback control u = r, 1) guarantees 


i G R n u G R 
04) 


^j[/(rO) + (/(i t)n(j + 


ViGR’\V/>0 (3 5) 


where V , U R n x R+ —♦ R+ are positive definite and radially 
unbounded and V is decrescent and continuously differentiable 
in i uniformly in / / R 71 x R+ —> R n , q R ri x R+ -* 
R'\ p R n x R + ^ // R TI x R+ —► R are continuously 

differentiable in j and piecewise continuous and bounded m /, 
and d R+ —* R f/ is piecewise continuous Then the feedback 
control 

u = ii(i t)-\\p(c t)\*^(x, t)q(j, t) (3 6) 

where A > 0, guarantees that 
l) If d G Coo then x € £no 

n) I< d € Ci and f/(j t) > t |j |* Vj € R", V/ > 0, r > 
0 then i e Ci If, in addition, d € £<* then t g £<*, 
and x(t) —+ 0 as t -+ oo 


Proof i ) Due to (3 5), the derivative of V alonj 
(3 4M3 6) is 

„ dVf, ( \ t 9V T ,\] dV 

V = —^ + w + P-^l+P ■')J+- 5r 


<-u-x 

I 

< -u + 


0V 1 . 

p -J7"-n d 

Ml 2 




4A 

and, hence i G C x 

u) Integrating (3 7) over [0 oo), we obtain 

i<^MIj + v(o) 


c 11/I 


12 ^ 


(3 7) 


(3 8) 


which implies that j E £ 2 If, in addition, d e then by 
part i) of this lemma, r e C ^ and therefore, u 6 C ^ Hence 
j e £oo By Barbalat’s lemma, r(f) —* 0 as t —> -x, □ 

To apply this lemma to the error system (3 2)—(3 3) we 
first note that the coefficients multiplying 6 and 9 are w, and 
/69 lespectively They play the role of the function 
p in the lemma, while the part of (6\ /6t)tj in (3 6) is played 
by z, Therefore, our choice of nonlineai damping 1 unctions is 

(3 9) 


S, = A.,|w,| J + (/, 


dot t 


69 


where K t q 1 i = 1 , n arc positive scalar constants 1 The 

usefulness of the first term foi achieving boundedness was 
stressed by Kanellakopoulos [16] 

With this choice of s, we now prove input lo-stale stability 
of the error system (3 2), (3 3), (3 9) making use of the 
following constants c 0 = mini<K ?l f ( 1 /t H) = 5Z” =1 (1/*,) 
and 1 /g 0 = E" =1 (l/g,) 

Lemma 3 2 (JSS) In the error system (3 2), (3 3), (3 9) if 
9 9 e £oo[0 tf) then z , i e £^[0 t/) and 

koi < 7^(^11^111+-j-iic y /2 +i~(o)k 

^V^uV^o Qo J 

(3 10) 

Pi oof Differentiating \\z\ 2 along the solutions of (3 2) 
we compute 




1 = 1 


< -foi-i -2 - Y '/' 1 

n 


iv,z, - —9 

Zt\ f 


1 


*±L I ,. + ±, 

06 2q, 


(±^y+(±±y «"» 


1 The constant coeffiuents ry, arc not components of the vector field ry( t) 
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.id arrive at 

+ ,3,2) 

f rom Lemma A.l(/), it follows that 

!^(t)l 2 < KO)l 2 fi" 2rot 

+ \ fc-^-^lUeir)] 2 + -I^(r)| 2 ) dr 

* J o V h> o y o J 

<*U) \ * U (Jo ) 

(3.13) 


where 0 : R+ -4 W> j s differentiable, A; W x R 
R nxn ,^:R M x R+ -* R nXm , W : R n x R + - Rr - 
D:R" x R+ R pXfl , J:R" x R+ —> R rxm are U X ;,U> 
Lipschitz in z and continuous and bounded in /, uiid 
h : R" x R+ R rxn is bounded in .3 and /. Along with 
(4.1) consider the linear time-varying systems 


X T - A(z, t)x T + y{z. t)W(z , /.) T 
:V 2 = h(z, t)x T 4- /(«, /)H r (:;, /,) r 


(4.2) 


Wi = -A(«-W- 


which proves a £ £,*; and (3.10), and by (3.1), j: 6 £rx>. □ 

The quadratic form of the nonlinear damping functions is 
only one out of many possible forms. Any power greater than 
one would yield an ISS property, but the proof with quadratic 
nonlinear damping is by far the simplest. 

A consequence of Lemma 3.2 is that, even when the 
adaptation is switched off, that is, when the parameter estimate 
0 is constant (0 = 0 ) and the only disturbance input is 0 , the 
state 2 of the error system (3.2), (3.3), (3.9) remain bounded 
and converges exponentially to a positively invariant compact 

set. (Note that since 0 = 0 , the terms —gi\(dnii-i/dd) T \ 2 Zi 
are not needed.) Moreover, when the adaptation is switched 
off, this boundedness result holds even when the unknown 
parameter is time varying. 

Corollary 3. i (Boundedness Without Adaptation): If 0 : R + 
—+ R p is piecewise continuous and bounded, and 0 is consiant, 
then x € £ 00 , and 

|a:(0l < 7 T~ 7 == sup |^( T ) -9 1 + k(0)|e--'-"'. (3.14) 

Zy/<\)K,i) r >() 

Proof: Since 0(0 = 0, (3.12) holds with 0(0 = 0(0 “ 9. 

□ 

Thus, the controller module alone guarantees boundedness, 
and the task of the adaptation is to achieve tracking. 


Assume that z(t) is continuous on [0, oc) and there exists a 
continuously differentiable function V : R n x R + R f such 
that 

«i|C| 2 < V(C.<)<rt 2 |C| a (4.4) 

and for each 2 £ C° 

<)V A , _ OV , ,, 

— A(z. /;< + < -03 C 2 (4.5) 

a (, at 

V/ > 0 , VC e R w , 01, 02 , 03 > 0. Then for Vs( 0 ), 0(0) e 
R r \ V\( 0 ) e R pX '\ V/ > 0 the outputs of systems (4.IM4.3) 
are related by 

V\ = V'rf + J /3 + y. (4.6) 

where y f is bounded and exponentially decaying. 

Proof: Due to the continuity of ,?(/), we see that 
y{z(t), t), W(z(t), 0 and D(z(t ), t) arc continuous in i. 
Since qW £ £ XP and £2 is a linear time-varying system, 
then \ e £oo f . Therefore (\ + D) 1 0 6 ., which implies 

0 C. £oor because £3 is a linear time-varying system. 

Differentiating c = z + 0 - 0, we obtain 

e' = i + 0 “ X 7 0 - X 1 & = '4(3, /)( (4.7) 


IV. Nonlinear Swapping 


which together with (4.4)-(4.5) yields 


The desired boundedness property having been achieved by 
the controller module, we can now proceed to the identifier 
module design. To make this design as close to linear designs 
as possible, we derive a nonlinear counterpart of the ubiquitous 
Swapping Lemma [25]. This lemma is an analytical device 
which uses regressor filtering to account for the time-varying 
nature of the parameter estimates. It was used in the early 
nonlinear estimation-based results [27], [24], [29], [30], (35], 
12], [9], [101, [39], For a class of nonlinear systems, including 
our error system ( 3 . 2 ), we provide the following two nonlinear 
swapping lemmas. 

Lemma 4.1 (Nonlinear Swapping): Consider the nonlinear 
time-varying system 

E . z = A(z , t)z + g(z, t.)W(z, t) T e - D(z , t,) T 0 
*’ ft = h(z, t)z + l(z, t)W(z, t) T 9 


V(c, t) = ~A{z, t)t + Z< < - — V. (4.8) 

ck at (*2 

Therefore V(t) < V r (0)e’"^ a:| / a,a ^, and, hence 


M < 



-( n ,)/2 n 2 )* 


(4.9) 


Now, (4.1H4.3) imply that y f = yi - ?/20 ~ Vn - h( z i 
Since h(z , t) is bounded then y ( is bounded and decays to 
zero exponentially. 

Remark 4.1: When D(z , /) = 0, the result of Lemma 4.1 
is reminiscent of Morse’s linear Swapping Lemma [25]. To 
see this we rewrite (4.6) as 



T\W 


r ' 
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In this notation T z W T 9 *-+ Vi is the nonlinear operator 
defined by (4.1) with D(z , 1) s 0, while the system 

£ = i4(*(f), /K + ff(z( 0 ) 0 w 

v = fc(*(0,0€ + *W0.0u (4 11) 

is used to define the linear time-varying operators T u »-► 
T* u y-+ y for h = 1 and Z = 0 , T^ a ^ y tor 
q = / and Z = 0 When >4 9 , h and Z are constant, then the 
operator ^( 9 ) = T(s ) = h(sl - A)~ l q +1 is a proper stable 
rational transfer function, T fl (s) = (si - A)” 1 # 7h(s) = 
—h(sl - A)' 1 , and Lemma 4 1 reduces to Lemma 3 6 5 from 
[34] □ 

In some texts on adaptive linear control, an extended result 
which guarantees that 9 e £ 2 T Z [W 7 9] - T[W J )9 e C 2 
is also referred to as Swapping Lemma Our next lemma is a 
nonlinear time-varying generalization of this result 
Lemma 4 2 Considei systems (4 1M4 3) with the same set 
of assumptions as in Lemma 4 1 Further, assume that 2 E £ n0 

If 9 e Ci , then 

Vi -Va®er 2 (4 12) 

If 0 E £2 H £no then 

hm[vi(/)-?/ 2 (f)0(/)]=O (4 13) 

r —+00 


Changing the sequence of integration, (4 16) becomes 

fe a '\0(H)\ 2 -t- a ’d« (4 17) 

/ o a 

because l*c~ aT dT = 1 /n(e- n '-t~ nt ) < (1 /n)i~ ua Now 
the cancellation c afl e _rv * = 1 in (4 17) yields 

110Hz < -4=10(0)1 + L|| x + Ulloc || 0|| 2 < » (4 18) 

V2a « 

which proves 0 € £2 Due to the uniform boundedness of // 11 
follows that ys € £2 This proves (4 12) When 9 E £ 2 
then 0 E £2 H £00 and 0 € £^ Thus, by Barbalat’s lemma 
0(f) —+ 0 and hence 93 (f) —► 0 as / —► <x This proves (4 13) 
because \j c (t) —► () as Z —► ou 
Remark 4 2 When D(* /) == 0 we rewrite (4 12 ) as 

1 [W T 0}~ llW 1 ^ E C 2 (4 19) 

and (4 13) as 


Pi oaf Since z E C x then gW T D E Due to the 
exponential stability of A(z, t), it follows that \ € £ou By 
Lemma 4 1, z/e € C 2 We need to prove that y 3 E C 2 The 
solution of (4 3) is 

0(f) = *.(*. 0)0(0) 

+ / (f. t)[x(t) + D{z(t), t)] t 9[t) dr (4 14) 

IQ 

where (4 4)-(4 5) guarantee that the state transition matrix 
R+ x R+ —► R" Xn is such that \$ z (t r)| 2 < 

kc -°0 p >, A, a > 0 Since \ and D are bounded then 

10(01 < Ae-“‘|0(O)| + k\\x + D\\ x ft -«('-) |( 9 ( T )| rf r 

Jo 

< kt -“'10(0)1 + A \\x + C||oo ~ n(, ' T) dT 3 




|0(t )| 2 dr 


< u- a, \m\+mx+D\\ c 




|(9(r)| 2 dr 


where the second inequality is obtained using the Schwartz 
inequality By squaring (4 15) and integrating over [0, t] we 
obtain 

f Q |0(t)| 2 dr < L|0(O)| 2 + L|| x + Z)||L 

, |«(a)| 2 d* dr (4 16) 


hm {T.[W T l9](0 - (/[W f ]0)(O} = 0 (4 20 ) 

t "1C 

with 7^ and T as in Remark 4 1 For constanl A r; /* and 
Z the operator T z = 7 is a pioper stable rational transfer 
function, and Lemma 4 2 reduces to Lemma 2 11 from 128] 

□ 

V Parameter Identifiers 

We are now in the position to design the identifier module 
by applying the Nonlinear Swapping Lemma 4 1 to either z- or 
j-system Each of the two types of identifiers, with z-swapping 
and with i -swapping, can be implemented with either gradient 
or least-squares update laws These parameter identifiers are 
variants of the regressor filtering identifiers in [32] 

A z-Swappinq 

For the error system (3 2) we introduce the filters 

xo = AM 0xo + W(z 9 t) 1 9 - D(z 9,t) T 9, 

Xq e R" (5 1 ) 
X r = A z (z , 9 , l) x T + W(z 9, I) 1 , xe R ,,x " (^ 2) 

and define the estimation error as 

C = z + Xo- X T Q, ( € R" (5 3 

Along with ( we define 

<=* + *> ~X 7 9 f E R" (5 4) 

Then we obtain 

c = X 7 6 + t (5 5) 
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; nd, by differentiating (5.4) and substituting (3.2), (5.1) and 
; 5 . 2 ), recognize that e is governed by 

i = A 3 {z, 0, t)i. (5.6) 


The update laws for 6 employ the estimation error t and the 
Mitered regressor x■ The gradient update law is 


0 = r- 


X> 


i + Hxl V 

and the least-squares law is 
0 = T 


r = r T >o,v >o (5,7) 


X< 


1 +v\x\jr 


f = -r 


xx 


i + "1x13- 


r r(o) = r T (o) > o. v > o. ( 5 . 8 ) 


By allowing v — 0 we encompass unnormalized update laws. 

Since the regressor x is a matrix, we use the Frobenius 
norm \x \t to avoid the need for on-line matrix inversion, as 
well as unnecessary algebraic complications in the stability 
arguments that would arise from applying update laws 9 = 
Tx(l P 4- with r fixed or updated with f = 

-r A (/„ + ^x T rx)-‘x T r. 

The boundedness properties of the 2 -swapping identifiers 
are as follows. 

Lemma 5.J: Suppose the solution x(t) is defined on [0, tf). 
Tlie update laws (5.7) and (5.8) guarantee that 

1) if v — 0 then 9 E £^.[0, tf) and e E £ 2 ( 0 , */)« 

2) if v > 0 then 9 E £^[0, If) and 


A. - 7f-=F € £2 t°- l f ] n '/>■ 

V* +>'\X\-F 


Proof: (Sketch) Noting from (5.6) and (3.3) that d/dl 
|r|“) = ~ <: i<i - -com 3 it' s c l e ar that the positive 

definite function V = + l/2r.-o |£| 2 can be used as in 

| 6 |, f34J, [7] to prove the lemma. □ 

As explained in [ 6 ], various modifications of the least- 
squares algorithm (covariance resetting, exponential data 
weighting, etc.,) do not affect the properties established by 
Lemma 5 . 1 . A priori knowledge of parameter bounds can also 
be incorporated via projection. 


B. x-Swapping 

A different identifier results if instead of the error system 
(3.2) we consider the plant (2.1) rewritten in the form 

x = Ex + e n /io(x)u -f </>(x) T 0 (5.9) 


where 


0 

; -1 
0 ■■■ 0 


We employ the following filters 


ti 0 = v4(it)(no - x) + Ex -F e n fto(x)u, fto £ R" (5.10) 
a T = + <Hx) T , ■ n e w* n (5.i i) 


where A(t) is an exponentially stable matrix. We define 
estimation error vector 


c — x 

-%-n T e, 

r 6 R". 

(5.121 

and along with it 




e = x 

I 

P 

1 

5^ 

t e R". 

(5.13) 

Then we obtain 

f = Q t 0 + f 


(5.14) 


and, by differentiating (5.13) and substituting (5.9), ( 5 . 10 ) and 
(5.11), recognize that f is governed by 

c = A(t,)L (5.15) 

The update laws for 0 employ the estimation error < and the 
filtered regressor ft. The gradient update law is 

r= '' T>a ' i,i " <5|6 » 

and the least-squares law is 

§. r ^ 

1 + V\il\ 2 jr 

t "- r TTm r r (°) = r J (0) >o,„>o. ,5.i7) 

Again, by allowing v ~ 0 we encompass unnormalized 
gradient and least-squares. Concerning the update law' mod¬ 
ifications, the same comments from the preceding subsection 
are also in order here. 

Lemma 5.2: Suppose x(t) is defined on [ 0 , /./), and A(t) 
is continuous and bounded on [ 0 , tj) and exponentially stable. 
The update laws (5.16) and (5.17) guarantee that 

1 ) if v — 0 then 9 E £^[0, if) and t E £ 2 ( 0 , */), 

2) if v > 0 then 9 E £ 00 [0, if) and 

7=J7W e £ 2[0, tf) n £„,[{). tf). 

Proof: (Sketch) There exists a continuously differen¬ 
tiable, bounded, positive definite, symmetric P : R+ R" Xn 
such that P -f PA 4 A 1 P = V/. E [0, tf), and the positive 

definite function V — |||0|jLi 4 |f| 2 > can be used as in | 6 ], 
134], [7] to prove the lemma. □ 

VI. Stability and Tracking 

Either of the identifiers from the preceding sections can 
now be connected with the ISS-controller (3.1), (3.9). We give 
stability proofs for the resulting adaptive systems. These proofs 
encompass both normalized and unnormalized update laws. 

Theorem 6.7 (z-Swapping Scheme): All the signals in the 
adaptive system consisting of the plant (2.1), controller (3.1), 
(3.9), filters (5.1), (5,2), and either the gradient (5.7) or 
the least-squares (5.8) update law, are globally uniformly 
bounded for all t > 0, and lim^oo z(t) = 0. This means, 
in particular, that global asymptotic tracking is achieved: 
limt-oc [sr(0 -■ Vr(t)} = 0. Furthermore, if lim,-oo r(t) = 0 
and (f>( 0 ) = 0 then limt™* ocx(t) = 0 . 
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Proof: Due to the continuity of x m and die smoothness 
of the nonlinear terms appearing in (2.1), (3.1), (3.9), (3.1), 
(3.2), (5.7), (5,8), the solution of the closed-loop adaptive 
system exists and is unique. Let its maximum interval of 
existence be [0, t /). 

For the normalized update laws, from Lemma 5.1 we obtain 


e,», 




€ ^oo[ 0 , tf). 


an arbitrary exponentially stable ;4(f). We avoid this difficulty 
by designing 

A(t) = A 0 - X(t> T {x)4>{x)P (6.2) 

where A > 0 and Aq is an arbitrary constant matrix that 
satisfies PAq + AqP = P = P r > 0. With this design 
the matrix A(f) is exponentially stable because 

PA(t) + A T V)r=-I-2\Pcl> T <j>r<-I (6.3) 


When the update laws are unnormalized, Lemma 5.1 gives 
only 6 e £<*>[(), tf) and we have to establish boundedness of 

6. To this end, we treat (5.2) in a fashion similar to (3.11) 



< -fotr{)a T }-tr{y ( diag(K 1 |u) 1 | 2 , ■ ,h„|iz'„| 2 )x T } 

+ tr{W r7 \} 

= -oj|x|£- + 5^(-k.|u».| 2 |x.| 2 + u’J \,) 

1=1 

5: -f’olxl^ + —“• (6.1) 

4*0 


Theorem 6.2 (j-S wapping Scheme) All the signals in the 
adaptive system consisting of the plant ( 2 . 1 ), controller ( 3 . 1 ), 
(3.9), filters (5.10), (5.11), and either the gradient (5.16) or 
the least-squares (5.17) update law are globally uniformly 
bounded for all t > 0, and lim f _* cx; z(t) = 0 . This means, 
in particular, that global asymptotic tracking is achieved- 
lim,—^ [//(/) - y 1 (A)] = 0. Furthermore, if limine / (/) = 0 
and 0 ( 0 ) = 0 then linii-*,*, .r (0 = 0 . 

Proof We first considei the normalized update laws. 
As in the proof of Theorem 6.1, wc show that 0, 0. z. j E 
£cx[0, tf) and hence u E £ x [0. if) From (5.10) and (5.11) 
it follows that J2o, 1L and therefore ( are in £^ [0, if) Now, 
by the same argument as in the prool of Theorem 6.1 we 
conclude that if = x. 

Second, we consider the unnormalizcd update laws (5 16) 
and (5.17) with ,4(/) given by (6.2). Along the solutions of 
(5.11) we have 


This proves that \ E £oo[0, //). Therefore, by (5.5) and 
because of the boundedness of f we conclude that r e 
£oo[0, tj). Now by (5.7) or (5.8), 0 E £oo[0, if ). Therefore, 
by Lemma 3.2, z, s E £^[0, //). Finally, by (5.3), \o E 
£oo[ 0 , tf)> 

We have thus shown that all of the signals of the closed-loop 
adaptive system are bounded on [ 0 , //) by constants depending 
only on the initial conditions, design gains, the external signals 
x m and r, and not depending on tf. The independence of 
the bounds of If proves that if = oo. Hence, all signals are 
globally uniformly bounded on [ 0 , oo). 

Now we set out to prove that z E £ 2 < and eventually 
that z(1) —► 0 as t —► oc. For the normalized update laws, 
from Lemma 5.1 we obtain 0, E / \J\ + v\x\r £ £ 2 - Since 
X € £rx- then t 6 £ 2 - When the update laws are unnormalized 
Lemma 5.1 gives t E £ 2 . and since \ E £x> then by (5.7) 
or (5.8), 0 E £ 2 - Consequently in both the normalized and 
the unnormalized cases * T 0 E £2 because f E £ 2 - With 
V = 5 |C| 2 , all the conditions of Lemmas 4.1 and 4.2 are 
satisfied. Thus, by Lemma 4.2, z - x T V € £ 2 * Hence : e £ 2 - 
To prove the convergence of z to zero, we note that (3.2), 
(3.3) implies that z E £oo- Therefore, by Barbalat’s lemma 
z(t) 0 as / —> 00 . When r(t) —► 0 then —► 0 as 

t —> oc, and from the definitions in (3.1) we conclude that, if 
0(0) & 0, then x(t) —► 0 as / -+ 00 . □ 

Now we proceed to prove stability of the .r-swapping 
scheme. With normalized update laws* the proof is similar 
to the proof of Theorem 6 . 1 . With the unnormalized update 
laws, it is not clear how to prove boundedness of all signals for 


npn T ) = - 2A iip^ <i>pn r + u/y + (fiPti 1 

= -nn T - 2A ($pn T - ^i ;i j 


(6.4) 


which implies 
d 


j f (tr{nPil 7 }) < -tr{Hn f } + (6.5) 


Hence 12 E £^[0, tf ). Lemma 5.2 gives 2 0 E £>.[0, //). and 
from (5.14) and (5.15) we conclude that c E £-*,[(), tf) Now 

by (5.16) or (5.17), 0 E £^[0* tf). Therefore, by Lemma 3.2, 
2 . ./ E £oc[0, if)* Finally, by (5.12), 12o E £ x [0, //) As 
before, if = oc. 

Now we set out to prove that z E £ 2 . For normalized update 
laws, from Lemma 5.2, we have that (9, t / yjl + //E £2 
Since il E £-» then < E £ 2 . When the update laws are 
unnormalizcd, Lemma 5.2 gives t E £ 2 . and since $2 E C 0 c 

then by (5.16) or (5.17), 0 E £ 2 * Consequently for both the 
normalized and the unnormalizcd cases, U T 0 E £2 because 
i E £ 2 - Now, as in Theorem 6.1, we invoke Lemma 4.2 to 
deduce that z - \ J 0 E £ 2 - To show that : 6 £ 2 , we need to 
prove that Q 7 0 E £ 2 implies \ T B E £ 2 , or, in the notation ol 
Lemma A.2 from the Appendix, that Tj[<f> T ]B e £ 2 implies 

2 Since A{t) depends on i(f) whose boundedness is yel to bcjjrovcn, in 
invoking Lemma 5 2 we violate the boundedness condition lor 4(f) This, 
however, causes no difficulty because the boundedness condition is required 
only in order to establish the existence of P, and we know P in (6 2) a priori 
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\ r [W T ]6 G £ 2 - To apply this lemma to our adaptive system 
i' note from (3.3) and (3.1) that 



' 1 

0 

... o' 


II 

am 

1 

• ■ 0 

ct> T (x) 



. 1 

.. 1 



L dxi 

dxi 



= M(z, 0, t)cf> T {x). ( 6 . 6 ) 

Since M(z(t ), 8(t ), t) satisfies the conditions of Lemma A.2 
then x T 0 G C ‘2 and hence z. G £ 2 - The rest of the proof is the 
same as for Theorem 6.1. □ 

Remark 6.1: All the above results are presented for the 
parametric-strict-feedback form ( 2 . 1 ) without zero dynamics. 
As in [12], they can be readily modified for the strict-feedback 
systems with zero-dynamics 

ii = tfi+i + ti T tpi(x 1 , • ■ •, Xi, x r ), 1 < i < n - 1 
in = lM-r)u + 0 T tp„(x) 

X r = My, x r ) + *(w, x r )0 

V = -'i (6.7) 


Lemma 7.1: For both the j- swapping (5.7) anti tl 
swapping (5.16) normalized (v > 0 ) gradient update 
the following bounds hold 

he 

*) IlflU = |fl( 0 )| 

(7.1) 

») ii*il < }m\ 

(7.2) 

*«) 

(7.3) 

Proof: The proof is given for the 2 -swapping identifier 
(5.1), (5.2), (5.7). The proof for the x-swapping identifier 
(5.10), (5.11), (5.16) is identical. 

Consider the positive definite function V# = l/27|0| 2 .Its 
derivative along the solutions of (5.5), (5.7) is 

|r | 2 

Vi - - , 1 ' ,, < 0 . 

9 1 + v\x\r ~ 

(7.4) 

i) Due to the nonpositivity of V# we have V$(t) < 
which implies (7.1). 

ii) From (5.7) we can write 

^( 0 ) 


where the x r -subsystem has a bounded-input bounded-state 
(BIBS) property with respect to y as its input. The procedure 
can also be modified, as in [ 12 ], to obtain a local result 
for the parametric-pure-feedback systems, i.e., the systems in 
which ifi also depends on x 1 • As in [40], the subset of 
pure-fecdback systems that can be controlled globally can be 
enlarged using an appropriate filter and parameter estimate 
initialization. □ 


Vll. £00, Mean-Square and C 2 Performance 

For linear systems the issue of transient performance has 
recently received considerable attention (see [4], (20] and 
references therein). For the adaptive schemes presented in the 
preceding sections we now derive mean-square, and C 2 
bounds for the error state z, which incorporate the bounds for 
the tracking error y — y r . 

First we give performance bounds for parameter identifiers 
and use them to establish £« and mean-square bounds for 
v that are valid for both the 2 -swapping and the x-swapping 
schemes. Then we derive an £ 2 norm bound on 2 foT the 2 - 
swapping scheme. For the x-swapping scheme a similar £ 2 
bound is not yet available. 

We analyze in detail the scheme with the normalized gra¬ 
dient update laws and suggest in Remarks 7.1 and 7.4 how to 
aodify the derivations for other update laws. 

Without loss of generality we assume in our analysis, and 
recommend for implementation, that c( 0 ), *(()) (in the 2 - 
*wapping scheme), and fi( 0 ) (in the x-swapping scheme), 
be set to zero. This can be achieved by initializing Xo(0) = 
- 2 ( 0 ), x( 0 ) “ in the 2 -swapping scheme, and f 2 o( 0 ) ■ = 
r( 0 ), 12(0) = 0, in the x-swapping scheme. For simplicity, 
we also let T = 7 /. We explain in Remark 7.3 how the 
performance bounds differ in the absence of initialization. 


r T v T v ( 

|2 < 7 2 _lii_ < ^ 

I — I /1 . 12 — ’ 


\<m% 


r 


(l + "l*| 2 r) 2 

! LI2 


(i + >Ax II-) 2 


" 1 + 'Ax?r ’ 

By using (5.5) we get 

“ 2 '"I 2 UI3- 


(7.5) 


\8\> < < T 


v 1 + 1 Ax?f v 1 + v\x\% 


<7< " 


which, in view of (7.1), proves (7.2). 

in) By integrating (7.4) over [0, oc) we obtain 


v 71 + "UI3- 


< 7 - 7 ) 


Integration of (7.5) over [0, 00 ) and substitution of (7.7) yields 
(7.3). n 

Remark 7.1: The only difference in the case of the normal¬ 
ized least-squares is that (7.3) becomes || 0||2 < \pi!v |0(O)|. 

□ 

Theorem 7.1; In the adaptive system (2.1), (3.1) using 
either the identifier (5.1), (5.2), (5.7) or (5.10), (5.11), (5.16) 
with normalized update laws, the following inequalities hold 


1/2 


0 

«) 


+1*(0)|. 


.—cat 


Jv/coV^o <h)V 2 ) 


V t Jo ) 2 y/co \ «o t ) 


(7.8) 

1/2 


^z( 0)|. (7.9) 


n/K, 
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Proof: i) This bound follows by substituting (7.1) and 
(7.2) into (3.10). 

ti) By integrating the first line of (3.13) we get 


Proof: i) By Lemma A.l-i), and since x(0) = < 
inequality (6.1) is rewritten as 


/ |*(r)| 
Jo 




:oi 5-<u(»)&e- ! '-'+ f 

Jo 


e -ac„(f-r) dT < 

2«o 4r 0 K 0 

(7.17, 


+ dl ) (h ■ (7 - 10) 


Now, to arrive at (7.9), the sequence of integration in (7.10) 
is interchanged as in the proof of Lemma A.!.(?/). 

Remark 7.2: Although the initial states z 2 (0),--- ,2>(0) 
may depend on r,, y ,, this dependence can be removed 
by setting z( 0) = 0 with the following initialization of the 
reference model 

= -r, (0) - r*,_i(7, _!(()), 0(0), 7 m ,, _!(t))). (7.11) 

It can also be proven that in this initialization .r m (0) does not 
depend on c x . g t . Therefore, the bounds (7.8), (7.9) can be 
made as small as desired by increasing r u , and/or Ku , (/ 0 . A 
practical limit to the increase of these gain coefficients is that, 
in the presence of an error in the initial stale measurement, 
they increase 2(0) and the performance deteriorates. As for 
the pure-fecdback systems mentioned in Remark 6.1, the fea¬ 
sibility region may, in general, decrease as c ,, h,. q t increase. 
□ 

Remark 7.3 : The above bounds are readily modified to also 
cover the case when r(0) ^ 0. For example, the C^ bound 
(7.8) is augmented by the term 


and (7.14) follows. 

ii) Now (5.5) implies ||t||oc < |||xMkl|0|k 

(l/2 % /qjKo)|^(0)| which proves (7.15). 
in) The bound on the £2 norm of t is obtained using 


<(i + "ll \x\Ai) 


r( 1 + "ll 


2 


. (7.18) 


2v^o L 1<- « \«u do 1 ' 2 ) a 


If we set both filter initial conditions to zero, namely, x(0) = 0 
and \o(0) - 0, we get e(0) = z(0). (This initialization is 
always exact because it does not depend on the measured 
stale.) In this case, (7.12) shows that, for 2 ( 0 ) ^ 0 the 
performance bound (7.8) is 


By substituting (7.7) and (7.14) into (7.18) we prove (7.16). □ 
Remark 7 4: With the bounds (7.14M7.16) for the 2 - 
swapping scheme we can tighten the bounds on ||0|| 2 and 

H^llfx, in Lemma 7.1 and make them valid for the unnormalized 
update laws with v = 0. It is straightforward to show that I /r 
in (7.2)—(7.3) can be replaced by min{l/z', l/4r ( )Ato}. The 
same is true for (7.8M7.9). We can also show that for the 
.r-swapping scheme \jv can be replaced by min {I /77 (p/2A) 

[^max (Po)/X min (n)i). □ 

Theorem 7 2 In the adaptive system (2.1), (3.1) with the 2 - 
swappmg identification scheme (5.1), (5.2), (5.7) the £ 2 nonn 
of 2 is bounded by 

||..|| 2 <!»JL + _L_ » V" 

y/2r 0 \go lJ> 2 cqKqIs 2 ^ 07 / 

+ ®U-7=M0)|. (7.19) 

V<o 

Prooj: We will calculate ihe C 2 norm bound lor : as 

NI2 < IMI2 + M2, where 


|0(O)| ( 1 


1 + 23 L) 

no dai' 2 ) 


2 v/oi [4< - o 
+ |s(0)|p- r “' 


(L + 22 L) + ^ 

\*o 9ov z J g^v 


l*(0)l 

(7.13) 


Lemma 7.2: For the adaptive system (2.1), (3.1), (5.1), 
(5.2), (5,7), the following inequalities hold 


V' = Xo - \ (7.20) 

A bound on ||r|| 2 ts given by (7.16). To obtain a bound on 
|| 0 || 2 , we examine 

= A £ {z, <9, t)t-D{z, (9, t) r 9-^e. (7.21) 

By using (3.3) and repeating the sequence of inequalities 
(3.11), we derive 


< -%d>\ 2 - — 


? 0 — ~x' & 

l f’o 


0 lllxWU<; 

4 

n) IHI,» < M 


2v^ono 

m\ 


2^/f'oN() 


(7.14) 

(7.15) which gives 




" ,) *' ||,s ^ + ^ 5 ^ |S(o)| - <7i6> 
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> applying Lemma A.L(ii) to (7.23), we arrive at 



where W {) denotes W( 0, 0, t). Likewise, we rewrite If ■ 
(3.3) 

D = A) + [0. fiflji] r 


m substituting (7.3) and (7.14) into (7.24) we get 



where we have assumed that xo(0) = -(H)- Combining this 
and (7.16), and rearranging the terms, we obtain (7.J9). □ 

The form of the bound (7.19) is favorable because it is 
mear in |0(O)|. It may not be possible to make the £2 
iorm of z as small as desired by cq alone because of the 
enn |0(O)|/v/27. With the standard initialization 2 ( 0 ) = 0. 
lowever, a possibility to improve the £ 2 performance is by 
simultaneously increasing r 0 , g {) and '■y. 

VIII. Rapprochement with Linear Designs 

A connection of the adaptive nonlinear ISS-design presented 
•n this paper with linear estimation-based designs will become 
clearer when the ISS-controller of Section Ill is replaced by 
he weaker SG-controllcr developed in this section. The only 
difference between the two controllers is that the SG-controller 
employs weaker nonlinear damping functions s,. For example, 
’or an uncertain term 0^>(j ^) in the first equation of the plant, 
he 1SS and SG nonlinear damping functions are respectively 

*'j SS (-n) = ^(■n) 2 and 


SG 


(■n) = 


^(.'■ 1 ) - ^(0) 


l2 


T] 


= w(.n) 2 . (8.1) 


n this way the growth of .s^ G is reduced by a factor of 
r’f. In the process of backstepping this reduction is even 
nore pronounced. However, the SG-controller can no longer 

guarantee the ISS property with respect to 0 and 0. Instead, 
we reveal a small gain property and prove boundedness with 
i linear-like Gronwall lemma argument. The main interest 
! n the SG-controller is that for linear systems it becomes 
:mear in .r and in that sense is similar to linear estimalion- 
>ased designs. In contrast, the ISS-controller for linear systems 
cmains nonlinear. 

To derive the nonlinear damping expressions for the SG- 
-ontroller we rewrite the regressor vectors yj r as follows 

w t (z t , 0 , /) = w,( 0, 0 , i) 4 U t (z^ 0 , t) T z t (8.2) 


where 2 ? ^ [z\, ■ ■ ■, z t ] r , and u, : R' x x R+ —♦ R ,x ** is a 
natrix-valued function smooth in the first two arguments and 
-ontinuous and bounded in the third argument (with a slight 
ibuse of notation relative to (3.1) we now express w 1 as a 
unction of 2 ,, 0, /). Thus we have 


w = W 0 + [wfzi, ■ • ■. u*z n ] 


where h,: R‘ -1 x R p x R + ->• R 1 "- 1 '*'' are matm-valucil 
functions smooth in the first two arguments and continuous 
and bounded in the third argument, and D n denotes I)(i\. 0, t) 
The SG-controller has the same form (3.1) as the ISS 
controller, but its nonlinear damping functions are defined 
as 

■S = + (8.5) 

As in linear estimation-based adaptive control, ihe SG- 
controller employs an identifier with normalized update 
laws. 

Theorem 8.1: All the signals in the adaptive system con¬ 
sisting of the plant (2.1), SG-controller (3.1) with (8.5), 
2 -swapping filters (5.1), (5.2), and cither the gradient (5.7) 
or the least-squares (5.8) normalized update law (o > 0). arc 
globally uniformly bounded for all t > 0, and lim/—^ ?(/) = 
0. This means, in particular, that global asymptotic tracking 
is achieved: linv-,^ [*/(/) - /y r (f)] = 0. Furthermore, if 
lim/^r^ r(t) = 0 and 0(0) = 0 then lim,_^ . r(t) - 0. 

Proof ■ Using (8.3) we write (5.2) as 

V 7 = + VT 0 r + 1 . • ■ • , ^z ,,} 1 . (8.6) 

In a fashion similar to (6.1) we compute 

< -oiUlJ- - X/.kl^U.I 2 


n 

+ 'jTzjuJ'X, + tr{W',f*} 
1=1 



On the other hand, using (8.4) we write (7.21) as 

0 = A^> - (* + Doff) - [ 0 , blzu ■ ■ ( 8 . 8 ) 

and compute 

ji G^ 2 ) - ~ 

n 

- *l ,T {x "i" Ai) 7 0 “ Y*7-m,6 
1=1 

I — 1 

<-ylV'l 2 + 1 W^I 2 

l t*o 

+ -i-|z| 2 |(9‘| 2 + i|2?o|^| 2 . (8.9) 

4go c 0 


(8.3) 
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The system (8.7), (8.9) is summarized as 

Si -c <iM 3 : -+ jj^M 2 + (8.10) 

^(ivf) < -coi^i 2 +^rixi 3-+2 ~i^i 2 i«i 2 

+ -|D 0 |^| 2 . ( 8 . 11 ) 

Co 

From Lemma 5.1 we have 0 e £nc[0, t f ), so |Ho|3- < * 
and l-Dol^ < fc, where A* denotes a generic positive finite 
constant. From Lemma 5.1 we also have 0, c/yj FT Hx|$r € 
£ 2 ( 0 , tj) n£jo[ 0 , tf). Let us denote by /1 a generic function 
in £][0, //) fl £-*j[ 0 , //). Since r = 2 + 0> then we have 

W ! < 2 T Ji j y(i + Hxl5-) + 2 WI ! 

< 2/1(1 +Hx&) + 2|vf. (8.12) 

Thus (8.10H8.11) become 

< —(^ki ~ h)\x\j? H-|0 | 2 -1“ A* (8.13) 

ar *0 

J t {\ H 2 ) < -(co - hM 2 + h\x& + *■ (8-14) 

This is a loop with small gain because |\|3r appears multiplied 
by /1 in (8.14). To finish the proof we define the “superstate” 

X=\x\% + — 1 (/-| 2 (8.15) 

*00) 

differentiate it and substitute (8.13)-(8.14). After straightlor- 
ward rearrangements and majon/ations, we get 

X <-{^-h)x + l. (8.16) 

By applying the Gronwall lemma we conclude that A r is 
uniformly bounded on [0, t /) In view of (5.5), r is bounded, 
which along with the boundedness of 0 , proves that 2 is 
bounded. Thus // = cx>. The rest of the proof is the same 
as for Theorem 6 . 1 , and again uses Lemma 4.2 for proving 
convergence. □ 

In contrast to the ISS design, the global result has been 
established only with normalized update laws. The issue of 
normalization in adaptive nonlinear design was discussed 
in [26], 

Performance bounds similar to those in Section VII for the 
ISS design are not available for the SO design, nor is it clear 
how to develop its .r-swapping version. 

In Section IX we compare the linear SG design with the 
<ftew ISS design for a linear plant. 

IX. Examples and Discussion 

The first example in this section illustrates the performance 
properties of the ISS design on the relative-degree two plant 

* r l = *2 + M-M) 

(9.1) 


with the objective to regulate x to zero (without a reference 
model). The second example makes a comparison between thi 
ISS design and the SG design. 

We define the error variables 

Zl = X l 

Z‘2 = X2- ni(xu 0)- (9.2) 

The two-step ISS-controller design 

<*1 ~ “<T*1 - Kj<£> 2 2i ~ dip 


(Oai \ 2 2 ( 0«iA 2 

! 2 l '“ 2 ( 37 rJ = ! 


U = —Z\ — V 


results in the error system 

i = A,(zJ)z + 

where 

A 3 = 


d(Y\ , - , 

+ -t— L (J'3 + M (9.3) 

UJ'\ 


l~£>j 


0 + 


0 

Jo 
L d» J 


9 (9.4) 


-O - 


(£f)V-«(^) J 


(9.5) 

The 2 -swapping identifier is designed with the following filters 


\0 = A z ( -• 0) Vo + 
\ T = >Ur, §)x T + 



0 - 

0 

clt*l _ 

_ ctf*| 


- (Jtf - 




(9.6) 

(9.7) 

(9 8) 

T - (9 9) 

i + i'X\ 

The j -swapping identifier is designed with the following filters 

X)o = — (w A(^ 2 )(f2o — j’j) ./ 2 , Sio £ R (9.10) 


which are used to implement the augmented error 

f - - + 

and the gradient update law 


f - + Ao ~ V T ^ 


0 = 7 - 




fl = —(71 -f- \ip 2 )tt -|- ip, 


fie R (9.H) 


which are used to implement the equation error 


( — .ri — fly — iW 

(9.12) 

and the gradient update law 


i ih 

(9.131 

(j — -y- 

7 i + i/n 2 


This reveals that the ./-swapping approach is uncertaint> 
specific in the sense that only the terms <p t multiplying 
the unknown parameter 0 need to be filtered. This opem 
a possibility for a reduction in the dynamic order of the 
identifier. 

In simulations, the only difference between the z-swapping 
and the i-swapping approach was in the value of 7 needed 
to achieve the same speed of adaptation—higher value was 
needed in the 2 -swapping case. Since the responses were 
similar we show them only for the 2 -swapping scheme. 


.r 2 = ti 
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Fig. 1. Dependence of the transients on t*o with *o = gp = 1. (Note an 
expanded lime scale for control u.) (a) 7 — 10 ; (b) 7=0 1 : rn = 1 ; 2 : 
Co = 0 : 3: fu = 15. 

Example 8.1 (ISS-Performance): We consider system (9.1) 
with nonlinearity tp{x\) = x\. The simulations were carried 
out with nominal values c..\ = c 2 = o 0 = K.y = k 2 = ^0 = g 2 
- <j 0 = 1 . 7 = 10, 61 = 5, 0(0) = 0 which were judged to give 
representative responses. All simulations are with following 
initial conditions: ,t(0 ) = -\o ( 0 ) = [ 0 . 10 ] r , *( 0 ) = 0 (to 
set c( 0 ) = 0 ). 

Fig. 1(a) illustrates Theorems 7.1 and 7.2. The design 
parameter c () can be used for systematically improving the 
transient performance. Up to a certain point the error transients 
and the control effort in Fig. 1(a) are simultaneously decreas¬ 
ing as c.q increases. Beyond that point the control effort starts 
increasing. The control u is given in an expanded time scale in 
order to clearly display the main qualitative differences among 
the three cases. Fig. 1(b) illustrates Corollary 3.1. When 
adaptation is switched off, the states are uniformly bounded 
and converge to (or remain inside) a compact residual set. 
Corollary 3.1 does not describe the behavior inside the residual 
set, which may contain multiple equilibria, limit cycles, etc. 
For this example (but not in general), there is an asymptotically 
stable equilibrium at the origin for any value of the parameter 
error. For small yalues of c 0 , this equilibrium has a basin of 
attraction which is strictly inside the residual set. For higher 
values of c u the global asymptotic stability is achieved. 

Fig. 2 shows the influence of kq on transients. According to 
Theorem 7.1 and Remark 7.4, the peak values can be decreased 
by increasing kq, which is confirmed by the plot. The £2 
performance may not be improved, however, by increasing kq 
because the K-terms slow down the adaptation and make the 
transients longer. The effect of the (/-terms was shown to be 
significant only for very small r<) and kq or for very large 7 . 

Fig. 3 demonstrates the influence of the adaptation gain 7 
on transients. Due to the slow initial adaptation, which should 
be attributed not only to the normalized gradient update law 
but also to the fact that the regressor is filtered, there is a 
clear separation of action of the nonadaptive controller, which 




Fig. 2. Dependence of the transients on kq with r n — gp = ]. - — ui. 1: 
cq = 1; 2: ro = 5; 3: cu = 15. 
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Fig. 3. Dependence of the transients on 7 with co = jr/o = M) = 1. 1: 

cn = 1; 2: rp = 5; 3: cp = 15. 

at the beginning brings the state 2 quickly to the residual 
set, and the adaptive controller which takes over to drive 
the state to the origin. The property that the £^ bounds are 
increasing functions of 7, to be expected from Theorem 7.1, 
was exhibited in simulations only with extremely high values 
of 7. This indicates that some of the bounds derived are not 
very tight over the entire range of design parameter values. 

Finally, an explanation is in order about the initial condition 
.t( 0) in our simulations. We used ,t(Q) = [0, l()] T . and hence 
^( 0 ) = [ 0 , 10 ] T which is independent of the design gains 
Co, kq. go. This is why the peak of z\ decreases monotonically 
as any of these gains increases. If, instead, we used x\(0) ^ 0 , 
then, according to Remark 7.2, we would have added an 
appropriately initialized reference model (with r(t) = 0). In 
this way, bad transients would be eliminated by following a 
less aggressive path to the origin. □ 

Example 8.2 (ISS vs. SG); Let us consider system (9.1) 
with (p(x 1 ) = &*i. For this linear system we make a comparison 
between the ISS and SG designs. The only difference is that 
the terms Kx(p 2 ^K 2 (dtx\/dxi) 2 tp 2 ,g2{dot\ld0) 2 in the ISS 
design are, respectively, replaced by K\, K 2 (dai/dxi) 2 ,g 2 
in the SG design. The same design coefficients and initial 
conditions are used as in Example 8.1, except for 6 = 3. 
The adaptation gains, 7 = 5 for the ISS design, and 7 = 1.5 
for the SG design, are chosen so that the rate of parameter 
convergence is the same for both designs. The control law of 
the SG design is linear in x and nonlinear in 0. 

Fig. 4 shows the difference in performance between the two 
designs. The ISS design uses larger control effort and achieves 
better attenuation of the z\ -transient. The dashed responses 
illustrate the underlying nonadaptive behavior (7 = 0). While 
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Fig. 4. ISS design vs. SG design. The dashed lines show ci (t) when 
7 = 0.(Note an expanded lime scale for control u.) 

the response of the SG design exhibits linear exponential 
instability, the response of the ISS design, according to Corol¬ 
lary 3.1, is bounded. Hence, there is a clear trade-off of 
performance improvement versus control effort between the 
new ISS design and the linear adaptive designs such as the 
SG design. □ 

X. CONCLUSIONS 

Recent Lyapunov-based recursive designs of adaptive 
controllers for nonlinear systems transformable into the 
parametric-strict-feedback form [12], [8], [19], [36] achieve 
global stability and tracking, but do not allow a choice 
of parameter update laws. In these designs the wealth of 
knowledge about standard identifiers is not utilized because 
the identifier does not appear as a separate module of the 
adaptive system. 

A complete separation of the controller and identifier mod¬ 
ules is one of the main accomplishments of this paper. It has 
been achieved by a new nonlinear controller with an input-to- 
state stability property with respect to the parameter estimation 
error and its derivative as disturbance inputs. This strong ISS- 
controller remains nonlinear even when the plant is linear. 
For comparison with linear estimation-based designs, a weaker 
SG-controller is introduced, resulting in a small-gain rather 
than the ISS property. For linear plants this controller is linear. 

As a separate module, the ISS-controller can be connected 
with the standard unnormalized or normalized gradient or 
least-squares identifiers, while the SG-controller requires nor- 
malization. The connection of the controller and identifier 
modules is made possible by a nonlinear extension of the well 
known swapping lemma. 

In addition to the global boundedness and tracking, the new 
design also provides explicit bounds on the transient perfor¬ 
mance, which can be utilized for its systematic improvement. 

The results of this paper assume that the full state is 
available for feedback. Relying on the experience gained with 
recent recursive output-feedback designs, such as [22], [14], it 


is expected that the estimation-based design of this paper wil 
be extended to nonlinear systems in the output-feedback form 
The applicability of various designs, Lyapunov-based or 
estimation-based, will ultimately depend on their robustness 
with respect to unmodeled phenomena. This is another impor 
tant topic of current research. 

Appendix 

Lemma A.l: Let v , p: R+ —» R, c, b > 0. If 

v < —a; 4- bp 2 , v(0) > 0 (A.l) 

/) then 

v(t) < t>(0)e~ c * 4- b f c" r ( # “ T )p(r) 2 dr. (A.l) 

J o 

ii) If, in addition, p e £ 2 , then v 6 £00 n £\ and 

IH|i<i(i’(0) + fe|H|i). (A.3) 

Proof; i) Upon multiplication of (A.l) by c ct , it becomes 

jMiY 1 ) < M0 2 e c ‘- (A.4) 

Integrating (A.4) over [0, t], we arrive at (A.2). 
ii) Noting that (A.2) implies that 

v(t) < v(0)(~ ct 4 b sup f p(r) 2 dr (A.5) 

r €:[(),*] do 

we conclude that v e £ 00 ■ By integrating (A.2) over [0, I], 
we get 

f v(r) dr < / t/(0)e~ CT dr + b f j e~ ci<T ~ s) p{s) 2 ds dr 

Jo Jo Jo L./u 

< -u(0) 4- bj^ ( ,ctt p(s) 2 ^ ^ (~ CT dr ^ ds 

< -?;(0) + b f e c V(*) 2 -e- r ' da 

c Jo c 

< - v(0) + b [ p(r) 2 dr (A.6) 

c L Jo 

which proves (A.3). D 

Lemma A.2: Let T*: u 1 —► Q, i = 1, 2 be linear time- 
varying operators defined by 

C i = Ai(1)Q + u (A.l) 

where \ R+ —* R nHn are continuous, bounded and ex 
ponentially stable. Suppose 9 : R+ —> R p is differentiable, 
<£: R+ —♦ R pXm is piecewise continuous and bounded, and 
M : R+ —► R T,X W is bounded and has a bounded derivative on 

R+. If 9 € £2 then 

Ti[p T ]§ € £ 2 =» T 2 [M(f> T ]9 6 £ 2 '. (A.8) 

If moreover, M(t) is nonsingualr VI, and Af -1 is bounded 
and has a bounded derivative on R+ then (A.8) holds in both 
directions. 
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Proof: Suppose that Ti[<t> T )9 e £ 2 . By Lemma 42, 
€ £ 2 ana therefore Cl = Tij [<t> T 9] € £ 2 . 
; will show first that C 2 = T 2 [M(j> T 0] 6 £ 2 - By substituting 
9 = Ci - 4i(0Ci into the variation of constants formula 
,: j applying partial integration we calculate 

,,(*) = $2(L 0)C2(0) + / $2 (t,T)M(T)<j> T (T)6(T)d,T 

JO 

= *a(t, O)Ca(O) + / * 2 (t, r)M(r) 

Jo 

■ [CiW - i4i(r)Ci(r)]dr 

= * 2 (t, 0)C 2 (0) + M(f)Ci(t) - *2(t, 0)M(0)Ci(0) 

+ / $ 2 (f, r)[M(r) + A 2 {t)M(t) - M(t)A x {t)] 

Jo 

Ci (r)dr (A.9) 

where $ 2 (L r ) is the state transition matrix of A 2 (t) that 
satisfies |$ 2 (i, t)\ 2 < kt~ a ^~ T \k, a > 0. It is clear that 
4>2(<. 0)C2(0 )+M(£)Ci(<)-$ 2 (*, 0)M(0)Ci( 0) G £ 2 because 
0) is exponentially decaying, M{t) is bounded and 
(j e £ 2 . Since 

<J> 2 (f, r)[M(r) + A 2 (t)M(t) - M(t)A 1 (t)]Ci(t) ,It 
< \\M + A 2 M-MA x \\l u k 2 f P~ 2n(t - T) \<:i(r)\ 2 dr (A.10) 

Ji) 

then similarly to (4.16)—(4.17) from the proof of Lemma 4.2, 
we can show that the expression (A. 10) is in C 2 . Thus = 
T 2 [M<l> r f)} e c 2 . By Lemma 4.2, T 2 [M</> r 0] - T 2 [M(j> T ]6 € 
C <2 and therefore T 2 [Mcf) T 0] £ C 2 . The proof of the other 
direction of (A.8) when M(t) is nonsingular Vi, and M -1 is 
bounded and has a bounded derivative on R+ is identical. □ 
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Approximate Decoupling and Asymptotic 
Tracking for MIMO Systems 

D. N. Godbole, Student Member, IEEE , and S. S. Sastry, Senior Member, IEEE 


Abstract —This paper presents an algorithm for approximate 
input-output decoupling of nonlinear MIMO systems that are 
cither numerically ill-posed or exhibit nearly singular behav¬ 
ior in the application of decoupling algorithms. Although the 
systems considered are regular, so that the exact decoupling 
algorithms are applicable in this case, they require inversion 
of an ill-conditioned matrix, and yield high gain feedback so¬ 
lutions that may result in actuator saturation and cancellation 
of high frequency zeros. The approximate algorithms of this 
paper are numerically robust, and provide solutions that do not 
cancel far off right-half plane zeros. This latter characteristic 
is especially valuable when some of the far off right-half plane 
zeros are unstable. The algorithms are inspired by and are 
generalizations of some examples in the flight control literature 
II], 12], [3]. 


I. Introduction 

T HE nonlinear control toolbox has grown enormously 
in the last decade. Central to this development is the 
iheory of feedback linearization for nonlinear systems (see [4], 
[5]) and the solution to the multi-input multi-output (MIMO) 
decoupling problem. Several algorithms have been proposed 
in the literature for solving the problem of exact decoupling 
foi nonlinear MIMO systems, sec for example 14]—[9]. These 
algorithms all require the determination of the inverse of a so- 
called decoupling matrix. Practical implementation of these 
algorithms, however, is difficult when the decoupling matrix 
is ill conditioned or close to singularity, in which case the 
decoupling of such systems needs excessively large control 
effort. Further, the algorithm is not numerically robust since 
it requires the inverse of an ill conditioned matrix. 

In this paper, we propose a numerically robust input-output 
decoupling algorithm for invertible nonlinear MIMO systems. 
Our efforts arc motivated, in part, by the use in Hauser et 
(d [2], the work of Singh [1], [31 of such techniques to 
aircraft flight control problems and the work of Barbot et al. 
hOJ with applications to models of electric motors and the 
i'elousov-Zhabotinsky reaction kinetics. In these examples, 
intuition for approximation and the choice of parameters 
1 be approximated is provided by the physics of the prob- 
Tti. This paper attempts to formalize the theory involved in 

Manuscript received March 26, 1993 Recommended by Associate Editor, 
M. Bloch. This research was supported m part by Army Research Office 
irter Grant DAAL-91-G-091 and NASA undei Grant NAG 2-243 
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these examples and provide an algorithm for more general 
MIMO nonlinear systems, whose physical derivation may not 
be explicitly known by the designer. Another recent paper 
by Grizzle and Di Benedetto fll] provides an approximate 
decoupling algorithm for systems that are not decouplable 
by the exact decoupling algorithms by reason of their not 
being regular. The approximate algorithm in this paper aims at 
those systems that are regular and so decouplable by the exact 
decoupling algorithms, but the numerics of the decoupling are 
poorly conditioned. 

In addition to the numerical robustness of our approx¬ 
imate algorithm, it also serves another important purpose: 
The exact input-output decoupling algorithm is essentially 
a pole-zero cancelling control law. Thus, this Jaw cancels 
zeros of the open-loop system regardless of whether they 
are in the left half plane or the right-half plane and re¬ 
gardless of their magnitude. In particular the input-output 
decoupling control law can only be applied to minimum 
phase nonlinear systems. For systems with far off zeros, 
cancellation may not be necessary, since the zeros do not 
play a large role in the system dynamics and indeed may 
cause instability when the zeros lie in the right-half plane, 
In either case, cancellation of far off zeros results in high gain 
controllers. Our approximate decoupling algorithm does not 
cancel the far off zeros of the open-loop system, thereby pro¬ 
viding reasonable gain, practically implementable solutions. 
The price to be paid is the replacement of asymptotically 
exact tracking control laws by approximate tracking con¬ 
trol laws. This connection between regular perturbations of 
nonlinear systems and the far off zeros was first pointed 
out in f 12] for the single-inpul single-output (SISO) case 
and in [131 for the MIMO case. Systems in which these 
far off zeros are in the right-half plane are called slightly 
nonminimum phase systems in [2J. The approximate de¬ 
coupling controller, in this case, results in a stable closed-loop 
system. 

It is possible to develop several different approximate 
decoupling algorithms starting from the different decoupling 
algorithms in the literature. The algorithm presented here is 
based on the Descusse and Moog dynamic decoupling algo¬ 
rithm (see [6], [14]). Section II reviews the Descusse-Moog 
algorithm. In Section III, we state the approximate algorithm 
and Section IV compares its convergence properties with that 
of the Descusse-Moog algorithm. In many cases, input-output 
decoupling is a first step in designing a tracking controller. 
Section V examines the effect of the approximate decoupling 
on the performance of tracking controller. 


0018-9286/95$04.00 © 1995 IEEE 
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II. Decoupling Algorithms for Nonlinear Systems 

Consider the square (i.e„ number of inputs is equal to the 
number of outputs) MIMO nonlinear control system described 

by 


rn 

x =/(*) + 'Y^g,(x)ui 

i —1 

yj=hj(x) j — 1, ■ ■ • ,m 


(D 


where x E 9t n , f {x), gx(x), ■ ■ •, g m (x) are analytic vector 
fields on !K n and hi(x), ■ • ■, h m (x) analytic functions on 9t M . 
For convenience, these equations are written as 



x = f(x) 4- g(x)u 
V = h(x). 


Throughout the analysis, we will assume that xq is an equi¬ 
librium point of the autonomous system, that is f(xo) = 0 . 
We will assume (without loss of generality) that h(xo) = 0 . 
All the analysis in this paper will be local and will be valid 
in a given open neighborhood U of x 0 . We now review some 
algorithms for decoupling of MIMO nonlinear systems. 

Wc assume in what follows that each output yj has a well 
defined relative degree yj, i.e„ there exists an integer yj such 
that 


L gi Ljhj(x) = 0 V/ < 7 j - 1 , VI < i < rn , Vx E U. 

Collecting these calculation, we have 


'vV ' 


'LjhiW ■ 

y? 

— 

Lfh 2 (x) 

-I/m ‘ - 


Lj m h m (x) m 


L al Ly-%{x) 


L L ai L)'"- l h m {x) 
: = b{x) 4 - A(x)u. 


L^Ly-'hiix) 


Lg„, f 


“fm ““ 1 


hm (T’) J 


( 2 ) 


A(x ) is called the decoupling matrix. If A(ar) is invertible at 
every point in £/, then the static-state feedback given by 


•u = (A(a:))" 1 [- 6 (ar) + w] 


(3) 


will result in a closed-loop system that is decoupled from input 
v to output y. This decoupled and input-output linearized 
system is given by 


(4) 


[f the matrix A(x) is singular, we cannot use a static state 
Feedback to decouple the nonlinear system ([41), and we have 


pi' ‘ 


~ v \ m 

v? 


v 2 




■Vm - 


. c n , . 


to look for a dynamic state feedback to achieve input-outpu 
decoupling. 


A. Dynamic Decoupling 

If decoupling of the system 0) cannot be achieved by 
static state feedback, it may still be possible to find a dynamic 
compensator of the form 



z — D(x, z) 4 - E(x, z)v 
u = F(x, z) 4 - G(x, z)v 


(5: 


with z E SH Hc , v E 9t m . such that the closed-loop system 
denoted by (for extended system) 

{ x = f(x) 4 - g(x)F(x, z) 4 - g{x)G(x, z)v 
z = D(x , z) 4 - E(x, z)v ( 6 ) 

y = h(x) 

is decoupled from v to y. The dynamic feedback that decouple? 
the system of (I) is actually a static feedback to decouple 
the extended system of ( 6 ). There are a number oi 
algorithms in the literature for dynamic decoupling. The 
approximate decoupling algorithm we will propose is basec 
on the Descusse and Moog algorithm of ([ 6 ], [14]). We wil 
review the original algorithm to fix notation. 

Descusse and Moog Dynamic Decoupling Algorithm: 
Define the extended system at the end of iteration k - 1 
to be Ylk having x c as its state and equilibrium point xfj. 
The algorithm will start at k = 0 with the given system 
having state x E 9t r< and j ; 0 as its equilibrium point. The 
outputs of the system are unchanged during the course of Ihe 
algorithm. 

Step i: Compute the relative degrees 7 f(i E {1, ■■■,///)) 
for the rn outputs of Define the decoupling matrix A k (x r ) 
to have its ij th entry given by 


(*') - L ^s 


hi(x r ). 


Let Tk be the normal rank of Ak{ f x c ) in an open neighborhood 
of .Tq. If r'k = rn , stop. 

Step ii: If r k < rn , define a square and nonsingular matrix 
Gk(x F ) such that the (m — r k ) last columns of A k (.x e ): = 
Ak{x e )Gk{x r ) are identically zero. Moreover, this process 
can be carried out such that there exists r k rows of which 
the nonzero elements form an r k x r k nonsingular diagonal 
matrix. It is shown in [ 6 ] that ) always exists and is as 
analytic function of x e . 

There are r k columns of Ak(x r ) with nonzero elements 
out of which q k columns have two or more nonzero elements 
Design a permutation matrix I\ such that the first q k column 
of Ak(x c )Pk have two or more elements nonzero, followc 
by the {r k - q k ) columns having only one nonzero elemen 
and finally the last (ra — 7 >) columns having all zero elements 
Denote G k {x e ) = Gk(x*)Pk. Define an intermediate input i 

by 


U = [G t (* c )]- 1 f*. 


(7) 
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■itep iii: The system now is 

x e =f{x e ) + g(x')G k (x e )u 

y =A(x e ). 

A ;.;l integrators in series with the first q k inputs. This creates 
the new input vector u of the form 

r *1 i 


Um J 


Thus the new system after adding these integrators is 


m i e " 


7{® c ) + E?i i9i{x e )ui~ 



Ui 

= 

0 

+ 

*1 

- 


0 




V = h{x e ) (9) 

where g(x e ) = g(x e )Gk(x c ). Call this system X^+i- 

Step iv: Go to Step i and resume the procedure with k 
fe+1, new state variables x e *- {:r e }Uand new 
input u u. Let /, g, h still denote the extended /, g , h, for 
notational simplicity. Let U still denote the open set of interest 
containing the equilibrium point of the extended system. □ 

It has been shown by Dcscusse and Moog that if system (1) 
is right invertible and satisfies the accessibility rank condition 
(cf. [5, p. 86J) at x'o, then the above algorithm converges in a 
finite number of steps, L, to an extended system which 
is decouplable by static state feedback. 

Note: At the end of zth iteration, the above algorithm adds 
(], integrators (q, < r, ) to the system ■. Thus the dimension 
of x c increases by q ,. 

The Dynamic Extension algorithm (cf. [4]) and Singh’s 
algorithm ([7], [15]) are similar, except that Singh’s algorithm 
involves nonlinear transformations of the output space instead 
of the input space as in the Descusse-Moog algorithm. 

The computation of the rank of Ak(x e ) will be greatly 
simplified if Ak(x c ) satisfies a regularity condition (see [16]), 
v hich guarantees that the normal rank of A k (x e ) is the same 
a - the local rank for every x c E U. In this paper, we assume 
1 5Zo * s a regular system. If the given system is not regular, 
( *en one can use the procedure of [11] to get an approximate 
ystem that is regular and call this system 5I 0 . 

Normal Form 

Let us assume that the Descusse-Moog dynamic decoupling 
'gorithm converges after L steps to a system of the form 
f (6). Let us denote this extended system by ^ et 
P, 9 e , h e ) be the triple characterizing £ e> x e = (x, z) E 
t n+Tlc its state, u € its input, and y e its output. Let = 
fo, zq) be the equilibrium point of interest. This system 


has a well defined vector relative degree [M 
at £§■ L®* We can construct a local chan::, 

of coordinates <j>(x e ) = (£, r/) with £ = col (£*), such \\ux 
= 0 , by choosing 

C = col (A?(x e ), Lf')i'(x %. ■ •, L-}r l hUx*)) 

: = coi(€i, ( 10 ) 

and remaining (n + n c - 7 ”) complementary coordinates In 
these coordinates, takes the norma] form (see [ 4 ]) 

ei = e a 

^ 7 ' -1 = 5 r do 

= Kit, v) + EJLi r >) u j 
v = g(C v) + P(l, v)n r 
Vi = « 

for i = 1 ,• ■ ■, rn, where 

bi{^r 1 ) = Lpil{r\^r ) )) 1 <i<m 

v) = L 9 .Lyr l h'i{4>~ l {i, r/)) 1 < i. j < rn. 

The static state feedback that decouples the system 
given by 

u e = (A^ n ))- l [-b^Crn) + ri 02 ) 

The decoupling state feedback renders the r/ dynamics unob¬ 
servable. The zero dynamics of system are the dynamics 
of the rj coordinates in the subspace £ = 0 with the decoupling 
feedback law of ( 12 ) (with v = 0 ), i.e., 

V = 9 ( 0 , V) - P{ o, r/)[(^‘ (0, r/)) _1 6 l (0, r/)]. (13) 

For a detailed discussion of the zero dynamics and the trans¬ 
mission zeros of nonlinear systems we refer the reader to [17], 
[4, Chapter 6], [5, Chapter 11]. 

III. Approximate Dynamic Decoupling Algorithm 

The difficulty in implementing the decoupling algorithm 
comes from the ill-conditioning of the decoupling matrix or 
from a situation in which the decoupling matrix may be 
nonsingular but close to singularity. To be precise, this occurs 
if the smallest singular value of A c (t c ) is smaller than a 
certain prespecified c > 0 for any x E U. In this case, the 
algorithm calls for the inverse of an ill-conditioned matrix. 
In addition to the fact that the inverse is not numerically 
robust, it may cause large feedback gains in the controller 
and also cause the cancellation of far-off zeros (see [13, 
Section 4]). To alleviate these difficulties, we propose the 
following numerically robustified decoupling algorithm: while 
the algorithm appears to have numerical considerations in 
mind, it is, in fact, valuable for the reason that it does not 
cancel far off right-half plane zeros and may help control the 
magnitudes of the control inputs. To state the algorithm, recall 
a few basic facts and definitions about the numerical rank of 
a matrix. 
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Definition J: A matrix A E £H nxn is said to have t 
numerical rank r if 

inf rank {#: \\B - A\\ < r} = r. 

The norm in the above definition is the induced norm of the 
matrix induced by the Euclidean norm. 

Thus, the numerical rank of a matrix is the lowest it can drop 
to in an r neighborhood of the given matrix. In particular, if 
the matrix has (n - r) of its singular values less than e, then 
its numerical rank is r. 

Approximate Dynamic Decoupling Algorithm. This algo¬ 
rithm starts at k = 0 with the given system having j () 
as its equilibrium point. We are given a threshold e > 0. 
Let the extended system at the end of iteration (A - 1) be 
denoted by having P as its state and .r£, the corresponding 
equilibrium point. 

Step i * Compute the relative degrees of the outputs, namely, 
7 *, 7 = 1 ,... and the decoupling matrix A k (P ). Let r k 
be the normal rank of A k (P) in U. If 7 ^ = ni and if the 
smallest singular value of A k (P ) is greater than the threshold 
f uniformly on IJ, stop. 

Step ii If all the nonzero singular values of Ay, (./■') are less 
than ( uniformly on £/, approximate A k (x l ) by a zero matrix. 
Go to Step i and recalculate the relative degrees 7 * with this 
approximation. 

If r k = m, go to Step 111 , with A k (.r { ) = and 

G k (j' ) = AnXm* 

Design a square, analytic and nonsingular matrix G k (r*) 
such that the last m-t k columns of /U( 7 ') : = A k (x ( )G k (x t ) 
are identically zero. 

Step in If the smallest nonzero singular value of A k (x f ) 
is greater than the threshold r, go to Slep iv, with A k (.r* ) = 
>4*(.r e ), G k (.i p ) = / m x rti and w k = 7 k . 

If the smallest nonzero singular value of A k (x *) is smaller 
than f, then there exists a positive integer iu k (< r k ) such that 
the t rank of A k (x e ) is w k uniformly on U , i.e., (r k - tu k ) 
nonzero singular values of Ajt(j') are less than ( uniformly 
on V. 

Design a square analytic nonsingular matrix such 

that 

)•<>, ••■,()]. (14) 

Approximate the r k - w k columns, which are small in norm, 
by identically zero columns. Go to Step iv with 

•'M-O = ),•••,at, t (.!■'), 0,•■•,()]. 

Note. The < numerical rank of A k {P ) may not be constant 
over U. If the ( numerical rank passes through singularity at 
x* 0 , then the approximate algorithm will not work . 2 Throughout 
the analysis, we assume that the e numerical rank of ) 
is constant in U. 

1 The proof of existence ot such a matrix is given in the Appendix. 

2 The problem may be solved by reducing the value of f. Since ,s a 
regular system (cf. |16|), the singularity does not exist for f = 0 However, 
reducing the value ot *■ defeats the purpose of the algorithm, by causing high 
gain solutions to the original decoupling problem. 


Step iv: Out of w k nonzero columns of A k {x e )> q k column 1 
will have two or more nonzero elements. Design a permutatioj 
matrix P k such that the first 4 *. columns of A k (x e )P k (x 1 ) have 
two or more nonzero elements, followed by the (w k - q k > 
columns having only one nonzero element and finally the 
last (m - w k ) identically zero columns. Denote G k (x e ) = 
G k (s c )G k {s f ’)P k . Define an intermediate input by 

u = [Gfc(.r r )]~ 1 u. 

Step v The system now is 

P = f(x l ) + g(P )G k (x c )v 

y = HP)< 

Add an integrator in series with the first q k inputs. This creates 
the new input vector u of the form 


Ml ’ 


ih 

U <lk 



JjL qt +1 


1L, U H 1 

- - 


. »m - 


Thus the new system after adding these integrators is 


" x f 


’f(p) = <h(p)u> 9 


'Er=i M1 

Ml 

= 

0 

+ 

»i 

- 


0 


««» 


y-h(r') ( 16 ) 

where q(r f ) = q(s' )G k (P )■ Call this system Yl k n* 

Step vi Return to Step i and resume the procedure with 
A■ <— A -hi, new state variables ./ r <— and 

new input it <— it. Let /, g , h still denote the extended /, ry, h 
tor notational simplicity. Let V still denote the open set con 
taining the equilibrium point of the extended system. □ 
Let us assume that the approximate dynamic decoupling 
algorithm converges after L steps to a system of the form 
of ( 6 ). Let us denote this extended system by Yl r . Lcl 
{f l ,g e nh c ) be the triple characterizing £ t , P = (.r, zi 
its state, it* its input, and j) r its output. Let i f 0 = (^o, zq 
be the equilibrium point of interest. The system e has a 
well-defined vector relative degree [ 75 *, ■■ , 7 j] at P 0 . Tlnr 
we can construct a local diffeomorphism of the form of (10 
such that will be transformed into the normal form (£, i] 
coordinates given by 

£' = col (h;(s r ), Lj.Ki ?), • ■ • ,Xf r -'iw )) 

s «l(Pi,'"4) (17 

and remaining (r; 7f) complementary coordinate' 

7/. In the course of the approximate dynamic decoupling 
algorithm, we have neglected order f terms at each iteration ol 
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algorithm. Thus the original or exact system in the normal 
m coordinates of will be 

tl = ?2 


- £(<> 

■J ™ 

£ 7 “=£^+i+ 

.1=1 

■j .. ™ 

£ 7 ;-! = £ 7 - + 

■j . . ™ 

% = m, v)+ J>^(£, 

.?=! 

v = </(£,»)) + P(£, r/)u r 

?/,• = £1 ( 18 ) 

for i = 1, - ■ ■ ,m, where 

&,•(£.»/) = £}t*<(0 _1 (£. »/)) 1 < * < m 

«*;(£. v) = Lg* n)) 1 < *. :i < m - 

Note: If we substitute c = 0 in (18), then we get the rep¬ 
resentation of the system J2 V * n lis normal form, coordinates 
(£, //). The static state feedback that decouples the system 
E, is given by 

= U'(£, r))) _1 [-^(£. »j) + 4 09) 

If we compare the approximate and exact decoupling algo¬ 
rithm, we see that 

• If the exact decoupling algorithm converges with a de¬ 
coupling matrix A e (x r ) whose smallest singular value 
is greater than c uniformly on U, then the approximate 
decoupling algorithm yields the same result. 

• If, on the other hand, i4 r (^ r ) obtained by exact decou¬ 
pling algorithm has its smallest singular value of the order 
off uniformly on t/, then the decoupling control law (12) 
will have terms of the order of 1/c that will result in a 
high gain controller that may not be practically feasible 
if the actuators have saturation limits. 

The approximate decoupling algorithm is of use if we can 
11 'i-swer the following two questions: 

1) If the given system (1) is right invertible and locally 
accessible (i.e., the exact decoupling algorithm con¬ 
verges in L steps), then does the approximate decoupling 
algorithm converge in a finite number of steps? 

2) If the answer to the first question is yes, then how much 
error do we introduce in tracking the reference outputs 
by using the approximate decoupling and tracking law 
instead of the exact one? 

These questions will be answered in the following sections. 


IV. Convergence of Approximate 
Decoupling Algorithm 

The approximate decoupling algorithm is based on the Dc •. 
cusse and Moog dynamic decoupling algorithm. Consequently, 
the convergence properties of the approximate decoupling 
algorithm will be compared with that of the Descusse and 
Moog algorithm. Ideally the approximate decoupling algorithm 
should preserve the convergence properties of the Descusse 
and Moog algorithm. 

The algorithm of Descusse and Moog adds dynamics to the 
given system Eo* to get an extended system which is 
decouplable by static state feedback. The following theorem 
[6], relates this notion to the nonsingularity of the decoupling 
matrix Ai(x). 

Theorem 1: System E of the form of (1) is decouplable 
by static state feedback, if, the decoupling matrix for E is 
invertible. 

The approximate decoupling algorithm converges to an 
extended system that is robustly decouplable by static state 
feedback. This notion is defined as follows. 

Definition 2: An rn x m matrix A(j:) is r robustly invertible 
in an open set V with respect to a threshold r if the c numerical 
rank of is rn uniformly on f/. 

Definition 3: A system E of the form of (1) is t robustly 
decouplable by static state feedback with respect to a threshold 
c, if, E is decouplable by static state feedback and the 
decoupling matrix A(x) is robustly invertible with respect to e. 

The following lemma considers the effect of one iteration 
of the approximate decoupling algorithm on a system that is 
decouplable by using the Descusse and Moog algorithm. 

Lemma 1: Suppose that the Descusse and Moog algorithm 
converges for a system Ejt of the form of (I). Apply the 
approximate decoupling algorithm. After one iteration, we get 
the extended system J2k+v ^ ne °* l ^ e following is true for 
Efc+i : 

1) = oc for some i. 

2) A/t+i(:/;) is singular and the Descusse and Moog decou¬ 
pling algorithm does not converge for Efc+r 

3) 7 fc+l = n and yl* ; +i(:r) is not robustly invertible. 

4) Afc+i(a?) is singular, but the Descusse and Moog algo¬ 
rithm converges for Ea-h- 

5) Ak+ x(a;) is nonsingular, not robustly invertible and 

7*'+ 1 < Tiy . 

2 ^ A- 4- J 

6) Afc+i(:/;) is robustly invertible. 

where 7 * +1 = Ei~i7? +J anc * n V ' — the dimension of 

state space of Efc+r 

Proof: This is a list of all the cases after application of 
one step of the approximate decoupling algorithm. It is easy 
to check that the list exhausts all the possibilities. □ 

We will analyze each of the above cases in detail to 
understand why in some cases the approximate decoupling 
algorithm may not converge. 

While applying the Descusse and Moog algorithm, we 
differentiate each output until at least one input appears on 
the right hand side. Some of the inputs show up earlier than 
others making the decoupling matrix singular. Integrators are 
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added in front of these inputs to delay their appearance for at 
least one more step of differentiation. 

In this process, at a particular step, some of the inputs 
might be weakly connected to the outputs, i.e., the functions 
multiplying them are smaller than the threshold f uniformly in 
U . These functions are approximated by zero in the approxi¬ 
mate decoupling algorithm. This modification in the original 
Descusse and Moog algorithm might make the resulting sys¬ 
tems noninvertible. The classification of various cases in the 
previous lemma helps detect such systems in the following 
manner: 

Analysis of Lemma 1. Recall that 7 t fc are the relative de¬ 
grees of the outputs of 53*- We have 



bk(:r) + A k {s)u 


rank of Ak(x) = i'k- At the end of Step ii, (m - r k ) columns 
of Ak( t) are identically zero and there is at least one nonzero 
element in each row of Ak(j'). 

At the end of Step iv, some of the rows of A*(,/;)G*.(j ) 
might be identically zero because of the approximation of 
(rk — Wk) columns by zero. The number of such rows is less 
than or equal to (r k - w k ). Let us denote the set of outputs 
corresponding to these rows by Y k . 

(wk - Qk) columns of Ak(T)C k (.r) have only one nonzero 
element. By construction, these nonzero elements will be in 
(w* “ Qk) different rows. Let us denote the (w k - qk) outputs 
corresponding to these rows by Y k . The remaining outputs 
will be denoted by V*. 

The outputs of 53fc not change in the process. Thus for 
Efr+l' we have 


e n 


7f +1 = st - ft 

7/ +1 = + 1 , V? s.t. y, e __ 

7 , > 7* + l, Vi s.t. y, € Y k . 


n 


Case 1: If the only nonzero entries in the ah row are in 
the j column, then when the jth column of Ak(j')Gk(j;) is 
approximated by zero, the 7 th row is made identically zero. //, 
is only affected by Uj, and after the approximation, u. never 
appears on the right hand side again. This makes 7 , = 00 . 

Case 2: The situation here is that Case 1 recurs, but not 
immediately at the end of the k - 1st step but later in the 
algorithm. Thus the system 53*, l° ses invertibility because of 
the approximation. Some rows of Ak+i(x) will be dependent 
for all l > 0. This case can also be avoided by normalizing 
the jth input. 

In the course of the approximate decoupling algorithm, Case 
2 goes unnoticed until you reach Case 3. Thus the reason for 
Case 3 is in fact the occurrence of Case 2 during one of the 
previous iterations. 

If the approximate decoupling algorithm converges to a 
system 53 l that can be decoupled by static state feedback, 
the outputs y x and their respective derivatives up to the order 
of ( 7 ^ - 1 ) qualify as a partial change of coordinates. In the 
normal form notation these are the £ coordinates. 

During each iteration of the approximate decoupling al¬ 
gorithm, the state space dimension of 53 * is extended by 


q k whereas at least (m - xu* + qk) new £ coordinates ai • 
introduced. The difference between the state space dimensio < 
of 53 *.+i an d the dimension of £ coordinates decreases b 
(rn - Wk) during each iteration of the approximation di 
coupling algorithm. 

Case 3: If we go through (k + l)th iteration, dimension nl 
£ coordinates will exceed the dimension of the state space ot 
]T fe+<2 . Thus we cannot proceed further. 

Cases 4, 5, and 6 lead towards the convergence of approx¬ 
imate decoupling algorithm. □ 

Definition 4: Suppose 53o satisfies the hypothesis of Des¬ 
cusse and Moog algorithm. Apply the approximate decoupling 
algorithm, for given t > 0. Let k > 0 be the smallest integer 
such that either case 1, 2, or 3 of the previous lemma is true 
for 53*+r Then the system 53o is said to be f-unnormalized. 

Theorem 2. Suppose 53o satisfies the hypothesis of the 
Descusse and Moog dynamic decoupling algorithm. Apply the 
approximate decoupling algorithm for a given r > 0. Then 
one of the following is true. 

• The approximate decoupling algorithm converges in a 
finite number of steps. 

• The system 53 0 is (-unnormalized. 

Proof: During each step of the approximate decoupling 
algorithm, the difference between the state space dimension 
and the dimension of £ coordinates decreases by (m - u;*). 
Thus, if the first three cases of the previous lemma arc avoided 
during each iteration, the algorithm has to converge in a finite 
number of steps. 

The discussion following the previous lemma shows that 
the first three cases correspond to the underlying system being 
unnormalized. □ 

In general, il is not possible, a priori , 10 find out whether 
a given system is normalized or not. If the approximate 
decoupling algorithm does not converge in n steps, then the 
system is unnormalized, provided it was decouplable by using 
the exact Descusse and Moog dynamic decoupling algorithm. 

The design parameter f is an input to the approximate 
decoupling algorithm. If the approximate algorithm does not 
converge for a given because of the system being r unnot- 
malized, then the value of r can be reduced. This might result 
in convergence of the algorithm but the original problem ol 
numerical robustness and high gain solutions still exists. In the 
limit that f —* U, one recovers the Descusse-Moog algorithm 
For f = 0, the r unnormalized systems arc noninvertible. Thw 
given an invertible system that satisfies the accessibility ran* 
condition, one can always find an e so that the approximai‘ 
algorithm converges. In particular, the algorithm converges U 
the specific examples of fl], [2], [3). 


A. Multiple Time Scale Zero Dynamics 

Since the zero dynamics of a system does not change b 
addition of integrators to its input channels [4, p. 389], 1 
by input space transformations or by state feedback, the zen 
dynamics of 53 o is same as that of 53 z» where 53 z IS ^ 
extended system at the end of Descusse and Moog algorithm 
The next lemma compares the zero dynamics of and 53 1 
where 53 L is the extended system at the end of approximau 
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coupling algorithm. Note that the approximate decoupling 
;orithm cannot converge In fewer steps than the Descusse 
A Moog algorithm. 

Lemma 2: Suppose ‘ s r *ght invertible (cf. [6|) and 
insfies the strong accessibility rank condition (cf. |5, p. 86J) 
t) r ro- Suppose that the approximate decoupling algorithm 
converges for this system in exactly the same number of steps 
,ls the Descusse and Moog algorithm. Let Y!l anci ^ 
sssiem at the end of the Descusse and Moog algorithm and 
the approximate decoupling algorithms respectively. Then 


dim (t/e, ) < dim (r/v, ) 

where 7 / denotes the zero dynamics coordinates. 

Proof: During each iteration, the difference between stale 
space coordinates and the £ coordinates decreases by (m - w^) 
for approximate decoupling algorithm and by (m — r*.) for the 
Descusse and Moog decoupling algorithm. After L steps when 
both the algorithms converge, the zero dynamics dimension is 
given by the difference between the state space dimension of 
(he extended system and the dimension of the £ coordinates. 
Thus 


dim(//v, ) = dim(f/v, ) - Sf =0 '(/-, - «>,) 
< dim (//£, ). 


□ 


Thus, in general, the zero dynamics of the extended system 
at the end of the approximate decoupling algorithm (i.e., ) 

will have smaller dimension than that of We would like 
to investigate the relationship between r/v and //^ ( . 

Recent results in the area of singularly perturbed zero 
dynamics of nonlinear systems ([12J, [13]) lead us to the 
lollowing conclusion: under some suitable technical hypothe¬ 
ses, the zero dynamics of can be decomposed into two 
or more time scales using singular perturbation theory (cf. 
il8|). The slow or reduced system is described by the zero 
dynamics of J2, and the dynamics that were neglected during 
the process of approximation constitutes the faster time scale 
or boundary layer subsystem. In [13J, the authors have proved 
the above conclusion for a restricted class of two-input/two- 
ouiput systems. The full details and the technical hypotheses 
needed to guarantee the existence of singularly perturbed zero 
ovnamics for this general class of systems remain to be worked 
< ut. We will conclude this section with the following remarks. 

The approximate decoupling algorithm creates an extended 
> stem that does not include the far off zeros of the original 
\ stem. Since the static state feedback that achieves decoupling 
1 is a pole zero cancellation law, we do not cancel the 
>r off zeros of in the case of the approximate decoupling 
•gorithm. The cancellation of these far off zeros requires a 
■rge control effort resulting in a high gain controller. If these 
ir off zeros are unstable, then their cancellation makes the 
loscd-Ioop system unstable. These systems are referred to 
^ slightly nonminimum phase systems in [2]. Application of 
3e approximate decoupling algorithm to slightly nonminimum 
>hase systems results in a stable closed-loop system. 


V. Approximate Asymptotic Tracking 

Input-output decoupling is closely related to tracki.i < , • 
reference trajectoties by the outputs of a MIMO nonius 1 
system. If the desired trajectories to be tracked fall mu* 
a restricted class of functions, say constants 01 sinusoids 
with a finite spectrum, then we can use nonlinear regulatoi 
theory (see [4, Chapter 7], [19J, [20]). If ihe class of desired 
trajectories is more general, for example, functions that arc A 
times continuously differentiable but otherwise arbitrary, then 
according to [21] the decoupling controller forms an inner loop 
of the overall tracking controller. If the given system is not 
robustly dccouplable by using exact decoupling algorithms, 
then we have to use the approximate decoupling feedback. This 
section considers the effects of approximate decoupling on the 
performance and stability of the overall tracking controller. 

Let us assume that the approximate decoupling algorithm 
converges for ^T 0 giving us the approximate extended 
system ]T f . bquations (18) with ( = 0 represent ]T ( 

in its normal form (£. 7 }) coordinates. If the objective of 
the controller is to track the desired reference trajectory 
//,/(/) = [/VdjlOi* (0] 7 which is smooth and hounded 

with bounded derivatives, we design the control input 11 ( to be 

u r = (A' (t'W'l-b' + 






[viz («lz 1 ■ ■ •-* <{»d n 


( 20 ) 


where (.s^ r + rr 1 -1 H-b a,',) is a Hurwitz polynomial 

for / == 1 .•*•, 7 M. 

Let us define the tracking errors to be 


f*i : — £1 - y ( i, 1 1 < / < m. (21) 

Let us define the error coordinates for system to be 


1 < / < m. (22) 


Thus the system with the feedback (20) can be expressed 
in (c, r/) coordinates by 

r L = A 1 ? 1 1 = 1, • ■ ■ , r/v 


■ <■! 


■ t\ 


" Vd, 

1 

1 

1_ 


1- 

\ 

1_ 


1- 

1 ’' 

1- 


V = <l(t fi) + '/)«' (C, V, v) 

where A‘ E given by 


( 23 ) 


A = 


0 1 


0 


0 0 0 1 

L-<*0 -<T i ••• J 
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It can be shown (e.g., see [2]) that if 

• The reference trajectory and its derivatives arc bounded 
and small enough. 

• Zero dynamics of (23) (j.e. f the equilibrium point rj o = 0 
of the system) 

rj - g(0, rj) + P(0, fj)u e (Q, r), 0) (24) 

is exponentially stable. 

• <?(£> V) + v)u ( (C W, l 0 is locally Lipschitz contin¬ 
uous in £, fj 

then limt^o^eKt) = 0 V«. and the states £, rj remain 
bounded. 

The controller of (20) is designed for the approximate 
extended system If we apply this controller to the exact 
system, we get the system equations in the (e , fj) coordinates 
given by 

e’ = 7TV + r(f(x v )n c (x f ) 1 < t < t it 
V = v) + P(t q)u l {t r'i , v) (25) 


where 


0' = 


XT7I 


Ol Xrn 

ftp 


L 01 x m J 

represents the dynamics that was neglected during the approx¬ 
imate decoupling algorithm. Each flj is 1 x in row vector of 
functions of .r, the first elements of which are identically 
zero. 

The tracking control law (20) was designed for a system of 
the form (25) with f = 0. The following theorem shows that it 
works for the approximate system with nonzero f as well. This 
theorem is motivated by and is similar to the one for slightly 
nonminimum phase systems as in [2]. 

Theorem 3: If 

• Zero dynamics of system (25) (i.e., the equilibrium point 
f)o = 0 of (24)) is exponentially stable in a neighborhood 
f/. 

• The functions fP(x t )iY(x r ) are locally Lipschitz contin¬ 
uous in a neighborhood U with ^*(x ( r ,)f/'( xq) = 0 V/ = 
1, ■ ■■, rn. 

• h) + f/)w f, (£, ?;, v) is locally Lipschitz contin¬ 

uous in i), v. 

Then for < sufficiently small and for desired trajectories 
with derivatives small enough, the states of system (25) are 
bounded and the tracking errors satisfy 

IKII = lift - Vd,\\ < K( 


for some K < oo. 

Note: Since we know that /, g , h are smooth functions of 
x to start with, the functions Ii i (x e )u € (x v ) will be locally 
Lipschitz so long as the matrix ) is a smooth function of 
x* at each iteration of the approximation decoupling algorithm. 


Proof: From (22) and the fact that the desired trajectory 
and its derivatives are bounded (by bd), we get 

\\i\\ < IMI + Ad. (26 

The transformation that transforms into (£, rj) coordinates 
is a diffeomorphism, thus there exits /, > 0 such that 

IMI<UIKII + » (27) 

As the functions /^(r p )w f ‘(.r e ) are locally Lipschitz continuous 
with ft l {x t 0 )u*(x o) = 0 , there exists a positive constant 
such that 

||2P/7(x f )f/ e (j-' , )|| < l fi \\a e \\ (28) 

where fl(x r ) is the block diagonal matrix with ft{x e ) being 
its diagonal blocks. Since the zero dynamics 

V = 7(0, fl ) + P(0, v)u c (0, />, 0) (29) 

is exponentially stable, by a converse Lyapunov theorem [22] 
there exists, v(rj) and positive constants frj, A: 2 , A\j, k 4 such 
that 

kiWnW 2 <"(r'i) < k 2 \\v\\ 2 


dv 

Or) 


[f/(0, r'i) + P( 0, r'i)u c ( 0, 7 ), 0)] < -fcj|k)|| 2 


do 
c)rj 


< h\\v\\- 


Thus from the exponential stability of zero dynamics and the 
Lipschitz continuity of q + Pu K , we get 

v = i'i) + P(t n)n K {L V, i0] 

= f)) + -P(0, #;)«*■(0, r'i, 0)] 

+ v) + f))u K (t v, y) 

- {<?((>• v) + P(H '))«'((), n. 0 )}] 

<-A: a ||# + fc 4 ||v||/ t) (|K| + ||,||). 


To show the states of (25) remain bounded, let 


V(f>, if) = c J Pr -1- , il>v(ri) 


be a Lyapunov function for the system (25) where P > t 1 
satisfies the following Lyapunov equation 

a 7 p + ¥a = -i 


and 0 is a positive constant to be specified later. Then 

v = -IMI 2 + 2 tv T P0{s)u k {s) + 

<Hkll’ + r|klMMI 
+ fl-MMI 2 + MMlieil + Ill'll)] 

< -Ikll 2 + iIMKUIMI + b d + Ik/ll) + 0 

■ [-A- 3 |k/|| 2 4- M’/IMIMI + b d + („(|M| + b d )}] 

+( W </) 2 
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Let 


+ (d fl l x + ipk i l v (l + l v ))^f,f 

fj.fwnw _ WrM i + j »)\ 2 

A 2 ki ) 


+ i> 


(kJrMl + l v )Y 


k. 


INI 2 - ;j#s||»?ll a 


-Q-V,)l 

< -(^ - cf/W*) l|e|| 2 

- Q#a - (dpi, + # 4 /„( 1 + /„)) 2 ) ||f?|| 2 

+ {{Ifllfbd) 2 + '0* 


(W + ly )) 2 


___ 

4 (/ tf/; C + A ‘ 4 /^ (1 + /,,)) 2 


this theory. Although this approximate algorithm is fiasco 
the Descusse-Moog dynamic decoupling algorithm, a siuu 1 
algorithm based on the dynamic extension algorithm (sec !■; 
Chapter 7J) can be worked out in similar fashion. 

Appendix 

Existence of an Analytic Matrix G k (x) 

The proof of existence of G k (x) is given by Descussc and 
Moog in |6]. Thus we have a matrix whose c numerical 
rank is w k and the last m - r k columns are identically zero. 

From the definition of t numerical rank of A*(;r), it is 
possible to find a w k x w k minor, say, A*(a;), such that all 
the singular values of A k (x) are bigger than r in V. Without 
loss of generality, we assume that & k (x) is the block formed 
by the first w k rows and the first w k columns of A k (tr), and 
let 6(x) represent its determinant. 

By definition, any minor of A k (x) having size bigger than 
w k , will have at least one singular value smaller than r 
uniformly in U. Thus the determinant of this minor will be 
of the order of e x <5 (;/:). Thus we get 


Then, for all </ ; < 4 } and all t < min ('</?, 1/4/^/j), we have 

^3 


Thus V < 0 whenever \\fj\\ or ||c|| is large. This implies that 
||f/|| and ||e|| and also, ||£|| and ||:/;|| are bounded. The above 
analysis shows that if wc choose the initial condition suffi¬ 
ciently close to :r[, and b,i sufficiently small, we can guarantee 
that the states will remain in U. Using the boundedness of x 
and the continuity of fi l (x)u v (: r), we see that 


f v'(;r) = AV Hh cfr(x)u e (x) 


: Aij(x) 

A k .(x) : : : 

del : 

An{x) ■■■ : Aij(x) . 

Vi G {w k + 1, —, to} , Vji e { v.i k + 1, ■ • ■, r fc ). (31) 
Consider the top left r k x »> block of A k (x). We get 


A 

6(x)Aij(x) + y^A/(:r)A 1 /(x) = order r x 6(x), 

i-1 


are rn SI SO exponentially stable linear systems driven by order 
(. input. Thus we conclude that the tracking errors c r converge 
to a ball of order c. □ 


VI. Conclusion 

A numerically robust algorithm for input-output decoupling 
of nonlinear dynamical systems has been proposed. This algo¬ 
rithm provides low gain, practically implementable controllers 
shat do not cancel far off zeros. The use of this algorithm for 
'■» slightly nonminimum phase system (i.e., one that has far off 
right-half plane zeros), results in an overall stable closed-loop 
ystem. It is shown that the tracking controllers constructed by 
sing this approximate decoupling algorithm result in bounded 
racking with stability. Controllers based on this theory already 
xist for a few specific examples in the literature and this paper 
an be thought of as an attempt to formalize the techniques 
>sed in those particular examples. Detailed calculations to 
stablish one-to-one mapping between the neglected dynamics 
md the formal structure at infinity are still to be worked out. 
Hie ongoing work on a CAD package called AP.LIN ([23]) 
>ascd on this approach will help automate the application of 


V* e {w k + 1, * ■ ■, m), Y; € {1, ■ • •, r k } (32) 

where Aj(:r) is the cofactor of Au(x), calculated with respect 
to the top left r k x r k block of A k (x). 

Define the elementary column operation by 

gj(x): = 

P Ai(ar) 

Ar.(x) 

0 


L 0 U 

where fi(.r) is in the j th row and column. The first r k - ] 
elements in the ,/th column of A k {x)gj(x) will be zero. The 
r k l element will be the determinant given by (32), which is 
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of the order of e. The rest of the elements in this column 
ifiust of the order of e, or else there will be a minor of 
/-having' all its singular values more than c and 
having more than w* columns and rows. This will contradict 
the definition of the c numerical rank of a matrix. Thus this 
particular procedure makes the elements of jth column of the 
order of e as compared with i(ar). We can have r* - Wf. 
matrices of these form making one column of Ak(x)gi(x) 
small at a time. It is clear that the matrix Gk[x) will thus be 
nonsingular, square, and an analytic function of x. 

Thus Gk{x)'.= Gk(x)Gk(x)Pk(x) is a square invertible 
matrix of analytic functions of x. 
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A Generalized Orthonormal Basis 
for Linear Dynamical Systems 

Peter S. C. Heuberger, Paul M. J. Van den Hof, Member, IEEE, and Okko H. Bosgra 


Abstract —In many areas of signal, system, and control theory, 
orthogonal functions play an important role in issues of analysis 
and design. In this paper, it is shown that there exist orthogonal 
functions that, in a natural way, are generated by stable linear 
dynamical systems and that compose an orthonormal basis for 
the signal space f!J, To this end, use is made of balanced re¬ 
alizations of inner transfer functions. The orthogonal functions 
can be considered as generalizations of, e.g., the pulse functions, 
Lagucrre functions, and Kautz functions, and give rise to an 
alternative series expansion of rational transfer functions. It 
is shown how we can exploit these generalized basis functions 
to increase the speed of convergence in a series expansion, 
i.e., to obtain a good approximation by retaining only a finite 
number of expansion coefficients. Consequences for identification 
of' expansion coefficients are analyzed, and a bound is formulated 
on the error that is made when approximating a system by a finite 
number of expansion coefficients. 


1. Introduction 

C ONSIDER a linear time-invariant stable discrete-time 
system G, represented by its proper transfer function 
(r(z) in the Hilbert space Hi, i.e., G{z) is analytic outside 
the unit circle. \z\ > 1. A general and common representation 
of G(z) is in terms of its Laurent expansion around c = r»c, as 

X 

G(z) = '}TG k z- k ( 1 ) 

A =0 

with {G* f a — 0 , 1 , the sequence of Markov parameters. 

In constructing this series expansion we have employed a 
set of orthogonal functions: { 2 °, z ~ ] , z~ 2 , - • - }, where orthog¬ 
onality is considered in terms of the inner product in H 2 ‘ In 
a generalized form we can write (l) as 

T£- 

G(z) = Y,Lkh(~) ( 2 ) 

* *=o 

v ith {/fc(s)}fe=o,i 1 2 , a sequence of orthogonal functions. 

Munuscript received November 17, 1992, revised December 3, 1993. 
commended by Pusl Associate Editor, B. Pasik-Duncan. This work was 
jpporicd in part by Shell Research B.V., The Hague, and the Center for 
‘dustnal Control Science, The University of Newcastle, Newcastle NSW, 
istralia 

P S C. Heuberger was with the Mechanical Engineering Systems and 
'nlrol Group, Delft University of Technology, 2 628 CD Delft, The 
^therlands and is now with the Dutch National Institute of Public Health 
d Environmental Protection (RIVM), P.O. Box 1, 3720 BA Bilthovcn, The 
therlands. 

P M J. Van den Hof and O. H. Bosgra are with the Mechanical Engineering 
vstems and Control Group, Delft University of Technology, Mekelweg 2, 2 
'8 CD Delft, The Netherlands. 

IEEE Log Number 9408269. 


There are a number of research areas that deal with the 
question of either approximating a given system G with a 
finite number of coefficients in a series expansion as in (2), or 
(approximately) identifying an unknown system in terms of a 
finite number of expansion coefficients through 

N 

A - 0 

The problem that will be analyzed in this paper is the 
following. 

Can we construct a sequence of orthogonal basis functions 
-X with £ W 2 , such lhal 

a) to some extent, the basis can be adapted to a linear 
stable system G to be described, implying that G can 
be accurately described by only a small number of 
coefficients in the expansion, and 

b) the basis allows the construction of an error bound for 
the approximation of a linear stable system G by a finite 
length expansion in the basis f k i.e., an upper bound 

on ||G(~) - 52 k *0 II some prechoscn norm, 

whenever G and G do not match exactly. 

The use of orthogonal functions with the aim of adapting 
the system and signal representation to the specific properties 
of the systems and signals at hand has a long history. The 
classical work of Lee and Wiener during the 193()\s on network 
synthesis in terms of Laguerre functions [24|, |46| is summa¬ 
rized in [25]. Laguerre functions have been used in the 1950’s 
and 1960’s to represent transient signals [45), |7|. During 
the past decades, the use of orthogonal functions has been 
studied in problems of filter synthesis [22|, |30| and for system 
identification [23], f32], 131], [6] and approximation [351, [36]. 
In these approaches to system identification, the input and 
output signals are transformed to a (Laguerre) transformed 
domain and standard identification techniques are applied to 
the signals in this domain. Data reduction has been the main 
motivation in these studies. Identification of continuous-time 
models with the aid of orthogonal functions is considered 
in e.g., [38] and [291. In recent years, a renewed interest 
in Laguerre functions has emerged. The approximation of 
(infinite dimensional) systems in terms of Laguerre functions 
has been considered in [27J, [28], [12], [13], and [13]. In the 
identification of coefficients in finite length series expansions, 
Laguerre function representations have been considered from 
a statistical analysis point of view in [43], [42], and [16|. 
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The use of Laguerre-function-based identification in adaptive 
control and controller tuning is studied in [47] and [9]. A 
second-order extension to the basic Laguerre functions using 
the so called Kautz functions [21] is subject of discussion in 
[41] and [44]. 

In this paper we will expand and generalize the orthogonal 
functions as basis functions for dynamical system representa¬ 
tions. Specifically we will generalize the Laguerre functions 
and Kautz functions to a situation where a higher degree of 
flexibility is present in the choice of basis functions, and 
where consequently a smaller error bound as meant in part 
b) of the problem can be obtained. Laguerre functions are 
specifically appropriate for accurate modeling of systems with 
dominant first-order dynamics, whereas Kautz functions are 
directed toward systems with dominant second-order resonant 
dynamics. The generalized basis functions, introduced in this 
paper, will be suited also for systems with a wide range of 
dominant dynamics, i.c., dominant high frequency and low 
frequency behavior. 

We will restrict attention to the transfer function space H 2 
being equipped with the usual inner product. This choice, 
rather than the Hoc -space where orthogonality is abandoned, 
is motivated by the fact that our main intended application oi 
these results is in the area of approximate system identification. 
As the main stream of approaches in system identification is 
directed toward prediction error methods and the use ot least- 
squares types of identification criteria, [26], the choice of a 
two-norm is quite straightforward and natural in this respect. 

Note that the two problems a) and b) should be treated as a 
joint problem. One of the (trivial) solutions to problem a) only 
is the use of a Gram-Schmidt orthogonalization procedure on 
the impulse response of the system (7 itself [1]. In that case 
the system can be described by a series expansion of only one 
single term. In this situation, however, no results are available 
for part b) of the problem. 

In an identification context, the use of the orthogonal 
functions as in (1) leads to the so-called finite impulse response 
(FTR)-model [26] 

v 

.,(/) = £ G*(tf)u(/-fc) + e(0 (4) 

Ar=0 

where f(/) is the one-step-ahead prediction error, and 
{y(t ), u(0} are samples of the output, input of the dynamical 
system to be identified. The identification of the unknown co¬ 
efficients {Gfr(0)h=o, ,jv through least squares minimization 
of e(1) over the time interval is an identification method that 
has some favorable properties. First, it is a linear regression 
scheme, which leads to a simple analytical solution; second, it 
is of the type of output-error-method, which has the advantage 
that the input/output system G(z) can be estimated consistently 
whenever the unknown noise disturbance on the output data 
is uncorrelated with the input signal |26]. 

It is well known, however, that for moderately damped 
systems, and/or in situations of high sampling rates, it may take 
a large value of /V, the number coefficients to be estimated, to 
capture the essential dynamics of the system G into its model. 


If we would be able to improve the basis functions in such i 
way that an accurate description of the model to be estimate 1 
can be achieved by a small number of coefficients in a serk. 
expansion, then this is beneficial from both aspects of bias and 
variance of the model estimate. 

For the series expansion in (1) with /*, = it j s 
straightforward to show that a system G will have a finite 
length series expansion if and only if all system poles arc ai 
z = 0. Moreover, in the scalar case the length of the expansion, 
i.e., the index of the last nonzero coefficient, equals the total 
number ot poles at z = 0. 

As a generalized situation, we can consider Laguerre poly¬ 
nomials [37] that are known to generate a sequence of orthog¬ 
onal functions [14] 

A(') = H < L (5) 

Similar to above, a system G will have a finite length senes 
expansion if and only if all system poles are at z =■ w, with 
the length of the expansion being equal to the total number 
of poles at z = a. 

In dealing with the problem of finding similar results for any 
general stable dynamical system G(z), we have considered 
the question ot whether a linear system in a natural way 
gives rise to a set of orthogonal functions. The answer to 
this question appears to be affirmative. It will be shown that 
every stable system gives rise to a complete set of orthonormal 
functions based on input (or output) balanced realizations, 01 
equivalently based on a singular value decomposition of a 
corresponding Hankel matrix. These generalized orthogonal 
basis functions will be shown to provide solutions to problems 
a) and b). 

in Section III we will first briefly slate the main result of 
this paper. Next in Section JV it will be shown how innei 
functions generate two sets of orthonormal functions that aie 
complete in the signal space ( 2 . This is the basic ingredient of 
the main result. Next an interpretation of these results is given 
in terms of balanced state-space representations. After showing 
the relations of the new basis functions with existing ones, \\e 
will focus on the dynamics that implicitly arc involved in the 
inner functions generating the basis. It will be shown that if the 
dynamics of a stable system match the dynamics of the innei 
function that generates the basis, then the representation 1 1 
this system in terms of this basis becomes extremely simpk 
Consequences for a related identification and approximatu 1 
problem are discussed in Section VIII. 

Due to space limitations, a complete statistical analysis 
the related system identification problems thai result fro 1 
these basis functions can not be given in this paper. ^ 
statistical analysis along similar lines as [43] and [441 s 
presented elsewhere [39]. 

The proofs of all results are collected in an appendix. 

II. Preliminaries 

We will use the following notation. 

(.) r Transpose of a matrix. 
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'■r 

U^/;Xrr» 

/+ 

f 2 [0- oo) 
^ lX7, [0,oo) 




«wS x,n 


IHU 

Krr(.) 


W(G) 


/» 


Complex conjugate transpose of a matrix. 
Set of complex-valued matrices of 
dimension p x m. 

Real-valued matrix with dimension p x m. 
Set of nonnegative integers. 

Space of squared summable sequences on 
the time interval Z+. 

Space of matrix sequences 
{Fk £ C m Xri }^=(, i i l 2 , such that 
£r=o tr(F£F k ) is finite. 

Set of real p x m matrix functions, 
analytic for \z\ > 1, that are squared 
integrable on the unit circle. 

Set of real rational p x m matrix 
functions, analytic for \z\ > 1, that are 
squared integrable on the unit circle. 
Induced 2-norm or spectral norm of a 
constant matrix, i.e., its maximum singular 
value. 

-norm. 

Vector-operation on a matrix, slacking its 
columns on top of each other. 

Kronecker matrix product. 

(Block) Hankel matrix related to transfer 
function G — defined by 

H, } (G) = being the (/, j)-block 

element. 

/th Euclidian basis vector in IR". 
it x n Identity matrix. 


In this paper we will consider discrete-time signals and 
systems. A linear time-invariant finite-dimensional system will 
be represented by its rational transfer function C E 7?7Y ; 2 ,Xm , 
with w the number of inputs in m, and p the number of outputs 
in \f. State-space realizations will be considered of the form 


.r(A- + 1) ~ Ar(A-) -f Bv(k) (6) 

y(k) = Cjr{k) + Du(k) (7) 

vMth A 6 C 7,X7 \ B E C n *”\ C E C vx71 , and D E C' Xm . 
( is an 7/-dimensional realization of G if G(z) = 

f l — A)~ l B + I). A realization is stable if all eigenvalues 
A lie strictly within the unit circle. If a realization is 
tblc, the controllability gramian P and observability gramian 
( are defined as the solutions to the Lyapunov equations 
PA* + BB* = P and A*QA + C*C = Q. respectively. A 
ible realization is called (internally) balanced if P = Q = £, 
ilh £ = diay((T\< ■ ■ ■, a n ), > ■ ■ ■ > a diagonal matrix 
th the positive Hankel singular values as diagonal elements, 
stable realization is called input balanced if P = /, Q — £ 2 , 
id output balanced if P = £ 2 , Q = 7. 

A system G E TlH\ xm is called inner if it satisfies 
' r (z~ 1 )G(z) = /. As G is analytic outside and on the unit 
' rcle, it has a Laurent series expansion 52^1 0 £?**“*. 


III. The Main Result 

We will start the technical part of this paper by giving q. 
basic result first and then consecutively give the analysis tlu 
provides the ingredients for making the result plausible. 

Theorem 3.1: Lei G be an 77 / x m inner transfer function 
with McMillan degree n > 0, having a Laurent expansion 
G(z) = J2kLo G kZ~ h and satisfying ||G'o|| 2 < 1, and lei 
(A, B , C\ D) be a balanced realization of G(z). Denote 

V k {z) = .'(zI-A)- l RG k {z). (S) 

Then the set of functions n%k ^ ^ consti¬ 

tutes an orthonormal basis of the function space 

A direct consequence of this theorem is the following 

corollary. □ 

Corollary 3.2: Let G be an inner function with McMillan 
degree n as in Theorem 3.1, with a corresponding sequence 
of basis functions V k (z). Then for every proper stable transfer 
function H E X? 2 ’ xm there cxisl unique D s e R' ,X7? \ and 
L = {L k \6 /$*"[(), hc), suctl that 

rx. 

JHs)~D, + *~'Y i L k \\{t). (9) 

k=[) 

We refer to Lk as the orthogonal expansion coefficients 
of II(z). □ 

Note that due to the fact that \ 4(c) is an n x m-malrix of 
transfer functions, the dimension of each L* is p x n. 

IV. Orthonormal Functions Generated 
by Inner Transfer Functions 

In this section we will show that a square and inner transfer 
function gives rise to an infinite set of orthonormal functions. 
This derivation is based on the fact that a singular value 
decomposition of the Hankel matrix associated to a linear 
system induces a set of left (right) singular vectors that are 
orthogonal. Considering the left (right) singular vectors as 
discrete time functions, they are known to be orthogonal in 
sensc, thus generating a number of orthogonal functions being 
equal to the McMillan degree of the corresponding system. 
We will embed an inner function with McMillan degree n 
into a sequence of inner functions with McMillan degree A //, 
for which the left (right) singular vectors of the Hankel matrix 
span a space with dimension kn. If we let A■ —> 'x the set 
of left (right) singular vectors will yield an infinite number of 
orthonormal functions, which can be shown to be complete 
in H 2 - 

First we have to recapitulate some properties of inneir 
transfer functions. 

Proposition 4.1: Let G(z) be an inner transfer function 


with a Laurent expansion G(^) = 

iXo Then 


no 

EgJ+.gw 

for 7 = 0; 

(10) 

fr=0 



= 0 

for 7 > 0. 

(11) 


□ 

The Hankel matrix of an inner transfer function has some 
specific properties, reflected in the following two results. 



454. 


IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 40, NO. 3, MARCH 19' i 


Proposition 4,2: Let G(z) be an inner function with 
McMillan degree n > 0. Then a singular value decomposition 
(svd) of H(G) satisfies 

H{G) = UqVq 

with Up, Vq € C°°* n unitary, 1 and the pair (Do, Vo) is unique 
modulo postmultiplication with a unitary matrix T 6 C nXn . □ 
The proposition states that an inner transfer function has all 
Hankel singular values equal to one. For continuous-time sys¬ 
tems, this is proven in {11]. The discrete-time version follows 
straightforwardly by applying a bilinear transformation. 

Proposition 4.3: Let G(z) be a square inner function, hav¬ 
ing a Laurent expansion G(z) = G k z~ k . Denote the 
Mock Toeplitz matrices 


Gq 

Gi 

G'2 


0 

G 0 

Gi 

g 2 

0 

0 

Go 

G 1 ••• 

• 


0 

Go •• 

: G 0 

0 

0 

■ 

G L 

Go 

0 


g 2 

G k 

Go 


: 


Gy 

Go ••• 

. : 

• 




( 12 ) 


(13) 


Then 


i) T v Tl = T^T U = /; 

ii) T v \o = TjUo = 0, for any unitary matrices Uq, Vq 

satisfying UqVq = TL[G). □ 

Lemma 4,4: Let G(z) be a square inner function with 
McMillan degree n. Then for all h £ Z + , G k (z) is an inner 
function with McMillan degree kn. □ 

The result in the lemma is quite straightforward if one 
realizes that an inner function has all poles within the unit 
circle, and all zeros outside the circle. 

Considering Proposition 4.2, it follows that the rows of VJ* 
and the columns of (7o, are n mutually orthonormal vectors of 
infinite dimension. Additionally Lemma 4.4 shows that we can 
construct an inner transfer function with increasing McMillan 
degree, by repeatedly multiplying the transfer function with 
itself, and thus implicitly creating an increasing number of 
orthogonal vectors. The following result shows how we can 
increase this number of vectors, by embedding the svd of 
H{G) into a sequence of svd’s of H(G k ). 

Theorem 4.5: Let G(z) be a square inner function with 
McMillan degree n > 0. Then 

a) There exist unitary matrices £/*, V t £ C ocXn , i = 
0, l, ■ ■ ■, such that for every 0 / k e Z+, the matrices 


Tfc = [ Uy Uq) and 


Ti = 


r K 1 

v i 


(14) 


(15) 


L vsl.j;.. 

1 With slight abuse of notation we will use this notation to indicate an 
operator C" —* /a|(). oo). 


constitute a singular value decomposition of H(G k 
through 

n(G k ) = nr k . (i6) 

b) The matrix sequence (I/,, V,} f= o,i r is unique up to 
postmultiplication of each Ui and Vi with one and the 
same unitary matrix. 

c) Let G(z) have a Laurent expansion G{z) r_ 

Git" 1 , and consider the block Toeplitz matrices 
T u , T v as in (12), (13) then the matrix sequence 
{Ui*Vi}i= o.i,- satisfies 

V£ = VUT v (17) 

Uk = T u Uk-\ for A; = 1,2, ■ • ■. (18) 

□ 

The theorem shows the construction of orthogonal matrices 
IX F£ that have a nesting structure. The suggested svd of 
TL(G k ) incorporates svd’s of H(G l ) for all i < k. In this 
way orthogonal matrices T° k and F k are constructed with an 
increasing rank. Note that the restriction on the structure of 
the consecutive svd’s is so strong that, according to b), given 
a singular value decomposition H(G) = UqVq, the matrix 
sequence V/, i — 1,2, ■ ■ ■} is uniquely determined. Nole 
also that there is a clear duality between the controllability 
part T£ and the observability part T k , To keep the exposition 
and the notation as simple as possible we will further restrict 
attention to the controllability part of the problem. Dual results 
exist for the observability part. 

Proposition 4.6: Let G{z) be an rn x rn inner function with 
McMillan degree n > 0, and consider any sequence of unitary 
matrices {V/}i=o,i, satisfying (17) in Theorem 4.5. Denote 
for k £ Z+ 

oc 

V k {z) = Awith M k (i) e C nxm defined by 
?-o 

V k * =: [A4(0) Mfc(l) M,(2) (19) 


Then 


V k (z) = V 0 (z)G k (z). 


D 

The proposition actually is a z-transform-equivalent of 
the result in Theorem 4.5. It shows the construction of the 
controllability matrix T c k . 

In the next stage we show that this controllability matri> 
generates a sequence of orthogonal functions that is complet 
in q. 

Theorem 4 7: Let G(z) be an m x rn inner function wit: 
McMillan degree n > 0, such that ||Go ||2 < 1; consider i 
sequence of unitary matrices {V^} ,— 0 , 1 ^.. as meant in Theorer 
4.5. For each k £ Z + consider the function 4>k - Z+ —► G' 
defined by 

[<M0) 0fc(l) 0fc(2) ■■■].= v k *. 

Then the set of functions \V(G) := {0fc}£L o constitutes ai 
orthonormal basis of the signal space E 
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! 



j i I Series expansion of a transfer function in lernis of an orthonormal 


Remark 4.8: This basis has been derived from the singular 
value decomposition of the Hankel matrix 7i(G). As stated in 
Proposition 4.2 this svd is unique up to postmultiplication of 
l> [h V {] with a unitary matrix. Consequently—within this con¬ 
text—both V£, V k (z)* and the corresponding basis functions 
} are unique up to unitary premultiplication. 

For use later on we will formalize the class of inner 
I unctions that have the property as mentioned in the previous 
theorem. 

Definition 4.9: We define the class of functions: Q ] := (all 
square inner functions (7 with McMillan degree > 0 such that 
i|(»'(iii» < 11. □ 

As a result of the fact that the proposed orthonormal 
functions constitute a basis of each square inner function 
generates an orthonormal basis that provides a unique transfor¬ 
mation of fJj-signals to an orthogonal domain. Similarly, when 
given such an orthonormal basis, each stable rational function 
can be expanded in a series expansion of basis functions V k (z) 
as defined in Proposition 4.6. 

Corollary 4.10 Let G G (7|, and let T'(G) be as defined 
in Theorem 4.7. Then 

a) For every signal ./■ £ ^[O, x) there exists a unique 

transform A' - {A7 }a=oj, € suc h that 

X. 

./•(/) = Y^XkM0- ( 20 ) 

A-0 

Wc refer to AT G C ,{ * " as the orthogonal expansion 
coefficients of ,r. 

b) For every proper stable transfer function H(z) G 

there exist unique D s G and L = {La }a=o,i, £ 

n ,Xrl [(),Lx), such that 

rXj 

7/(s) = D, + 5- 1 53l*V*(s). (21) 

A-U 

We refer to D s ,L k as the orthogonal expansion coeffi¬ 
cients of H(z). □ 

Wc will refer to the sequence {Va (*)}*=(],i, * as defined 
Proposition 4.6. as the sequence of generating transfer 
ictions for the orthonormal basis ty(G). 

The series expansion as reflected in (21) is schematically 
1 dieted in the diagram in Fig. 1, where q reflects the time 
1 ift, qu(i) = u(t 4- 1). 

To find appropriate ways to calculate the orthogonal func- 
ns, as well as to determine the transformations as meant in 
1 * corollary, we will now first analyze the results presented 
far in terms of state-space realizations. 


V. Balanced State-Space Representations 


To represent the orthogonal controllability matrix in .» M 
space form, we will use a balanced state-space realization oi < 
We first present the following, rather straightforward, lemma 

Lemma 5.1: Let G be a square inner transfer function 
with minimal realization (A,B,C,D). Then the realization 
is (internally) balanced if and only if it is both input balanced 
and output balanced. [j 

Next we examine how the property that a transfer function 
is inner, is reflected in a state-space realization of the function. 

Proposition 5.2: Let G be a transfer function with realiza¬ 
tion (A, B , C, D ), such that (zl, B) is a controllable puir, and 
the realization is output balanced, i.e., A*A 4 - G*C = I. Then 
6 ?t (^‘ 1 )G( 2 ) = J if and only if 

i) D*C + B*A = 0 , and 

ii) D'D + B*B = 1. □ 


Note that for this proposition there also exists a dual 
form, concerning the transfer function G 1 with realization 
(A*. C*, U*, D*), that can be applied if G is square inner. 

The characterization of the inner property in the above 
proposition is made for output balanced realizations. Since, 
according to Lemma 5.1 output balancedness is implied by 
balancedncss, it also refers to balanced realizations. 

The class of functions Q\ can simply be characterized in 
terms of a balanced realization. 

Proposition 5.3: Let G be an in x m inner function with 
minimal balanced realization (71, B, (L D). Then G G if 
and only if rank B = m, or equivalently rank ( 1 - m. □ 

The following proposition shows that we can use a balanced 
realization of G to construct a balanced realization for any 
power of G. 

Proposition 5.4: Let G be a square inner transfer function 
with minimal balanced realization (A, BALD) having state 
dimension n > 0. Then for any k > 1 the realization 
(A k .B k .C h ,D k ) with 


A k 


B k 


’ A 0 ■■■■() 

BC A 0 0 

BDC BC 1 0 

: : .’•.() 

BD k ~ 2 C BD k ~ l C BC A 
‘ B " 

BI) 

BD 2 


( 22 ) 


(23) 


Lad*- 1 ] 

G k = [D k ~ x C D k ~ 2 C ■ DC C] (24) 

D k = D k (25) 


is a minimal balanced realization of G k with state dimension 

n ■ k. E2 

Examining the realization in the above proposition, reveals 
a similar structure of observability and controllability matrices, 
as has been discussed in the previous section; e.g., taking the 
situation k = 2 , it shows that the controllability matrix of 
(A 2 ,B 2 ) contains the controllability matrix of (A,B) as its 
first block row. 
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Proposition 5.5: Let G(z) be an n x m inner transfer 
function with McMillan degree n > 0, whose Hankel matrix 
has an svd H{G) = C/ 0 V 0 *; let (A,B,C,D) be a minimal 
balanced realization of G such that V 0 * = [B AB A 2 B ■ • • 
Then the unique sequence of orthogonal matrices }jt=i, 2 , 
as considered in Theorem 4.5 is determined by 

n = [B k A k B k A\B k • • ■ ] (26) 

with A kl Bk as defined in (22), (23). □ 

The above result shows how a minimal balanced realization 
of G actually generates the sequence of orthogonal matri¬ 
ces TJ, the rows of which are the basis functions in our 
orthonormal basis of Pj. 

We will show that there exist recursive formulae for con¬ 
structing the orthogonal functions. 

Proposition 5.6: Let G T be an inner function, G € Q i, and 
consider the assumptions and notation as in Theorem 4.5 and 


Proposition 5.5. Denote 

A r = BC and (27) 

P any matrix satisfying 

PB = BD. (28) 

Then the elements of r* fc are determined by the following 
recursive equations 

Afo(0) = B (29) 

A 

Af*(i + l) = AMk(i)+^2 P ] ~ 1 XMk- J (i), i > 0; (30) 

;=i 

A4(0) = m-i(0) (31) 

with as in (15) With 

VC = [Affc(O) AMI) M k ( 2) ■■■] 

as in (19). □ 


The recursive equations show how we can simply construct 
the set of orthogonal functions. Note that the matrix P in 
(28) is nonunique. The result (29)-(31) however is unique. A 
straightforward choice for P satisfying (28) is 

P = B I){B* B)~ l B*. (32) 

Note that, as a result of Proposition 5.3, the matrix B*B is 
invertible whenever G e Q\. 

The orthogonal functions #( G) generated by an inner func¬ 
tion G can be represented in terms of their generating functions 
V k (z ), as defined in Proposition 4.6. These generating transfer 
functions can also be realized in terms of a minimal balanced 
realization of G. This is reflected in the following theorem. 

Theorem 57: Let G be an inner function, G e Q\, with 
a minimal balanced realization (A,£J,C\ D). Let this inner 
function generate an orthonormal basis with corresponding 
generating functions I4(z), as defined in Proposition 4.6. 

1) Let F be a matrix determined by 

(33) 


with X defined in (27) and P any matrix satisfying (28 
Then, for k 6 Z+, 

a) V k (z) = {{zI-A)- l F{I-zA*j\ k z{zI-A)~'L 

b) Vjt(z) is unique, i.e., it is not dependent on th* 
specific choice of P in (28). 

2) If there exists a matrix K such that B — RC *, then 
F = R satisfies the conditions of Part 1 of this theorem 

a 

Now we come to the construction of a series expansion of 
any stable proper rational transfer function, in terms of the 
new orthonormal basis. 

Theorem 5.8: Let G be an inner function, G € Cj, with 
a minimal balanced realization D). Let this inner 

function generate an orthonormal basis with corresponding 
generating functions V k (z), as defined m Proposition 4.6. Lei 
H be any proper and stable transfer function with a 


minimal realization (A*, B M < C„, D H ), Then 

nu 

H(z) = D, + z~ i Y j ^V k (z) ( 34 ) 

frr-0 

with L k E C p * n determined by 

L k = C\Q k ( 35 ) 

Qi> = (36) 

Q,+i = A^A* + AnQiF* - Q.AF * ( 37 ) 

with F as defined in (33). □ 


In Section VII we will show that specific choices of G(z) 
in relation with /f(z), i.e., specific relations between the inner 
function G producing the orthonormal basis and a transfer 
function H that should be described in this basis, will lead lo 
very simple representations. 

VI. A Generalization Of Classtcai Basis Functions 

In this section we show three examples of well-known 
sets of orthogonal functions that are frequently used in the 
description of linear time-invariant dynamical systems and that 
occur as special cases in the framework that is discussed in 
this paper. 

Pulse Functions 

Consider the inner function G(z) = z* 1 , G £ Q^. Th 
Hankel matrix of G satisfies 

"1 0 ■■■ ■ 0" 

0 0 0 -0 

K{G) - () () () () 

'!] 

= J [1 0 0 ■■■] = l/ 0 V 0 \ (.V 

As a result Vo(z) = 1, and with Proposition 4.6 the generatin 
transfer functions V k (z) satisfy V*.( z ) = G k (z) = z~' 
k = 0.1, • • •. The corresponding set of basis functions 


F^X-PA 



I 
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determined by <t>k(t) = b(1 - k) with b(r) the Kronecker 
,-na function. 

The inner function G can be realized by the minimal 
i .unced realization (A,B,C<D) = (0, 1, I, 0). The equation 
/ I! =* BD is satisfied by P = 0, and the corresponding 
u >ult for F is F = BC — 1. Applying Theorem 5.8 shows 
ilie classical result that L k = C n A*B H . 

iai'iterre Functions 

Consider the inner function G(z) = -——, with some real- 

z — a 

valued a, |a| < 1, and denote // = 1 - a 2 . A minimal balanced 
realization of G is given by (A, B , G\ D) = («, ^/T/, v /7/, -a). 
Equation P# = BD is satisfied by P — -a, leading to 
f — BG - PA = rj 4- a 2 = 1. Taking account of the fact 
that for one-dimensional scalar G\ A//<.(v) = </»^(/), it follows 
from Proposition 5.6 that 

</>o(0) = (39) 

k 

+ I) = U'MO + n)'’ '0* -,(/') (40) 

0fr(O) = -«0*-i(O). (41) 

These equations exactly match the equations that generate the 
normalized discrete-lime Laguerrc polynomials with discount 
factor m, 114|, |32]. 

The corresponding generating transfer functions \\(z) can 
he analyzed with the result of either Proposition 4.6 or 
Theorem 5.7 

= (42i 

This exactly fils with the formulation of the generating transfer 

functions of discrete-lime Laguerrc polynomials in, c.g., 123]. 


VII. Orthonormal Functions Originmiv, 
from General Dynamical Systems 

We have shown that any square inner transfei funetu»h 
G e Q) generates an orthonormal basis for the signal spate f\ 
One of the reasons for developing this generalized bases was 
to find out whether we can yield a more suitable representation 
of a general dynamical system, when the basis within which 
we describe the system is more or less adapted to the system 
dynamics. In view of the results presented so far, this aspect 
relates to the question whether we can construct an innei 
transfer function generating a basis that incorporates dynamics 
of a general system to be represented within this basis. 

There are several ways of connecting general transfer func¬ 
tions to inner functions, as e.g., inner/outer factorization |10|, 
15], normalized coprime factorization |8], [40], |33|, [4], or 
inner-unstable factorization 12]. Even if the corresponding 
inner functions are not square, they can always be embedded 
in a square inner function [II). In this paper, however, we 
will explore a different connection, where a general stable 
dynamical system with input balanced realization (.4, /J. C\ /)) 
will induce a square inner function through retaining the ma¬ 
trices (A, B) and constructing (f\ D) such that (.4. B<(\ D) 
is inner. This implies that the poles of the stable dynamical 
system arc retained in the corresponding inner function. The 
following result shows the existence and construction of such 
an inner function. 

Proposition 7.1: Let (/l, B) be the system matrix and input 
matrix of an input balanced realization of a transfer function 
H e 1\H!> y ' n with McMillan degree n > 0, and with rank 
B - m. Then 

a) There exist matrices C\D such that is a 

minimal balanced realization of a square inner function 
G € t/j. 

b) A realization (A. B.(\D) has the property mentioned 
in a) if and only if 


Kautz Functions 

Consider the inner function Glz) — — z —“—-— 

V z 2 + 6(r- l)z - r 

with some real-valued b,c satisfying \c\.\b\ < 1. 

A balanced realization of G(z) can be found to be equal to 


A - 


, n = 

0 

j1 — 

.<V(i 11,2 ) ~ hr 

IVd -<' 2 )J 

a = j 

1'1 7l ] J) = 




"■"h 7i = -M 1 ~/' 2 ) and ^ = \/(i -* J )(i - &*)■ 

With the expression for Vo(e) from Theorem 5.7-a) it 


: lows that 

*- l V Q (z) = 


x/0^ 3 ) 


z 2 + l>(r ~ 1): 


v/rn 7 ^) 

s - b 


(43) 


»ch exactly equals 


representing the orthogonal 


S' l (z,b.r) ii 

•uu functions, as represented in [41J, |44]. Postmulliplica- 
n with G k (z) is equivalent to the situation in the case of 
•utz functions. 


C = VB*{l n 1 + A) (44) 

D = V ■[«*(/„ -4 -1*) 'B(45) 

with 6 R mxm any unitary matrix. 

c) For a realization satisfying (44), (45) a valid choice of 
matrix F satisfying (33) is given by 

F=[I u +B(V-l m )(B*B)- l H*]U„ + A)U„ 4 A*)~\ 

(46) 

□ 

In the proposition all inner functions are characterized that 
can be constructed in the way as described above, by retaining 
the matrices (A, B) of any given stable system. Note that the 
extension C.D is not unique. The nonuniqueness is reflected 
by a possible unitary premultiplication of the inner function. 
Note also that when choosing U = I m , expression (46) reduces 
tO F = (In + A)(I n + A*)" 1 . 

We will now present a result that is very appealing. It 
shows that when we want to describe the dynamical system 
H in terms of the basis that it has generated, as presented 
in Proposition 7,1, then the series expansion in the new 
orthogonal basis becomes extremely simple. 
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Theorem 72: Let H € have an input balanced 

realization having all controllability indexes 

> 0. Let G € G\ be a square inner function with minimal 
balanced realization (A, 13,(7, D) such that A = A* and 
B ?= B*, generating an orthonormal basis with generating 


transfer functions V k (z). Then 


no 


H(z) = D a + z-'Y,L k V k (z) 

(47) 

k=0 


with 


Lo = C, and 

(48) 

L k = 0 for k > 0. 

(49) 


Proof: The proof follows by applying Theorem 5.8. With 
A — A# t B = B a (36) becomes Q {) = AQ^A* + BB*. 
Since (A,B) is input balanced, the solution to this equation 
is Q 0 = /, leading to L 0 = C*. Substituting Q 0 = / in (37) 
and using the stability of A shows that Q t = 0 for i > 0. □ 

The theorem shows that when we use a general stable and 
proper dynamical system to generate an orthonormal basis 
as described above, then the system itself has a very simple 
representation in terms of this basis. It is represented in a series 
expansion with only two nonzero expansion coefficients, being 
equal to the system matrices C 8 and TV 

In the next section we will discuss the results of this paper 
regarding their relevance to problems of system identification 
and system approximation. 

It has to be stressed that, so far, we have only used the 
generalized orthonormal basis to study the series expansion of 
a given stable transfer function. Similar to the case of the pulse 
functions and Laguerre functions, the presented generalized 
functions induce a transformation of /^-signals to a transform 
domain, compare e.g., with the 2 -domain when pulse functions 
are used. In this transform domain dynamical system equations 
can be derived, leading to transform pairs of lime-domain and 
orthogonal-domain system representations. In the case of a 
Laguerre basis, these kinds of transformations actually have 
been used frequently also in an identification context, by first 
transforming the measured input/output signals to the Laguerre 
domain, and consecutively identifying a system in this domain; 
see e.g., [22], 123], [32], [31]. 

For the generalized basis, results along these lines have been 
presented in [18], [19]. An analysis of the system transforma¬ 
tions between time domain and generalized transform domain 
is treated m |19] and [39]. 

VIII. System Approximation and Identification 

We will now discuss the way in which the introduced 
orthogonal basis functions provide a solution to problem b) 
as mentioned in the introduction, i.e., the quantification of an 
error bound for finite length expansion approximants. 

We will present results showing that the speed of conver¬ 
gence in an orthogonal series expansion can be quantified and 
that an increase of speed is obtained as the dynamics of system 
and basis approach each other. To formulate these results we 
need an alternative formulation of Theorem 5.8 in terms of 
Kronecker products. 


Proposition 8.1: Let H € JJ x m be a transfer functio 
with an input balanced realization (A*, B 9 , C a) D s ), and It 
(A,B) be an input balanced pair that generates an m x / 
inner transfer function G € 5i, leading to an orthonorm, 
basis ®(G). 

Then the orthogonal expansion coefficients L k satisfym 
H(z) = D a + z~ ] ^2r=o L kVk(z) are determined by 


Vtc(L k ) = ZX k Y (50, 

with 

Z = (/»C S )M- 1 (51) 

Y = Vn (B'B*) (52) 

A' = NM~' (53) 

M = / ® I - A® A a (54) 

N = FwA H - FA* ® I. (55) 


Note that due to (50) we can consider Vcr(L^) as a 
sequence of Markov parameters of a dynamical system with 
a state-space realization given by (A r . 0). By examining 
the eigenvalues of this realization, we create the possibility of 
drawing some conclusions on the speed of convergence of the 
series expansion. The following result is taken from [19]. 

Proposition 8.2- Consider the situation of Proposition 8.1 
with H(z) and G(z) having McMillan degree 7i H , /?, re¬ 
spectively, and m = 1. Let //,, i ~ 1, ■ , denote the 
eigenvalues of A«, and p r j ~ 1. - - -, denote the eigenvalues 
of A. The dynamical system Z(zl - A’)' l V has a realization 
(A in r tM Z o ,0) that satisfies 

a) X tJ has dimension 

b) A\, has eigenvalues A,, i — 1, ,// s that satisfy 


n 


w-n 

1 = 1 


/<! ~ Pj 
1 ~ /*»/>, 


(56) 


Since the proof of this proposition is somewhat outside the 
scope of this paper, the reader is referred to 119]. 

The above proposition shows that wc can draw conclu 
sions on the convergence rate of the sequence of expansion 
coefficients {V}A-u, » when given the eigenvalues of the 
original system H{z) and the eigenvalues of the inner function 
G(z) that generates the basis. Note that when the sets of 
eigenvalues {p t }, \p 3 ) coincide, then A, = 0, for all /, and 
consequently the sequence {L k } will have a finite numbci ol 
elements unequal to zero. The above result also enables lU 
determination of an upper bound on the error that is mad.. 
when we approximate a given system H(z) through a fim 
number of its expansion coefficients. 

Theorem 8.3: Consider the situation of Proposition 8.2, ai 1 
denote 

N~ 1 

H n (z) = D,+z~ 1 ^L k V k (z) 

k= 0 

and A := infix, |A,|. Then there exists a finite r £ R such th 
for any 7] 6 R, tj > A 

*.N + 1 
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Since A is 3 measure for the '‘closeness” of system dynamics 
■ id basis dynamics, the above theorem shows that the error 
m u is made when neglecting the tail of a series expansion, 
comes smaller as A becomes smaller. As a result, when 
( siricting to a fixed number of expansion coefficients, the 
t ipnroximation error gets smaller the more accurate the basis 
dynamics is “adapted” to the system. 

In the final pari of this paper we will briefly commeni 
on how these results could be employed in an approximate 
identification framework. As mentioned in the introduction, 
identification of a finite impulse model (FIR) (4), has some 
important advantages; however, it fails to be successful when 
the number of coefficients to be estimated becomes large. This 
may happen in situations of high sampling rates, moderately 
damped systems, as well as systems that have dominant 
dynamics in both the high-frequent and low-frequent region 
(e.g., multitime-scalc systems). An alternative way to attain 
the advantages of this identification method, is to exploit the 
model structure 

A — I 

«(/) - im + Y + e(t) c58> 

k 

where c(t) is the one-slep-ahcad prediction error, P(6)< Lfr(0) 
the parameterized expansion coefficients, and with V*(c) rep- 
lesenting an appropriately chosen basis. 

Note that this model structure can simply be written as 

A 1 

v(i) - 0(8) + Y i+(8)u k (i) + < 59 > 

A-0 

where n k (t) can simply he calculated by applying u(t) to the 
known- fillers !*(</), compare Fig. 1. 

Identifying i9 through least squares optimization of c(/)ovcr 
the lime interval, is a similar problem as in the case of a FIR- 
model. With appropriately chosen basis functions, however, 
the convergence rate of the scries expansion can become 
extremely fast; with only a few coefficients to be estimated 
J very accurate approximate model can be obtained. This 
is of course interesting and appealing from both aspects of 
bias (accurate approximation is possible) and variance (few 
paiameters to be estimated from data). An analysis of bias 
Ji:d variance errors in these identification schemes is presented 
>’< |W|. 

Additionally, when comparing these “orthogonal FIR” 
11 'del structures with nonlinearly parameterized model 
1 ucturc as e.g., a Box Jenkins or ARMAX model ([26]), we 
! oid problems of possible occurrence of local (nonglobal) 
’lima in the quadratic identification criterion. Moreover 
freedom in the choice of basis functions allows the 
1 ‘itful use of “a priori information” concerning the system 
uamics. 

Very often an identification experimenter has a -rough- 
owledge about the dynamics of the system under considera- 
n, e.g,, from previous experiments or from physical insight 
u ihe process dynamics. It would be favorable to exploit 


this knowledge in an identification procedure. The nu uv i 
suggested above, shows that this a prion knowledge uu k 
exploited in tow? of the basis functions that are cIiomv 
W hen we have -rough- knowledge about the poles o( the 
system, we can construct basis functions that are based on this 
set of poles. The more accurate the poles are, i.c., the more 
accurate our a priori information is, the better we can adapt 
the basis functions to the system dynamics. As a result, see 
Theorem 8.3, the estimated model can become more accurate 
when restricting to a prespecified number of coelhcients to be 
estimated. 

Effectively the identification problem now reflects the iden¬ 
tification of the mismatch between the system under considera¬ 
tion and the knowledge that already was available, represented 
in the basis functions. This actually is very appealing, as the 
priori information simplihes the identification procedure. Note 
that in the way described above, the a priori information does 
not have to be exact, i.e., it is not of the type of fixing a 
priori a constraint on the model parameters, as e.g., the steady- 
state gain. The information can be uncertain. The only result 
is that the more accurate it is, the more simple the system 
representation will be. 

This discussion also motivates the use of an iterative 
scheme, where the identification of parameters 0 is performed 
iteratively, using the model that is estimated in step i - l 
for constructing the basis functions for step t. An example of 
such an iterative scheme has been shown in |19]. 

One remark that has to be made in this respect, is a remark 
on the model order of a system represented by a finite number 
of expansion coefficients. The McMillan degree of this system, 
as in the case of an FIR-representalion. will generally be large. 
This results from the following observation. 

Proposition H 4 Consider the transfer function 

A - I 

//*(£)= f)+ Y 

A =0 

with l a (') the generating transfer functions of an or¬ 
thonormal basis ty((r), where the inner function G 6 £?i 
has a minimal balanced realization (A.BA\P) with 
dimension n. Then II s (z) has a state-space realization 
(A\ i, , A\ /)), with A \r defined in (22), 

(23), and K = [ to L\ t 2 - ■ ■ L;v-i ]■ D 

The proof of this proposition follows by inspection. 

With L, being the result of an unconstrained optimization 
in an identification procedure, the state-space dimension of 
the model will generically be equal to Nn. Consequently, if 
one wants to represent the model again in a traditional state- 
space form of low dimension, a model reduction procedure 
will have to be used to arrive at a reduced dimension. This 
also motivates a further analysis of the realization problem in 
terms of orthogonal expansion coefficients {La}- 

IX. Conclusions 

We have developed a theory on orthogonal functions as basis 
functions for general linear time-invariant stable systems. The 
basic ingredient is that every square inner transfer function m 
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a very natural way induces two sets of orthogonal functions 
that form a basis of the signal space 1%. The ordinary pulse 
functions and the classical Lagucrre and Kautz polynomials 
are special cases in this theory of inner functions. 

With this concept we have explored the connection between 
a general dynamical system and an inner function, by letting 
the inner function be determined through a specified set of 
poles. An important property of the resulting orthonormal 
functions is that they—to some extent—incorporate the dy¬ 
namic behavior of the underlying system. We have developed 
a theory on these system based orthogonal functions, both on 
an input-output level and in terms of balanced state-space real¬ 
izations. Furthermore we have shown how the alternative basis 
can be fruitfully used in problems of system approximation and 
identification, leading to simplified identification schemes, in 
which a priori knowledge about the process dynamics can be 
utilized by incorporating the information into the basis. 

Appendix 

Lemma Al: Let G(z), F(z ), and R(z) be stable transfer 
functions wit Laurent expansions G(z) = ]T£l 0 
F(z) = £r=o F k z-\ and R(z) = 0 R k z~ k . Then 

R(z) = F(z)G(z) if and only if 

[RvR, R 2 •■■] = [F 0 F 1 F 2 ■ • • ]T V (A. 1) 

with T v as defined in (12). □ 

Proof: The equality R(z) = (E£L 0 F kZ~ k ) (££L 0 
G k z~ k ) is equivalent to 0 R k z~ k = Efclo Ei=o 
FiG k ..i ]z~ k , and to R k = E*_o F iG k -i, which exactly 
matches (A.l ). □ 

Lemma A2 [11J: Given matrices X E C nxr, \ Y E C mxr , 
r > m, with XX* = YY*\ theh there exists a W E G mxr 
such that Y = XWmd WW* == 1. □ 

UmmaA3: Let C> = ii^rr w ( B = 

C^andEL/ ?') = (* Jit 1 )- □ 

Proof: By simple calculation. □ 

Lemma A4: Let G{z) be a square inner function such that 
||t?o|| < 1, with || ■ || any induced matrix norm. Let G k 
have a Laurent expansion G k (z) = G^z“\ Then 

l|Gi fc) ll < 1 ) for 0 < « < *- 1. □ 

Proof: The proof will be given by induction. For k = 1, 
validity is trivial. Suppose that the statement holds true for 
fc < n. Now we consider two cases, 
i) Consider G- n+1 \ where i < n 

G- n+1) = GqG^ + GiG^i + • • ■ + G i Gj ) n) ; 

||G- rt+l) || < 

< ll G o||||G, (n) || + \\G\ n _\\\ + ■ ■ • + ||G< n) || + ||G< n) 

<Fo||^^ii 1 )||Goir“ i + 

+ <n »-l 2) H G °l r " i+1+ 

+ ‘-- + ( n !l ) l | Go | r 1 + ||Go|r (A.2) 


<ncoir l - , ^( n n ^ J ‘) 

. 7—0 

= ||G 0 |r +1 'f + *) by Lemma A3. (A.3 

Tl 

ii) Consider the case i = n 

6t +1) = GoG^ + GxGjTJi + ■■■ + G n G ( 0 n) ; (A.4) 

ll G n" +1) |l < 

<ll G o|| + ||G^ 1 || + --- + ||G'" ) || + ||G' n) || 


< ll G o|| 


= ll G 


•'oil 


1 + 


1 + 


( 2 ’-i) + ( 2 ri 3 ) ++ 
+ (.- 0 +i ] 

( 2n - 1 )] 

=iig.ii( 2 ;). 

We have shown that ||G T - U+I ^| < ||Go|| n + 1_l ^ + % ^ for 

% < n< which proves the result. □ 

Lemma A5: Let G(z) be an rn x rn inner function such 
that ||Go|| < 1, with || ■ || any induced matrix norm. Let 
G k have a Laurent expansion G k (z) = G-^z“\ and 

Hankel matrix Uk H{G k ). Then for all / 

lirn max ||(njll fr ),j - Ml = U. 

A-—*oo j 

Proof: Consider /?.*(?■) — J3t=o IK*«^II- With Lemma 
A4 it follows that Rk('i) < 


t=() 


L<=o 


iiGoir 


k-i 


= ft i )\\Gu\\ k - i <(k+ 1)^00^. 


Since ||Go|| < 1, this implies that Rk(i) —>■ 0 for A; —► oc. 
Now consider the (i, j)-block element of TI£IIa with j > i- 

(n;n fc )« = E “ 0 ( G i?.)* G fl •= -Ei=I. 

(G[ k) r Consequently ||(II£ n k )„ -6 h J m || ■■= 

ii5:i=o (Gi k) r ii < e;= 0 iiGi fc) n = R k a - n 

—♦ 0 for k —► oo. 

For j < i it holds that Rk (j -1) < Rk(i - 1), which implic s 
that for all j, ||(Ii;TI fc )i. 7 - h 3 \\ < R k {i - !)■ ' 1 

Proof of Proposition 4. J: Denote Li = YltL o^k+i^ i% 
for i e Z , with Gj := 0, j < 0. Then G T (z~ l ) G(z ) = . 

Gj z e Er=o G kz~ k . This expression equals (Efc= > 

G k+J G k ) z 3 = Ejl-oo LjZ J . Since G is inner, G T {z~ ] > 
G(z) = /, and evaluation of the former expression for j > 1 
proves the result. 

Proof of Proposition 4.3: Part i) follows directly fro; ‘ 
Proposition 4.1. For Part ii), consider T V (H(G))*. Applyir^ 
Proposition 4.1, shows that T V (H(G))* — 0, which implit 
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; at T v V 0 U$ = 0 and T v V a U^U 0 = 0, leading to T v V 0 = 0. 
t he proof for T u follows analogously, employing the fact that 
, r (z) is inner too. □ 

Proof of Lemma 4.4: If G is inner, then for any k > 1, 
G k f (z- ] ) G k (z)=* (G k ~ l ) r (z-') G t (z~ x ) G(z) G k ~ l 
I (G k ~ 1 ) T (z~ x ) G k ~ l (z), and by induction it follows 
that G k is inner. A proof for the McMillan degree of G k is 
contained in the proof of Proposition 5.4. □ 

Proof of Theorem 4.5: 

Part A: A constructive proof will be given in three steps. 

i) The choices for Uj and Vf as in (17), (18) lead to 
matrices T£ and F* in (14), (15), that are unitary; 

ii) The constructed matrix F£F£ has a block Hankel struc¬ 
ture; 

iii) Tin = H(G k ). 

Proofs: 

i) Note that the (z,;y)-b1ock-element of F^.(F^)* equals 

With Proposition 4.3 it follows that 
this equals 1 for i ==■ j , and zero elsewhere. 

ii) This proof will be given by complete induction. For 
k = 1 the statement is true by definition. Assume that 
it holds for k - 1, i.e., fl*._i := rj^F^, is a Hankel 
matrix. We have to show that H* is a Hankel matrix too, 

with II, = [17^! r^jipjr 1 . 

L V A: — 1 J 

The Markov parameters of the system G k ~ 1 (z) will 
be denoted by Ho , II \, f/ 2 , ■ ■ 

With Uj and V* chosen as in (17), (18), it follows that 


n, = [7;,r^, f/„] v 


showing that 11, = 7’„Il,_i + UoV ( *T k ~ l - 
The matrix II, has a block Hankel structure if and 
only if Sll, - II, S*. with 

m 70 0 

,9- |() 0 / 0 


Evaluation of 5TI, shows that 511, = 6’T u Il,_i + 

su^rf-' = 

■G } I Kh G; ••.I 

= G a t u n,_,+ G 3 G 4 ••• J*- 1 (A.5) 


77] H 2 H 3 


G 2 G 3 
+ G 3 G 4 


= r„n,_ 1 5*+ 

l-Gi g 3 ■ 

+ g 2 g 3 


l Hr Hi H ;i 


From Lemma Al we can deduct that the nh block m v 
of T k ~ l corresponds to the Markov parameters of tin 
transfer function z~ i+1 G k ~ l (z). So 


rr 1 = 


'H 0 Hr H 2 
0 Ho Hr 

: 0 Ho 


As a result (A.6) can be written as 

sn, = r„n,_!5* + UoV't^'s* = 11 , 5 * 

which proves that II* is a block Hankel matrix, 

iii) The proof follows by induction, similarly as in step ii). 
Consider the first block row of II* = F*F*. This equals 

G()[H 1 H‘2 ■ ■ ■] 4- [G\ G 2 ■ • -]T*" 1 . 

Lemma A l shows that this is equivalent to 

G {) [H , Ho ■ ■ ■] + [W 0 Wi -■-] 

where W, is such that 


J^w k z' k = Yi'Gj+i*- J G k " l (z) = 

k=0 t_J-—0 

CJO * X 

= 5>*-‘ 


Hence the first block row of 11* corresponds to the 
Markov parameters of [S/-o 

= G k (z). 

Part B: Since IT* = [I/*_i FJLj] T/r 1 is an svd of II*, 

„ v k-\ J 

it follows that (/qII* = V k _ v and II*Vo = f7*_i which shows 
the uniqueness of f/*._ 1 and V*_i for a given FJ_ 1 and F£_ r 
Since Uo, Vo are unique up to unitary postmultiplication, this 
holds for the whole sequence of matrices {Uj , },— 0 , 1 ,■ 

Part C: The proof is given by construction in part a). □ 

Proof of Proposition 4.6: With V k = V*_ 1 T v the result 
follows immediately from Lemma Al. □ 

Proof of Theorem 4.7: The result of the theorem follows 
if the set of basis functions is complete in t. 2 , i.e., if for any 
x E 00 ), the following implication holds 

(< <!>k, x > = 0 for all k) => x = 0 

with < ■ > the inner product in ^ 2 - 

If < </>*,./: >= 0 for all fc, then (F£)*F£]f/ = 0 for all A;, with 
y := [;/;(()) .r(l) ■ ■ ■]*. Consider the /th row of this equatioii: 
’[(rj)*r£]i*y = 0 for all A;, with [(rj)T£]i* the /th row of 
the corresponding matrix, then 


(A.6) 


ll([(r£)*n]<. - <)y II < ll[(rj)'rj] j . - e*|| • ||y||. 
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Since, (F£)*r^ = (H{G k ))*H(G k ), it follows from Lemma 
AS that ||((r*Tr^]^ - e*|| 0, which implies that 

Vi-0. □. 

Proof of Corollary 4.10: Part a) follows directly from the 
completeness of the basis. For part b), consider the ith row of 
H(z) — D , with D = Hindoo H(z ), and H(z) - D written 
as SZfcLi h T {k)z~ k , with h(k) 6 R m . 

Consider the scalar time series {/<//(£)},_ 0t i,... defined by 

[w(0) w(l) w(2) ] = [/> T (1) h T ( 2) • • • ]. 

Applying part a) delivers w(t) — W£<t>k(t), with W k e 

R n , M*) € R". As a result h T (j+l) = EE.0 W klM™j+ 
1) ••• <Mm(j + 1)) ]. 

In the notation of Proposition 4.6 this leads to h T (j + 
1) = E“=o W k M k(j)- Consequently E^i h T (r)z~' = 
EZx ZZo W{M k (r-l)z-' = Erie WZV k {z). Since 
this applies to each row of H(z) - D , this proves the result. 

Proof of Lemma 5.1: From Proposition 4.2 it follows that 
for the realization of an inner function, the controllability 
and observability grammians have to satisfy PQ = 7, while 
stability requires that P, Q > (). In a balanced realization 
P = Q and diagonal, which implies P = Q = 1. □ 

Proof of Proposition 5 .2: 

G T (z~ l )G(z) = - A*)~ 1 C* + D*} 

■ [C(zl - A)- 1 13 + D] 

= B*(z-'I - A*)~ l C'C(zI - A)~ l B+ 

+ D*C(zl - A)~ l B 
+ B*(z~ 1 I - A*)~ l C*D + D*D. (A.7) 

Using A*A + C*C = 7, we can rewrite the first term of 
the right-hand side by employing I — A*A = .4*( 2 / — .4) + 
(aT 1 / - A*)A + (2" l 7 - A m )(zl - A). 

Substitution of this in (A.7) shows that 

G T (z~ l )G(z) = (D*C + B*A)(zI - A)~ l B 

+ B*(z~ l I - A*)~ l (C*D + A*B) 

+ B*B + D*D. 

Since (A,B) is a controllable pair, it follows that 
G T (z~ 1 )G(z) = I if and only if B*B + D*D = / 
and D*C + B*A = 0. □ 


with (A*. B\,C],D\) = (A, B, C,D). Validity of the state 
ment for (At, B \, C \, D\ ) is straightforward. Assuming valid 
ity for k - 1. we have to show that the statement holds fos 
k. First we show that (A k . B k ,C k , D k ) is indeed a realizatioi 
of G k (z) 

C k (zI-A k )- 1 B k + D k = [DC k -i C}- 

(zl — Afc-i) _1 0 

(zl - A)~ x BCk-\{zI - Afc-j)- 1 (zl - A)~\ 


= [DC k -i + C(zl - A)~ x BC k -i](zI - Air-i)* 1 B k -i+ 
+ C(zl - A)~ 1 BD k ~ 1 + D k 
= [D + C(zl - A)~ l B]Ck-\(zI - A*_,)- 1 fl fc _ 1 + 

+ \C(zI- A)- 1 B + D]D k ~ l 
= (D+C(zl — A)~ k B)\Ck~x{zl - Ak-i)~ l Bk-\+Dk-\] 
= G(z)G k ~ 1 (z) = G k (z). 

Balancedness of the realization (A k ,B k sC k ) can be shown by 
evaluating; A k A* k + B k Bl. For brevity of notation, we will 
write (Ak-\, B k -\,Ck~uDk-i) = (A.B,C,D) 

A k Al + B k Bt = 

AA* + BB* AC*B* + BD'B* 

BCA* + BDB* BCC* B + AA m + BDD*B* 

AA* + BB* (AC* + BI)*)B* 

” B(CA* 4- LHP) B(C(r + DIT)B * + A A* ‘ 

Employing Proposition 5.2 together with A A* f B B* = I 
shows that the above expression equals the identity matrix. 
In a similar way, using the dual forms, it can be shown 
that A* k A k 4- C k C k = 7, which proves that the realization 
is balanced and minimal. □ 

Proof of Proposition 5.5: Since V 0 * — [B AB A 2 B ■ • ■ J 
and H{G) = UqVq is an svd, it follows that ~ 

[G* A*C * ■ • • ]. Similarly it holds for any A; that 7 i(G k ) = 
rjrj is an svd, with (V° k )* = [CJ AJCJJ • • ■ ]. Since Tg and 
satisfy the recursion property of Theorem 4.5-a), the given 
solution has to be the unique one. EI 

Proof of Proposition 5.6: With X — BG and P an^ 
matrix satisfying PB = B 77, the matrices A k ,B k as in 
Proposition 5.4 will lake the form 


Proof of Proposition 5.3: Using Proposition 5.2, and its 
dual version, it follows that DD*'+CG* = 77*77+ P* 7? = 7. 
Now ||D ||2 < 1 is equivalent to the smallest singular value 
of B*B being greater than zero which is equivalent to rank 
B = rn. The result for rank C follows analogously. □ 

Proof of Proposition 5 A: We use complete induction on 
A; to prove this proposition. Note that we can write 


A k = 

At-] 

BCk-x 

o' 

A 

B k = 

' fifc-i ' 
BD k ~ l 

^ = 1 

pCk~\ 

C] 





0 

A 0 
A r 


B k = 


P k ~ 2 X P k ~ l X 

B " 

PB 

P 2 B 


X 


0 

0 

0 

0 

A 


and 


P k ~ l B 
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v c can write 

r M »u) i 

M x {j) . , 

A J k B k = .... = Ak(A J k ~ l B k ) = 

-Mk-iU)- 
f Mo(j - 1) 

M\ (j - 1) 

= Ak . 

-Mk-iU - 1) 

With the above representation of Ak this leads to the recursive 
relation (30). Relations (29), (31) follow directly from Bk- □ 
Proof of Theorem 5.7: 1-a). Using Proposition 4.6 we 
have to show that 

z(zl - A)~ l B[D + C{zl - A)-'B} k = 

= [(*/ - A)- 1 F{1 - zA*)\ k z(zI - A)~ l B. 

Note that it is sufficient to show that this holds for ft = 1, 
since successive application of the equality for ft = 1 shows 
the result for any ft. The equality for ft = 1 is equivalent to 

{B[D + C(zl ~ A)" 1 B] = F(1 - zA*)(zI - A)' 1 B] 

<S> {BD + BC(zI - A)" 1 B = BC(zI - A)~'B+ 

- PA(zI - A)~ l B - zFA*(zJ - A)" 1 B}, 

«• {BD = — PA[zl - A)~ l B - zFA*(zI - A)~'B). 

With PB - BI) it suffices to show that 

P = -PA(zJ - A)" 1 - zFA*(zl - A)- 1 . 

This is equivalent to {P(zl - A) = —PA — zFA*\ 

{P = -FA*} o {P = -BOA* + PAA*} « {P = 
BDB* + P-PBB*} {PBB* = BDB *} which is known 
to be true since PB — BD. 

1—b) This follows directly from Proposition 4.6. 

2) Take P = -RA*. Then PB - - RA*B , which with 
Proposition 5.2—i) equals RC*D. With B = RC* it follows 
that PB — BD and thus this choice of P satisfies (28). 

Now it has to be shown that for this P, BC — PA — R. 
This follows from BC - PA — BC 4* RA*A — BC 4- R(I — 
C*C) = BC+ R - BC = R. □ 

Proof of Theorem 5.8: Denote the infinite-dimensional 
matrices BBk- ft —> oo t and Aoc : = Ak, ft —> oc. Then 
we can rewrite (34) as 

[< \ V, C a A s B s C a A 2 H B s ■■■] = 

= [L 0 L ! ■ • ■][/?« A^Boo A'iB^ ■ • ■]. (A.8) 

' -cause of the orthonormality of [B^ A^ B^ ■ ■ ■] postmul- 
’ dication of (A.8) with [£*<*> A^B^ ■- ■]* provides 

'\B„ CtA'B, ■ ■ -poo A„oBrv ■■•]* = [L 0 h L 2 ■ ■ ■] 

: -ding to [Lo L x L 2 ■ ■ ■} = 

We define 

Ou 

Q ■■= [Qo Qi Qz ■ ■ ■) = £ A!B.BZ,(A^) k (A.9) 

k- 0 

J as a result, L k = C,Q k , which equals (35). 


Based on (A.9) we can write A„QA^, = Q ~ 13„B* 
X = BC and P any matrix satisfying PB - BD. this 1 
■A* X* X*P* 

0 A* X * X’P* 

A a [QoQi(h • ■•] 0 0 ' r V * ■ 

: : 0 A* 


= [Qo Qi Qi ■ ■ ] - B,[B* B'P* B*(P*)' 2 ■ ■ •]. 

The first element of this equation shows A h QdA* = Q 0 - 
B a B*, which equals (36). The /th element leads to 

i 

A a \QiA* + Y,Qi-jX*(P*y-'} = Qi - B f B*(P*y. 


(A. 10) 


Postmultiplication with P* gives 


A a [QiA*P* + Qi^X*{P*y\ = QiP* - B,B*(P*y +l . 

j=rl 

(A.l 1) 

Writing (A. 10) for i —► i + 1, shows that 

*4-1 

A.[Q f+ i A* + Y^ Qi-j+i X*(P*) j ~ 1 ]=Q;+1 -B h B’(P*) i+1 


and subtracting (A. 11) from (A. 12) delivers 
A'Qi+xA' - A s Qi[A*P* -'A*] = Qn i - QiP V (A.13) 


Note that P = P{AA* + BB*) = PAA* 4- BDB * = 
PAA* - = -FA*, and since F - A' - /VI (A. 13) 

leads to (37). D 

Proof of Proposition 7.1: Part a) A similar result for 
continuous-lime systems is proven in [11). The discrete-time 
version follows by applying a bilinear transformation, as is 
shown in [19]. 

Part b) The proof is based on an equivalence relation as 
employed in [11], based on the bilinear transformation. If 
( A t i , Bd , C (i . D d ) is a balanced realization of a square discrete- 
time system G f \, then (A r - B ( ,O r , D t ) is a (continuous-time) 
balanced realization of a continuous-time system C ( , where 

A c — [Ad ~ J][Ad 4- Z]" 1 Ad = [I 4- Ar][/ “ A r ] 

J3, = n/2[A,/ 4 /]■* :1 Bd = v^2[/ - >»r]‘ 1 B t: 

C c = y/2C d [A d 4 Z]- 1 = V^Vl/ - A-]' 1 

D r -ZV - rV[A (/ 4 Z]- l Bj D rf = A 4 C v [l - 

Furthennore Gj is inner if and only if G c is inner. From 
[11] it follows that G c is inner if and only if the following 
conditions are satisfied: 

1) Ac 4 A* 4 BcB ; = A, 4 A; 4 C?;C r = 0 

2 ) A/7; = zw = z 

3) Z7;a 4 B* = DcB: + C = 0. 

Given Ad-Bj we can construct A r , Z? f with the equations 
above, additionally choosing C7 r Z?*, D< := —Z it follows 
that (A r ,ZZ c ,C r , D c ) is a balanced realization of an inner 
function. 

Now transforming the continuous-time realization back to 
the discrete-time domain, with the expressions for C ( i , D ( i 
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as given before and employing the relation (1 - A c )~ l = 
\{I + Ad), shows that 

C d = B' d {A* d + iy\A d + l) (A. 14) 

Dd^B'MZ + iy'Bd-I (A. 15) 

which completes the balanced realization (A d , B d ,C d , D d ). 

Using the fact that B d and C d have full rank, Proposition 
5.2 now implies that premultiplication of C d and D d with any 
unitary matrix U characterizes the intended class of balanced 
realizations. 

Part c) Consider a realization (A, B, C, D) satisfying (44), 
(45) with U — I. For F - [I + A]\I + A*]~ l it follows 
immediate that PC'* = B, and the result follows with Theorem 
5.7-2). 

Denote F(A, U) = [A/ + B{U ~ A I)(B*B)- l B*]F, with 
U a unitary matrix and A € G. Then by substitution it can 
be verified that F(\,U)C*U* = B. This means that for any 
C = UC we have constructed a A-family F( A, U ) that satisfies 
F(\,U)C* = B. Again with Theorem 5.7-2) and choosing 
A = 1 this proves the result. □ 

Proof of Proposition 8:1: This result follows directly 
from rewriting the Lyapunov equations in Theorem 5.8 in 
terms of Kronecker products: see [31, [19]. □ 

Proof of Theorem 8.3: Consider a single scalar entry of 
the rational matrix function W(z) — Z„(zl - X 0 )~ l Y 0 , 
written as w(z) = u>kZ~ k . Then u/(z) is convergent 

for \z\ > A. Consequently, according to basic theory of 
power series, see e.g., [20], as employed also in [17], there 
exists an a e F such that for each // > A, < 

a , leading to \wk\ < ocrj k . Since this holds for any entry 
of W(z), and W(z) = i)z~ k * there exist 

scalars asuch that \L k (i<j)\ < a»j7/ fc+1 , with L k (i,j) 
being the (z,j)-entry in L k . Denoting E(z) := H(z) — 
H N {z) = L k V k (z), it follows that ||J5(z)|| : *> < 

iiEr=N^.v fc (z)iu < iiv„( 2 )iuEr=A'ii^iu- 

Using the above upper bound for \Lk(i,j)\ together with the 
well-known relation between the oo-norm and the Frobenius 
norm, || • |U < || ■ || F , it follows that 




(A.16) 


Substituting this latter upper bound in the derived upper 
bound for ||F(^ )|| fX) , the result of the theorem follows with 


c=iivo(*)iuyE^. 


□ 
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Hoc Control via Measurement Feedback 
for General Nonlinear Systems 

Alberto Isidori, Fellow, IEEE , and Wei Kang, Member, IEEE 


Abstract —This paper shows how the problem of (local) dis¬ 
turbance attenuation via measurement feedback, with internal 
stability, can be solved for a nonlinear system of rather general 
Structure. The solution of the problem is shown to be related to 
the existence of solutions of a pair of Hamilton-Jacobi inequalities 
in n independent variables, which are associated with state- 
feedback and, respectively, output-ipjection design. The results of 
the paper extend a number of recent achievements in this area. 


1. Introduction 

T HE development of a systematic analysis of the nonlinear 
equivalent of the ffoo (sub)optimal control problem was 
initiated by the important contributions of Ball-Hclton [2], 
Basar-Bemhard [4], and Van der Schaft [JO]. In particular, 
|10| has shown that the solution of the problem in question 
in the case of full information configuration, that is when the 
set of measured variables which are available for feedback 
includes the state of the controlled plant and the exogenous 
disturbance input, can be determined from the solution of 
a Hamilton-Jacobi equation, which is the nonlinear version 
of the Riccati equation considered in analysis of the //^ 
(sub)optimal control problem for linear systems. More recent 
contributions to this area of research are the works [11], [6], 
[3], and [7]. In particular, [6] presents a set of sufficient 
conditions for the solution of the problem of (local) distur¬ 
bance attenuation in the case of measurement feedback, that 
is when the set of measured variables is just a function of 
the state of the plant and of the disturbance input. The paper 
[3], among many other important contributions, discusses a 
number of issues related to the necessity of the (sufficient) 
conditions proposed in [10] and [6]. The paper 17) considers, 
as the paper |6| docs, systems modeled by equations which 
are affine in both control and disturbance inputs, and shows 
that, if a local solution is sought, the sufficient conditions 
given in [6] can be simplified and turned into conditions 
involving two Hamilton-Jacobi inequalities in v independent 
variables, associated with the design of a “state-feedback” and, 
respectively, of an “output-injection gain.” 
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work was supported in part by MURST, by the National Science Foundation 
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In the present paper we study the H w (sub)optimal con 
trol problem for systems modeled by equations which are 
not necessarily affine in the inputs, a more general class 
nonlinear systems already considered in [3] (which discusses 
necessary conditions and a separation principle in the case 
of measurement feedback) and in [12] (which discusses the 
case of state feedback). More precisely, we present a (more 
general) necessary condition for the existence of a solution 
to the problem in the case of measurement feedback and we 
show that, if this condition is strengthened in a suitable way, 
then the construction of a feedback law yielding local distui- 
bance attenuation with internal stability becomes possible. The 
analysis of the necessity extends the results of [3] while the 
analysis of the sufficiency extends the results of [7]. 

Consider a nonlineai system modeled by equations of the 
form 

./ = F(j\ w, a) 
z = u) 

y = Y(.r , w). (1) 

The first equation of this system describes a plant with state 
j ■, defined on a neighborhood A" oi the origin in R n with 
control input u £ R m and subject to a set of exogenous input 
variables w £ R' which includes disturbances (to be refected) 
and/or references (to be tracked). The second equation defines 
a penalty variable c £ R\ which may include a tracking 
error, as well as a cost of the input u needed to achieve 
the prescribed control goal. The third equation defines a set 
of measured variables y £ W\ which are functions of the 
slate plant .; and the exogenous input w . The mappings 
F(.r, a ), Z{x , ?/), and Y(.i , w) are smooth mappings (i.e . 
mappings of class C k for some sufficiently large k) defined in 
a neighborhood of the origin in R" xR r x R m . Wc assume 
also that F(0, 0, 0) = 0, Z((), 0) = 0 and )'((), 0) = 0. 

The control action to (I) is to be provided by a controllci 
which processes the measured variable y and generates th 
appropriate control input u, and is modeled by equations v' 
the form 

£ = y) 

« = 0 (£, y) C 

in which £ is defined on a neighborhood E of the origin in R 
and ?/: E x R /} —♦ R", 6 : E x H p —* R m arc smooth function 
satisfying r;(0, 0) = 0 and 0(0, 0) = 0. 

The purpose of the control is twofold: to achieve closed-lot 
stability and to attenuate the influence of the exogenous inpt> 
w on the penalty variable z. Following numerous authors (sc 
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-g., [13], [5], [3], [11]) the property of disturbance attenuation 
in a nonlinear system can be characterized in the following 
way. A nonlinear system 

•i' ” w) 

y-h{x) ( 3 ) 

said to be locally dissipative near (x, u) = ( 0 , 0 ), with 
respect to a given supply rate /?(u, y), if there exists a smooth 
function V{x), which is nonnegative and vanishes at x — 0 , 
such that 

dV 

— u) + *{u, h(x)) < G (4) 

tor all u) in a neighborhood of ( 0 , 0 ) (note that there is 
some abuse of terminology in this definition: following [13], 
one should say that a system is “dissipative/’ with respect to 
the supply rate s(x< y ), if the integral version of the inequality 
( 4 ) holds, which would not require V'(;r) to be differentiable). 

If (3) is locally asymptotically stable (at the equilibrium 
x - 0 ) and locally dissipative with respect to the supply rate 
,s(u, y) = 7 2 |M | 2 - ||?y|| 2 , then its output response to any 
sufficiently small input, from the initial state x(0) = 0 , satisfies 

o < V(:r(/,)) < /Vm*)I | 2 - ||?/(.s)|| 2 Hs 

Jo 

for all t > 0. As a consequence (3) has an Lo gain which is 
less than or equal to 7 (see [ 5j). 

In what follows, we address the problem of finding a 
controller which renders the closed-loop system ( 1 )—( 2 ) locally 
asymptotically stable [at the equilibrium £) — ( 0 , 0 )] and 
locally dissipative with respect to the supply rale .s (w, z) — 
7 2 IMI 4 -IWI a - This problem will be referred to as the problem 
of local disturbance attenuation with internal stability. 

II. Disturbance Attenuation via State Feedback 

As pointed out by various authors (see, e.g., [2J, [4]), the 
problem of finding a feedback law of the form u = a(:r) 
rendering the closed-loop system 

x = F(x< w, a(j:)) 

2 = Z ( x , «(x)) (5) 

(locally) dissipative with respect to the supply rate s(w, z) = 
7 < 2 || , m ;|| 2 - \\z\\*, can be cast as a two player, zero sum, 
differential game, in which the minimizing player controls the 
input u and the maximizing player controls the input w. 

The Hamiltonian function associated with the game in 

icstion in the present case is a function H : R n x R n x R r x 
—► R defined as 

H{x. p, w, u) = p t F(:X, w, u ) + \\Z(x, w)|| 2 - 7 2 ||'«'l| 2 - 

Suppose plant ( 1 ) satisfies the following hypothesis. 

Assumption AJ: The penalty map Z(x , u) is such that the 

rtrix 

D, = f ( 0 , 0 ) 

* rank rn. 


Then, it is easy to see that, in a neighborhood of the 
(a;, p, w. 11 ) =(0, 0, 0, 0), the function H(x. p. w. v) fob 
unique local saddle point in (w, u) for each (x. p). More 
precisely, there exists unique smooth functions w,(x. v ) and 
u»(x, p). defined in a neighborhood of (0, 0), satisfying 

OH 

— (a:, p. w,(x, p). u*(x, p)) = (), 

OH 

— (.r, p, w+(x, p), u*( j:, p)) ~ 0. 

0) = 0, u.(0,0) = 0 

and such that 

H(x, p, u;, u«(x, />)) < H(x, p, w+(x< p), u„(.r. p)) 

< H(x, P, w*{x, p), u) (6) 

for each (x, p, w, u) in a neighborhood of (x, p, w. u) =((),(), 
0, 0). The existence of these functions and (6) can easily 
be deduced from the observation that H(x, p, w/, i/), viewed 
as a function of (w, //,), has a Hessian matrix which, at 
( x , p, w, u) —(0,0, 0, 0)., is equal to 

- 27 2 / 0 \ 

0 2 DjDi ) 

where DjD\ is positive definite by hypothesis. 

Let now V : R" R be a smooth function, defined in 
a neighborhood U of x — 0 and such that V(i)) = 0 and 
14 ( 0 ) = 0, set 

//*(:/;, p) = H (;r, p. w+(x, p). u,(x< p)) 

«.W = w,(x. V?(x)),. n 2 (x) = «.(*, Vj(x)) (7) 

and observe that (6) implies in particular 

H(x,Vj(x), w. « 2 (x)) < H,(x, V/(*)). 

Now, suppose the function V(;r) is nonnegative (in which 
case V(0) = 0 implies V r (0) = 0) and renders the inequality 

H,(x, Vj(x)) < 0 (8) 

satisfied for each x in a neighborhood of zero. Then, set 
a(x) = ao(:r) in (5). This yields a closed-loop system 
satisfying 

V x (x)F(x, w, €*,(*)) + || Z(x, o a (*))|| 2 - 7 2 ||«'l| 2 < » 

that is a system which, in a neighborhood of (j\ w) =(0, 0), 
has the required dissipativity property. 

Inequality (8) is called a Hamilton-Jacobi-Isaacs inequality. 
If V(x) is positive definite and appropriate additional hy¬ 
potheses are satisfied, it can be proven that the feedback law 
u — «2W is also locally asymptotically stabilizing. These 
additional hypotheses may take different forms. For instance, 
as shown in [7] for the case of affine nonlinear systems, one 
may simply assume that the function V{x) satisfies a strict 
inequality, i.e., that the left-hand side of (8) is negative for 
each x / 0, Or, one may assume that the plant (1) has a 
suitable “detectability” property. In what follows we describe 
this second choice. 
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Assumption A2 Any bounded trajectory i (f) of the system 

i(0-=F(j(/),o »(/)) 

satisfying 

Z(r(t), "(0) = 0 

for all t > 0, is such that lim*—^ r(/) = 0 
Remark Note that this assumption is the nonlinear version 
of the hypothesis that the system with input u and output ^ 
(and w =■ 0) has no finite transmission zeros on the imaginary 
axis □ 

In fact, we have the following result (whose proof, pretty 
similar to the proof of a corresponding result for affine systems 
given in [71, is omitted) 

Proposition III Consider the system (1) and suppose As¬ 
sumptions A1 and A2 hold Suppose there exists a smooth 
positive definite function V(r), locally defined in a neighbor 
hood of i = 0 and such that V'(O) = 0, which satisfies the 
Hamilton-Jacobt-Isaacs inequality (8) Then, the feedback law 
u = o 2 (a’) solves the problem of local disturbance attenuation 
with internal stability 

Remark Recalling the interpretation of the problem as 
a two players, zeio sum, differential game, the previous 
arguments show that if there exists a function V'(i) which 
renders V? (r)) =■ 0, the strategy u = a 2 ( r) is the best 

strategy for the minimizing player and the strategy w = oi( /) 
is the best strategy for the maximizing player Since in the 
present setup, w is a disturbance, the latter can be interpreted 
as the woist possible disturbance affecting the system □ 

III Disturbano r Attfnuation 

VIA TylbASURbMLNT FEEDBACK 
A Preliminaries 

If the state / of the plant is not available for measurement 
then the feedback law proposed in the previous section cannot 
be directly implemented Motivated by the results obtained 
in [7] for affine systems, we consider then a controller in 
which the feedback law m(r) is replaced by a law of the 
form where £ is an “estimate” of / provided by an 

appropriate auxiliary dynamics The latter consists of a copy of 
the dynamics of r corrected by a term proportional to the error 
which is induced by the estimation on the measured variable 
jy, that is a system described by an equation of the form 

t = w W ) + (7(0(V->'(£, m)) (9) 

where the matrix G(£), which will be called the output 
injection gain, is a matrix to be determined 
Since v) is not directly available cither, we substitute its 
actual value by the worst possible one, which, as seen in the 
previous section, at each time t is a function of the value 
of the state of the plant at this time and has the expression 


w+(t) — ai(jr(t)) Thus, in (9) we set u = 0^(0 ami 
w = oi(0 This yields a dynamic feedback law of the form 

c = f(c, «i(o, mo) + 6 ’(0(v - <nm 

« = MO (lOi 

For convenience, wc represent the corresponding closed 
loop system as 

j f =F(in), z = Z f (S) (11) 

with i‘ and F l (j c , w), as shown at the bottom of the page 
and Z l (jr ') = Z(r, a 2 (()) 

As before, we try to render this system locally dissipative 
with respect to the appropriate supply rale, i c , we seek the 
existence of a smooth nonnegative function f (t l ) such that 

", u>) + ||Z'(Oll' -7 2 ||u'|| 2 <0 tor all w 

( 12 ) 

In addition, we want the closed-loop system to be locally 
asymptotically stable In this respect, it is quite easy to 
show that if the above dissipation inequality holds and some 
appropriate conditions are satisfied the closed loop system 
(11) has an asymptotically stable equilibrium at i' = 0 In 
fact, the following result holds 

Lemma III l Consider system (11) and suppose the fol 
lowing 

i) Assumptions A1 and A2 hold 
ii ) the system 

* = /X€ m(0 0)-«(O5 U <m(0) (M) 


has a locally asymptotically stable equilibrium at = 0, 
m) there exists a smooth function U(i ( ), vanishing al 
i ( = 0 and positive for /' ^ 0, which satisfies the inequality 
(12) for w — 0 

Then /' (/ f 0) has a locally asymptotically stable equilib 
rium at j' — 0 

Proof The positive definite function satisfies 


dll (,'(!)) 


■\\A> MO)ll 2 <<> 


and this shows that the closed-loop system is stable at r ( = 0 
To prove asymptotic stability, consider any trajectory r' (/) ol 
0) yielding f/(/ r (/)) - 0 for all / > 0 Then 
the previous inequality implies that 


Z(f(/), (U(W))) = 0 lor all / s 0 


From Assumption A2, we conclude that hm— b 
Moreover, Assumption Al implies that there is a umqu 
smooth function u = //(/), defined in a neighborhood o 
j = 0 such that 


Z( /,(/(/)) = 0 and «(()) = () 

Therefore, lim r ^r( t) - 0 and Z( r(t) (/))) = 0 impl 

hm<_^MOO) = 0 
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This shows that the uMimit set of the trajectory in question 
entirely contained into the set 

A< = {(r. 0 /=0, « 2 (O = 0} 

ny initial condition in this set produces a trajectory m which 
/) = 0, while which is necessarily a trajectory ol 
1 11), is such that lim t->ao£(t) — 0 by hypothesis Thus, by 
1 iSalle’s invariance principle, we conclude that r* = 0 is an 
i ^mptotically stable equilibrium of F e ( r f , 0) □ 

In view of this results wc see that it ihe gain matrix G(£) 
is such that 

ai the inequality (12) holds for some positive definite 
function and 

b) system (13) has a locally asymptotically stable equilib- 
num at £ = 0, then the feedback law (10) solves the problem 
ol distuibance attenuation (with local asymptotic stability) In 
the next two subsections, we discuss the problem of how to 
hnd a matrix G(£) such that these two properties hold 

B A Notessens Condition 

To (paitially) motivate the condition that, m the next sub¬ 
section, will be used to find an expression of the output 
injection G(£) in the feedback law (10), we first describe 
a necessary condition for the solution of the problem ol 
disturbance attenuation via measurement feedback 
Consider the Hamiltonian function h R" xR 71 xR' x W J 
R defined as 


for all (.r, p. h>, </) in a neighborhood of (0 f 0 0 oi a , 0 
K{i j), w(i p i/). y) 

> A(r, p . «»(j p i/.(j p )) ,,)) , 

for all (j p y) in a neighborhood ol (0, 0, 0) 

Finally, set 

u'**(j p) = iv(i p //*< ( p)) 

The functions u>**(r,p) and </*(; p) thus dehned can 
be used to express a necessary condition loi the existence 
of solutions to the problem of disturbance attenuation via 
measurement Iccdback 

Theorem III J Consider system (I) and suppose Assump 
tion A3 holds Suppose the problem of local distuibance 
attenuation is solved by the feedback law 

C - n(C v) M = m 


and let U(i, £) be a positive definite smooth function satis 
fying 


UU f O 


n) {'(' »’ #(0)) 

+m< ^))ii 2 -7 2 n«'ii 2 < o (i7) 


tor all (/ £ w) in a neighborhood ot (0, 0 0) Then the 
positive definite function \\ (/) = U(i 0) satisfies 

Ml W\ r (') !«..(' IV, 1 (a)) </.(< H/(/))) < 0 (18) 


h(i p u i i/) + ]P b (; ii 0) — i/ T } (/ ii ) 

4 II7(i 0)I| 2 -7 2 |M| 2 (14) 

and suppose the plant (1) satisfies the following hypothesis 
Assumption A1 The measurement map ) (/ w) is such 
that the matrix 

Pi ~ (0 0) 

ow 


has rank p 
Since 

p tv ij) 
\ ()w 2 


-) 

/ (j u a 


— - 2^1 


(j p a y)-(U 0 U 0 ) 


there exists a smooth function tv(t p //), dehned in a neigh- 
hoihood of (0, 0, 0) such thal 


()K (/ p w 


du> 


l0}) 

/ u =u (j 


= n ///(o, o, o) = o 


1 ( J p u) 

N' >ieover, it is also easy to check that 

<;2 A(i p m(r, p, i/), </)' 


dij 1 


% 


1 


= 


U v y)=(o o oi 2 T 

os, there exists a smooth function </*(/, p), dehned in a 
^hborhood of (0, 0) such thal 

’^, P ,„)\ = „ (0 0) = „ 

°y v) 

'3y construction, 

K(i , p, uk v) < K(j) p, w(j\ p, </), y) (15) 


for each / in a neighborhood ot zero 
Pi oof Set C - 0 in (17) to obtain 

W,(/)/ (i, w 0) + C c (i »)'/(0 J (i «)) 

+ ||Z(; 0)||' 7 2 ||» II 2 * 0 

Since the function //(() */) vanishes at </ = 0, there exists a vec 
tor of smooth functions P(i i/) such that l $( i 0)f/(0 </) — 
if)tf and therefore 

U,(/)/(/ tv 0) + l^(i ) (i <i))l (i tv) 

+||Z(» 0)11 2 -7 2 ll«ll 2 <n 
Choosing in — in(i W ], i/), the latter yields 
A (i M / h;(j M, r y) P(i l(i «'(/ «? iy)))< 0 
Now, let i/( i) denote the unique solution ol 

v(i) = /■>( i V(r ui(i W7 (/(/))) 1 /( 0 ) = 0 

and set tf — y(i) into the previous inequality, to obtain 

A (/, W , T w(, w! 1/(0) 1/(0) CO 

At this point. (16) shows that A (/ U r ' u (r H / 
</„,(/ W, 1 )) '/*(*! M j 1 )) < 0 and the result follows □ 

Remark In the case of a system in which I (/ 0) 

and } (r «;) are affine in //;, condition (18) reduces to the 
necessary condition I AY I(i) < 0 of |3] Wc observe, 
however, thal the latter was proven under the hypothesis that 
f/(£, ij) is affine in </, while this hypothesis is not required in 
the proof ot (18) On the other hand, (18) is only locally valid 
while the necessary condition of |3| is globally valid □ 
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C. The Design of the Output Injection Gain 

In this subsection we show that, under appropriate hy¬ 
potheses, the feedback gain can be determined from the 
solution of an inequality involving the Hamiltonian function 
K(x,P, w,y) introduced in the previous subsection. The 
inequality in question is somewhat stronger than the inequality 
(18), whose necessity was proven in the previous subsection, 
and has the following form 


b) indicated at the end of Section III-A hold, if G(x) is chosen 
as in (20). As far as condition a) is concerned, observe that 

U x *F{x e ,w) 

+ ||Z c (x e )|| 2 -7 2 M| 2 

= Q,(* - 0[F(x, w, a 2 {0) -F{t, «x(0,-aa(0) 

+ V x F(x, w, a a (0) + ||Z(x,a 2 (0)|| 2 - 7 2 |M| 2 


K{x, W x {x), w,.(x, Wj(x)), y m (x, Wj(x))) 

V?) < 0. (19) 

As a matter of fact, we can prove the following result. 
Theorem III.2: Consider the system (1) and suppose: 

i) Assumptions Al, A2, and A3 hold, 

ii) inequality (8) has a smooth solution V(x ), defined in a 
neighborhood of x = 0, vanishing at x = 0 and positive for 
x y 0, 

iii) inequality (19) has a smooth solution W(x), defined in 
a neighborhood of x = 0, vanishing at x = 0 and positive 
for x ^ 0, 

iv) W(x) — F(x) > 0 for all x ^ 0, 

v) the Hessian matrix of 

K{x, W?(x), u )„(x, Wj(x)), j/,(x, Wj(x)))-H.(x, V" r T ) 


< Q r .{x - 0[F(x, w, « 2 (0) - F{£, «i(0, «2(0) 
-G(OY(x, w) + G(t)Y(t, ttl (0)] 

+ H(x, Vj , w, a 2 (0)-H.(x,V?). 

Let L(x, £, w) denote the expression on the right-hand side 
of this inequality. It is easy to see that, in a neighborhood 
of (x, £) =(0, 0, 0), there exists a (unique) function w(x, £) 
such that 


and 


(m, =0 

\ dw /»-*(«, e) 

L(x, t,w) < L(x, t, u>(x, 0) 


for all (x, w) in a neighborhood of (0, 0, 0). Thus, in this 
neighborhood, 


is nonsingular at x = 0 and the equation 


U x .F(x', w) + ||Z p (x c )|| 2 - 7 2 |M| 2 < L{x, t, w(x. £))• 


(W x (x) - V x (x))G(x) = yj(x, Wj(x)) (20) 
has a smooth solution G(x). 

Then, the problem of local disturbance attenuation with 
internal stability is solved by the output feedback 

i = m «i(a aa(«) - Gitmt, mo)+g(o» 

U = tt 2 (0 

with Oi(x), f>2(x), and G(x ) chosen as in (7) and (20). 
Proof: Set Q(x) = W(x) - V(x) and define 

S(x , w) = Q x [F(x, w, 0) - G(x)Y(x ) w)] 

+H(x, Vj, w, 0) - H.[x, Vj). 

Then, it is easy to check that 

S(x, w) = W t F(x, w, 0) - yj(x, Wj)Y{x, w) 

- V x F(x, w, 0) + H{x, Vj, w, 0) 

- ».(*, Vj) 

= W x F(x, w, 0) - yl(x , Wj)Y(x, w) 

+ ||Z(x, o)H 2 -7 2 IHI 2 -^.(*.v, t ) 

= K(x, Wj, w, y,{x. Wj)) - H.(x, V?) 

< K{x, Wj, tx„(x, Wj),y.(x, Wj)) 

- ».(*, v x T ) 

= x t A/(x)x 

where M(x) is a matrix of smooth functions, which is negative 
definite at x = 0. 

Now set U(x c ) = Q(x - () + V(x). It can be proven that 
the function thus defined is such that the two conditions a) and 


Moreover, it is also possible to show, by means of somewhat 
lengthy calculations (omitted for reasons of space), that 

L(x. t, w(x, ()) = (x - t) T fi(x, t)(x - t) (21) 

where R(x , £), a matrix of smooth functions, is such that 
R( 0, 0) = A/(0). Thus R(x , £) is negative definite in a 
neighborhood of (0, 0) and the result [namely, property a)] 
follows. 

To prove that condition b) holds, it suffices to set w = m (x) 
in the definition of S(x, w), to conclude that 

0 > S(x, «i(x)) > Qx(F{x, o:i(x), 0) - G(x)Y(x, a.i(x))) 

thus showing that Q{x) is a Lyapunov function for (13). □ 

Remark: The output injection gain G(x) is implicitly de¬ 
fined by means of (20). Obviously, the existence of a smooth 
solution G(x) of this equation depends on how W x (x) - V x (x) 
and t/.(x, Wj (x)), which both vanish at x = 0, tend to zero 
as x —► 0. If, for instance, the function W(x) - V(x) has a 
Hessian matrix which is nonsingular at x = 0, then a smooth 
solution of (20) always exists. In this case, in fact 

W T .{x) - V,.(x) = x T /?.,(x) 

where R\(x) is a matrix which is nonsingular for each x ir 
a neighborhood of x = 0. The right-hand side of (20) can bt 
given a similar expression, say 

yj(x, Wj{x)) = x t L(x) 

and the equation in question is indeed solved by G(x) - 
R^ix)L(x). [ 

Remark: It may be useful to stress that the proof o 
Theorem IIL2, as far as property a) is concerned, does no 
just consist of straightforward "second-order” arguments, h 
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this respect, the result cannot be viewed as direct conse- 
( |uence of the corresponding result which is known to hold 
for linear systems. As a matter of fact, the maximum of 
/ x , F(x F < 'w) + \\Z* (r c )|| 2 - 7 2 ||i/;|| 2 in w (that is, the function 
L(t. £, iv(x, £))] is a function of (x, £) whose second-order 
terms vanish at x = £. In other words, the maximum in w of 
the left-hand side of ( 12 ) has a “second-order" approximation 
which is only semidefinitc, and this docs not suffice to con¬ 
clude that (12) holds. The required conclusion is reached only 
after a stronger relation like ( 21 ) is shown, which clearly is 
not just a property of second-order terms. □ 

IV. An Example 

In the previous sections, wc have shown how to construct 
a feedback law which solves the problem of disturbance 
attenuation, using the solutions V(x) and W(x) of the pair 
of Hamilton-Jacobi-Isaacs inequalities (8)-(19). In this sec¬ 
tion we illustrate, with the help of an example, how the 
problem of determining actual solutions of these inequalities 
can be addressed. The first thing to observe, in this respect, 
is that—except for some special classes of systems (like, 
for instance, the systems in which F(,i\ in, u), H(:t . u) and 
}'(./*, w) are affine in id and a) — the inequalities in question 
arc only implicitly defined [because so arc the functions 
i/'*(.r). ?/+(.r). w++(.r). ;/*(./ ) which characterize the left-hand 
sides of (8)-(I9)]. Thus, the only feasible practical way to 
implement the results established so far is to try to estimate 
the solutions of these inequalities by means of appropriate 
numerical methods. 

To this end, we recall that the problem of determining 
polynomial approximations (of some prescribed degree) for 
the solution of a Hamilton-Jacobi equation has been addressed 
and solved by Albrekht in [1J and Lukes in [9|. The results 
described in these papers were developed for the specific case 
of the Hamilton-Jacobi equation arising in nonlinear optimal 
control, but indeed hold (as, e.g., observed in 11 IJ) also in the 
more general case of the Hamilton-Jacobi-Isaacs equation of 
a nonlinear differential game, thus also for the inequalities ( 8 ) 
and (19). 

The approach in question can be briefly described as fol¬ 
lows. Suppose one is interested in the determination of a 
positive definite function V : R fl —► R, defined in a neigh¬ 
borhood of ,r - 0 and vanishing at .r = 0 , such that 

//(.r, I'V), V; T (.r)), V, T (x))) = </>(*) (22) 

‘or some negative semidefinite or negative definite analytic 
‘auction </>(./•). Suppose V(.r) is analytic and set 

V(.r) =fV['' f V) 

t/=l 

nu 

uu(.t) -uu(.r, v'/(,)) = 

d =1 

OC 

«.(x)=Mj:, L T U)) = 2>*V) 

</=l 

oo 

*(■0 = 5 > i, V) 

d =3 


in which the superscript “[rf]" means that a function m q, 
components of a vector are homogeneous polynomials 
degree d. 

Following [I] and [9], it is possible to show that, U 
any d > 1 , r^ + 1 l(j-) and wl J, (jr). ul rf, (.r) depend only 
on VW(,r), ul lj (.r), ■ ■ ■, VM(.r)?rl f '~ 1 ] (.i), 1 ’( J ). 

The function V^(x) is determined by a Riccati equation, 
involving the parameters which characterize the linear 
approximation of the plant at the equilibrium (./, u. w) »((), 
0 , 0 ), while V^ + 1 J(.r), for d > 1 , is determined by 
a linear equation (called homological equation) which 
involves the parameters of V^(./), //>**(.r). (.r). --. 

V r l f/ l(.r). tul <, “ l] (,r),uL rf “ 11 (.r) (see [9f | 11 |, and [ 8 | tor more 
details). 

Thus, it is not difficult to setup up a recursive piocedure 
yielding, after d stages, a polynomial approximation of order 
d+ 1 for V(x) and a polynomial approximation of order d lor 
w*(.r). Note also that the function </>(.r) on the right-hand side 
of ( 22 ) is quite an arbitrary function (the only requirement 
being its sign-definiteness) and, thus, the extra degrees of 
freedom associated with the choice of this function can be 
exploited to the purpose of simplifying the entire solution 
process. 

In the following (elementary) example wc show how the 
solution of a Hamilton-Jacobi inequality can be achieved via 
polynomial approximations. This example may also help, in 
our opinion, to dissipate the impression that the analysis of 
the problem of “local" disturbance attenuation with internal 
stability relies solely on “first order" arguments. Consider the 
system 

(*) = (, r " ^ /wy 

\y) \j * 2 + «»+*// \ d ) 

It is easily seen (by means of simple arguments which arc 
omitted for reasons of space) that for any linear feedback 
law // — ax + by, the equilibrium point (,r, y) =■((), 0 ) 
of the corresponding closed-loop system (with id - 0 ) is 
unstable. Thus, in particular, the problem of local disturbance 
attenuation with internal stability cannot be solved by means of 
a linear stale feedback law. The problem in question, however, 
can be solved by means of a nonlinear state feedback, as shown 
hereafter. 

The Hamilton-Jacobi inequality ( 8 ) assumes in this case 
the form 

V', .ry + Vyjr 2 + ,/ - 2r 2 y + x 4 + \\V 2 < 0 (23) 

4 

where A — (I/ 7 2 ) — 1. 

Set 

V'(.r, y) = y) + .'/) 

with 

V™(x, y) —ax 1 -I- by 2 

V^(.r, y) =rx 2 y + dxy 1 + ex A + f y s . 
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In the notation introduced before, we obtain on the left- 
hand side of the Hamilton-Jacobi inequality (23) a function 
of (/, y) of the form 

0 (r, y) = y) + $ [,] ('> y) + 

in which 

y) = (1 +b 2 \)y 2 

$^(x, y) = ( 2 a + 26 — 2-1- Xbc)x 2 y + 2Xbdxy 2 + 3A6/y J 

Smce <f>(x, y) is required to be sign-definite, we impose 
y) = 0 ,i e„ we impose the following constraints on 
the parameters which characterize V(j , y) 

a = 1 - b - ^bc, d = / = 0 

a 

Moreover, to obtain simpler expressions, we set e = 0. As a 
result, we obtain 

= clt 2 4 - by 2 -i- cx 2 y 

</) s= (1 + b 2 X)y 2 4- ^r 2 + r 4* 1 ^j* 4 + 2cv 2 y 2 

The function V{x, y ) is positive definite, and the function 
y) is negative definite, in a neighborhood of the origin, 
if the following constraints hold 

a > 0 , b > 0 l + b 2 X <0, ^ 

For any A < 0 , ie., for any 7 > 1, these conditions can 
indeed by satisfied by suitable (sufficiently large) choices of 
6 > 0 and c > 0 The positive definite function V(j , y) 
thus found satisfies the strict version Hamilton-Jacobi-Isaacs 
inequality ( 8 ) Thus, according to the results described before, 
the corresponding feedback law 

U = a(x) = ~^V y = -by -‘-I 2 

locally asymptotically stabilizes the equilibrium of the corre¬ 
sponding closed-loop system and renders its L 2 gain less than 
or equal to 7 . 
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Technical Notes and Correspondence 


perturbation Bounds for Root-Clustering of Linear 
Systems in a Specified Second Order Subregion 

W. Bakker, J. S. Luo, and A. Johnson 


Abstract —Sufficient bounds for structured and unstructured uncer¬ 
tainties for root-clustering in a specified second order subregion of 
the complex plane, for both continuous-time and discrete-time systems, 
are given using the Generalized Lyapunov Theory. Furthermore, for 
unstructured uncertainties, a still less conservative result is obtained by 
shifting the center or focus of the subregion along the real axis to the 
origin and by applying root-clustering to the “shifted eigenvalue” system 
matrix, which is obtained by shifting the eigenvalues of the system matrix 
correspondingly. 

I. Introduction 

We study the problem of how to guarantee the location of the 
eigenvalues of a perturbed system matrix in a specified (symmetric) 
second-order subregion of the complex plane. The perturbation may 
be structured or unstructured. The subregions discussed are, in the 
continuous-time case, subregions of the left-half complex plane 
(LHP) and in the discrete-time case subregions of the unit disk 
(UD) centered at the origin. Recently, Abdul-Wahab [1] and [2] 
discussed second-order subregions for discrete-time and continuous 
time systems, but (he results have turned out to be erroneous. The 
errors have been pointed out by Yedavalli 13] for the continuous- 
time case and by Bakker and Luo [4] for the discrete-time case. 
More recently, Yedavalli [5] obtained results for first and second- 
order subregions using Generalized Lyapunov Theory presented by 
Gulman and Jury | 6 ). 

Conservatism of the paper of Yedavalli [5] is reduced in several 
ways. First, the norms used to compute the bound are taken at a later 
stage compared to the bounds of Yedavalli. Next, an important result 
is. lor unstructured uncertainties, that the bound can be improved 
tor subregions with center or focus at (a, 0 ). This improved bound 
is obtained by shifting the center or focus of the subregion along 
the real axis to the origin and computing the bound for the “shifted 
eigenvalue” matrix. Generally, a large a yields a large improvement. 
Finally, from Luo et al. [7] we are motivated to use the square root 
of positive definite matrix Q {= S 1 5) in the Lyapunov equations; 
ibe result can be improved by choosing the 5-matrix appropriately. 
Fsing a personal computer, the optimal 5-matrix can be found by an 
optimization program in Matlab (e.g., fminu.m (9]). 

This paper is organized as follows. In Section II the robust stability 
analysis for state-space models of continuous- and discrete-lime 
'‘Vstems using the Generalized Lyapunov Theory [ 6 ] is described. In 
v ction III we introduce bounds for both continuous and discrete-time 
/stems with unstructured uncertainties. A motivation for “shifting” 
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is given, followed by the “shifting theorem,” In Section IV, bounds 
for structured uncertainties are given. For both cases, the bounds can 
be improved by optimization programs. In Section V. the results are 
illustrated with examples, followed by two general remarks. We end 
Ihe paper with conclusions in Section VI. 

II. Generalized Lyapunov Root-Clustering Theory 
Let the perturbed continuous-lime system be represented by 

x(t) = (A + E)x(t) (2.1a) 

and let the perturbed discrete-time system be represented by 

x(fr + 1 ) = {A + E)x(k) (2.1b) 

where the state vector is x(f) or x[k) E R”, the time-invariant system 
matrix .4 € R" XY ' and the perturbation matrix E E R" x ". If the 
structure of the perturbation is known, the substitution 

mi 

E=Y, f ' E ' < 2 - 2 > 

is made. Here e, (/ = 1.2, v.-,m) are time-invariant uncertain 
parameters which are assumed to lie in the interval around zero 
(f , E [—f, f] where r > 0) and E, (/ = 1. 2. ■ ■ ■. ni ) are constant 
matrices determined by the structure of the parameter uncertainties. 

Any symmetric second order subregion in the complex plane ((* 
or :)=■*' + iy) is described by (.r, y) satisfying the inequality | 6 ] 

il '2 — {( f\ y)' '>rm + T 1 oT -F T'io i‘ 2 4- 7oy?/‘ < 0} (2.3) 

where 77 ,, is a real coefficient (note the difference with the boldfaced 
slate vector x anti the real coordinate r). It is clear that this region 
is symmetrical with respect to the r axis. This inequality describes 
for example circles, ellipses, left parabolas, etc. Let us now define, 
as in | 6 J 

1 1 , , > 
coo = Ton. = ro 1 = -T10. r i 1 ““ ^(T20 -rToy). 

cyo = co 2 = —(720 Toy) (2.4) 

so we can present the Lyapunov criterion for root-clustering in a 
specified second-order subregion Ife [ 6 ]: Let 720 + Toy > 0 in (2.3) 
and let the coefficients e |l(| be given by (2.4). If and only if for a 
given positive definite Hcrmitian matrix Q the following generalized 
Lyapunov equation (GLE) 

C 00 P+ c,o (A T r+ PA) -(■ cnA T rA + c w ((A‘ ) 2 P A PA‘) 

= -Q (2.5) 

has a positive definite solution P, then all the eigenvalues of the 
nonperturbed system [E = 0 in (2. 1 )] are located inside the defined 
subregion (2.3). Equation (2.5) can be solved using the Kronecker 
product; for a detailed description, see [ 6 |. 


0018-9286/95$04.00 © 1995 IEEE 
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III. Perturbation Bounds for Unstructured Uncertainties 

In this section new stability robustness bounds for systems with 
unstructured uncertainties based on the Generalized Lyapunov Root- 
Clustering Theory presented in Section II are given. It is shown that 
“shifting," for a large class of subregions, yields improved bounds. 
These results are organized in the “shifting theorem." 

In the next theorem we give less conservative bounds on the norm 
of the perturbation matrix E so that the eigenvalues are located in 
the specified subregion. There is no knowledge assumed about the 
structure of the perturbation. 

Theorem 3.1: Suppose 

1) The perturbed system is described by (2.1). 

2) The eigenvalues of the nonperturbed system matrix .4 are 
located inside a subregion G 2 defined by (2.3). 

3) S T and 5 are full rank (invertible) matrices satisfying Q = 
S T S. 

4) P is a symmetric positive definite matrix defined by (2.5). 
Then the perturbed system matrix -4 + E has all the eigenvalues 

located in the subregion fl 2 described by (2.3), if 


l|£|l<,/ = ^(' 1 + V 1 + S 


with 

fl = h J |||P||||S- , || 2 + 2h D |||S- 1 ||||PS- 1 || (3.1a) 


b = 2|r 10 | ||5“ 1 1| HPS-'H + 2|r„| ||S- l || ||P.4S _) || 

+ 2|c 2 „|||5-'.4 r || HPS-'H 

+ 2|r a „| ||S-‘|| H.4 1 PS' 1 1| (3.1b) 

and where ||.4||=the spectral norm of the matrix ,4 (the largest singular 
value of A). 

Proof: See Appendix. 

Remark 3.1: For first-order subregions, rn = r 2 o = 0 (2.3) and 
(2.4). In that case the upper bound (3.1) is obtained from (A.2) with 
a = 0 to be ft =■ 1 / 6 . 


Fact 1 . The coefficients n 1 and C 20 and the solution for P (2,5) 
(see Remark 3.2), used in the bound (3.1a), are not affected by 
shifting. 

Fact 2: From (3,3), ||A|| > \a +1|. This implies, with Fact 1, that 
when 5 = / at least the third term in (3.1b) is multiplied by a factor 
larger than |o + 1|. Thus when a is small (a < -1), |a + 1 | is 
large and consequently that ||A|| is large. So a large a causes a large 
6 . For subregions as a left-parabola or a vertical strip, with center or 
focus at (a, 0 ), similar inequalities hold. 

Fact 3: The value of a is unaffected by shifting (follows from 
Fact 1). 

Fact 4: When h becomes large (a unaffected) then the bound 
becomes small (i.e., very conservative). This is even more obvious 
when, with 6 > c, a Tavlor series expansion is used for the square 
root in (3.1): n « 1/6. 

We have shown that the bound is conservative when subregions 
with center or focus at (o, 0 ), a sufficiently large, are considered. In 
the next theorem, it is shown that it is possible to shift the subregions 
(3.2) from (a. 0) to (0, 0) and to compute the bound for the “shifted 
eigenvalue" system matrix with Theorem 3.1. With the facts are given 
above, the improvement of the bound should be clear. 

Theorem 3.2 “Shifting Theorem ” Root-clustering in a subregion 
12 2 (3.2) of the complex plane for a system matrix A is equivalent to 
root-clustering in a shifted subregion (3.4) of the complex plane 


for a “shifted eigenvalue” 

system matrix A, where 


= {(.r, J/): 

(x/i ')' + (y/w)"-u <()} 

(3.4a) 


.r = .v — n 

(3.4b) 


,4 = .4 - a/. 

(3.4c) 


Proof. Using (3.4c) 

A (A + £*) = A( .4 -I- a 1 + E) = A( A + E) + a 
and for the real coordinate of the subregion, with (3.4b) 


Shifting of Subregions 

The bound // depends on the matrices A, P, and 5 and on the 
coefficients that describe the subregion. Consider the role of the 
system matrix in the bound (summarized in Fact 1). Of interest are 
subregions of the form 

ih = {(.r, y): ((.r - o)/r) ; + (y/w) 9 - 1 / < ()} (3.2) 

where (a, r, u\ u) 6 R and r > (), 1 / > 0; / = 1, 2, and 
g = 0. 2. Examples are a circle, a left-parabola, an ellipse, a vertical 
strip, etc. The eigenvalues of system matrix A have to be located 
in this subregion. Next, for convenience, restrict the subregion to 
be a circular subregion of the LHP, with center (a. 0) and radius 
1 satisfying a < -1, / = 2 = g, it = 1 = v = w in (3.2) (for 
other subregions similar arguments hold). Then the eigenvalues of 
the matrix A have to be located within a distance 1 from (a. 0) and 
therefore satisfy 

(a-1)<Rc(A(A))<(a + 1). 

Because a < -1 ^ rv + 1 < 0, taking norms 
|Re(A(A))| > |a +1|. 

From [20] 

IIA|| > |A(.4)| = (Re(A(.4)) 2 + Im (A(j4)) 2 )'^ 2 

> |Rc(A(.4))| > |n + l|. (3.3) 


.1 = -f A. 

Thus, the bound on the perturbation E can be computed for the shifted 
subregion and “shifted eigenvalue" system matrix, being valid for the 
original subregion and system matrix. 

Remark 3.2’ Theorem 3.2 can also be proved using the General 
Lyapunov Equation; that is, when (3.4c) is substituted in the GLE 
of the shifted subregion (with coefficients (2.4) for this shifted 
subregion) after some standard manipulations it can be shown thai 
for subregions that can be shifted the solution P of the GLE of the 
shifted subregion equals the solution P of the GLE of the original 
subregion. 

Remark 3 3 • For convenience and motivated by fact that the 
coefficient no becomes zero in the bound (see (3.1b) and Fact 4) 
the subregion is shifted along the real axis to the origin. For soim 
very specific cases, the best bound is not obtained by shifting exactl; 
to the origin. In that case, one can find the “optimal shift" along tht 
real axis which yields the best bound. 

Remark 3.4’ The result (3.1) is a function of S. In [7] it 1 
shown that for similar cases an optimization (maximization) 0 
li(a(S)< 6(5)) reduces conversatism. This can be done with th< 
program fminu.m in Matlab [9], The power of this method will b 
illustrated with examples in Section V. 

Table I shows the results of this section for some second orde 
subregions. When the regions satisfy (3.2), the parameters holdin 
for the shifted subregions are presented. 
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TABLL 1 

I si IIS FOR Din I RENT SUBRfcClIpNS < IS FOUND WITH (2 4) AND (2 1) I HE ElOKNVALUl S ARF LOl ATFD IN THF SlJBKKJlON V * (2 3) OU O { , , 

EDITION IN THFORFM 3 1 IM1H a AND b AS BFLOW IS &ATISI IFD Till EQUATIONS AlTFR IHE 1 ARI OBTAINFD FROM IHf SHIIHD SlJHKIl.lON < TMl ki 


Subregion 

«2 = (*.y): 

Qj 

QQ 

| 


a 

b 

circle - 

(x/r) 2 +(y/r) 2 -l < 0 

n 

B 

r 2 

0 

1 r 2 1 11 P 1111 S ‘ii 2 

2,r 2 1 US 'llIIPAS 'll 

parabola- 

4px+y 2 < 0 

0 


1 

2 

"4 

illPllllS l ll 2 

4pllS 'imps 'll + llS 'llIIPAS 'u 




H 



+±ns Sups ‘ii 

+i|lS T A T llllPS ‘ll+jllS 'llIIA t PS 1 H 

ellipse - 

(x/v) 2 +(y/w) 2 -l <0 

1 

9 

i(v 2 +w 2 ) 

i(v 2 -W 2 ) 

J(v 2 +w 2 )IIPiihS ’ll 2 

(v 2 +w 2 )liS 1 ll ll PAS 'll 



1 

1 



+i(v 2 ~w 2 )liS ’llIIPS ’ll 

+i| v 2 -w 2 1 ( 11 S t A t II HPS 'll + IIS 1 ll ll A t PS 'll) 

Vertical 

x 2 -(S 2 < 0 

-P 2 


1 

2 

9H 

jllPimS ‘n 2 

IIS 'll ll PAS 'll +pS T A T nilPS 'll 

strip - 





mi 

+ 'nS 'iniPS '11 

+pS 'll ll A 1 PS 'll 

Ride 

i -*V+p* < o 

p\ 


1 


IIPIIIIS ’ll 2 

2llS 'll ll PAS 'll 

quality 

2 x 2 +y 2 -p 2 < 0 

i 

-Pi 


1 


llPll ns 1 ll 2 

2llS 'llHPAS 'll 

[6] 

3 -w 2 +y 2 < 0 

2 

-u 


1 

2 

4 

iIIP 1111 S V+iiiS 1 1111 PS ’ll 

11 S 'll ll PAS 'll + 'llS T A T llllPS ‘ll + 'llS 'lillA T PS 'll 

2 2 


IV Pi RIURBATlON BOUNDS FOR SlRUCIURH) UNC L RIAINTIFS V tXAMFII S TOR Dll tFRFNT Sl( OND C)RDI R SUHKI C.IONS 


For structuied uncertainties it is possible to obtain less conservative 
it suits In the next theorem we obtain a perturbation bound foi this 
use 

Thtottm 4 1 The same assumptions as in Theorem 3 I 1)4) are 
made The system is given bv (2 1) with the perturbation as in (2 2) 
The eigenvalues of the perturbed system aic located in the described 
subregion (2 3) if 


h /'+r 


(4 1) 


wheie 


ED'’*i 


i i • 


h = 


£i 7 > 


p* = s ' v, s 1 />; == s ' r s ' 


i = > m (r/ rh, + rj pc, ) 

4ua((F,/-,)'/>+ r£,t, + (r,r )‘p + pr,r.) 


Root C lusttnm• in a C in le uifh Uint/ui hard Ununawlu \ 

The result will now be illustrated with an example We lake the 
same plant matrix as Ycdavalli \^\ 


with eigenvalues \| — - l 2, \_ = - 3 j lhc circular subregion is 
defined with <\ = - 1 < u — \ u = 1 ) — it - 2 in (3 2) I irst, 
taking Q = S = /, (3 1) gives the upper perturbation bound 


fi s _/ - 0 0370 


where the result in [^1 is p , / = 0 0341 This is an improvement 
of KS<7< 

Secondly we shift the center of circle lo the ongin (Theorem 3 2) 
and apply oui root clustering Theorem 3 1 foi die system matrix 1 
in the subregion 0 4a) This gives 



with eigenvalues \|( 4) — —0 2, \ fc ( l) “ 0 j The uppei bound 
on \\E\\ becomes 


// s / = 0 37S2 


-<n.(£' p+ PL ) + <u(r!r\+ i 1 rr.) 

r+ pf, i + (\r,)‘r + r \r > 


Comparing the icsult ft i lo p , / |S] we sec that the bound is 
improved by 1009% 

Thirdly, for the shilled subregion ih and the shifted eigenvalue” 
matrix 4, optimizing S gives the solutions with (2 S) 


I where | 4| = {|a, y |}, with 4 = {a 3 } a n x n matrix 
Pi oof See Appendix 

\imaik 4 I For first order subregions, (4 I) becomes p = 1 jh 
Remark 3 1 ) 

l imaik 4 2 Optimization of p(u(5). b{$)) in (4 1) reduces con- 
atism (see Remark *3 4) 

similar to Section III, it is possible to work out this general result 
different subregions This is done in Table II 


0 0013 0 0723 
0 0700 OS 49 4 


0 909S 0 29S1 
o 29 si i i m 


With (3 1 ) this gives 

p s S|( =0 3SG j 

Comparing ft s-s Jff and p f s-/ an improvement of 2 2V< is 
achieved 
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TABLE n 

Results for Different Subregions; cpq is Found with (2,4) and (2.3), The Eeigenvalues are Located 
in the Subregion fl 2 if the Conditions in Theorem 4.1 with Pij and P, as Below are Satisfied 


Subregion 

n, *■ (».y): 

Cqo 

, _ 

'Q]| 

«n 

°70 

p u 

p . 

click 

(x-ft) 2 +y 2 -r I <0 

59 

a 

1 

0 

ETpEj+E|PE, 

•«(E>+PE,)+(E|PA+A t PE,) 

parabola 

-4pa+4px+y 2 < 0 



mm 

um 

^eTpej+eTpe,) 

2p(Ejp+PE^+i(EjPA+A T PE,) 





■ 

91 

- j(eJe{p+pe,e,+eTe]p+pE|E 1 ) 

- j(A T E{P+PE,A+EyA T P+PAE|) 





mm 


^(V’+w-^TPEj+ETpEi) 

-«v J (ETP+PE,)+i(v J +w' , )(E'[PA+A T PEJ 





WM 


+j(V l -w J )(EyETp+PE l E j +ETE}P+PE | Ei) 

+ ;(v 2 -w' 2 )(A T E'[ P+PE, A+E[A T P+PAE,) 

Vertical 

(x-«) 2 -e 2 < 0 


9 

■1 

■ 

^EjPEj+EjPE,) 

-a(E|P+PE^+j(E{PA+A T PE|) 

Strip 



■ 

99 

1 

+±(Ej E^P+PE,E i +E| eTp+PE jE,) 

+ 1(a t eTp+pe, a+e|a t p+PAE,) 

Ride 

i. p? < 


■ 


0 

-(EjPEi+EyPE,) 

-(eTpa+a t pe,) 

quality 

2. x l +y l < t>\ 



1 

0 

ETpEi+EjPE, 

e|pa+a t pe, 

[6] 

3. -w 2 +y 2 < 0 


9 

1 

2 


iCElPE^EjPE,) 

j(E|PA+A t PEj) 




9 


■ 

- i(ETElP+PE i E | +ETEjP+PE i E l ) 

- j(A t E{P+PE 1 A+E{a t P+PAE 1 ) 


Root-Clustering in an Ellipse with Structured Uncertainties 
Consider the system and perturbation matrices 



-0.1 

0 .6' 


1 ()‘ 

.4 = 

-0.4 

1 

, El = 

1 0 


0 
0 1 


With X\(A) = 0.2, A 2 (A) = 0.7. Taking the ellipsoidal subregion 
with a = 0.4, v = 0.5, w = 0.4, then the eigenvalues A] and A 2 
are located in this subregion. The bound on f i and f 2 . //, can be 
computed directly with Q = 5 = I ((4.1) and Table 1) 

/if.Sr/ = 0.0395. 


Using the method proposed in [5], taking Q — i, we obtain the 
perturbation bound 


Pr.yrd = 0.0072. 


Comparing the bounds /i tl .s=/ with p e , ye d we see that our bound 
gives an improvement of 449%. A still less conservative result can 
be obtained by optimizing. The optima] $ matrix and the solution 
for P (2.5) become 


0.9144 -0.4853 
-0.8012 1.0582 


1.8312 -1.6732 

-1.6732 1.8692 



Fig. 1. Root-clustering in the ellipsoidal subregion of the unit disk. The roots 
of the nonperturbed system are denoted by “o”, the roots of the system with 
the maximum perturbation bound $~j arc denoted by ‘V’ and the roots 
with the bound p € ,s=s opt are denoted by “x.” 


The error bound then becomes 

Pe,s=s opi = 0.1098. 

Comparing the bounds /i,. s=s opt with /i e , .s=/ wc sec that optimiza¬ 
tion gives an improvement of 178%. Optimizing the Q matrix and 
using the method of Yedavalli, the same bound as yt <i is obtained. 

The results of this example are illustrated in Fig. 1, where the 
location of the roots in the ellipsoidal subregion of the unit disk, 
with the maximum perturbation (A(.4 ± p(E\ + E 2 )), are shown. 

Regarding this plot, it is important to note that the roots marked 
with a “x” are, in general, not the outermost points of a root locus 
determined by varying f from -// to /* in A(A ± f(Ei + E 2 ). 

General Remarks 

Remark 5.1: In remarks 3.4 and 4,1, comment is given about 
the optimal S matrix. This 5 matrix can be obtained with use of 


MATLAB™ programs for the optimization toolbox [9]. Furthermore, 
note that the optimal S matrix is not unique. For example, when 
5 = aS is substituted, the solution of P with (2.5) become* 
(where Q = o 2 Q — S) P — a*P, Clearly, the bound 1(3.! 
or (4.1)] obtained for P and S equals the bound obtained for / 
and 5. 

Remark 52: It is important to note that the bounds derived an 
valid for time-invariant uncertainties. In the theorems given, however 
only the location of the eigenvalues is guaranteed to remain in - 
specified subregion; this is true for each value of the uncertaint; 
that satisfies the bound. So, when the bound is satisfied, at each tinr 
instant, by the time-varying uncertainties, then the eigenvalues remain 
in the subregion. For these time-varying uncertainties, however 
this does not guarantee a certain corresponding performance. 1 
engineering practice we are interested in performance; hence w< 
presented bounds that guarantee, beside root-clustering in a subregior 
also a corresponding performance. 
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VI. Conclusions 

[n this paper some improved bounds for robust performance of 
,,ieur systems are obtained. Systems with structured and unstructured 
ijncertainties are treated. For each case, optimization yields better 
,* suits. For unstructured uncertainties, with shifting of the subregion 
>fid the system matrix, a less conservative result is obtained. We 
tufva presented our results for various different subregions in the 
form of tables. With the method proposed it is possible to determine 
.Klmissible uncertainties of the system to guarantee, for time invariant 
mccrtainties, a certain performance. 


^(kn|||r||||S- , || 2 + 2h„|||5- , ||||/'.S- 1 u j.i / 

+ <2ho|pr'|| || rs _l ||+ 2h, | H.s"'|| ||Ms 
+ 2|,,„|«5“ 7 .4|| II PS - 1 || 

+ 2MII.V- 1 II 11-4' PS’ 'll )|| E||- 1 <n 

« ..||E|| J + 6|| E||-l <0 ,A2» 

with 

^|n.|||P|||| 5 - , |r + 2h u |||.V x \'\\ps 'll 


Apklndix 

Proof of Theorem 3.1: Recall the Lyapunov root-cluslenng cri- 
rrion of Section II. From assumption (2) it follows that there exist 
a positive definite solution P of the GLE (2.5). If the OLE of the 
perturbed system matrix .4 -f E has a positive definite ‘’solution" 
:)i for that solution P then the eigenvalues of the perturbed system 
natrix are located in the specified subregion. Thus, the eigenvalues 
of the perturbed system are located in ih if 

<■1)0 P + <■ ,0((-4 + E )' P + P( l + E)) 

+ ni (-4+ E)'/’(.l + E) + cjn ((( 4 + P) 7 ) J P 
+ P(.l4 E) 2 ) = -(), < 0. 


h = 2|cio| || A - 1 1| ||E.S ,_I || + 2|c,, | H^ -1 1| ||/* 4.V 'll 
+ 2hn|(II.S’ -1 .4 7 || || PS ' 1 1| + ||.S" 1 1| || 1 1 PS 'll) 

o iieii <+ L • yprr;. 

2" -« q 

Proof of Theorem 4.1: The first part of the proof is the same as 
the proof of Theorem 3.1. We continue with (A.l) 

«./>- '{(•„,(E ( P4 PE) + r,,(E'f 14 4'PE) 

+ <■,„((/•: \)‘ P + (AE)' P + /’( 4E + E.4)) 

+ . iiE' PE + <•■„((E' )-'P+ PI -')}.S’ -1 -/<(). (A.3) 


See also Yedavalli |51. Substitution of (2.5) gives 

«• (•„,(/:'P+ PE) + (,,(E' P4+ 4' PE+ E'PE) 

+ CJ,,((.!' E' +E'a‘ +(E'r’)P 
4 ~ 1 ju P{ 1 E ■+■ E 4 ■+■ 77 ) — < 0 

<*r w S '(E'p + PE).S ' 

+ <■,,.S' 1 (E 1 P A + A 1 PE 4 E' PE).<r' 

+ 1 -, t „S-'(( 4' F.' +E'.4 / +| E')')PS 1 
+ r. u S 1 P(-4E + E 1 + E 1 )S ' 1 - 5" ' .S' .S.S” 1 < 0 

<#<wS-'E'PS '+.,,,5 7 PE.S' 1 + ruS’' E 1 PAS 1 
f < n5 1 4' PE.S" 1 + ci 15 ‘' E' PE.S _ 1 
+ c w S-'A'E J PS 1 + r-jn.S 1 E 1 A 1 PS 1 
+ ooA- ' ( E' )-p.S -1 + r jo .S' -7 P AES ’ 1 
+ CJ0.S - 'PE.4.9' 1 +cjo5 ’ PE 2 S~' - I < 0. (A.l) 

Ihe left side of (A.l) is a Hermitian matrix. A Hermitian matrix 
has the property that all the eigenvalues (A/J arc real and that it 
is negative definite if and only if all its eigenvalues are negative, 
further, from |10], the property A*(.4 - /) < 0 <£> A* (A) - 1 < 
11 *= \\A\\ -WO and ||.4|| + ||£|| > ||-4 4- D ||. with ||.4|| - 
'<ie spectral norm of the matrix .4 (the largest singular value of .4), 
"ives 


Now the substitution E —► , t-,E, is made (2.2). Abbreviating 

= $ 1 ]\S 1 ; Pfj = S~ 1 P tJ S 1 

P, = E/ P 4 PE, ) + n,( El P4 + A ' PE,) 

+<• ((E, .4) r P 4 PE, .4 + ( 4 E,)' P + P.4 E.) 

P, =< u(E! PE,+ E; PE,) 

+cj„((E, E,) 1 P + PE,E, + (E,E,)' P+ PEiE, ) 

/II Ml Ml 

i i /_i,i 

The left side of this inequality is a Hermitian matrix, from 110], the 
property 

\k{A - 1) < 0 \k( 1) - 1 <r 0 <^ || 4|| — 1 < 0 gives 




i-l — i 


1 < 0 






- 1 < 0 


-Ihii5-'E'P5 l |H|r 1 o.S-'PES- , |H|ni5 'E 7 P.4,S ’|| 
+ ||c,, S ' APES- 1 1| + ||c, S-'E 1 PE.S -1 1| 

+ |K.S - ' A' E‘ PS 1 1| 4- |ht>S~ 7 E' A' PS 1 1| 

+ He,, S ~ 7 E‘ E‘ PS~' || + ||cjoS ' PAES~ 1 1| 

+ ||c a o5 - ' PEAS' ' || + HrjoS* ' PE 2 S’ ' || - 1 < 0 

-- 2||r,„S-' P|| ||E|| ||5“ 1 II + 2||n, S - ' ||E J \\PAS-'\\ 

+ lki.s-'|l ll£ r ll 11^1111^11 II*"' 1 II 

+2||rj„.S- '.4 7 HIE' 1111^'“ 1 H+2|ho5-' HUE 7 llll 4' PS - '|| 
+ 2||rj(iS“ 1 1| ||(£V|| lira- 1 II - 1 < 0 


<= * 


ElV 


+ r 




r= 1 / • 


- 1 < 0 


^ (if 2 +bf - 1 <0 (A.4) 

wiih </ & |E:i,E"=,i r */i||' ’> = 

M < -T- + -r • 

lu la 

and where |-4| = with .4 = {a,j }, a n x n matrix. 
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Comments on “Strictly Positive 
Real Transfer Functions Revisited” 

H J Marquez and C J Damarcn 


Abstract —In the above paper , 1 the distinction between weak and strong 
strictly positive real (SPR) functions was addressed, and the feedback 
interconnection of a weak SPR system and a passive one was shown to be 
stable. The purpose of this note is to show that the proof of this lemma 
Is actually incorrect. 


I Introduction 

The concepts of passivity and strict positive realness have been 
an important area of research for the last three decades These 
investigations have brought a better understanding of these ideas 
and their applications, but also an ever increasing mismatch in the 
terminology adopted by different authors The mosi widely accepted 
definitions of passivity and strict passivity are the following [1] 
Define a real inner product (r, y)/ by 

<1 .v)i=( i T (t)v(t)<n ( 1 ) 

Jo 

and let L n ir be the space of all functions i H + —► /?" which satisfy 
ll-rr| |i = {■/, x)t < sc, VT € H + (7?+ is the set of positive real 
numbers) 

Passiuty H L'f —» LJc is said to be passive if there exists 
ii € B such that 

(r H r)/ > J Vj €11, vrei?+ (2) 

Manuscript received January 24, 1994, revised Apnl 11, 1994 
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Fig 1 The feedback system S , 


y(t) 


Strict Passivity H L'{ t —► L'i is said to be strictly passive if 
there exists b > 0, and ii E R such that 

(j.H,) T >b\\n\U+ il Vi ELI. VT€if + (1) 


For linear systems these definitions are closely related to the concept 
of stnctly positive real (SPR) See the above paper 1 for the definitions 
of weak and strong SPR From these definitions, it is straightforward 
that a linear time-invariant system whose transfer function is (weak 
or strong) SPR is passive but, m general, not stnctly passive For 
example the system H (s) = A/(s+n), A > 0, and n > 0 is SPR (and 
so passive), however, it is not stnctly passive since Rc[H{ ju .)] —► 0 
as * x, and therefore no b can be found to satisfy (3) The main 
problem with this result is that it renders the passivity theorem nearl) 
inapplicable for linear systems, since only biproper or improper linear 
systems can be stnctly passive 

Motivated by this observation the authors 1 considered the class of 
systems which satisfy the inequality (6b) 1 


/V. 

Jn 


dt + i 


/V. 

/ f) 


> 0 


(4) 


dt 


where </ = Hu, and (4) is valid for all inputs u such that 
IMMMli < x This class of systems was shown to be equivalent 
to those which are weak SPR and played a central role in Lemma 1 
There, it was claimed that the feedback interconnection of a passive 
system and one that satisfies inequality (4) (see Fig 1 1 ) is stable 
The intention of this note is to show that the proof of Lemma l 1 
is not valid We notice here that the condition IMI2/IMI2 < x is 
not used in the proof of Lemma 1, and therefore in the remainder 
of this note it is disregarded 

Using our notation, (4) can be rewritten as 


(»■#»)+1 ,» 
(», u) 


O) 


A number of comments must be made concerning this definition In 
the first place, it ts incomplete since for expression (4) to be well 
defined, it is necessary for the function u to belong to the space L[ 
This is an important point In fact, this issue renders incorrect th 
proof of Lemma 1 1 Notice that in the proof of Lemma l, 1 it 1 
necessary to consider precisely the case where u is not in Lj Yl 
more appropriate is to use extended spaces and rewrite (4) as 


Inn 

7 —r»c 


(» Hu) T ± i > Q 

(«• u)i 


V« e LI, 


<6 


We now analyze the proof of Lemma 1 (Appendix II 1 ) Sine 
u, = -y and y\ = u (refer to Fig l 1 ) 

(ii, + A + (m. Hiui)i + lh 

(«. M )t (ii. Il)l (ii, m)t 


Here Hi is passive, and therefore the second term on the let 
hand side of (7) is greater than or equal to zero, while H satisfu 
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nequality (6) To show stability, the authors reason by contradiction 
s follows assume that u £ L 2 , and take limits on both sides of (7) 
si 1 x In this case, the right-hand side tends toward zeio and 
heiefore, the left-hand side also tends toward zero Hence the authors 
onclude that there is a contradiction since, by (6), the left-hand side 
actually greatei than zero Therefore it must be tree that i/ € I* 
fhis reasoning is fallacious, however, unless condition (6) is 
lengthened by requiring that the following be satisfied 


mf 

" eI Jr 



(</, Htt)i i 
u)i 


>b>() 


( 8 ) 


In other words, there are only two possibilities ot interest in (7) 


I) 


inf 

; 


Inn 

7 — X, 


(u Hu)j 4 i 


(».«)/ 


>/>-><) 


<»> 


It this is the case then indeed there is a contiadiction in (7) 
Condition (9), howevei, implies that the system is strictly passive 
and therefore Lemma 1 becomes a restatement of the passivity 
theorem (see, for example, [I]) 1 e, it says nothing about weak 
SPR functions 


2 ) 


mf 


lllll 

l - K 


(u , H u )i -I- i 
(« »)i 


( 10 ) 


In this case there is no contradiction in (7) since the left-hand 
side also fends toward zero for some function u without violating 
condition (6) (in the same way l/// 7 —* 0 as n -* "v, for all 
1> 6 /?* > 1 ) 

As a hnal remark we make the following observations, which 
emphasize the distinction between weak and strong SPR It is 
relatively easy to show that the feedback combination ol a (possibly 
nonlincai) passive plant and a strong SPR compensator is stable The 
result can be proved by defining the loop translormation shown in 
1 ig 1 and noting that it does not alter the stability properties ol the 
oiigmal system It is then straight!orwaid to show that tor small 
enough > 0, the system H[ = (1 - H\)~' H\ is passive while 
H' — H 4 1 is strictly passive and therefore stability lollows from 
the passivity theorem 

The case ot a weak SPR system is however very different as 
shown in the following example 

hxumpU I Consider the linear time-invauant system H{ s) = 
( s + < )/[( s + rt)( S + and let H'(s) = f/(s)/[l —r//(s)] Wc have 


__ (n6f — f< 2 ) 4 u. 2 {a 4 b — r — r) 

(ah — (( — Z . 1 )“ 4- (n 4 h — t) 1 

il and only it 

abt — n 1 > 0 n 4 b — < — f > () 


(ID 

( 12 ) 


d 4 b r, we can always find an f > 0 that satislies (12) II 
h >wever, a 4- b = < (l e, when //(s) is weak SPR), no such f > 0 
‘s ists 


II Conclusions 

The proof of Lemma 1 1 is incorrect Since this note does not 
sprove that the feedback interconnection of a passive plant and a 
•ak SPR controller is stable, wc conclude that it remains an open 
icstion 
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On Interval Polynomials with No Zeros in the V mt lb , 
V Blondel 


Abstract —We give a necessary condition for an inters at polynomial to 
have no zeros in the closed unit disc. The condition Is expressed m terms 
of the two first intervals. 


The stability analysis ol polynomials subject to stiuuurcd uncer 
lainty has received considerable attention this lust decade (sec [2| tor 
an historical overview, references related to this contribution include 
[II, HI. (U [81, and [9]) 

In this note we give a necessary condition tor an interval poly 
nomial 


P = {rin -+■ a i „ Ar 4 h, i± < n <. n } 

to be D-stable, ic such that all membeis ol P have no roots in 
the closed unit disc Oui condition is expressed m terms ol the two 
first intcivals only 

In a coiolfary we show that if n„ < 77o/2 and a, ' m/9 then 
P cannot be D stable 

The results presented here are easy consequences of a little known 
theorem on analytic functions 

Landau \ Theorem Assume that the function / is analytic in the 
open unit disc |~| < 1 and that ){ ) ^ U, 1 foi all | | < 1 Then 

|/'(0)| < 2|/(0)|(|log|/(0)||+ 4) 

were 4 is a constant which can be taken equal to 4 4 

For a proof ol this theorem (which is sometime referred to as 
Landau-Carathtrodory theorem) see for example Hill? [4 p 221] 
The best possible bound for 4 was given in 1981 by Jenkins [6), it 
is equal to 4 rr /F ’(\) = 4 17 
We now prove our theorem 

Theorem Let P = {r/o 4 n t . 4 n <, a < a [ be 

an interval D-stablc polynomial and assume that no > u 0 > 0 Then 

l«i|<2^,(log + - a ° -+ 1 » 

\ a n o 

where log^ i = max(l) log i) 

Proof Define a* € [n,>, no] by n ( * — mm (2n t) no) and choose 
an arbitrary set ol coefficients a* 6 [«, di] ( / = 2, u) Consider 
ihe polynomial p( ) defined by 

l>(~) “ ---; (iio +* "l - + "1 2 + + "* ”) 

do ~ a i) 

It is easy to see that p{^) never takes the value zero or one in the 
open unit disc Indeed 

p(~) = O^rijj + rh + n] 2 + 4u* ' 

and 

p{ ) = 1 ^ dj + fi"i- + + 4 a i — ^ 
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Those two polynomials belong to P % hence pU) ^ 0, 1 when |~| < 1 
Applying Landau’s theorem on p(z) we obtain 


o 1 


< 2 


sio "o 


|iog 


+ 4.4^. 


Since a£ > > 0 we get 


l«i| < 2flo^ 




+ 44 


) 


The coefficient a 5 is defined by r/J = min (2^, cfo). If 2^ < a 0 
then oq = 2oq and 

|o, | < 2oo 4.4 = 2a*, (log + ~ - + 4.4 ) 

\ °0 - 0o / 

whereas if 2oq > Tfo then «; — no and 

l»i|<22of|lOg^V +4 4) = 2 !l0 (log + F ^-+4 4). 
VI £Lo / \ ~~ Ho / 


The theorem is thus proved 

Remarks 

1) A corresponding theorem can be derived for other stability 
regions For Schur stability (no roots outside the open unit disc) 
we obtain a necessary condition for the stability of interval 
polynomials with uncertainty in the highest order coefficient 

2) It is clear from the proof of theorem that, if pi (-) = o -I- a i; + 

n 2 ~ 2 + + <?„- 7< andpj(c) = i + n 2 z 2 + + n„~ M 

are both D-stable polynomials, then 


hi 



This inequality can be used to derive bounds for other struc¬ 
tured uncertainties descnptions 

Corollary Let P = {ri 0 + «i - 4- • <J_ t < n 1 < «,} be 
an interval polynomial and assume that 0 < 2^ < a 0 and 9^, < 77i 
Then P cannot be D -stable 

Proof Assume by contradiction that P is D-stable Since 
2«o < "n, the theorem gives |oi| < 2^4.4 = 8 8^ But this is 
a contradiction since 9a,, < The result is thus proved 
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The Logical Control of an Elevator 

Derek N Dyck and Peter E Caines 


Abstract —This paper presents a detailed example of the design of a 
logical feedback controller for finite state machines. In this approach, 
the control objectives and associated control actions are formulated as 
a set of axioms each of the form X implies Y, where X assets that i) 
the current state satisfies a set of conditions and il) the control action y 
will steer the current state towards a given target state; Y assets that the 
next control input will take the value y. An automatic theorem prover 
establishes which of the assertions X is true, and then the corresponding 
control y is applied. The main advantages of this system are its flexibility 
(changing the control law is accomplished through changing only the 
axioms) and the fact that, by the design of the system, control actions will 
provably achieve the control objectives. The Illustrative design problem 
presented in this paper is that of the logical specification and logical 
feedback control of an elevator. 

I Introduction 

The COCOLOG system (from Conditional Observer and COn 
troller LOGic) [1], [2] is a logical system for the state estimation 
and control of finite stale machines In this approach, the control 
objectives are lormulatcd as axioms (i e, necessarily true logical 
formulas) which relate the current state to a target state The axioms 
are each of the form Y implies 3 , wheie \ asserts that i) the cunent 
state satisfies a set of conditions and n) the control action i/ will 
steei the current state towards a given target state, 3 asserts that 
the next control input will take the value y An automatic theorem 
prover establishes which of the assertions \ is tiue, and then the 
corresponding control y is applied The main advantages of this 
system are its flexibility (changing the control law is accomplished 
through changing only the axioms) and the fact that, by the design 
of the system, control actions will provably achieve the control 
objectives 

This paper applies the COCOLOG system to an idealized veision 
of an elevatoi control problem Section II describes the state, dy 
namics and control of the elevator Sections III—V briefly outline the 
COCOLOG system and present a new logical framework specific to 
the control of an elevator Sections VI and VII present the results 
of computer simulations using an Automatic Theorem Prover (due 
to Mackling, sec [3]) and the conclusions which can be drawn from 
these results 
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II The Elevator Control Exampif 
The logical control of an elevator is a good example with which 
to illustrate the operation of the COCOLOG system because it shares 
with many other discrete event systems the features of i) simplicity ol 
dynamics, a) combinatorial complexity of state description, and in 
great variety of possible control strategies and resulting trajectories 
This section desenbes the basic set-up of the elevator control problem 

A The Stare 

The elevator control problem studied in this paper consists o 
a single elevator m a building with five floors, numbered zen 
to four The elevator can handle up to three demands, with th< 
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TABLE I 

Elevator State Variables and Their Values for the Initial State of the Example Trajectory 


Current Floor * 2 

Location,, * 1 

Location! = 2 

Location * 3 

Destination = 2 

Destination, = 0 

Destination = 4 

Frustration = 3 

Frustration, = 2 

Frustration * 1 


ernands numbered zero to two. A demand is characterized by the 
emand state which is given by a triple consisting of a location, a 
eclared destination, and a frustration level. To simplify the state 
^presentation, a frustration level of zero is used to indicate an 
iaclive demand. Table I contains the complete set of state variables 
Td their values for the initial state of the example to be presented 
ater. 

I System Dynamics 

The dynamics of the elevator are quite simple. At each time step, 
'ie elevator either moves up or down one floor, or stays at the 
,ame floor, according to the control input. This means the elevator 
effectively stops at every floor it passes. New arrivals are treated as 
sturbance inputs: when a person enters the world of the elevator, 
le or she immediately announces a destination and starts with a 
Tusiration of one. The frustration is then incremented at each lime 
,iep until the individual reaches his or her destination, ll is assumed 
hal there are never more demands arriving than the maximum number 
if demands allowed by the system (if there are, these demands are 
.imply ignored until an active demand is satisfied). 

Any person at the same floor as the elevator moves with the 
levator (that is, the elevator lakes on everybody at each floor it 
arrives at, even if they are traveling in the opposite direction). When 
person's location becomes equal to his or her destination, that 
person immediately leaves the world of the elevator. That is, when the 
location and destination of a demand become equal, the frustration 
of the demand is set to zero, indicating an inactive demand. 

The example presented in this paper involves satisfying the active 
demands in the sense that the controlled trajectory of the elevator 
results in a state in which the location and destination of each active 
demand are equal, and the elevator is at rest. This formulation is 
easily extended to the case where new demands occur at random 
instants and enter the system if fewer than three demands are active 
at that instant (see, e.g., |3|). 

C i 'antral Law 

' he control problem for an elevator may be viewed as a disturbance 
u'l lion control problem. The basic objective is to empty the 
etc itor, though other goals may temporarily or permanently override 
t*i one. The basic control law must therefore guarantee that in the 
al ace of disturbances and overrides the elevator empties in finite 
ll1 Our control strategy for the basic control problem is that of 
f 'tizing persons with a high frustration level. 

le simple control law which satisfies these objectives is the 
L -»1 law “Max first” which always attends to the person with the 
h st frustration level. That is, at each time step, the person with the 
h* st frustration is selected and referred to as Max. If Max is not in 
di' evator, the elevator moves up or down in the direction of Max's 

II -on. If Max is in the elevator, the elevator moves in the direction 
f( ax’s destination. Since the dynamics guarantee that Max will 

icted when the elevator reaches Max’s floor, this control law 


satisfies the basic objective of emptying the elevator in finite nmc in 
the absence of fresh inputs (i.c.. new demands or overrides). 

To illustrate the basic COCOLOG property of flexibility with 
respect to control objectives, two override control objectives are 
added. In the first, the fourth floor of the building is designated as a 
hospital, and it is assumed that any person whose destination is the 
fourth floor is an emergency case who takes priority over all other 
persons in the world of the elevator. In the second, the objective is 
changed altogether in case of a fire: as is common practice, when the 
fire alarm is triggered, the elevator stops at the nearest floor (so that 
people can take the stairs to safety). In our control strategy the fire 
objective overrides the hospital emergency objective. 

The ease with which these control objectives can be added to the 
basic control law illustrates the power of the COCOLOG approach to 
control. It should be emphasized that the “Max first” basic control law 
is not designed to be an optimal one, but rather one which is simple 
enough to clearly illustrate its implementation using COCOLOG. ll 
is to be noted that logic control laws of the type: “serve the most 
frustrated customer first” treat large sets of stales as equivalent, since 
the stales of the demands not being served are ignored. Consequently, 
the logic control law described here succeeds in handling a system 
in which the number of possible states is estimated at 10 million. 

III. COCOLOG 

The COCOLOG system is a family of first-order logical theories. 
The theories provide a logical system for describing and reasoning 
about the state estimation and control of a given finite input-state- 
output machine. The COCOLOG system consists of a family ot 
theories because at each time step a new theory is generated, since 
some axioms are used to describe the inputs and observed outputs of 
the system (or the next state in the case of complete state observation). 
More formally, at each discrete time instant k, the A’th first order 
logical theory Th k is used to determine the next control input f {k) 
through a set of Conditional Control Axioms (CCA’s) with Th k , 
The CCA’s imply certain values for the control input when certain 
conditions C f k " are fulfilled. In logical syntax, the CCA's are of the 
form 

Cr -> (C(k) = n m ). 

This formula, in English, would read: “If formula C k ' in theory k is 
true, then this implies that the control input at time step k is equal 
to the constant i/ r ".” 

In COCOLOG, the control input V(k) is assigned a particular 
value n ?n if a proof exists, in T7u, of the predicate V{k) ■= u . 
Since each CCA™ is an axiom, a sufficient condition for this to hold 
is the existence of a proof of C™ in Th k . We observe that for finite 
machines in general, and the elevator example in particular, a finite 
model (in the technical sense of model theory, see 12] and f7J) for 
any theory is readily constructed. This establishes the consistency 
of COCOLOG theories in general and ELCOLOCi (see below) in 
particular. 
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A. Markovian Fragment 

The family of theories in a COCOLOG all have in common a subset 
of the axioms which are this same in all theories; these axioms are, 
by definition, the axioms in Tbo. The transition from theory Thk- 1 
to Tfn is governed by mela-level rules of inference, which cannot 
be represented within these theories. In the general case, each theory 
Thk follows from T7i*..i by the addition of axioms, for example ihe 
axiom which asserts the equality of the output at time instant k to 
the observed output value. In practice, however, it is very expensive 
(in terms of proof time on an automatic theorem prover)to keep 
increasing the number of axioms as the system evolves over time. 
An alternative is to restrict the set of axioms to a moving window 
which only includes the most recent additions to the axioms in Tho- 
This is a so called Markovian fragment of COCOLOG theory, which 
is described in more detail in |4]. It can be shown that, provided 
certain conditions are satisfied, the Markovian fragment has the same 
power to make control decisions as the original COCOLOG (this is 
the principal mathematical result of 14]). 

The rest of this section focuses on the application of COCOLOG 
to the special case of the elevator control example. More details of 
the operation of COCOLOG in the general case can be found in [2], 
[5], and [6]. 

B. Markovian Fragments for the Elevator Control Example 

In the case of the elevator control example, the Markovian fragment 
is implemented by effectively starting from a new initial state at each 
time step. Each theory Th k therefore consists only of the axioms in 
77i 0 together with a set of axioms which describe the state at time 
instant k\ In the general case, the state description axioms would in 
fact describe the current state estimate (CSE) set of possible current 
states which satisfy the predicate CSE*. However, since the elevator 
system has full stare output, i.e., complete stale observation, the 
current state is known exactly, and hence CSE* is satisfied only 
by a single state value. 

Our axiomatic formulation of the elevator system docs not conlain 
a complete axiomatization of the system state transition function. 
Although this approach reduces the cxpressibility of our syntax, 
this approach has several advantages at both the logical and the 
implementation level: 

1) A complete logical model of the system is not used in the 
elevator example to reason about the behavior of the controlled 
elevator. This reduction of the axioms to a subset of the Mar¬ 
kovian fragment increases the efficiency of the logic controller. 
We note that the actual (computed) state transition of the system 
(occurring ut each control action) is realized outside the ATP 
program. This is consistent with our ontological assumption 
that the controlled dynamical system is an entity distinct from 
the logic controller and the theories it carries. 

2) Whenever the system undergoes exogenous disturbance inputs 
(in our case new passengers and their destinations) the resulting 
state value is efficiently accepted into the flow of COCOLOG 
theories. This is because Ihe complete state observation hypoth¬ 
esis implies the new state value is immediately available to the 
COCOLOG controller. 

3) Since the current state is treated as an initial state within each 
theory, the time index can be dropped altogether within each 
theory. (This would not be possible, however, if a model of 
the dynamics of the system is used to predict the state of the 
system at future time instants.) This results in a significant 
simplification of the conditional control axiom set. 

The logic resulting from our restrictions of a complete Markovian 
fragment specification is sufficiently different from COCOLOG that 


it warrants the distinct name ELCOLOG (for ELcvator COntrolK -• 
LOGic). The next section presents the language of ELCOLOG. 


IV. The Elevator Controller Logic ELCOLOG 

ELCOLOG is a first order logic which is used in elevator control in 
prove or disprove control formulas, given a specified state. Associated 
with any logic is a syntax and a semantic interpretation. The syntax is 
a set of symbols and the rules for formulating formulas. The semantics 
is the meaning attached to these symbols. 

A. The Language of ELCOLOG 

The ELCOLOG language consists of constant symbols, variable 
symbols, function symbols, and predicate symbols, as well as the 
punctuation symbols “(," “),” and ‘V\ The precise syntactic rules 
governing formulas can be found in any logic text, e.g., [7]. This 
section gives an informal description of the syntax, and a semantic 
interpretation. 

Constant Symbol Set: 

The symbols representing zero and the first four positive integers: 
0 12 3 4 

The Skolem constants (used in place of existentially quantified 
variables): Vic Max 

Vic is the index to the demand of the person whose destination is Ihe 
hospital, in the case of an emergency (i.e., the victim), and Max is 
the index to the demand of the person with the maximum frustration 
level. 

Variable Symbol Set: 

General variables: a b c 

Variables used for indexing the location, destination, and frustra¬ 
tion: m n 

Ihe indexing variables may only assume values in the constant 
symbol set: {0, 1, 2. Vic. Max}. Normally, this restriction would 
imply the use of types to constrain the semantics and to increase 
the efficiency of the ATP. However, in our simple example this is 
instead accomplished through the use of domain predicates, which 
arc added to the relevant logical axioms to restrict their application 
to instantiations obeying this rule. 

Function Symbol Set: 

The following are the state projection functions. They are all implicit 
functions of the time index k. Note that arg is an index and must be 
one of {Vic. Max. 0. 1, 2, m, n(. 

Pn - the position of the elevator (at the current time 
instant). 

Pl(arg) - the location of the person associated with 
demand #arg. 

Pd(arg) - the destination of the person associated with 
demand #arg. 

Pf(arg) - the frustration of the person associated with 
demand #arg. 

And the input at this time step is also an implicit function of 1 ' c 
time index k: 

U - the control input. 

Predicate Symbol Set: 

Logical operators: 

-» arg - True iff arg is not true. 

arg L A arg 2 - True iff both arg, and arg 2 are true. 

arg A V arg 2 - True iff either arg, or arg 2 is true. 

arg! —* arg 2 - True iff arg 2 is true whenever arg, is true 
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\rithmetic relations: 

arg t = arg 2 - True iff arg A and arg 2 are equal. 

arg, < arg 2 - True iff argj is stnctly less than arg 2 . 

Predicates, with the state as an implicit argument, used in the 
L M ditional control axioms: 

Fire - True iff the fire alarm is on. 

Emergency - True iff someone's destination is the fourth 
floor (the hospital). 

To reduce the clutter of parentheses, the following operator prece¬ 
dence is used, listed from highest to lowest: =, <, -i, A, V, — 

B Rules of Inference 

Rules of inference determine how new formulas can be derived 
from a set of formulas (some of which will be axioms) while 
preserving truth in the process (that is, if all the formulas in the set are 
true, then the new formula is also true). Formally, for ELCOLOG to 
be a logic, it is required that the rules of inference be defined so that 
proofs of formulas can be constructed. For this formal requirement, 
(he two rules of inference used in ELCOLOG are Modus Ponens and 
the Generalization rule. Stated in English, Modus Ponens is simply: 
“If A is true, and .4 implies B is true, then B is true,” where .4 
and B are formulas. The Generalization rule is: “If .4 is true, then 
4 is true for all i\" where r is a variable, and .4 is a formula in 
which r is free (i.e., is not governed by a universal or existential 
quantifier). Also, along with these two rules, several Logical Axiom 
Schemata are also defined. In these details ELCOLOG is identical to 
COCOLOG (see [21), so they will not be repeated here. 

V. Axiomatic Theory of ELCOLOG 

As in COCOLOG, the axioms in Th () include all the axioms except 
the axioms describing the current state. The complete set of axioms 
ml In is given in Appendix I. The axioms in Th 0 divide naturally 
into number relation axioms, control axioms, and override control 
axioms. The set of axioms in 7'/u is formed by adding the current 
state axioms. 

A Number Relation Axioms 

The number relation axioms define the domain of positive integers, 
and also define the properties of the equality and inequality predicates. 
To make automatic theorem proving tractable, the arithmetic axioms 
also include several theorems. In particular, included as “oracle 
information” are the properties of the "less than” predicate (<) for 
all pairs of integers in the domain. 

Control Axioms 

I he control axioms are built around the simple control law, 
u bribed in Section II-C, which attends to the person with the highest 
1 ’‘iration level. These axioms are in the form of Conditional Control 

oms in COCOLOG 

C m - U = u"'. 

first of these is the axiom which expresses the law 
^ ' put if all demands are inactive: 

Pf(Max) = 0 —► U = 0. 

^ other axioms in this form are the following. 


Go down if Max is waiting at a floor below the elevatn. 

-(Pf(Max) = 0) A PI (Max) < Pn - U i 

Go down if Max is in the elevator and wanting to go down 

^(Pf(Max) = 0) A PI(Max) = Pn A Pd(Max) < Pn — II i 

Go up if Max is waiting at a floor above the elevator: 

-.(Pf(Max) = 0) A Pn<Pl(Max) ^ U ^ 2. and 

Go up if Max is in the elevator and wanting to go up: 

-.(Pf(Max) = 0) A PI (Max) = Pn A Pn < Pd(Max) — U = 2. 

In addition to these axioms, two other axioms are required to uniquely 
determine the value of the Skolem constant Max. They are 

Pf(n) < Pf(Max) 

and 

Pf(n) — Pf(Max) —♦ ->(n < Max). 

The next section discusses the override control axioms and explains 
how they are added to these basic control axioms. 

C. Override Control Axioms 

The full power of ELCOLOG, as for COCOLOG, lies in the ease 
with which it allows control objectives to be modified. In this case, 
the two override control objectives each use a predicate symbol to 
indicate whether the override is in effect or not. These predicate 
symbols are then logically OR’ed into all the basic conditional control 
axioms. For example, the control axiom in the case where the elevator 
is empty is: stay put if all demands are inactive; this becomes 

Fire V Emergency V (Pf(Max) = 0) —* U = 0. 

So in each case the value of the override predicate must be deter¬ 
mined, either through the current state axioms in the case of a fire, or 
through additional axioms in Th o, as in the hospital override case. 
The axiom in this second case, for the hospital override, is 

—«(PT(n) = 0) A Pd(n) = 4 -► Emergency. 

The complete set of axioms, including the hospital override axioms, 
is given in Appendix I. 

D. Current State Axioms 

The current state axioms are added to Tho to form T/u. These 
axioms are very simple, as they consist only of a set of equality pred¬ 
icates which equates each state projection function to the observed 
output value (recall this system has full state output, i.e., complete 
state observation). For example, one axiom in this set for the state 
shown in Fig. 1 is 

Pl(l) = 2 

which simply asserts that the location of the person associated 
with the demand #1 is the second floor. The current state axioms 
corresponding to Fig. 1 are given in Appendix I. 
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Fig. I. Elevator control example—hospital override, 

VI. IMPLEMENTATION 

A. The Automatic Theorem Prover 

To determine the control input given an initial state an Automatic 
Theorem Provcr (ATP) is used. This section briefly describes the 
“BliizenstumT resolution-refutation theorem prover, developed by 
Mackling [3]. which was used for the simulations. In the propositional 
logic case (i.e., no variables) resolution is an inference rule which 
is a generalization of Modus Ponens, This inference rule applies to 
formulas in canonical clausal form: the formula consists of terms 
(or the negation of terms) which are OR'ed together. Resolution 
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works by first searching tor two formulas which share a term tl» >i 
appears negated in one formula but not in the other. The new formu i 
is derived by deleting the shared term from both formulas ai J 
combining Ihem with the OR operator. For example: ihe lire overru - 
control axiom: Fire —♦ U = 0 can be re-wrilten in canonical clau 1 1 
form as: -»Fire VU = 0. If the current stale axiom set includes i! ' 
formula Fire, then by applying the resolution rule, the new formi i 
U = 0 can be derived. In the first-order logic case, the resolutu i 
rule involves the process of unification (see [9]). 

A proof by refutation starts with a consistent set of axioms, > 
which is added the negation of the formula to be proven. If 1 






























I< h TUANS ACTIONS ON AUTOMATIC CONTROL. VOL 40. NO 3, MARCH 1995 


c itradiction can be derived using the rules of inference, then ai 
» jst one of the original formulas must be false, since inference rules 

Finol derive false formulas from true ones. Since the original set of 

ioms is consistent, it is the negation of the formula to be proven 
Much must be false, and therefore the formula must be true. For 
v ample, the formula to be proven might be U = 0, which asserts 
rat the control input is STAY. To prove this, the negation of this 
wrmula ">(U — 0) is added to Th In the case where one of the 
Liment state axioms is Fire, U = 0 can be derived by resolution, as 
shown above; then U = 0 with -’(U = 0) yields by resolution the 
empty formula, i.e., a contradiction. The steps leading to the empty 
formula constitutes a proof of the formula U = 0. 

A resolution-refutation ATP, defined in terms of these concepts, 
is, an algorithm which recursively applies the resolution inference 
rule to a set of axioms together with the negation of a formula 
in order to find the empty formula. In the case where the original 
formula is, in fact, false, the theorem prover will fail to terminate, 
since a contradiction cannot be derived from a set of true formulas. 
Therefore, in practice, it is necessary to run the ATP on all possible 
control actions simultaneously, and terminate them all as soon as one 
finds a refutation proof. 

Wc say formula A subsumes formula D if A —► D. For example, 
C] V D subsumes C\ V C 2 V D. It is a fact that, if A subsumes 
/?, then any formula which can be derived by recursively applying 
the resolution inference rule to .4. B } and S can also be derived 
from only .4 and 5, where 5 is a given set of additional formulas. 
The “Blitzcnsturm” ATP extensively applies the subsumption rule 
which, in its simplest form, deletes subsumed formulas from the 
list of formulas at any stage in the resolution process (sec |9J). For 
example, the current state axiom -<Fire subsumes the fire ovemdc 
control axiom; Fire —♦ U = 0. In this case, the fire override control 
axiom can be deleted by the ATP, since it is not necessary to prove 
any control actions. Subsumption is especially useful in keeping the 
size of the set of formulas manageable while searching for a proof. 
In particular, note that the combined effect of the resolution rule 
and the subsumption rule in the above examples is to delete the 
irrelevant axioms from Tin depending on whether Fire or -’Fire 
appears among the current state axioms. This applies to override 
controls in general, and so the addition of override controls does not 
necessarily imply longer proof times. 

The ATP has no built-in provision for handling objects of different 
types. Specifically, the restrictions imposed on indexes are not obeyed 
by the ATP. However, as explained in [8], the set of axioms can be 
converted into a single typed language by using domain predicates, 
vvhich, in ELCOLOG, is the inequality predicate. For example, the 
tust axiom to determine Max, in which n is universally quantified, 
becomes 

n < 3 —► --(Pf(Max) < Pf(n)). 

1 >‘se domain predicates are included in the input tile for the ATP. 


r he Trajectory 

he initial state is as shown in Table 1. Note that there is one 
m whose destination is the fourth floor. Appendix 11 shows the 
cment of the elevator with the hospital override in effect. In this 
l >hical representation of the state, the elevator travels horizontally, 
■' - the triplets of integers display each person’s location, destination, 
' frustration. This trajectory was computed using the Blitzensturm 
' of Mackling [3]. 


VII. Discussion and Conclusion 

A. Computational Complexity 

At the time this paper was written, the time required 
single proof was of the order of 10(1 seconds on a Sl'N Spa 
workstation. This is clearly not suitable for the real turn u’-m, 
of an actual elevator. However, significant improvements m pu>, 1 
times are continually being made for T. Mackling’s Blit/ensunn. 
automatic theorem prover (see |3]), and in view ol predicted advances 
in computer processing speeds it is not unreasonable 10 expeel real 
time performance for systems of significant complexity in the future 

With regard lo more complex systems, it is a fact that (he 
computational complexity of refutation proofs is exponential in the 
number of axioms f 10], This is the central motivation lor the theory 
of Markovian fragments of COCOLOG [41 (which avoids increasing 
the size of the axiom set over time). An additional measure to combat 
increasing complexity is that of a hierarchal control strategy. In this 
approach, the states of complex systems are aggregated into the state 
sets of simpler systems at a higher hierarchical level. The computation 
of control actions will be carried out in a manner that respects this 
hierarchy. 

B. Conclusion 

In this paper the elevator control problem is employed to illustrate 
and apply the underlying concepts of COCOLOG. The essential 
features of this approach are i) the ability to formulate descriptions of 
dynamical systems and their control objectives by the use of logical 
axioms and (li) a flexibility which permits the easy modification 
of such axioms, in particular the control axioms. In the elevator 
example this modification was achieved by the addition of override 
axioms. In our view, the ease with which we could write in and 
computationally implement the “emergency” and “fire” overrides 
to the basic “max first” ELCOLOG control objectives provides 
evidence for the flexibility of COCOLOG systems, and we suggest 
the term “objective adaptability” for this property. It should also be 
emphasized that the use of axioms to formulate control objectives 
enables the resulting control system to be engineered so as to 
be provably correct. In general, this property is viewed as being 
difficult to obtain for finite state machines; for example, in current 
practise considerable time is spent developing and testing elevator 
controls. 

Finally, we shall mention that experiments are in progress with 
different COCOLOG formulations of the elevator problem which 
include simulations of exogenous destination demand flows and 
permit cyclical ATP iterations which attempt to prove different 
conditional control theorems. 


Appendix 1 

Complete Axiom Set for thf ATP 
Equality Axioms 

Reflexivity: a = a, symmetry; a = b —> b — a, and transitivity: 
a = bAb = c-+a = c 

In addition to these axioms, the appropriate substitution axiom is 
required for every function and predicate symbol. 

Strict Inequality Axioms 

Nonreflexivity; -»(a < a) and transitivity; a < bAb < c —* a < c 
Substitution axiom for inequality: a = bAb<c-► a <r c 
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Domain Characterization Axioms 

The entire domain consists of positive integers: a > 0 
The entire domain is ordered: a = bVa<bVa>b 
Axioms defining the relation between zero and the four positive 
integers: 0 < 1. 1<2, 2 < 3, 3 < 4 
Theorems to aid the Automatic Theorem Provcr (oracle informa¬ 
tion): 

0 < 2, 0 < 3, 0 < 4, 1 > 0, 1 < 3, 1 < 4, 2 > 0. 2 > 1, 
2<4. 3>0'3>1'3>2'4>0, 4>1'4>2'4>3 

State Projection Functions 

Pn is the number of the current floor, 

Pt(n) is the location of the person associated with demand #/«. 
Pd(n) is the destination of the person associated with demand #//. 
Pf(n) is the frustration of the person associated with demand #//, 
Substitution axioms for equality of the state projection functions 

m = n A Pl(m) — a —► Pl(n) = a, 
m = n A Pd(m) = a —► Pd(n) = a. 
m = n A Pf(m) = a —> Pf(n) = a 

Substitution theorems for inequality of the state projection func¬ 
tions 

m = nA Pl(m) < a —* Pl(n) < a. 
m = n A Pd( m) < a —► Pd(n) < a, 
m = nAPf(m) < a -> Pf(n) < a 

Axioms for Control 

Fire Override If there is a tire, stay at the current floor: 

Fire —> U = 0 

Fire is OR’ed into all of the following control axioms, for clarity, 
however, it will not appear in the following. 

Hospital Override' If someone's destination is the fourth floor 
(the hospital) then that person has priority 

n < 3 A -i(Pf(n) = 0) A Pd(n) = 4 —♦ Emergency. 

If there is an Emergency, then the following axioms determine Vic 

Vic is (an index to) a person: 

Emergency —* (Vic = 0 V Vic = 1 V Vic = 2) 

Vic’s destination is the fourth floor: Emergency -+ Pd(Vic) = 4 
Vic’s frustration is not zero: Emergency —* ~»(Pf(Vic) = 0) 

If there is more than one person whose destination is the fourth 
floor, then the one with the lowest index is treated hrst 

n < 3 A -»(Pf(ti) = 0) A Pd(n) = 4 —► -•(n < Vic). 

Control in case of emergency: deliver victim to the fourth floor 

Pl(Vic) < Pn - U = 1. -i(PMVk) < Pn) -+ U = 2 

(since Vic will exit on arrival, this last axiom will not cause an 
attempt to go up from the fourth floor) 

Basic Control: The basic control is: attend to the person with 
maximum frustration level. This is the control action to be taken 
in the absence of a Fire or an Emergency. To specify this, each 
of the axioms A 4 in the basic control is to be interpreted as: 
-•Fire A ->Emergency —► A,. For clarity and because of space 
limitations, only the nonoverridden axioms A, are specified in the 
following. 


Max is (an index to) a person: Max = 0 V Max = 1 V Max = 2 

Axioms which uniquely determine Max: 

a) No one is more frustrated than Max: -»(Pf(n) > Pf(Max)) 

b) Of those who are equally frustrated, none has a lower demant 
number than Max 

Pf(n) = Pf(Max) -> -i(n < Max). 

Stay put if all demands are inactive: Pf(Max) = 0 —► U = 0. 

Go down if Mux is waiting at a floor below the elevator, or if Mav 
is in the elevator and wants to go down 

Pf(Max) / 0 A PI(Max) < Pn —> U = 1 

Pf(Max) ^ 0 A PI(Max) = Pn A Pd(Max) < Pn —► U = 1. 

Go up if Max is waiting at a floor above the elevator, or if Max 
is in the elevator and wants to go up. 

Pf(Max) ^ 0 APl(Max) > Pn — U 2 

Pf(Max) ^ 0 A Pl(Max) = Pn A Pd(Max) > Pn —>* U - 2. 

Axioms Describing the Current State 

Current state at time instant A- = 1 is as in Fig. 1 

-Fire. Pn = 2. Pl(0) = 1. Pd(0) -= 2. Pf(0) ^ 3, Pl(l) ^ 2. 

Pd( 1) =r 0, Pf(l) “ 2. Pl(2) =- 3, Pd(2) = 4, Pf(2) r: 1. 

With the Hospital Override in effect, the next control input should 
he up (T = 2). This is proved by a refutation proof (a contradiction 
is derived from the negation of the formula to he proved -(U = 2)). 
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Robust Stability Criteria for Dynamical 
Systems Including Delayed Perturbations 

Hansheng Wu and Koichi Mizukami 


\bstract— In this note, we consider the problem of robust stability 
of uncertain time-delay dynamical systems. A new robust stability cri¬ 
teria for linear dynamical systems subject to delayed time-varying and 
nonlinear perturbations is derived. The results obtained m this note 
are less conservative than the ones reported so far in the literature. 
Some analytical methods are employed to investigate the bound on the 
perturbations so that the systems are stable. A numerical example is given 
to demonstrate the utilization of our results. 

1. Introduction 

It is well known that time delay is often encountered in various 
engineering systems, such as chemical processes, hydraulic, and 
rolling mill systems, and its existence is frequently a source of 
instability. Therefore, the stability problems of lime-delay dynamical 
systems have received considerable attention over the decades (see, 
eg., 111—|8]). Because time-delay dynamical systems often include 
some perturbations, it is necessary to investigate the problems of 
robust stability of uncertain time-delay dynamical systems. In |9], 
lor example, the problem of quantitative measures of robustness for 
linear dynamical systems including delayed nonlinear perturbations is 
investigated, and by using Razumikhin-type theorems, some bounds 
on the perturbations are obtained so that the systems remain stable. 
In [10], a delay-dependent stability condition for the linear uncertain 
time-delay dynamical systems is presented. 

In this note, we consider the problem of robust stability criteria for 
linear dynamical systems subject to delayed time-varying and nonlin¬ 
ear perturbations. A new bound on the perturbations is presented so 
that the systems remain stable. Some analytical methods are employed 
to investigate this bound. Our results show that the robust stability 
criteria presented in this note is less conservative than the one given 
m [9]. In addition, since the robust stability condition developed in 
this note is independent of the delay, the results of this note may be 
applicable to dynamical systems with uncertain time-delay. 

The rest of this note is organized as follows. In Section II, the 
problem to be tackled is precisely stated. In Section III, the main 
results of this note are proposed. As an application, an illustrative 
example is given in Section IV. The simulation of this example 
demonstrates the utilization of our results. 

11. Problem Formulation 

We consider the linear dynamical systems including delayed per- 
lurhations described by Ihe differential-difference equation of the 
tom i 

~ = A.r(t) + Af(.r(t-h(t)).1) Cl) 

(It 

Wl ‘e / e R is the ‘time,' ,r{t) E R" is the current value of the stale, 
^ H llXn is an asymptotically stable matrix, A/(-, ■) : R n x R -* 
^ a time-varying nonlinear continuous function. The time-delay 
»s any nonnegative, bounded, and continuous (unction. That is, 
( h{t) < h, where h is any constant. 
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Here, it is worth noting that we do not require foi ih. , 
of the time-varying delay h(t) to be less than one, i.e. , 

II is well known that such an assumption is often needed 
papers dealing with stability problem of systems with tni ■ 
delay (see, e.g., |8]). Therefore, as will be seen in the next st. ■ ,, 
results of this note may be applicable to the case where the tun 1 * 
function is uncertain. 

The initial condition for system (1) is given by 

■•(*)— <y’(0- t 6 [to — hj [t ] (2) 

where ?.»(•) is a continuous function on [/» - hj n \. 

We assume that A/K-) represents the uncertainty and perturba¬ 
tions acting on system (1), and that there exi us a nonnegativc constant 
3 such that for all (jr.t) € R" x R 

\\±f{.v(t-h(t))J)\\<,l\\jit-h(t))\\ (3) 

where || ■ || denotes the Euclidean norm. 

Now, the problem to be tackled in this note is to hnd some 
conditions on the perturbations so that system (1) is stable. 

III. Main Resuits 

In this section, we derive the stability condition for system (l). 
which gives the bound on the perturbations. Before stating our main 
results we give the following definitions. 

Definition 1. For any similarity transformation matrix 37, we 
define 


/i = ii.vini.v- , ii. 


(4) 


Definition 2: For the asymptotically stable matrix .4 given in 
system (1), wc define a constant n which satisfies 

0 < n < min {|Rp A,(.4)|. / — 1, - ■ -. /i} (5) 

i 


where A,(-) denotes the ith eigenvalue of the square matrix (■), 
ReA,(-) its real part. 

Definition 3. For the symmetric positive definite matrix P E 
J? ,,x, \ we let 


A, n „x(P) 

An ».,(/') 


( 6 ) 


where A llltix ( ) and A im „(-) denote the maximum eigenvalue and the 
minimum eigenvalue of matrix (■), respectively, and P satisfies the 
following Lyapunov-type equation 

(.4+ .»/)' P + r{A + nI) = -Q (7) 


where 1 denotes the identity matrix, (} E R” is any symmetric 
positive definite matrix, and 

A = \r l AM. («) 


Remark I: Since the matrix .4 is asymptotically stable, by Defini¬ 
tion 1 the matrix (.4 + o/) is also asymptotically stable. Therefore, in 
the light of the Lyapunov stability theory. Lyapunov-lypc (7) exists 
for any symmetric positive definite matrix Q a solution P which is 
also a symmetric positive definite matrix. 

Then, for system (1), we have the following stability condition 
which results in a bound on the perturbations. 
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Theorem I: If the following inequality is satisfied 

0 <!>■= — (9) 

litr 

then system (1) is stable. More precisely, we have 

INM||<f^||D||«p{-T(f-f 0 )} dO) 

1 — a 

where 

d-.= d^)=zr l 1, -€[(),«) (11a) 

« - 1 

||D||:= sup p/ _, i'-(r)|| (lib) 

||‘.(f)||:=||A/" l j-(0|| (Uc) 

where M is any similarity transformation matrix. 

Proof; We transform system (1) with the aid of a similarity 
transformation matrix A/, and get 

= A-.(t) + M~' A/(,Vt<( - h(t)).t) (12) 

iIt 

with an initial condition 

i{t)= t€[t»-h.to] (13) 

where ; = M~ l .r. 

Let c(f) be the solution of system (12) with the initial condition 
(13) and define a positive definite function VU) as follows 

V(:) = iUt)Piit). (14) 

In the light of Rayleigh principle 111J. from (14) we have 

A im „ (r)||i(f)|| J < ru) < a,„„ (D||.(f)|| J . d5) 

From (7), (12), and (14) we obtain 

2nV(;)- -’(OQ-At) 

ill 

+ 2 z 1 (t)PM 1 A/(il/:(f - /»(()),(). (16) 

Since both Q and P arc positive definite, we also have 

< —2<»1 U) + 2,i 1 X .,(r)||:(f)||||i(f-/<(0)||. (17) 

(it 

Therefore, from (17) we obtain the following inequality on Y(\) 

V(i) <\,^(P)\\D\\ J v\v\-2o(1 - fi,)} 

+ f 2// )A„ m , (/ > )pxp {-2n(t — r)} 

Jt (i 

■ |U(r)|| ||-(r ~ (18) 


Furthermore, from (19) we obtain 

= -nS(t) + /i/iA M »(P)Myi||i(f - hit ))II 

<- nS {t)+'^d£l\\i(t-h(t))\\. 

yAjinn ( P) 

Therefore, from (22) we also have 

Sit) < v /A„„.,(/’)||D||«ci>{-o(f-ro)} 

+ 7Cmr ,H '" 11 

■ ||~(r - ft(r))|| (It. 

Comparing (20) with (23) yields 

||-(0|| <ff l/a ||D||rxp{-o(f-f 0 )| 

+ (1(7.1 / exp { -o(f - 7)} 

Ji o 

■ ||-(r - A(r))||r/r. 


+ fiaJ 


Here, we first define the following continuous function 

*f( ■) 6 [().<»). 

<\ — ~ 

It is obvious from the condition (9) that 

,/(«) = '^<1. 
n 

Then, according to the property of continuous function, there cxisls .1 
constant T>() (“< n) such that r/(") < 1. Here, for such a constant 
T> 0, we define 

rf:=i/(“-)< 1. (25) 

Continuing with (24), multiplying both sides of (24) by exp 
{”(/ - to )) yields 

< <t ]/2 \\D \\exp - to) + T(f - /»)} 


+/i(T i f exp {-n(f - r) H- ;(/ - t n )} 
Jt., 


■ ||i(r - h(r ))|| (h. 

With trivial manipulations, from (26) we obtain 
||--(/)||exp {T(^ ~ h)\ 


Here, we define an auxiliary function 5(f) as follows 
f ^A P)\\D\l t E [tu — h.to] 

S[t) ~ 1 JH7). t > (0 


Letting 


< a^ 2 \\D\\ + f exp { - /1 (7-)} 

Jit) 

- exp { — (n — ~)){t — 7“) }|| -(T — Ii{t) 
■ exp {^(r - h(r) -fn)( dr. 


1 m(t):= sup [||M/;)||exp {-,(/» - to)}] 
re\ii)-h q 


from (27) we have 


S(t) = A.,(/’)||/7|| J oxi) (—2rv(/ - tn)\ 

+ f 2/i,(A mux (/') exp {—2rv(/ — 7 )} 
Jilt 

■ ||-(r)|| ||~(r - A(t))|| dr. 

Then, comparing (18) with (19) yields 

Sit) > s/vT~) > V / A ln „ 1 (r')||:(f)i|. 


In the light of the definition of V,„(f), from (25) and (28) we fun er 
obtain 

r,„(f) Ka'^WDW + .^-^rWjt) 

(i ~ 1 

= a' /2 \\D\\ + 7lY,„(t) 


That is 


II--mil < 

Sit ) - 


V,.A*) < —-=l|£||. 

1 - a 
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TABLE I 

The Comparison of the Results Obtained in [9] and This Note 



The Results of [9] 

Our Results 

Not Transformed 

p = 0 1458 

/>* 0 1835 

Transformed 

p = 01780 

P = 0 2383 



Time 

Fig 1 Time delay history 

From (29) we can obtain that for t > to 

IMO|| <lr (f)c\p{--(f-fo)} 
ff '/‘ 

< —-ll/^ll exp {--(/-fo)} (30) 

1 — a 

It is obvious from (30) that system (12) is exponentially stable This 
implies [noting - = V _l i] that system (1) is also exponentially 
stable □ 

Remark 2 The robust stability condition (9) gives a bound on 
the perturbations It is worth noting that (9) is independent of delay 
Therefore the results obtained here is applicable to the case where 
the delay function h(t) is uncertain 
Remark ? The lobusl stability condition (9) does not include 
\ (Q)/\ui ,*(/*) which is included in the bound given in [9] Then 

our results may be less conservative than the ones given in [9] This 
will further be illustrated in the following numerical example 


IV Numerical Example 
W e recall from [9] the following matnx 


4 = 


r-3 

L i 


-2 

0 


(31) 


i 19] selecting the similarity transformation matnx M and Q 
n identity matrix l yields for the perturbations the bound p = 
( 1-58 Further improvement is obtained by applying the following 

11 ^formation suggested in [12| for the perturbations 


0 99904 -0 26217’ 

0 02G60 095937 


(32) 


yields a robust bound /> = 0 178 

our results this bound can further be improved Here, if we 
s l a = 0 3, M = /, and 




15 000 

3 000 


3 000 
2 G47 


( 33 ) 



Time 

Fig 2 Simulation of example for ? = 0 22 


then we can have p = 0 1835 Furthermore, we also employ the 
transformation (32) and let 


Q = 


'8 000 1 000 ' 
1 000 0 120 


(34) 


then we can obtain a bound p = 0 23893 which improves the bound 
reported in [9] for the perturbations (by 34%) 

The comparison of our results with the ones obtained in (9] is 
shown in Table I It is obvious from Table I that the results obtained 
in [9] are more conservative than our ones To further confirm this the 
simulation of this example foi i = 0 22 (</> = 0 2383) is depicted 
in Fig 2 Here we have employed foi the simulation the following 

[11 20 sui [ii(f-/i(f))]' 

M } [0 22sin [ri(f- h(t))] m 

t, (O = [3 cos (/) 2 c os (f)] 7 


and the time varying delay h(1) is given in Fig 1 Note that h{t) is 
not defined at t = 0 25?i, n = 1, 2, 

It is shown from Fig 2 that the system is indeed exponentially 
stable 
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An Efficient Method for Unconstrained 
Optimization Problems of Nonlinear 
Large Mesh-Interconnected Systems 

Shin-Yeu Lin and Ch’i-Hsin Lin 


Abstract —We present a new efficient method for solving unconstrained 
optimization problems for nonlinear large mesh-interconnected systems. 
This method combines an approximate scaled gradient method with a 
block Gaus&-Seidel with line search method which is used to obtain 
an approximate solution of the unconstrained quadratic programming 
subproblem. We prove that our method is globally convergent and demon¬ 
strate by several numerical examples Its superior efficiency compared to 
a sparse matrix technique based method. In an example of a system of 
more than 200 variables, we observe that our method is 3.45 times faster 
than the sparse matrix technique based Newton-like method and about 
50 times faster than the Newton-like method without the sparse matrix 
technique. 

I. Introduction 

In this paper we consider the following unconstrained optimization 
problem for a nonlinear large mesh-interconnected system 

min J{jr) (I) 

where the objective function J: ft" — ► ft is continuously differen¬ 
tiable, bounded from below and satisfies the Lipschitz condition that 
there exists a constant K > 0 such that ||V./(.n ) - < 

A' ||j‘i - 11-2- j*2 € ft". A general descent algorithm, which is 

called a scaled gradient method in (1], for solving problem (I) uses 
the following iterations 

J- Ul = / + -vV (2) 

where k denotes the iteration index, is a step-size, and is the 
solution of 

min s* 1 C{,c k )h+ K?J(jr k )* s (3) 

in which C{.r k ) is a positive definite matrix. 

For a large mesh-interconnected system, if C{j k ) is selected so 
that (2) is a Newton or Newton-like method, C(.r A ) may be a sparse 
matrix. The solution of (3), which is the solution of the linear system 

C(r‘ ).s = -VJ(.r* ) (4) 

can be obtained using the sparse matrix technique, which is a very 
powerful means for solving linear equations in a circuit system [2] or 
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an electric power system |3J. This technique effectively reduces th 
amount of computer memory needed and improves the computation 
time dramatically because it stores only nonzero elements and ignore 
operations involving zeros in the solution process [2]. Thus, a spar^ 
matrix technique based Newton or Newton-like method is much mor 
efficient than a method with quadratic convergence rate. 

In this paper, we will present a new method for solving (1) for ,i 
large mesh-interconnected system to compete with a sparse matrix 
technique based method. Our method combines an approximaic 
scaled gradient method with a block Gauss-Seidel with line search 
(BGSLS) method. Although each of the above methods is well 
known, combining them into one method is a new approach. The basic 
idea behind this method is to preserve the advantages and discurd the 
disadvantages of the block Gauss-Seidel method. We have observed in 
many numerical experiments [4] that the block Gauss- Seidel method 
approaches its convergence point very fast in the first few iterations 
and then slows down around that point. This fact indicates that the 
block Gauss -Seidel method is better suited for use as a descent 
direction generator rather than as an optimization algorithm by itself. 
To improve the quality of the descent direction further, wc use an 
exact line search at the end of each cycle of the block Gauss-Seidel 
method and thus form a BGSLS method. Executing the BGSLS 
method for a finite number of iterations with appropriate stopping 
criteria will generate the descent direction needed for the approximate 
scaled gradient method. Wc will show in this paper that our method 
is a globally convergent descent algorithm. It is difficult to give an 
analytical convergence rale for our method because of the nature of 
the method. However, since the major compulation required hy our 
method lies in the execution of the BGSLS method, our method 
will be efficient if the BGSLS method can generate an effective 
descent direction in several iterations. Intuitively, our method should 
be efficient and effective, for two reasons: i) only small minimization 
problems are involved in each iteration of the BGSLS method and ii) 
the exact line search used at the end of each iteration of the BGSLS 
method greatly improves the quality of the descent direction. 

II. Solution Method 

A. The Approximate Scaled Gradient Method 

The approximate sealed gradient method Ml is 

/ fl =.l ■ k +- k s k (S) 

where is an approximate solution of (3) and ' k is a step-size, 'lit 
ensure global convergence, we use Armijo-type rule to determine the 
step-size by 

-/ = i"' k A <h) 

where 0</*<l, A>(), and nu is the smallest nonnegativc integer ei 
that makes the following inequality hold for some positive constant 

A" 2 

./(/ + ir\s k ) - j(/) < ~ii m ah** 113. «<) 

Remark 2.1: a) Condition (7) is to ensure a sufficient decrement >1 
the objective function obtained in each iteration of (5); the satisfacti n 
of this condition serves as a terminating criteria for our Armijo-t> * 
step-size rule (6). b) As will be shown in Theorem 2.1, the mat ' 
^C(.r A ) - Kil being nonnegaiive definite is a sufficient condiu a 
for (5) to converge. This sufficient condition provides a value for I 
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emma 2.1: Suppose (3) is solved by a descent iterative algorithm 
s ting from « = 0 for any arbitrary number of iterations, and let ** k 
(I -,ote the final value of *. Then VJ(x k ) T s K < 0. 
t his lemma can easily be verified by the fact that ) 7 C{* k )s k 
7J{i' k ) 1 * k < 0. Thus, our approximate scaled gradient method (5) 
v 1 be an efficient descent algorithm for solving (1) if the descent 
, l orithm we employ to obtain s k ' is efficient. 

B Block Gauss-Seidel with Line Search (BGSLS) Method 

1) One Block Gauss-Seidel Cycle: Let us partition s into p sub- 
vectors such that ft = [*i*2 ■ ■ ■ ■‘v] / . Then one block Gauss-Seidel 
c>cle is to perform 

min C(.r k )s + VJ(. v k ) r n (8) 

from / — 1 to p. In (8), the subvector ft, is taken as the vector of 
minimizing variables while the variables in the subvectors *j, ■ • ■ , 
,s, — i, .s J+ i, ■ ■ ■ , s p are held fixed at their current values. Note that 
compared to (3), (8) is a small unconstrained minimization problem 
for every /. 

Remark 2.2. On the partition of the ^-vector, there are two ex¬ 
tremes corresponding to p = 1 and p = n. The case of p = 1 is not 
the interest of this paper. For the case of p = n, the descent direction 
generated by (8) has very poor quality. Thus, a good partition should 
take the following two factors into account: a) computational burden 
of solving (8) and b) the quality of descent direction generated. 
In fact, the above two factors have conflicting interests; at this 
stage, we have not yet achieved an optimal way to partition the s- 
veclor. Nonetheless, for a network-structure like system, it would be 
beneficial if each partitioned subnetwork is mesh-interconnected, and 
the sizes of all subnetworks do not differ much. 

2) Optimal Step-Size Let .s (/) denote the value of s after solving 
(8) for Then .s (0) represents the initial value and s iv) the final 
value of -s for one block Gauss-Seidel cycle. Suppose s ({)) is not the 
optimal solution of (3). Then the following inequality holds 


is obtained by solving the following one-dimensional mmn 
problem 


mill - * (n) )]' C(.r‘ )[*"’> + „(*' 


+ VJ(/)V 01 +n(s , '"-» 


J«) v 


3) One Iteration of the BGSLS Method. The following thuv 
erations form one iteration of the BGSLS method: i) execute mu 
block Gauss-Seidel cycle; ii) determine a: and in) update > wifi 
be the initial value * (0) for the next iteration of the BGSLS method 

4) Convergence of the BGSLS Method: Tins iterative BOSKS 
method will converge to the solution of (3), as described m the 
following lemma. The proof of the lemma is given m the Appendix. 

Lemma 2.3: Assuming that there exists a constant h' 2 >0 such 
that \C{x k ) - Kil is nonnegative definite for all .r A , then a) the 
BGSLS method is a descent method, and b) any limit point ot the 
sequence generated by the BGSLS method is a solution of (3). 

5) Stopping Criteria of the BGSLS Method: As pointed out in 
Section I, the BGSLS method approaches a solution point of (3) 
very quickly in the first few iterations; however, it then slows down 
around that point. In fact, it was this characteristic of the BGSLS 
method that gave us the idea of combining it with the approximate 
scaled gradient method. Therefore, to obtain the H k needed in (5), we 
do not need to execute the BGSLS method until it converges. In fact, 
we can stop the BGSLS method after a finite number of iterations. 
Consequently, one of the stopping criteria of this method is if the 
improvement of the objective function satisfies 


Q(t) 


x 100 ( / < (</ 


(15) 


where Q(1) = ^(l) 1 C(s k )s(t) + V J{.r k ) j s{t)< t denotes the 
iteration index of the BGSLS method, and £'/ is a preselected 
percentage. To ensure that (5) converges, another stopping criteria 

IHf + i)||j < A'i ||VJ(/)||, U6) 


i(.- (!,| ) 7 C(j-V ni + VJ(.r*) , *"’> 

> C(j-‘ )s , ' ,) + V./(.r*)' (9) 

Furthermore, because C’(./) is positive definite, Is f C(.r k ) « + 
V .7(.r* )' s is a convex function in *. Based on this fact, we may 
verify that 

> £(.s (ll) )'n.rV 01 + v./(.iV.s (0) 

+ [C(/ ).S (0 » + V )]' (.S 1 '” - .S (,,) ) J . (10) 


for some I\\ > 0, for all t > / , where t is a finite positive integer, 
should also be satisfied. Then, the s{t + 1) which satisfies stopping 
criteria (15) and (16) will be set as s k . An explanation of the need 
for (16) will be given in the proof of Theorem 2.1. The existence of 
A’i and t is ensured by the following corollary, the proof of which 
is also given in the Appendix. 

Corollary 2.1: Let {«(/)} denote a sequence generated by the 
BGSLS method. Under the assumption of Lemma 2.3, there exists 
a t such that ||.s(/)||j > A'j ||V7(/)||<j for some A'i > 0 and tor 
all t > t . 


Then, combining (9) and (10), we obtain the following lemma. 

Lemma 2.2: Let : .s (0) and w (/,) denote the initial and final values 
Oi ‘^-variables, respectively, for one block Gauss-Seidel cycle of the 
SLS method. Suppose s (l,) is not an optimal solution of (3). Then 

[CU k h iu) + V7(/)]' (* l,,) - .v (u, )<0. (11) 

i * above inequality implies that * (/j) - is a descent direction 
1 \s r C(.v k )s + ^JU K )'s at .s = * (ni . 

herefore, we can determine the exact optimal step-size ri to update 
variable s by 

ii (0) i (/.) (OK 
s — .s + f|( V' — s' ) (12) 

" re a" denotes the updated *, and the optimal step-size 

. _ [r(.i'*)* (0l + O(/)] 7 (« (,,) -* ln) ) 

a ~ [„<,.) _ „<o)]/C(j.,,(")] 11 


C The Algorithm for the Solution Method and its Convergence 
Combining Lemma 2.1 and Lemma 2.3 then proves that our method 
is a descent method. 

Lemma 2.4: The combination of the approximate scaled gradient 
method (5) and the BGSLS method, supplemented by the stopping 
criteria (15) and (16), is a descent method for solving (1). 

/) The Algorithm- We are now ready to state our algorithm as 
follows: Given data: i) * = [*i, ■ • ■ , -s /( ] 1 , ii) the values ol A( >0) 

and ,f(0 < ,i < I) in Armijo-type rule, iii) the value of £'/(> 0%) 
in (15), and iv) a positive constant hf which meets the assumption 
in Lemma 2.3. 

Step 0: Set the values .r (0) and set k = 0. 

Step /. Set f = 1 and set *(/) = 0. 

Step 2: Set i = I and * (n, (f) = s(t). 

Step 3: Solve (8) to obtain If / = />, go to Step 5; otherwise, 
set i =./-*- I and repeat this step. 
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Step 4: Compute n^{t) - * (0) (f) and determine *>(/) by (13). 
StepS: Update + 1) * * <0) (f) + r>(f)|* ( ^(/) -*< 0) (f)]. 

Step 6; If (15) and (16) arc satisfied, set s k = *(t + 1) and go to Step 
7; otherwise, set t = t + 1 and return to Step 2. 

Step 7: Set m * 0. 

Step 8: t \f J (s' + -J(.r A ) < -(A*/2) * ,m A||A a ||g. set 

7 = ,^"A; otherwise, set m ^ »« + 1, and repeat this step. 

Step 9: Update .7 +l = .7 + 7 , set k = A 1 + 1, and return to Step 1. 

2) Convergence of the Algorithm: The following theorem ensures 
the convergence of our algorithm. A proof of the theorem is given 
in the Appendix. 

Theorem 2.1: Let {.7} denote the sequence generated by our 
algorithm. Under the assumption of Lemma 2.3, liiiu-* ^ V./{7) 
= 0. 

Remark 23: The values of A, J, and £ { /< do not affecl the 
convergence but will influence the efficiency of our method. Thus, 
these values can be determined empirically for individual systems. 

3) Stopping Criteria of the Algorithm: From Theorem 2.1, we see 
that as A* —> :x, & k —► 0; however, for practical considerations, our 
algorithm will stop when \s k < * for a reasonable accuracy. 

III. Examples 

Most electric power systems are nonlinear large mesh- 
interconnected systems. The weighted least squares problem in 
power system state estimation is a typical unconstrained optimization 
problem and thereby is an adequate example for demonstrating the 
computational efficiency of our method. 

A . The Weighted Least Squares Problem in 
Power System State Estimation 

The power system state estimation problem [61 can be briefly 
described as follows: For an /-bus 1 power system, the voltage 
magnitudes and phase angles of all buses constitute the states of the 
system. Because the bus phase angle cannot be measured, the states 
of the system have to be estimated based on available measurements, 
which arc functions of the states, such as power transmission line 
flow, bus voltage magnitude, bus power injection, and transformer 
tap. All such measurements can be expressed in a general form 
as ; = h[.r) 4- //, where .r is a 2/-dimcnsional vector of state 
variables, : is an rn -dimensional vector of measurements, h denotes 
tn nonlinear measurement functions which are twice continuously 
differentiable, and // represents an in -dimensional Gaussian random 
vector of measurement errors with an m x m diagonal covariance 
matrix /?. A common formulation |5| of this state estimation problem 
is to solve the following weighted least squares problem 2 

mill ./(.r)(= i[ ; -/,(.,•)]'''/?-'(; -/'(.<•)])• (17) 

J 

Since h(.r) is twice continuously differentiable, J(.r) in (17) 
is twice continuously differentiable and thereby satisfies Lipschitz 
condition. Taking n = 2/, the J(.r) meets the assumptions given in 
Section 1. 

Note: For an authentic power system state estimation |6], bad data 
processing has to be associated with the solution of (17); if bad 
measurements are present, they should be eliminated, and the states 
re-estimated. However, for the purposes of this paper, we will focus 
on solving (17) only. 

1 The buses of the power systems considered here are similar to nodes in 
an electrical network. 

2 The phase angle of the slack bus is considered to be a known value. 


B. Conventional Approach 

In general, the Newton method may fail to solve (17) because th 
Hessian matrix of (17) may not be positive definite. Thus, the conven 
tional approach in the power system literature 16] uses a Newton-lik 
method, which is (2) with C(.c k ) — 2H(.r h )* B~ l H{j k ) in (3 
where H{.r k ) — V//(.r A ), However, H{.r k ) l R~ l H{.r k ) may b 
singular or ill-conditioned, since it is only positive semidefinite. 1. 
cope with this difficulty, we can modify the above C(7 ) to ensure 
the positive definiteness by setting C(.r k ) = 2H(.v k ) 1 ff(.r k ) f 
where b is a small positive real constant and I is the identity 
matrix. We see that under this modification, the only assumption 
needed in Theorem 2.1, -C(.r A ) - K?1 is nonnegative definite, is 
satisfied by taking A\> = (A/2). Furthermore, the numerical stability 
of our algorithm is guaranteed if b is not too small. Consequently, 
(4) becomes 

[2£T(/) , 7?’ 1 ff(/)-b bl]s = -VJ(7). (IK) 

The matrix 2H(.r k ) 1 U “ 1 H{.r k ) 4* bl and the matrix 
tf(.r A ) y /T7/(.r A ) are sparse, and (18) can be solved using 
the sparse matrix technique. 

Remark 3 l: a) Because H(j- k ) ! 7? l H(.r k ) may be ill condi¬ 
tioned, methods that use an orthogonal transformation technique to 
solve R [ H(.r k )s = -V\/(7) has been developed [7] 

However, these methods arc computationally very inefficient, b) In 
[6], the slep-si/e 7 is set to be 1 for all k. In fact, with a constant 
step-size such as this there is no guarantee that (2) will converge, 
c) There are other methods in the power system literature that can 
obtain an approximate solution tor stale estimation using a decoupling 
approach [8). However, these methods are beyond the scope ol this 
paper, since we are concerned only with methods that solve (I) 
exactly. 

C. Application of the Proposed Method 

Let us partition the power system network into j>{ > 1) subnetworks 
such that each subnetwork is a connected graph m the topological 
sense. Let B, denote the set of buses of the /th subnetwork; then 
denotes the subvector of * corresponding to states of the /th 
subnetwork. Replacing ,s,, C(.r k ), and V,/(,r A ) in the proposed 
algorithm by , 211 (jl k ) ! //“' ff(.r k ) -|- bl, and H{.r k )R 1 (: 
//(.r A )), respectively, we can apply the proposed algorithm directly 
to (17). Analogous to (8), we see that /!«,,■■■, ,, ah, f ,, ■ ■ ■ , 

bH fl are held fixed in the /th minimization problem 

min ^s I C(,r k )s + V./(.r A )'s (W 

"H, 

whose solution can be obtained by solving a small set of linear 
equations involving only the /th subnetwork and the boundary bu^ s 
outside the /th subnetwork (9). 

D. Test Results 

We applied our method, the sparse matrix technique based Newh i 
like method, and the Newton-like method without sparse mat. ' 
technique to the weighted least squares problem in state cstimati* n 
of the IEEE 30-bus power system and the IEEE 118-bus pov. *r 
system. For the cases under consideration, the IEEE 30-bus sysu n 
was partitioned into three subnetworks, as shown in Fig. I, in wh» h 
each subnetwork is indicated by closed dashed contours. The IF F ’■ 
118-bus system is partitioned into eight subnetworks. Because of 1 i ‘ : 
limitation on the length of this paper, we can not include here 1 
figure of the IEEE 118-bus system, however, this figure can be foil d 
in [10|. Nonetheless, we list the bus number of the buses of e*i I 1 
subnetwork in the following. Subnetwork 1 contains buses I to ' »• 
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TABLE 1 

Comparison of Our Method with Newton-like Method with and without Sparse Mafrjx Technique 
_ Final objective value _ Average CPU time (second-) _ Speedup ra no 


System 

nlwts (A| 

Nl.WS (U > 

Our method 

nlwts (A) 

NLWS ,U) 

Our method 

i 

/ 


method 

method 

method (I) 

method (II) 

(III) 

77 

Til 

UiEE-30 bus 



mam 

0.99 

0.46 


2.15 

3.67 

LEE-118 bus 




76.53 

5.49 

1.59 

13.94 

48.13 




Fiy 1. The IEEE 30-bus system and three partitioned subnetworks. 


and bus 117. Subnetwork 2 contains buses 15 to 19, buses 27 to 
12, and buses 113 and 114. Subnetwork 3 contains buses 20 to 26, 
buses 70 to 76, and bus 118. Subnetwork 4 contains buses 33 to 
47, Subnetwork 5 contains buses 50 to 61, and buses 63 and 64. 
Subnetwork 6 contains buses 48, 49, and 62. buses 65 to 69, buses 
77 to 81, and buses 97, 98. and 116. Subnetwork 7 conlains buses 
H2 to 96. Subnetwork 8 contains buses 99 to 112. Note that each 
subnetwork contains about 10 to 20 buses in mesh-interconnccted 
structure though the bus numbers are not consecutive. 

In both cases, wc took the real-power flow and the rcaclive- 
power flow of all transmission lines of each individual system as 
the measurement data and assumed that some of these are bad data. 
Wc set the parameters in our algorithm as follows: = 0.01, 1 = 
lor both systems, £% = 50% for the IEEE 30-bus system, and 
ft -■= 10% for the TEEE 118-bus system, A = 1.05, 1.10. 1.15 for the 
IHi F 30-bus system, and A = 1.25, 1.30, 1.35 for the IEEE 118-bus 
s > 1 ' m. Note that each value of A corresponds to one computer run. 
W used various values of A to test Armijo-type rule and average the 
t - time. The accuracy e in the stopping criterion of our algorithm, 
h is described in Subsection II-C-3), was set to be KT 2 for 
cases. We used a Sun 4/60 workstation to test our algorithm. 
1 ' Emulation results for the case of the IEEE 30-bus system and 
ase of the IEEE 118-bus system are shown in Table I. The 
lv se CPU time of our algorithm is the average CPU time of the 

Cl uter runs with various values of A reported on the Sun 4/60 

v, ‘ station. Wc also solved the same cases for both systems using 
( 7 ne sparse-matrix technique based Newton-likc method, with the 
s:|; setup of Armijo-type rule and same initial guess as our method 
0,1 c same Sun 4/60 workstation. We used |./|, x < - = 10" 54 as 
^ Upping criteria for the sparse matrix technique based Newton¬ 


like method, the test results for which arc also shown in Table 1. 
We also applied the Newton-like method withoul using the sparse 
matrix technique to the same cases on the same workstation with the 
sume setup of Armijo-type rule, same initial guess, and same stopping 
criteria as ihe sparse matrix technique based Newton-likc method. 
The resulting average CPU times are also reported in Table 1 for the 
cases of both systems. We see that our algorithm achieved the same 
Anal objective value as the Newton-like methods. As expected, the 
sparse matrix technique based Newton-like method was much faster 
than the Newton-like method without the sparse matrix technique in 
both cases, especially in the case of the IEEE 118-bus system, which 
has more sparsity. Compared to the sparse matrix technique based 
Newton-like method, our method also performs better with Ihe larger 
system. From the speedup raLio shown in 'table I, we see thal in 
the case of the IEEE 118-bus system, our method is 3.45 times faster 
than the sparse matrix technique based Newton-like method and 48.13 
times faster than ihe Ncwton-like method without the sparse matrix 
technique. This demonstrates the dramatic increase in efficiency 
provided by our method. In order to better appreciate the merits of 
our algorithm, in Fig. 2 we describe the details of the progression 
nf our algorithm and ihe Newton-like method with the sparse matrix 
technique in solving the case of the IEEE 118-bus system when A = 
1.35. The result of our algorithm is shown by the curve marked with 
circles o and associated with the Arabic numeral iteration index. Each 
circle indicates the CPU time accumulated (horizontal axis) at that 
iteration versus the corresponding value of the objective function 
(vertical axis) in ihe progression. The curve marked with asterisks * 
and associated with the Roman numeral iteration index corresponds 
to the Ncwton-like method with the sparse matrix technique. Our 
algorithm takes 12 iterations to meet the stopping criteria |s A |* < 
KT 2 , while the sparse matrix technique based Newton-like method 
takes only 11 iterations to meet the stopping criteria |.s A | „ < l(P J . 
However, we sec that the amount of CPU time taken up by each 
iteration of our algorithm is far less than that for each iteration of the 
Ncwton-like method with the sparse matrix technique. This shows the 
effectiveness and efficiency of using the BGSLS method to generate 
the dcsccnl direction. Furthermore, when our algorithm has nearly 
reached the optimal objective value, the Newlon-like method with 
the sparse matrix technique has just finished its first iteration and the 
corresponding objective value is slill quite far away from the optimal 
value. The efficiency of our algorithm is obvious. 

IV. Conclusion 

We have developed a globally convergent algorithm for un¬ 
constrained optimization problems of nonlinear large mesh- 
interconnected systems. Although, due to the nature of our method, 
an analytical convergence rate is not available, the dramatic increase 
in efficiency provided by our method can be observed both from 
the method itself and from the simulation results. It is worth noting 
that the BGSLS method can be processed by parallel processors if 
the order of the partitioned subnetworks is suitably arranged 1111. 
Thus, Ihe speed of our method may be further increased by using 
parallel processors. 
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Fig. 2. Details of the progression of our method and the sparse matrix 
technique based Newton-like method in solving the weighted least squares 
problem for the IEEE 118-bus system. 


Appendix 

Proof of Lemma 2.3: a) Let T = {« E ft 2 " |C(.r* )* + V ,7(.r*) 
= 0} denote the solution set of (3). Then if a £ F, the descent 
property of the BGSLS method follows directly from Lemma 2.2 and 
(12M14). According to the Global Convergence Theorem [12, pp. 
187-1881, b) can be proven if we show that i) the BGSLS method is 
a descent method, ii) the sequence generated by the BGSLS method 
lies in a compact set, and iii) the mapping of any iteration of the 
BGSLS method is closed. Clearly, i), which is a), has been shown. 
In the following, we will prove ii) and iii). 

Because C(.r k ) is positive definite, the unique minimum solution 
of (3) is 

„* = -C'(/> ’V.7(/). (Al) 


Lei the set S = {.s G |’-iVJ(x 4 ) ; <V )“' V.7(.r‘) < 
2* 1 C(.r k )x + VJ(.r k )‘ s < 0 ). Then S / K, since 0 £ 5. We 
claim that S is compact. Clearly, S is closed. Thus, it is enough to 
show that S is bounded. Suppose noi, 6 5 9 


IW |2 > HiA* 


/ lir J(.r* )l|j -fl 

V * * 

|irJ(/) / C(.r*)- , V.7(./) +1 


(A2) 


For this, s and by the assumption that jC'(.r k ) - I\‘i / >0, we have 

= (A’,|Hb- ||V.7(.r < )||,)|H| 2 . (A3) 

From (A2) 

(A- a |Hb-IF.7(/)|| I )|Hk>INI, 

>|lVJ(.r fc ) / C(.r*)-’ 1 VJ(/)|. (A4) 

Thus, from (A3) and (A4), we obtain that \^s l C , (.r k )» + 
s\ > j|W(.r* ) l C[j k )’' l VJ(j‘ A )|. This inequality con¬ 
tradicts s € S. Hence S must be bounded. From (a), the BGSLS 
method is a descent method, Thus, every point in the sequence {*(f)} 
generated by the BGSLS method starring from ssO should satisfy 

i*(0'CU*)*(/) + r.7(.r*M7)<0, w (A5) 


moreover, by the descent property 

i«* r C(.r‘>* + VJ(.r* )V 

< ±s(1) T C(.v k )s(t) + VJ(jt k ) 1 *(t), V/. (A ,) 

From (Al) the left-hand side of (A6) equals — £ V J(.r k ) r C'(.r k ) 1 
V.7(.r*), thus from (AS) and (A6), the sequence {«(*)}, Vf, most 
lie inside the compact set 5. This proves ii). Next, we will prove 
iii). Based on the fact that the composition of closed mappings is a 
closed mapping, and the exact line search method is also a closet! 
mapping [12, p. 210], it is enough to show iii) if we can show that 
the mapping of Step 3 of our algorithm is closed. This is true because 
(8) is a bounded, unconstrained quadratic minimization problem with 
positive definite C(.r k ). This completes the proof. □ 

Proof of Corollary 2.1: Since by Lemma 2.3, the sequence 
{*(f)} converges to a point satisfying C(-i* Ks* 4- V.7(.r*) = 0, 
then||C(./)||,||.s*||, > 11V J(.r k )||^. Let A' 0 = (l/mipk IKV* 1||/ 
We have ||.s *||2 > 7v'u||VJ(.r*)1. Let K\ = (A'n/2)>0 and lei 
t = A'i ||V,7(.r* )||j. Then r > 0 since we only need to consider 
the case ||VJ(,r* )||i* ^ 0. Because [*(/)} converges to s* , there 
exists a t such that ||«* - *(f )||2 < r for all t > t . Consequently, 
IMO||i>||»*||a - r > 7v„||V.7(/)||, - 7v, ||V./(.r‘)||, . 

A',||V.7(t*)|| 2 for all t>t □ 

Proof of Theorem 2.1: From Corollary 2.1, we see that VA 1 , with 
V.7(./) / 0, there must exist a t such thal the slopping criteria (16) 

11 -S’ ( 7 ) 11 j > A' 1 ||VJ(./ )||-j, f01M)lll<‘ K\ > 0, Vf > / 

(A7) 

is satisfied. Furthermore, hy assumption, wc have j.s(f ) 1 C(.r k )s(f) 
— tiff) 1 K 2 s(t) > 0, Vf; and from (A5), we have it) 1 C(.r k Nfi 
— V.7(.r A ) J s(f) > 0, Vf; these two inequalities lead to 

-A*2|Wf)||; > V.7(/) / .s(f), V/. (AS) 

Then, the following proof mostly follows the proof of the conver¬ 
gence theorem for descent algorithms given in (1, pp. 203-204] 
A Descent Lemma given in |l, pp. 203 2041 slates that if J( j is 
continuously differentiable and there exists a positive constanl h 
such that ||VJ(.i’) - V.7(</)|| < K\\.v - i/||j, V.r.// € ft", then 
.7(.r + //) < J{.v) + V.7( .v k ) J f/ + ( K/'2)\\y\\i, V.r,.»/ E ft". By the 
assumptions on .7(■) given in Section 1, wc may use the Desceni 
Lemma and (A8) to obtain 


J(.r k + ; k s k ) < .7(.r‘ ) + */V.7(.r*)' * k + ^ 

\\i 

<j(. r k )-^(^K,- h -ty, k wi 

(A9I 

If 0 < -)* < (K 2 /K), we can derive that 


1 

VI 

CM 

1 

71 

1 

(A 10) 

By (A9) and (A 10), we have 


Ju k + -, k s k )< J(.r l )-^' ( ‘||.s*n:i. 

(AH' 

If A > (K-t/K), 0<,(< 1, since { <”'A}, /» = 0, 1, J 

monotonic decreasing sequence approaching 0, there must exist an 


ni such that }T n \< (K 2 /K ). Let nu be the smallest nonnega ^ 
integer satisfying the above inequality, then the following holds 

)—r < (A 

Jv 7v 

If 0 < A < (AVA), 0 < /i < 1, then 

ja<a< 4^- < a:1) 

/v 
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]- his case, we can view 774 =0 which is the smallest nonnegative 
i, ger for /i ,ri k A < ( Ki/K). Now, for any A > 0, 0 < i) < 1, and let 
„ be either the 774 determined by (A12) or m k = 0 by (AI3); from 
nil), there must exist an m' < m k (or 771 ' = 0 if w k = 0) such that 

J(/ < •/(/)- y-;r'A||H*||,. (A 14) 

Note that 0 < -> A < (A\.//i) is sufficient for «. A11) to hold explains 
why 7)j < 777 a. This shows that Step 8 of our algorithm will 

t * 

terminate for certain m . Since i1 w > -T' 1 *, from (A12) and 
(A 13), li 1 " A > niinf^A, ,i{K' 2 /K)}. Let r = inin^M, 

UK 2 /K)}* then t is finite and positive, and 

./(.«•* + ,r A*‘) < J(.r*)-r||«*||^. (A15) 

Then, each iteration of our algorithm ensures a decrement of the 
objective function by at least the amount r||.s*||!j. Since J(.v) is 
hounded from below, wc assume r £ ft is a lower bound of J(.r), 
then from (A15), wc have 

() < ,/(.r M 1 ) - r < ./(./ ) - r - r||.s A \\l VA\ (A 16) 

Then, by (A 16), () ||.s*||-j < (./(a- 11 ) —r/r) < 'v. and (A7) shows 

that liniA— „ V./(.r A ) = 0. □ 
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On the Possible Divergence of the Projection \L 

Erjcn Lefebcr and Jan Willem Puldernian 

Abstract — It is shown by means of an example that the tl , 

algorithm does not always converge. 

I. Introduction 

It is well known that parameter identification of linem v vi L m 
depends very much on the excitation of the signals. General]^ -qv.ik 
ing, all identification algorithms require the signals to be suli * icntK 
exciting. In applications such as adaptive control, however, cxuUmn 
is often not possible. The question then arises how useful the standaid 
identification schemes are. In this note we consider the case where the 
data can be modeled exactly by a linear lime invariant discrete-time 
model. It is a fact, that for such systems recursive least squares always 
produce a convergent sequence of parameter estimates, although it is 
of course not guaranteed that the limit will be the true parameter f 1). 

For the projection algorithm a similar result or its negation is to the 
best of our knowledge not available in the literature. Properties that 
can be derived without any assumptions on the signals can be found 
in [1]. Nothing is said about convergence there (see also [2, Problem 
12.141). In [3], the algorithm is used for adaptive pole assignment. 
Since the adaptive algorithm could be analyzed without proving 
convergence of the parameter estimates, the possible convergence 
is not studied there either. 

In this note we show by means of an example that the projection 
algorithm does not necessarily converge. This is in contrast with 
recursive least squares. 

The construction of the counter example is as follows. Firstly wc 
construct a sequence ol real vectors that satisfies at least some of the 
properties of the projection algorithm and which does not converge. 
Secondly we show that the sequence could as well have been obtained 
by applying the projection algorithm to an appropriate input/outpul 
system. Hence, rather than fitting the estimates to the data, we fit the 
data to the estimates. 

II. The Projection Algorithm 

For the sake of completeness, we briefly describe the projection 
algorithm. Let the system be described by 

y{k+ 1) = b l 0(k) 0 £ R'\ (I) 

The projection algorithm is defined as follows: Suppose thal the 
estimate of 0 at time k is 0 a, define (h + j |0 € R" | !l{k -hi) — 
b 1 (b(k)}. Define 0 a- i i as the orthogonal projection of 0 a on (»a + i- 
The recursion is given by 

H = + ir—Uu/U- + !)-»* Oik)). (2) 

Notice that Ch -i i contains the true parameter 0. Regardless of the 
input sequence, the following two properties hold. 

Property 2.1: 1) For all k\ ||0 - 0a + i|| < ||0 - 0a ||- 
2) liniA —-x (0a-+i - bk ) = 0. 

It is obvious that from Property 2.1 we cannot conclude that 0a is 
a fundamental sequence, and in fact we will see that it need not he. 
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III. A Counterexample 

The idea of the counterexample is that we will first construct a 
sequence € R 2 with the properties that; 

1) (fli + i.fu + i) i s obtained from (<nJ)k) by orthogonally pro¬ 
jecting the latter onto a line passing through a fixed point. 

2) The sequence does not converge. 

Notice that this sequence is constructed in a similar way as the 
sequence of estimates in the projection algorithm. Subsequently we 
will show that the particular sequence is equal to the sequence 
of estimates produced by applying the projection algorithm to a 
particular first-order system. That will establish the claim that the 
algorithm does not necessarily produce a convergent sequence of 
estimates. The key idea is that we fit the data to the estimates rather 
than the estimates to the data. 

A . Construction of the Sequence 

The sequence {fu.fu} will be defined inductively 

* (<io,M := d/2,4). (3) 

Suppose now that («*,/»*) has been constructed. Let Lk be the line 
passing through (1/2.1) and (ru,M- Define (r/ui'^+i) the 
orthogonal projection on a line £* + i yet to be defined. £a + i will 
be a line passing through (1/2,1) with the property that the distance 
between ((ikJjk) and its orthogonal projection on £* + 1 is exactly 
l/(k + 1). There are two possibilities for Lk + i, one which requires 
a clockwise rotation of Lk to obtain Lk 41 and one for which this 
rotation would be counter clockwise. This freedom of choice will 
now be used as follows. Define the region 5 := {(«,fc) | -1 < 

-2<i < b - 2 A b > 1). See Fig. 1. Determine the two possibilities 

for (ujL + i, b k f 1 ). When both points are in 5, rotate Lk in the same 
direction as Lk-] was rotated to obtain £*, to get Lk + 1 , otherwise 
rotate Lk in the opposite direction. £1 of course requires a counter 
clockwise rotation of £ 0 . Notice that now every [fnJn) 6 5. Of 
course, the recursion could in principle be written in formulas; we 
feel, however, that this would not add much to our understanding. 

Lemma 3.1: i) The sequence is well-defined, ii) The 

sequence {(«*-.&*)} does not converge. 

Proof: 

i) D efine r k ■- y/(l/2 - o k Y 2 + (1 - b k ) a and b k M 

y/(aK+\ - ok )' 2 + - bk )*. From the construction it 


follows that 

r A + 1 + Afc + l — rf. • 

(4) 

Since <?>a+i = 

l/( A' + 1) it follows that 



rjf+i ^rJ-l/^+l)*. 

(5) 


If we disregard for a moment the restriction imposed by S , w 
conclude that (<u+i. 1 ) can be constructed from (ft*,6* 
provided r 2 - l/(A* + l) 2 > 0. Now, from (5) it follows that 

k 

'■*+. ^-^l/U + D 2 (ft. 

0 

hence we should have r% > n 2 / 6, since in our case r 2 = 9 
this condition is satisfied. 

As a byproduct we obtain that lira* —-x. r k = 9 - tt 2 /G. It 
should be clear that from this we can also conclude that the 
requirement that (<nJn) 6 S does not impose a restriction on 

the existence of the sequence. _ 

ii) From the fact that r k —♦ y/9 - n'*/6 and since b k + \ = 
1 /(k -I- 1), it follows that Z(£n, £* + 1 ) is ()(l/k + 1). There¬ 
fore the sequence of lines {£*} docs not converge and hence 
nor does {((i k J>k )}■ □ 

Lemma 3.2. Consider the i/o system 

i/(A- + l) = (l/2)j/(*)+ m(*). 1/(0) = 1. 

There exists an input sequence {u(A)j, such that the projection 
algorithm, initialized in (a 0 . ) generates { {<i k . b k )} as the sequence 

of estimates. 

Proof This is now easy. All wc have to do is make sure that 
at time k + 1, G k - £a + i, or equivalently. (u* + i. i ) € Cn \ i 
and y{k) ^ 0. Otherwise staled u(k) has to be such that 

l/'2y(k) + u{k) = (iK + iy(k) + /h-m “(k). (7) 

Hence we should take 

«(*)« ( 8 ) 

1 - b A-f , 

Since ((ik+iJ>k + \ ) £ .S\ this can indeed be done. 

To complete the proof we have to check that for all k the outpul 
</(£) will be nonzero. From (8) it follows that 

,,{k 41 )= ( 1/2 + ~ ;; 2 ),(A-). ( l »t 

Since iy(0) = 1, and since {tnu-^k + i) 6 S, it follows from (^) 
that y(k) ^ (I. 

Notice that since (<n < b k ) 6 5, we actually have that the sequences 
u and y are bounded. 1 J 

We have now proved the following theorem. 

Theorem 3.3: There exists a system of the form (I), a bounded 
input sequence u and an initialization of the projection algorithm, 
such that the resulting sequence of estimates does not converge. 

IV. Conclusion 

By means of an example, we have shown that the sequence >1 
estimates generated by the projection algorithm does not necessai > 
converge. Of course, the sequence of inputs needed for the exanii lc 
is fairly artificial. In applications such as adaptive control, howe\ '■ 
it is most desirable to derive as many properties of the identificati n 
part as possible without having to rely on the specific nature d 
the input. For the input will depend in a highly nonlinear fash 1 n 
on the estimates. Our construction shows that convergence is > >1 
autbmatically among the properties that can be derived with* Ji 
additional assumptions on the input sequence. 





r £ TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 40, NO. 3, MARCH 1995 


References 

11 G. C. Goodwin and K. S. Sin, Adaptive Filtering Prediction and Control. 

Englewood Cliffs. NJ: Prentice-Hall, 1984. 

12] G. C. Goodwin and R, H. Middleton, Digital Control and Estimation . 

Englewood Cliffs, NJ: Preniice-Hall, 1990. 
i'<j J. W. Polderman. k, A slate space approach lo the problem of adaptive 
pole assignment," Math. Contr., Signals, Syst vol. 2, pp. 71-94, 1989. 


A Comment on the Method of the Closest Unstable 
Equilibrium Point in Nonlinear Stability Analysis 

E. Noldus and M. Loccufier 


Abstract —A counterexample is presented to a theorem which has been 
proposed as a theoretical basis for the method of the closest unstable 
equilibrium point to estimate asymptotic stability regions in nonlinear 
systems. An additional condition is formulated under which the theorem 
is valid. Its implications on the applicability of the method are discussed. 

I. Introduction 

The method of the closes! unstable equilibrium point (c.u.e.p.) is 
a well-known direct method of the Lyapunov type for estimating re¬ 
gions of asymptotic stability (RAS) in nonlinear systems analysis. The 
method has been described, among others, by Chiang cl al. [1], [2] 
and various applications, for example to the power system transient 
stability problem have been reported [3]-[51. Its basic principle is the 
following: Consider an autonomous nonlinear dynamical system 

■r-/(.i) (1) 

where ,r £ IF' represents the slate and /(■) satisfies the sufficient 
conditions for the existence and the uniqueness of the solutions for 
given initial conditions. Suppose that a scalar function V(.r) 6 C . 
/ > I, can be found such that along the solutions of (I) 

V’(.r) < (),V.r 6 n n 

= 0 .i* - 0. ' 



Fig. I. Phase portrait of the system (5). (6) and level sets t/Si for varying 
level values A\ for the numerical values a = 2, h = I, /# = 0.8. 

one of which, say Si contains .r„. This subset S i is a RAS for .K 
Sj C 

The largest stability region S\ is obtained for A- -- \ \ luu . In 111 Chiang 
and Thorp have reported a theorem pertaining to the existence of 
the minimum U„ m , and a scheme for computing the corresponding 
stability region Si based on it. 

Theorem /V/. If system (1) has a Lyapunov function V(.r) in /?" 
which satisfies (2) and if S2(.t\) is not dense in if", then V milJ as 
defined by (3) exists and r, is an unstable equilibrium state. 

The proof relies on the property that if for A- =. q the set S\ is 
a closed and bounded neighborhood of .r, which contains no other 
equilibria, and if for some p > q there are no equilibrium slates in 
the set S\\k_,, - S ,|a then 

S\ In-,, is also closed and bounded. (4) 


By (2), \‘{.r) is a Lyapunov function of (1) in i?". Let i\ be a locally 
asymptotically stable (l.a.s.) equilibrium state and let il(.K) C /?" 
be its exact RAS. Suppose that on the stability boundary t)S2(.i\), 
I ( i) reaches an absolute minimum at a — r, and let 

min V(.r) - V(.r, ) = T mm. (3) 

r i_<M{ j .) 

Then it is well known that .i\ is an unstable equilibrium point of the 
\vstem (1) 11]. Furthermore for any k in the interval 

Y(j\) < k < r„ lin 

1 • set 

.S' = V(.r) < A } 

ihe union of a number of connected, disjoint subsets 


In Section II a counterexample to this result and to the property (4) 
is presented. It is pointed out, however, that the theorem is valid 
under the additional assumption that all Irajcctorics on the stability 
boundary are bounded for t > 0, Section 111 discusses the 

implications of this proposition for the c.u.e.p. method. 

II. Example 

Consider an example of the form 


where .r € J? 2 and 

V(.r) = <~ J 1 - ( 2 . 1-2 - ) (6) 


5 = Si U So U ■ ■ ■ U S ,: 
S,nS, =0 for i^j 


with 

<’i (-/ i) = [(" J1 + at _ '" 1 4- 1>] 
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J ig. 2. Phase portrait and level sets OS, for the .system (5), (8) and a): for the numerical values « = H. I, -- I. < = 0 75. ii = 5 O'... does not 
exist.); b): « = 20, ft = I, <■ = l, ji = I (I’ nllI1 = \'(S,„)). 


such that Y(.v) is a Lyapunov function in /?' which satisfies the 
:onditions (2). Differentiating r<j ) w.r.t. n and ,r 2 yields the state 
equations 

■r i = <•"'- (2 j'j - j'l) 

•K' 1 +2 fl n.r, f -"'i] 

■h = 4.r u (l - j-ij) 

•[«'■■’ 1 + ar ~ , “* + ft ], 

"he phase portrait is symmetric w.r.t. .the . 1 * 1 -axis. There are two 
equilibrium states, x,« = (0, 1) and .v, = (0, —1). Computing the 
acobian J(.r) = df{x)/0x and substituting the coordinates of the 
equilibria produces .7 (j\) = ,J(x, ) = diag f-2//f/.-8(l + u + />)], 
:uch that j\ and x, are stable nodes. Furthermore 

x^ = I) => Jr‘2 = 0; 


J'2 = ±1 => .i’*> = 0; 

j-i = — t ~ ,l/1 


uch that there is a single trajectory coinciding with the .n-axis, 
vhile the straight lines x-i = l and r± = — 1 each consist of two 
ajectories converging to, respectively, x 9 and x, for t —► + :*. 
Anally all solutions are bounded for t > 0, except the solution whose 
/ajectory coincides with the .ri-axis (see Appendix A). Because 
f (7) all solutions which are bounded for t > 0 converge to an 
quilibrium state as t —► 4-x. Fig, 1 displays the system’s phase 
jrtrait. Replacing (6) by 

r(.r) = - rr“ /1J * - (2 j\, - x'£)H 2 (.r, ) (8) 


'here a > 0, h > 0, c > 0, n > () and 

•roduces a slightly modified example with state equations 

/•'] =r ; i - + (2 j^ - 

I . , '^1 

l .r 2 = 4 j- 2 (1 - 1 ) 


(9) 


'here r/i' 3 Ui )/dx\ = + 2p(« - 


If a > r and r is sufficiently large, then the structure of the phase 
portrait is identical to the previous one, except that there are two 
additional equilibrium states 0) and .»■„((/. 0) on the positive x, - 
axis. It is easily verified that (closest to the origin) is a saddle point 
and r r is an unstable node. Now there are three trajectories on the -r 1 - 
axis (see Fig. 2(a) and (b)). In both exumples S2(.r„) = {x;xj >0} 
and il(x, ) = {./■; j j < [)\ while dil(x H ) = Oil{x, ) = {r:.r 2 = ()}. 

For the first example Fig. I shows the set .S'i for increasing values 
of k>V(x 9 ) — -(a + /)). Note that liuij, 4 x V(.r) = \'1 () 

= —ft( 2 .r -2 - .r-J), which reaches an absolute minimum V..(.i 2 ) = 

-h at x -2 = ±1, and a relative maximum V\ lllflx (. v 2 ) = 0 at x> = 0. 

Recall that the stability boundary 0U(.i \) = <M2(.r, ) coincides with 
the axis xj = 0. Hence S\ remains bounded for -it 1 -f h) < k < -h, 
but it becomes unbounded for -b < k < 0. It remains a region of 
attraction for .r„ as long as k < 0. The unboundedness of S, for 
—h S k < 0 contradicts (4). Although the system has a Lyapunov 
function in P 2 which satisfies (2) and il{x s ) is not dense in Jf 2 ,1 (.#■) 
does not have a minimum on Oi which is counterexample to 
Chiang’s [I] theorem quoted above. 

III. Discussion 

In the procedure sketched in the introduction the set .S', grows 
monotonically and continuously for increasing k>\ r (.i\), since 
T(.r) 6 C 1 where r > 1, 5i remains a region of attraction of x„ as 
long as OSi does not intersect the stability boundary Let 

lo = inf V(x). (10) 

‘C-iUHi „) 

Then the largest obtainable stability region of the form 5) corre¬ 
sponds to the critical value k t1ft = \ r u . In the first example Vo = 0. 
If V(x) possesses an absolute minimum V llim for x 6 then 

Vi = r imu . The existence of a minimum is guaranteed if 

All trajectories on dil{s 4 ) are bounded forf > 0 (11) 

and hence by (2) converge to an equilibrium state as t —♦ +x- 
(see Appendix B). So (II) constitutes an additional (sufficient) 
condition which guarantees the applicability of the c.u.e.p. method. 
Property (11) certainly holds if it can be shown that i2(.rj is 
bounded, or il it can be shown that all trajectories are bounded 
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Fig. 3. Phase portraii and level sets OS\ for the system (5), (8) and for the 
numerical values a =■ 2, h = 1. // =“ O.K. 


for / > 0. The latter property can often be proved using a suitable 
Lyapunov function. For example if V(.r) salislies (2) and is radially 
unbounded, then all solutions are bounded for / > 0. In |6|, [7] the 
structure of the stability boundary has been studied under a set of 
assumptions including the convergence of all solutions on to 

an equilibrium slate. 

The second example shows that even when there are equilibrium 
states on Oil{r. ), \does not necessarily exist. If 

V(.r„ ) = < -/J — rr > liin V(.r) = 0 

r 2 _U 

then r„ M „ does not exist, while Vo = 0. This case is illustrated in 
Fig. 2(a). If V(.i-„ ) < 0 then l mm exists and the c.u.e.p. method can 
be applied with k, llt = V(.r„ ) (sec Fig. 2(b)). 

If V.m. docs not exist then in general k, l1t —- V 0 cannot be 
determined since in (10), cM2(.r\) is unknown. In such cases the 
only conclusion possible is that for increasing k the set Si remains 
a RAS as long as it is bounded and does not contain other equilibria 
than .i To illustrate the latter point consider a final example of the 
form (5) where 



- yfU 

,)[<• 

" ,J 4 ar 4 l>] 

(12) 

r-(-rj) = 

2 A - 


for .r-j > 0 


A 

0 


for r -2 < 0. 



Note thut V(.r) 6 f ,J . This system’s phase portrait is identical to the 
phase portrait of Fig. 1 in the half plane r 2 > 0. while for r 2 < 0 all 
trajectories are straight lines parallel to the .r i -axis (r 2 = constant; 
.r i +-* for / —♦ +>:). The system has a single equilibrium state 
i*(0, 1) with stability region S2(.i\) = {.r:.r a >()}. 

For increasing k the set 5 1 remains a RAS of r» as long as it is 
bounded. For k > 0, 5i is unbounded and contains trajectories that 
converge to infinity for t —► 4oc, hence it is not a RAS any more 
(sec Fig, 3). The critical value k ,, ,< =0 cannot be determined if the 
stability boundary 012(.r„) is unknown. 

Finally it should be stressed that (11) is only a sufficient but not 
a necessary condition for the existence of an absolute minimum 
of V(.r) for r E Any other sufficient condition for the 

existence of the minimum yields a valid alternative version of the 
fundamental theorem of the c.u.e.p. method. In [2, Theorem 9], and 


in [8, Theorem 4.1], Chiang has provided such an alternative which 
is suitable for studying the power system transient stability problem. 
An energy function V(.r) is introduced, which for the application to 
the power system model plays the role of a Lyapunov function, upon 
which three conditions are imposed: 

1) Along any trajectory r(t), V( •/■(/)) < 0. 

2) Along any nontrivial trajectory the set {t E /?: V(.r(f) = 0} 
has measure zero in i?. 

3) Along any trajectory, if \ r (r(t)) is bounded then ./-(f) is 
bounded. 

The conditions of Theorem 4.1 in [H] guarantee the existence of an 
absolute minimum of the energy function on the stability boundary. 
In fact Chiang has shown that under these conditions the c.u.e. point, 
where the absolute minimum is reached, is generically unique. 

Appendix 

A. Proof of the Boundedness of Solutions 

Since the Lyapunov function (6) is a special case of (8) consider 
the system defined by (5), (8). Let 

tn.r) i (i -4) z . 

Then using (9) 

iru) = -10.cj(l -Af rA-n) 

<-lfi-riu - .rZ) i h < 0 (13) 

since a > r. (13) implies that 

min [-rj(O). 1] < .ryV) < infix [.r](0). 1] (14) 

Vf > 0 such that 

IV(.r) < -A|ir(.r) 

hence 

\\'(.r) < [l-.r*((>)] 2 <~' 1 '; V/>() (15) 

where Ai ^ Kj/iinin [.r}(0), 1] > 0 if .i'j(0) ^ 0. Furthermore by (9) 
.r i = (1 — ,rj )*’[< J 4- 2//(ri — t ' ,,J — 2/ifiJ*i i “ l (16) 

hence 

,i j >0 for j'i < —m and m sufficiently large. (17) 

Finally by (16), r\ = 0 and ri j- 1 imply that .ri >0, such that 
.ri (0) > 0 implies that .r, (t) > 0 for all t > 0. Now by (16) 

•n < (1 - ri ) J (1 4 n )r 11 for n > 0 and sufficiently large 

or, defining p = (1 4 ##)[1 — .i** 2 (()))■% y = r /J and using (15) 

ii < i» ■ 

Integrating the latter expression yields 

i/(f) < go - ~-r ~ Sl1 < go; Vf > 0 
Ai 

and for some finite i/o. It follows that 

.rj(f) < In </o < x: Vf > 0 (18) 

if .i*j (0) > 0 and .r_»(0) ^ 0. (14), (17) and (18) prove the bounded¬ 
ness for t > 0 of all solutions for which .^(O) ^ 0. 

B . Existence of V m i„ 

Let A' = {,r i. ■ • ■, r „,} be the set of equilibrium points on 0H{ r M ) 
and let 1 m,„ = min V(r ,) for / = 1 • ■ ■ m. If -t’o € ) then the 

trajectory starting at .r 0 at / = 0 converges to an equilibrium point 
i\ € Since along this trajectory V(.r) is nonincreasing it 

follows that V(.r 0 ) > V(j\) > V„ mi for all r 0 £ This 

proves the existence of the minimum of V(.r) on t)n(.i\). 





soo 
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Boundaries of Conditional Quadratic Forms—A 
Comment on “Stabilization via Static Output Feedback” 

D. Cheng and C. F. Martin 


Abstract —Motivated by the above paper, 1 this note considers the 
oundaries of a quadratic form with all possible constraints over a given 
subspace. Essential upper (or lower) bounds are presented provided they 
wist. It mends a mild incompleteness in the proof of the main result. 


J. Introduction 

It is well known (see, c.g., [2]) that for a real symmetric matrix M 


niinrr(.\/) < 


.rM U 

.r f jc 


< nirLX<r( M), 




(I) 


where rr{M) is the set of eigenvalues of M. 

In the above paper, necessary and sufficient conditions for the 
existence of a stabilizing static output feedback gain matrix were 
presented. In the proof of the main result, Theorem 3.1, the following 
fact was used (for the sake of consistency we use our notations). 

Given a real symmetric matrix: M nyn , and a matrix A',„ Xf( with 
rank(A') = r < n. Assume 


Then 


.r T ( M )j■ < 0; V.r E K( r[K). (2) 


nup .r l M,r 
•r ,r T h r h'.r 


< "jc. .r g K( r{ A'), 


(3) 
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A mild incompleteness in the proof 1 is: They did not claim and prove 
that the o (defined in (3.4), which is the same as in (3)), is upper 
bounded, i.e., r* < +oo. It is essential for constructing E (E~ ] > 
<>/). It was pointed by a nominated reviewer that this boundedness is 
also assumed in [21 without proof. From the following discussion one 
that this fact is not trivial. We call it the problem of boundaries of 
conditional quadratic forms. It can be considered as a generalization 
of (1), because when dim(A’) = 0, our results in this note will 
coincide with (1). 

In the next section we prove (3) by giving the essential upper 
bound, which may be of independent interest. Then in Section III we 
discuss all other constraints. 

II. Main Result 

Let the matrices 3/, lX A'„, x „, with rank(A') = r < n, be as 
above. Then there exists a linear transformation $ such thal 

A'<I> = (5 | 0) 

where S is the first r columns, and thus S has full rank. It follows 
that S 1 S is a positive definite matrix, so we can define a posilive 
definite E as 


£ = (S / S) l/ - > 0. 


(4) 


It is easy to prove that (2) is equivalent to ihe fact that after the linear 
transformation d\ M has the following form 





(5) 


Then we detine a characteristic matrix C as 

C= E~ l (A + I1Q 'I3')E (6) 


Using the above notations, we can prove the following theorem. 

Theorem I: Under condition (2), we have the essential upper 
bound as 


sup ./■ 1 M.r 
jt 


.r £ Kt r(Ji) 


= n\nxtr{C). 


(7) 


Proof: It is clear that .r£K( i*(A ), if and only if 

Cl> 

Since Q is positive definite, Q'*~ > 0 is well defined. Using (5). a 
straightforward computation shows that 



_ »u>> riiA + DQ-'n'^-WQ'Oto-Q-'OB 1 m || a 

U yJ'S'Su, 

- * uv 12U1 'JlLlm («i 

j/i * <> nl s 1 ~siii ( 

1 A. Trofino-Ncto and V. Kuccra, IEEE Trans. Automat. Contr., vol. 38, no 
5, pp. 764-765, May 1993. 
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where 


= = y, * 0. 


U2 


The last equality is obtained by setting yi - Q ' B 1 y\. 
Recalling the definitions of E and C in (4) and (6), we get 

* «"P yUA + DQ-'B 1 )y t 

l /j ^ 0 yj s 1 Sy\ 
ft up Z l CZ 

~ z± oW where z = Lyi ' 

From (1). it can be seen that 

sup x 1 Mx . 

- f T- r ' r. ..r^Arr(A) 

x x 1 A 1 A r 

= iu<ix<r(C) < -f x. 


(9) 


Since the transformation 4* is not unique, the last thing we have 
to do is to show that the essential upper bound obtained in (9) is 
independent of the choice of Theoretically, essential upper bound 
is unique. But, we use a particular $ to get it, and the parameters 
A , BX\E , etc. in the expression depend on 4>. k So we must show 
that the upper bound is independent of Lei $ = (ojto^), where 
Spanks) = K 1 , be another suitable linear transformation, and 4> = 
4>7 = (ui \a> )T. Since Span(c>j) = Span(C^) = K . ii follows that 



Now, a straightforward computation shows that the corresponding 
expressions under the new transformation are 

( s u* :)*<**>'<«■ 

= T'[ K •!>)'( K*)T= ^ S ' () ST ' 
and 

f .4 n \ _ /r/ t! \ ( a n \ (i\ o \ 

\ti' -q)~\ 0 Ti )\n‘ -q)\t 2 Ti) 

_ (T{ AT\+T-! D' 7,+T/ IJT.-Tl'QT. T / M\-Tj QT, \ ... u 

~\ T! n' 7’, -Tl QT, -r! QTi ) ( 0) 

Using them, we finally have S' S = T' .S ' 1 ST\ . 4 + DQ 1 D 1 = 
T! (,1 + DQ~ [ ). 

Therefore, the parameters obtained by the new transformation 
provide 

A«i> y[(A + rny'D' ) ih 
1/1 y ( S / S y\ 

=* . z • where Z = ET 'V' ■ 

Q.E.D 


111. Generalization 

As we mentioned before, conditional quadratic form (3) is a 
generalization of the famous result (1). So the boundary problem of 
expression (3) has both theoretical and practical interests. To make 
(3) meaningful x $ Ker(7\) . Moreover, without constrain on Ker 
(1\ ) expression (3) has no boundary. (2) is a particular constrain. This 
section consider all other possible constraints on subspace Ker(7v'). 
They may be applied to mini-max problems of quadratic forms, 
Case 2: Condition (2) is replaced by 

.«- 7 (A/).i- > 0:V.r 6 Kcr(K). (11) 

In this case, replace -Q by Q in (5) and redefine C in (6) accordingly. 
Then a parallel discussion shows the following corollary. 


Corollary I: Under condition (11), we have 
sub Jr 1 M.r 

•r ^ Ker A = mlnrT{C 

Case 3: Assume now we have 

.r r (M ).r = (); V.r 6 Ker(7i). 

In this case replace Q by zero in (5). Equation (8) becomes 
sup x 1 Mx 


( 12 ) 


(13) 


K r 7v 


x g Ker(/v) 


= sup ^ Mjjri + 2 y] B 1 t/i 

y y[ S r Syi 

From (14) one sees the following corollary. 
Corollary 2: Under condition (13), if B ^ 0 

^ Ker(A ' ) 

has neither upper bound nor lower bound. If 27 = 0 


(14) 


liiiiiirfT?” 1 ME ~') < 


j- 1 M.r 

~ .r 1 K r Kjt 

■r £ Ker( ft'). 


< iiuix<t{E ME 1 ), 


Proof Observe (14). If B = 0, the conclusion follows from the 
standard result (1). If B jz 0, choose 

\li = pB 1 i/i .k G B. 

Letting // go to cither - x or +x, one secs that neither upper bound 
nor lower bound exist. Q.E.D. 

Note that in Case 3 we have from (10) that B = T\ T BT\, and 
T j and T are nonsingular. It is clear Lhal the conclusion in the above 
corollary is independent of the linear transformation. 

Case 4. Assume 


•r 7 (.17).c < 0; V.r 6 Ker(/k). 


(15) 


In this case the matrix Q in (5) is positive semi-definite. Then we 
have the following corollary. 

Corollary 3. Under condition (15), if there exists a matrix II with 
suitable dimension such that B = IIQ, then 

7 (prnb) = + HQH 7 )E- 1 ), 


.r i Ker(Tk ). 


Otherwise 


.r 1 M.r 
r> K‘ K.v 


. -i’ $ Ker(7k ) 


has neither upper bound nor lower bound. 

Proof: Let B = HQ, Then we can choose 


-(//' /)• 


From (10), new 37 is block diagonal, and then (8) becomes 

sup ,r ! M.I- . 

t i ~ i t - ’ ^ Kcr( 7v) 

x x 1 h 1 h x 

_ soil y[ ( a + 7/Q77 1 )j/i - I/i Qwi 

y y{S l Sy t 

sup y[ [A + HQjT )j/i 

y t uTS^Syl 

The conclusion follows. 
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If D ^ HQ , the rows of D are not all in Span row Q, Without On Robust Stability of 2-D Discrete Systems 

oss of generality we assume 

W.-S. Lu 


y = (j)‘ q) 1 where Q\ > 0 

and partition y -2 as 



Now (8) has the form 


^ Abstract — This note presents a study on robust stability of two- 
dimensional (2-D) discrete systems in the Fornasini-Murchesinl (F-M) 
state space setting, A measure of stability robustness of a stable F-M 
model is defined. Relation of this measure to its counterpart in the 
Roessor state space and related computational issues are addressed. Three 
lower bounds of the stability-robustness measure defined are derived 
using an one-dimensional parameterization approach and a 2-D Lyapunov 
approach. A numerical example is included to illustrate the main results 
obtained. 


.v[ (.4)y)l + 2(yJ| 1 y2a)Q , »i 

tfS T St, 


I. Introduction 

In this note, wc present a study on robust stability of two- 
dimensional (2-D) discrete systems under unstructured perturbations. 
Throughout the concerned system is modeled in the Fomasini- 
Marchesini (F-M) local state space [1] as 


lie condition: “rows of D are not all in Span row Q" implies that 
3ya2 is not always zero. It is obvious that this term can make the 
/alue of the fraction be both positive and negative unbounded. 

One can also see from (10) that the condition B = HQ is 
ndependent of the linear transformation. 

Q.E.D. 

Case 5: Assume 

.1 ■' (M ).!■ > 0: V.r £ Ker< A'). (19) 


n this case replace -Q by a positive semi-definite Q in (5). Similar 
:o the proof of Case 4, we can show the following. 

Corollary 4: Under condition (19), if there exists a suitable di¬ 
mensional matrix H such that B = HQ. then 


Hub ( .r T M.r \ 
* {.r'K 1 K.r) 


= min <t{E~'(A + HQH‘ )E _I ). 


.r £ Ker(A'). 


Dtherwise 


•r' Ms 
.r' iv ' A> 


. .!■ £ Ker( A) 


ias neither upper bound nor lower bound. 


IV. Conclusion 

In this note we discussed the problem of finding the boundaries 
rf conditional quadratic forms with all possible constraints over a 
ubspace. In all cases, the necessary and sufficient conditions for the 
,xistence of upper and/or lower bounds are presented. The essential 
xmnds are obtained whenever they exist. 
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.»■(/ + l.j + 1) = Au'ft.j + 1) + .4j.r(/ -|- I,.;) (1) 

where .r(i.j) 6 H uX] . .4i, -L £ /?"*". Recall that system (1) is 
asymptotically stable if and only if 

//(c.'-jlsilrtl/., - :,-4 ( - for CT (2) 

where V =- {Ui.:*) : |:i| < l,|: fc *| < L. } |1|. To date, results 
on robust stability of 2-D discrete systems in a local stale-space 
framework are only available for the Roesser model [2|-|6], and 
one might attribute this to the lack of a Lyapunov stability theory 
for the F-M model. The objectives of this note arc twofold. First, we 
propose in Section II a quantitative measured, v, for the unstructured, 
stable perturbations of a given stable 2-D F-M system, and derive in 
Section III a lower bound for Issues on numerical evaluation of the 
bound obtained and the relation of the proposed stability robustness 
measure with its counterpart in the Roesser state space will also be 
addressed. Second, we propose in Section IV a Lyapunov approach to 
analyzing the robust stability of (1), leading to two lower bounds ot v . 
The proposed approach makes use of the 2-D Lyapunov equation [7| 
which is a generalization of the 2-D Lyapunov equation investigated 
recently by Hinamoto |X). A numerical example is included in Section 
V to illustrate the main results of the paper. 

In the rest of the paper wc write H > 0 or IT > 0 to mean that 
the symmetric matrix H is positive definite or positive scmi-definitc. 
For a real matrix P > 0, one can always write 

p = r'w 


where C is orthogonal and £ = rliag {cr i, ■ ■ ■, rr n } with o k > I), 
for 1 < < n. If we denote = diag . ■ ■ ■, n\J 1 }, then 

p — p 1 /- pd ' 2 

where 

pi/i _ au(1 pin = { pi/^r 

Such a P 1/ ' is called the nonsymmetric square root of P. The larges) 
and smallest singular value of matrix H is denoted by <t(/ 7) and 
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v(H ), respectively, ||£f || denotes the induced 2-norm of H , which 
is equal to <r(H ). Throughout vta shall use A to denote the n x 2 1 ; 
real matrix [.4, At] where A, and A 2 are from (1) and call it the 
system matrix of (1). .4 is said to be stable if (1) is stable i.e., -4 
satisfies (2). 


Assuming (9a) is stable, the commonly used stability robustness 
measure of the system (in the Roesser state space) is 

£ = l|Ai|| 

A •U'/t’u 

where 


II. A Measure of Robust Stability of System (l) 

For asymptotically stable system (1). let .4 = [Ai At] and denote 
by S„ the set of all complex perturbations A.l = [A.4, AA 2 ] € 
C n * 2 " with .4 + A.4 unstable, i.e., 

S„ = {A.4 : [A.4 1 AAt] € ,4 + A.4 unstable}. (3) 

Two sets of matrices can be induced from set S tl as follows 

S UI = {A.4, : A.4, £ [A.4, 0] 6 S u \ (4) 

S HJ = {A.4-2 : A.4 2 G C'*'\ [0 A.4,] G S«}. (5) 

In words, 5 (ll and S u , are the collection of the lirst and the second 

n x 11 blocks, respectively, each of which itself causes system 
instability. 

A measure for robust stability of system (1) is defined as 

" = aVps„ H a, 4 H’ (6) 

Obviously, a perturbed system with system matrix \ + A.4 is stable 

if ||A4|| < 

With the sets .S',,, and S„ 2 defined in (4) and (5). two auxiliary 
quantities can be introduced as 

,'1 = . l|A-li || iunl in = inf ||A4,||. (7) 

A \\* - A l.f\. 

Since S n] C S„ and S„ 2 C S'„, we have v < /n, v < (i ■#, and hence 
\> < mi ii(/ i i. /ij). (8) 


Shortly we shall sec that a relation similar to (8) also holds when the 
system is modeled in Roesser stale space. 

Like the l-D case, v defined in (6) will be referred to us the stability 
radius for unstructured complex perturbations for state-space model 
(1). Since perturbations for a physical 2-D plant are always of real 
value, a more realistic robust stability measure can be defined as 

v, ~ inf 11A.411 

A Kf l u 

where T„ is the set of all real perturbations A.4 = [A.4| A.4 2 ] G 
with .4 4- A 4 unstable, i.e.. 

T„ = {A.4 : [A. 4 1 A .4 2 ] G .4 4- A 4 unstable}. 


Note that T„ is a proper subset of S„ and consequently the stability 
radius for unstructured real perturbations, v ,, is always no less than 
/*, This means that /', is indeed is less conservative measure for 
stability robustness. For the l-D case, the real stability radius has 
been investigated | J7J-|22) and a complete solution has been obtained 
very recently by Qiu et ai [221. The generalization of the approach of 
[22J to the 2-D case is however far from straightforward and seems 
adequate to leave it as a separate lopic for research. 

Concerning the relation of the proposed stability measure to that of 
Roesser model, we recall [ 11 that if a 2-D discrete system has been 
modeled in the Roesser state space as 




4, 

.4 ," 

UJY 

= A 

O.j)' 

/"(»'. j +1). 


/i.) 

A i 

/'(/J). 

/"(/../). 


it can be remodeled in the Fornasini-Marchcsini slate-space as (l) 
with 




V'(m) 

y'a.j) 



and .42 


4, A 2 
0 0 
(96) 


/?„ = {A.4 : A 4- A A unstable} 


with 


; [Ai, Aij 
Ai., Ai., 


According to the structures of A\ and At in (9b), perturbation matrix 
Ai in the Roessor model is the sum of perturbations for A\ and At 
in the F-M model, i.e.. 


A.4 = 
Hence 


0 0 
Ai, A.l, 


A.4, A.4 2 

0 0 


= A.4, 4- A.4 2 . (10) 


£ - inf 11Ai11 = inf ||A.4, + A.4 2 | 

A \£H U A4(ES U 


where S„ is the set of unstable perturbations A.4 = [A.4, A 4 2 ] 
with A.4,. A.4 2 defined by (10). From S„, C 5„ and S„, C S u , 
it follows that 


t = Jnf, l|A4, + A .-la 11 < Jnf IIA.4, + A .4a H 

= . .'"f I|A4 j||=/m 

•A ' * j F ■* u j 


and 


4 = A inf s ||A.4, + A4,|| < ^ inf ||A4, + A4*|| 
= .. i ‘»f l|A.4a|| = 

A \ | fc s t, j 


Hence 


C < min(//,. /it). (11) 

On comparing (11) with (8), we see that v for the F-M model is in 
a certain sense similar to £ in the Roesser model. In the rest of the 
paper, our attention will be focused on investigating bound v. 

III. A Lower Bound of v and Its Evaluation 


A. A Low er Bound of v 

Let il = {(u. 1 1 . jJt ) : — 77 < a’, < 7T, -:t < < 7T}, and define 

1 = inf £[/,. -(^' J .4i (12) 


where I„ is the identify matrix of dimension o. As 
a[I„ -r'-'M, = - .1, + —’>4,] 

where // = ^ and ^ we have 

1= inf inf a[( y ~7„ - .4(//)] 

= inf inf rr — -4(/i)] (13) 

0 ,1 < fl “ T < yj < TT 

where 


A(/i) = 4, + f J,, 4 a (14) 

and the last equality in (13) is due to the fact that 
/(/<)= inf <r — 4(/i)] 

— rr < < 7T 


( 15 ) 
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is an even function for // € [-it, tt]. With the definition of f{p) 
in (15), (13) becomes 

■v = inf /(/<)• 06) 

0< t*<ir 

To explore the quantitative relation of 7 to the robust stability 
measure u, we note that ( 1 ) is asymptotically stable if and only if all 
eigenvalues of A(/i) for each p € [ 0 . n] are strictly inside the unit 
circle [9J. Hence ,4+A-4 represents a stable system matrix if and only 

is a stable perturbation of ,4(j/) for each p € [0. 7 r]. 

Since the exact 2-norm bound of complex, stable perturbations of 
A{p) is equal to f(p) = min <r[r /Ur 7 - .4(/0) [10HI2), it 

— n < i*> <. ir 

follows that A.4 is a stable perturbation of .4 if 

A.4 < f(p) for all // e [ 0 . n], (17) 

As A.4 < \/2||A.4|| and f(p) > (17) holds if 

||A.4|| < -^ = A, (18) 

This proves the following proposition. 

Proposition I: Define - by (12), A) = ‘ / \/2 is a lower bound of 
the robust-stability measure v defined by ( 6 ). 

Two immediate consequences from above analysis are as follows. 
Proposition 2: A.4 = [A.4i A .4a] is a stable perturbation for 


system (I) if 

IIA^IKIIA^IKv (19) 

Proof: For any ft 

A.4 J ltI <||A.4,|l+||A.4,||. ( 20 ) 

So if (19) holds, then /(/<) > 7 leads (20) to (17). □ 

Proposition 3: A.4 is a stable perturbation for system ( 1 ) if 

i < 7 (21) 

where 

1= sup ||A/4i +f ,,, A-4 a ||. (22) 

0 < n ^ n 

Proof: If (21) holds, ihen for any p 

A.4 lj <-,<*,</(//) 

which leads to (17). □ 

B. Computational Issues 


We see from (13) that computing 7 is a minimization problem 
with the objective function given by F(lj,j/) = <r[r iu 7„ - A(;i)]. 
Because of the periodicity of F{^,p) with respect to ^ and //, this 
minimization can be considered as an unconstrained problem, hence 
efficient optimization methods such as quasi-Newton algorithms can 
be used to find . As F{ p) represents the smallest singular value of 
a two-parameter matrix, there is in general no closed-form formula to 
compute the gradient of F( ^.p). For any b > 0, however, elementary 
properties of singular values [13, Chapter 6 ] imply that 

|JV + A./i)-F(u../i)| < '«->+»)_ r**\ < b 

and 

|F(*./i + A) - F(^./ 0 | < ||.4(p + b) - ,4(//)|| < A||.4,|| 


indicating that for any b > 0, ratio [F(*> + h p) - F(w,/t )]/6 and 
[F(jl'.p + 1 *) - //)]/A are bounded by unity and 11 Aj ||, respec¬ 

tively. Hence the gradient of F(*\p) if it exists is bounded and reli¬ 
able approximation of it can be obtained using simple numerical dif¬ 
ferentiations (i.e., the above difference ratio) with sufficiently small b. 

*An alternative approach to the numerical evaluation of 7 is to 
make use of (16) in conjunction with the fact that for a fixed p, f{p) 
defined by (15) is the exact bound for stable, complex perturbations 
of the system with ,4(/«) as its system matrix. Consequently, f(p) 
for a fixed p can be computed using the efficient bisection method 
proposed by Byers [14]. Again, due to the periodicity of f(p), (15) is 
indeed a scalar minimization problem, which can be solved using, for 
example, the modified threc-point-pattem algorithm 115, Section 7.3]. 

We shall touch upon these computational issues again in Section 
V through an illustrative example. 

IV. Lower Bounds Derived Using the 
Generalized 2 -D Lyapunov Equation 


A. The Generalized 2-D Lyapunov Equation 
In [7] the following proposition was given. 

Proposition 4 [7]: Equation (1) is asymptotically stable if there 
are positive definite F, If,, and \Y -j such lhal 

,._r r'mwr'f* 0 1 ... 


and that 


r i/*\y 2 p'/i 


/„ - U i - ir. > 0 . 


, -.l'/MX) (2.1) 


This proposition is a generalization of the 2-D Lyapunov theorem 
proposed recently by Hinamoto [8], It was shown in [7J that as 
compared to Hinamoto’s result, the generalized Lyapunov equation 
(23) can be used to confirm siability of a broader class of 2-D discrete 
systems in the F-M state space selling. As [7| has not been available 
to the reader, for the sake of convenience a proof of Proposition 4 
is given in Appendix A. 

B Two Lower Bounds of v 

Consider a stable system (I) that satisfies (23) and (24) with some 
positive definite If H i and \\\>, and let the perturbed system be 
given by 

.!■(/’ + lnj + 1) = (,4| + A.4 1 )r(i,j + 1) 4 (.-In 4 A.4j )./■(/ 4- l.j )■ 

(25) 

The matrices I\ U’j, and H ' 2 are employed to construct the Lyapunov 
function 

0(L j ) = 01 (/,./) 4 Oj (/, j ) (26) 

where 

Oiii.j) = U.j + 1 )P r/2 \V t P' / ’.rU.j + 1) (27) 

O Ai-j) - r 1 U + 1../)/' 1 +1.7). (28) 


01i (L./) =• 0][i 4 1../) + 0a(L j + 1). (29) 

By (24) 

0,1 ((. j) = . r r u + 1.7 + dp' / 2 <ir, +irjp l/ V(/ + 1.7 + d 
< ,r' (/ + 1.7 + 1 ) P . r(i + 1.7 + 1). ( 30 ) 


Now using (23) and (25H30) we compute 

A tp ( inj ) = 01 I (/,./) - 0(/../) 

< r‘ (iJ)[{A r + A.4)P(.4 + A.4) - P].i-(i.j) 

< -\e.M)-29 l/a {r-Q)# ,/3 {P)\\±A\\ 
-«T(P)||AA|ni|.r((.7)|| a 
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... , n fP T ' a WlP 7 '* 0 

• r( '” y) r{i + l.j) aiK P ~ 0 P 1 l2 W 2 P' 12 • 


Hence if 

then &0(i*j) < 0 for all i, j, and a typical argument (see, e.g., [16]) 
applies to show the stability of the perturbed system (25). 

An alternative Lyapunov function may be defined as 

>Ai.j) = [0 i U.j)+O J U.j)}' /2 . (33) 

Similarly we define 

i’n(i.j) =■ + Lj) + Oi(iJ + 1)] ,/2 (34) 

and use (24) to write 

i’iiO'../) £ [■<•' (i 4- 1 ,j + l)P.i(/ + 1 J + 1 )] ,/v - (35) 

Now (23), (25), (33), and (35) imply that 
Ai'(ij) = <’n(f .j) - r(i.j) 

< [s'(>\jHA + A 4)' P(A + A l).r(*.j)] l/v 
- ( t.j)P.i(i.j )] 1 

<-[ ■ —- g M -^ -* I/2 (P)||A4|| 

Thus if 


constrained minimization problem as follows. Note that (23) and 
(24) are equivalent to 

i/l^Po rj >0 (40) 


i-vr'v : 1 


lv ' rr' > o 


respectively, where T — P l/,a . ,4 = [A| Ai), A, = T 1 A,T, 
U', = r-'r," 1 (/ — 1,2), which are in turn equivalent to 


iinnmnzi* 

\ nonniniiulfti 
/ , 1 . - / . 1 


;p'i 0 

A 0 r 3 


IIA 4|| ^ --- r (36) 

iri(P)|rr4(P-y) + ffl(D] 

then Si'ii.j) < 0 for all i.j , and therefore, the perturbed system 
(25) will remain stable. 

Although our numerical experience indicates that the lower bounds 
of v given in (32) and (36) usually stay quite close each other, the 
bound given in (32) is always better than that in (36) as is shown 
in the next proposition. 

Proposition 5 ■ 

U. 2 < U , (37) 

where 

, _ [<r(F-Q) + <T.((})\' rJ -*' ri (P-Q) 

J, -‘ = - 1&W) 

,), , = — ; --— (39) 

with P given by (31). 

The proof of the above proposition can be found in Appendix B. 

C, Remarks on Numeric al Solutions of the 
Generalized 2-D Lyapunov Equation 
To compute lower bound or one must find positive defi¬ 
nite P, H’i, and ir 2 that satisfy (23) and (24). Although constraints 
(24), P > 0, H F i > 0 and 11'* > 0 can be put together as a set 
of linear matrix inequalities (LMI’s), it is difficult to formulate the 
problem of solving (23) as a convex programming problem such 
as those investigated in |23| and [24], since solving (23) cannot be 
formulated as a problem of minimizing A with AZ?(.r) - .4(.r) > 0 
where r is the vector collecting all the parameters in P, 11",, and 
H' 2 , and D(,r) and A{.v) are matrices depending on x affinely. The 
approach taken here is to reformulate the problem as conventional 


Solving the constrained minimization problem (42) is in general 
numerically intensive in the sense that it usually requires a fairly large 
amount of computations to obtain a local minimum, due primarily to 
the iterative nature of' the algorithm. When the norm of system matrix 
.4 or the norm of a weighted version of *4 is small in a sense specified 
below, however, solutions to the generalized 2-D Lyapunov equation 
(23) and (24) can be found easily. For example, if 

11-411 < (43) 

then U i — U'j = + /„ and P — f„ satisfy (24) and lead Q in (23) to 

Q=- \h„ -A 1 4 

which is positive definite. The above choice of 11 \, If'i, and P leads 
10 P= ' 1 h„.*(P-Q) = p||\ £.((,))= j~||.4|| 2 . and <f(P) = 1. 
The two lower bounds in this case arc found equal, being 

hi = = Tf-IMH. (44) 

Another simple case is when weights <\ > 0 and .1 > 0 can be found 
such that 

f + -Jr = 1 (45) 

l\~ 1- 


II [o.4, Uj]||<l. (46) 

Obviously, H’i — A /„, 11 j = tj/m, and P = T„ satisfy (24) and 
lead (J in (23) to 

Q= W iliH 11 <47) 

which is positive definite if (46) holds. The two lower bounds in this 
case are given by 


=[IMI|-‘+£<«)] ,/2 - 11-411 

(48) 

j _ ZlQ) 

||.4|| + max(n _ 1 , )' 

(49) 

V. An Examplf. 


fourth-order F-M model with 



0.0056 

0.2853 

-0.2432 

0.124G 

-0.0064 

0.2377 

0.1 GOG 

-0.1404 

-0.0084 

-0.0217 

0.1785 

0.1 GG2 

-0.0245 

0.0884 

-0.0321 

0.2428 

0.0495 

0.0G87 

0.2030 

-0.3013 

0.1G30 

0.4817 

-0.0993 

0.1814 

-0.2842 

0.1790 

0.5551 

0.0799 

0.2112 

-0.1887 

-0.0895 

0.5G04 
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As HAH *= 0.7791 exceeds \/s/2 « 0.7071, we attempt to find 
positive scalars o and d that satisfy (45) and (46). Using (45) we 
write = o/\Av a - 1 and define the objective function 

f(o) = ||[a„4i —7==2====.4^]|| for 1 < n < sc. (50) 

Applying the modified three-point-pattern minimization algorithm 
(15, Chapter 7) to the bracket (i.e., an interval of n) [1.1, 31, it 
took 12 iterations with 17.796 Kflops (10‘ J floating point operations) 
to find the minimum of ^(o), r(n') = 0.9001 < 1 at a* = 2. This 
leads to a solution to (23H24) with = 0.25/4, U 2 = 0.7574, 
and P =s h. It follows from Proposition 4 that the 2-D system is 
Stable, From (47M49), lower bounds and 7/^ were found to 
be Jii = 0,0310 and fi A = 0.0300. The total number of flops 
Used in computing /// 4 or U , is about 21 Kflops. To evaluute the 
lowef bound fo in (18), a quasi-Newton optimization method was 
used to evaluate where the gradient of F{uj,p) was performed 
numerically with h = 10“‘\ It took nine iterations (with 241.6 
Kflops) to obtain *> = 0,1775 at [a,’*,//*] = [0.0252 0.3856], which 
leads to = 0.1255. It is noted that hound is considerably less 
conservative as compared to .h. L and >h.> although computing Jo is 
far more expensive in this case. 

VI. Concluding Remarks 

A robust stability measure has been proposed and three lower 
bounds of v for F-M 2-D discrete systems have been derived by two 
distinct approaches. It appears from the numerical example discussed 
in Section V that the bounds h 1 and derived using the Lyapunov 
approach are quite conservative. This could be improved by carrying 
out the optimization problem (42) rather than using (50). As a matter 
of fact, the norm in (42) becomes t (o) in (50) if T = />, V] = n/ 4 
and V 2 = | ^ with n > 1 arechosen. With more parameters in 

(42), better lower bounds J/ M (i = 1,2) can be found at the expense 
of higher computation intensity. 


Appendix A 

Proof of Proposition 4 : We prove the proposition by contradic¬ 
tion. Suppose the conditions of the proposition are satisfied but (l) 
is unstable. Then there is (- 1 ,- 2 ) E V such that 

drt(/„ - ~i.4, - z-jAi) =0. (51) 


Hence there exists vector r ^ 0 such that 


Using (23) and (52), we compute 


(52) 


i*Pp = 


p // 2 vr,r ,/2 

0 


0 

p 1 /J w 2 p l/,i 

,-2/n . 


**Qv t 


where v * denotes the complex conjugate transposition of t\ *, is the 
complex conjugate of i, (i = 1,2), and r' = r*[z\J n 12 /«]. It 
follows that 

v’r T/IJ (in - i~.| 2 m - |; a | a ir 2 )i* ,/ *i' = -i>:qv,. (53) 


By (51) we note that (< 11 . 32 ) ^ (0,0), hence v s ^ 0. So Q > 0 
means that the right-hand side of (53) is strictly negative. On the 
other side of equation, |-i| < 1 and \z 2 \ < 1 and the positive semi- 
dcfiniieness of /„ - Wi - W 2 [see (24)] imply that I - \z\\ 2 Wi - 
|ia| 2 W”a > 0. Therefore the left-hand side of (53) is nonnegativc, 
leading to a contradiction. This completes the proof. □ 


Appendix B 

Proof of Proposition 5: l&l a = £ {Q), h — ff(P), r = fr(P - 

Q ), and d — <r(P). Note that a > 0, b > 0, c > 0, d > 0, and that 

<HP - Q) < <HP) - a(Q) 

Which implies that 

(« + c)' ri + r 1/2 < <- l/2 + rf )/2 . (54) 

Since h > 0 and d > 0, it follows from (54) that 

a ^ + 

ftl/3( r 1/2 + ,/l/2) - " fcl/^ □ 
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Multivariable System Identification via 
Continued-Fraction Approximation 

Rolf Johansson 


Abstract —This paper presents theory for multivariable system iden¬ 
tification using matrix fraction descriptions and the matrix continued 
fraction description approach which, in turn, yields a lattice-type order- 
recursive structure. Once the matrix continued-fraction expansion has 
been determined, it Is straightforward to obtain solutions to both the left 
and right coprime factorizations of transfer function estimates and, in 
addition, a solution to problems of state estimation (observer design) and 
pole-assignment control. An Important and attractive technical property 
is that calculation of transfer functions on the form of right and left 
coprime factorizations, calculation of state variable observers, and regu¬ 
lators all can be made using causal polynomial transfer functions defined 
by means of matrix sequences of the continued-fraction expansion applied 
in causal and stable forward-order and back ward-order recursions. 


I. Introduction 

A long-standing and somcwhal embarrassing problem in system 
identification is how to extend identification methods developed for 
single-input single-output systems to the case of multi-input multi- 
output systems. One important reason for this situation is that a large 
variety of ARMAX-lype system parameterizations are suitable system 
descriptions and that no unique parameterization can be easily defined 
113]-| 14). 116], 118], |20). This leaves a user with uncertainly as to the 
validity and numerical aceuiacy of calculations ~e.g., state estimation 
and control, based on such transfei function estimates. For instance, 
consider the following (wo factorizations 


{ 

{ 


7" and 

-l/ih 'W(. '){-» 

ii = lhA - 1 )Q( - ')C 


Q uni modular 


(I) 


where the partial stale £ provides a link to state-space realizations 
via the controllable canonical form (14. p. 4031. Clearly, the transfer 
functions of the two factorizations in (1) cannot be distinguished and 
there is no obvious choice of factorization to be preferred. As £ is 
not known, there is also little hope to estimate {An. Bn) by ordinary 
methods of identification. As a result there are no general purpose 
methods nor software available whereas many control design pro¬ 
cedure actually require coprime matrix fraction descriptions (MFD). 
As there are no established methods tq find MFD\ such as (I) from 
data, it remains a relevant problem in system identification to develop 
effective identification methods for such model structures [ 131. [16]. 
[20]. A technical problem in this conlexi is that statistical consistency 
of estimates of the matrix fractions becomes a meaningless issue 
as there is no unique parameter set towards which estimates may 
converge. Another as yet unsettled issue is the relationship between 
transfer function estimation, slate estimation, and control system 
design. 

A suitable starting point is to focus interest on transfer functions 
that are matrices of rational functions and transfer functions that can 
be approximated by a rational approximant (or a series of approx- 
imants for a sequence of increasing model orders). Approximation 
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Fig. 1. Expansion ol a transfer function into ihe coefficients 

ancJ a rcsi( j ua ] transfer lunction block G^(-) as practiced 
in continued fraction approximation. 
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Fig, 2. Interpretation of the continued 1 rad ion expansion as a recursive 
transfer function relationship in a forward (right) or a forward/backward 
manner (left). 


of complex-valued functions can be made by means of polyno¬ 
mials (Taylor series expansions) or rational functions (continued 
fraction approximation). Like infinite power series, continued fraction 
approximation can be used to represent certain types of analytic 
functions. Contrary to representations by power series, continued- 
fraction (CF) approximation may converge in regions that contain 
isolated singularities of the function to be represented and the 
approximants take on values in the extended complex plane CU(x}; 
see (8] and [23]. The method is well known from circuit theory 
and from the theory underlying the Routh criterion and the method 
has also been used for model reduction of linear systems [3]-[4], 
[6], [9]-[ 12], 1151, multivariable systems 111-[2], [7], and realization 
theory [2], 117], although recent focus in model reduction has been 
shifted towards methods based on balanced realization. 

Consider an asymptotically stable sysiem with a transfer function 
£»'(-,) in the forward shift operator or z-transform variable Then, 
the transfer function G {;) may be expanded according to continued 
fraction which for a scalar G {:) takes on the form 


G{i) = 


B{- 3 ) = _1_ 




r<" + 


,.C'l 


r , " + 


r ■ 


+ C. (•-) 


( 2 ) 


with coefficients — or more generally polynomials 

{f ( ' 1 (; -1 ) —and a residual transfer function block G* 

The extension to multi-input multi-output systems is straightforward 
and a even a block-diagram interpretation is possible; see Figs. 1-2, 
If = 0 (and thus l' 1 ' 1 = 0) at some stage /, the continued 
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fraction approximation terminates and results in a rational function 
approximant. Terminating continued fraction approximation are of 
particular importance in model reduction where methods can be 
proposed according to various principles of approximation—for 
instance, the' approximation G* TW *(i) « 0, which truncates the 
sequence of coefficients after 2 m t G*' n) (z) » G (,,,) (l), which 
preserves the static gain of the system, or the Pad£ approximation 
which is known to sometimes providing poor approximants 112]. 
All such alternatives result in reduced-order models of order m 
with the approximant 




D (,r,) ( .-X j 


(3) 


with continued-fraction coefficients obtained from polynomial coef¬ 
ficients by means of calculations reminiscent of those tor the Routh 
criterion of stability. Recursions of the continued-fraction approxi¬ 
mation therefore induce a ladder-type circuit theory reminiscent of 
lattice structures in spectral estimation (Fig. 2) and with the transfer 
matrix 1 ) relating stage i to stage t -1 in the order-recursive 

equation 







) 


where 




/1 + ) 



(4) 


A major difference between the present approach and the lattice 
fillers used for whitening operations in signal processing is the 
property of its lattice transfer function 

I\fi Kh = reflection coefficients (5) 



which is not a unimodular matrix. 

The main purpose of the present paper is to show how to obtain 
multivariable transfer function models from data on the form of 
coprime MFD’s and to clarify the close relationship between the 
continued-fraction approximation and solutions to the Diophantinc 
equation which is important in control system analysis for control 
design and observer design. 


II. Multivariable Modeling and Control 

Let JR[.r] denote the polynomials in the indeterminate x with 
coefficients in IFt, and let the set of n x w -matrices with entries in 
JR [or] be denoted JR t,x,t, [j’]. Consider the input and output variables 
U (t) € lR n \ € 1R P for i = 0, 1, n and matrices 
C (*') e JR pXrr ', C<^ j) € ffT Xp 0 * l n) which exhibit 
the dependencies (Fig. 2) 


and with denoting the backward shift operator or ^-transform 
variable. A pure forward-order recursive equation for the continued 
fraction in (6) can be obtained as 




_(lpx v + 





) 


( 8 ) 


Let C (2,) (r) e IR" *"’[•••]. C ,(2, - 1) ( J) e IT Xf, [.r] for i = 1. 2, • ■ ■, 
ii. The polynomial matrix 


A/ ( ’V) = ( 


_ /7 Mp + C (2 ' ) C (2 '- ,) .r 


£ X U«+P)[ r j 



(9) 


is unimodular with a unit determinant (i.c., ilrt = I) and the 

inverse of A/ ( '*(.r) is 


(A/^’U))-’ 


( W C*"x \ 


( 10 ) 


The matrix of (10) is unimodular since its inverse is also a polynomial 
matrix and from (10) is concluded that the order recursion of the 
continued fraction expansion is invertible so that 



An important technical property is that forward as well as backward 
order recursions are to be made using causal polynomial transfer 
functions which both are defined by means of the matrix sequences 
and (C" 2 ’*}. 

Now introduce the following notation for the matrix product 

M < / , (-' 1 ) -D\‘\r')\ 

= A/ (,, (-.' , )Af (, ' l, (--" J ) (12) 


and its inverse 

/ /?«(-■ l ) u‘, 

l-S};'!:- 1 ) 

= (A/ (1J (i -1 ))- 1 •■.(Af (,- 1 , (-. -l )r l (A/ 0 , (:~ , )r l 

(13) 


with the backward polynomial transfer matrix (A/ (,, (i _l )) -1 de- 
fined by (10). 

Lemma l ■ Bezout identity and coprime factorizations 
i) Any solution Xi € R pX,J [.r] and Yl € R rNXp [j*] to the 
Diophantine equation 

A^{x)X,Ax) + &l\x )Vi (x) = I,, x „ (14. 


with a mixed backward/forward dependence in the recursion order i 
and with boundary values 



(7) 


can be written 

-Yl + 

Y,.=^-A^N 

for some polynomial matrix A’ £ IR’" *"[.!•]. 


(IS 
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ii) Any solution X H €,n m *" , [jr] and Yu € WC nxp \r\ to the 
piophantine equation 


Theorem I — Pole-Assignment to the Origin: Assume that the con¬ 
trol object be described by (20). The control laws 


-Yn(.r)A { i } , (jr) + YuUWtf (x) = J wX ,u (16) 
> can be written 

Xr = R\‘) - .Y/)'" 

Yn = S\[ ] + A\A<" (17) 

for some polynomial matrix N E lR”' )<, '[,r]. 
iii) The malrix factorizations {(*4^(,r), ^/' ) (.i , ))}!L J for / = 1,2, 
•■*,// are left coprime and the matrix factorizations {(.4,^ (./-), 
K'=i for i = 1,2,*- *, ?/ are right coprime. 

A proof for the Bezout identity for general rings and—as a special 
case—for polynomial matrices is to be found in 122, Chapter 4], 
According to the definitions in (12) and (13), it can be concluded 
that for any i we have the generalized Bezout identity (cf. 114, p. 
382]) 


A 


■+ 1>) y ( rn +■ p) 



n { ; t ' n ( ; t ] \ 

i? 1 / 1 A-Sj;' A\‘>) 


H 


(') , nHciO 
n(0 _ n(') 

if n n i 


(*) n(') 


*/ n 

H l 

c (/l rt (/) 


- B 
+ 11 



with 


<1('K 

.aV’ 

= iIor.a',;’ - ; 

'1«'K 

b\ ,] 

= «‘«Ti B\\' = , 

rl'-fi 


= <i«'r = i -1 

(U-R 

.si" 

1 

II 

ZL 

II 


(0): 


4? 

CO) = 

= /?'/’(()) = J„,x™ 


Note that particular and homogenous solutions to the Diophantine 
equation (14) can be extracted from (18) and that direct substitution 
of (15) into (14) gives 


a'/’.y, + B}°r, = .A'/'t/?;;’ + o<;’.v) + b'/’isJ;' - .a';’ y» 


- (.a'/'/?<,;> + ( v/’bj;’ +1>‘," a';’) v 


■i(') i •, 


— A'X/j 4* 0, 


]> X I 


V = /, 


(19) 


Coprime matrix fraction descriptions originating from terminating 
continued fraction expansions of polynomial order n are suitable for 
multivariable system modeling. Let the control object be represented 
as 



where 


-D\" ) C.-')\(Y(,)\ _ /V (B) (;)\ 
: -')\(Y^C.)\_(Y(-.)\ 




/T ( ’"(-)\ _ 

Vf ,(,,) (-)J U<->/ 


( 20 ) 


i) n;) = -flr l (--* , )Si(i- , )V(a) 

=M5/ /?,.)(},’) = 0 >r > X 1 

... f f?ii (^“ 1 )£kU) = > r U) 

\ f (-) = —Sr{z~ 1 )(n(z) 

•( 5 >-Cr) 

both achieve pole assignment to the origin and £ = 0. 
Proof: For control law /) is found that 


0 ? „xi = {St Hi )^ 

= (Sj Hi)’ 

= (Si fin - Hi Sn Si Bn + RlAu) 

= 0 * £ ( 22 ) 

with pole polynomial Si Bn + Jlj An = as follows from (18) 

and the Bezout identity. Similarly, for control law ii) is found that 
with system dynamics determined from 





with the pole polynomial Ai.Rn + Bi Sn = Rearrangement 
gives 


CrMul) «*> 

which effectively decouples the state £ from perturbation E. More¬ 
over, the partial state £*■ of the control law ii) provides an observer 
for the disturbance E. □ 

It can be concluded that matrices with real- 

valued elements generate left and right coprime factorizations of a 
strictly proper control object and associated controllers. Using system 
equivalence results [21, pp. 73-74] there exist procedures to generate 
the corresponding controllable and observable state-space realizations 
and controllability indices and observability indices can be obtained 
[21, p. 51]. 


with £ as the partial state [14] and E(:) as disturbance inputs and 
modeling error, E(z) ideally being zero. In fact, (20) provides a 
means of correspondence between various system representations 
and, for instance, incorporates pole-assignment control which is easily 
calculated: 


III. A Multivariable Identification Algorithm 

It remains to determine the continued fraction expansion matrices 
{C (2 ' , {C (2,) }r =1 for nonunique transfer function estimates 

originating from standard identification methods—e.g., least-squares 
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identification For the case of least-squares identification, it is suitable 
to organize model and data sequences {vul and { ju } according to 


— —/4j ]/A „ 1 “ — i/A-n + B\ If A- 1 + + Bn lU-n, 


Vk € IR" 

<t>k = (— J/jiT— L * -Uk-n u[ 1 u[^ n ) r , 

<t> k € JR ,,(, " +/J) 

*=*Mi 4„ B t B„) r 

0 £ r»( X 


(25) 


which for V samples and model order n suggests the linear regression 
model 




with 

/*{ \ 

vi 

= 

vJ 


(/») H)) (r) 

= (e ro" ,x,, ['])' 

/ = n n — 1, 1 


n) r u(,, - ,+n u (r) 

I 

= -*•* 1 >-'J,^ (e m' x ’ [.]) 

1 = n n - 1, 1 


t) 


4'/ l) (0 

= 4 < / , (i)-B<'W i< - ,4n -" 
4 1 ," 1 =4(.) 


( 0 . 


<0 


oi ,_n (0 


= D) >(0- 4j "(-)C U ’ +l,l (r)i 
fl 1 / ’ = B(i) 


(28) 


and 


*\ = 


/*(\ 

<>2 

vJ 


(26) 


As a result of the nonuniqueness of parameters the normal equations 
of the associated least-squares estimation of 9 may exhibit rank deficit 
and it is natural to apply the least squares solution 




(27) 


where ( ) ^ denotes a generalized matrix inveise □ 

Dieoiem 2—Tctmmation of Continued hr action Expansion Let 
(4(a), D(i)) be a stnctly proper nonsingular left MFD of poly 
normal order n with 1(0) — J t x/ Algorithm 1 determines the 
terminating continued fraction expansion matrices \C ( l) ( 1 )} , 

{C (ji ) (r)} f 1 and unimodular matrices { W ( ^i)} 1 of Lemma 1 

in no more than 11 steps with 

Ql «)(/,xi <»,*..) = ( 4'/” -B}°) = (MO -B( O) 

( )/< 11 ) 1 ( U 1 1 ) ‘ ( \/‘ ’) 1 

(29) 


where ($i, $\ ) + denotes the matnx pseudo inverse of’ 4 *{ $ \ The 
associated least-squares estimate thus obtained has the smallest 2 
norm of all possible minimizers of the least-squares criterion and 
provides its result in the form of a left matrix factorization 

A{. ’)> (-J = B( ~')l (-) + « (-) 

-4(- 1 ) = I) xji + 4i. 4- + 4„. ' 

D{r i )=D,.~ l +B,. ’ 


with input l , output ) , and disturbance \\ 

Algorithm 1 Let A^’U) = l,,*, + E?-, 4* , ° i J € ]R' ,x ''[i] and 
Bj^U) = E, =1 D 1 ’" t J € ]R ,,xni [j] denote some nonsingular left 
MFD (A(a), B(r)) of polynomial degree ?i Compute for i = n, 
n - 1, ,1 


t) C [ 


l2(..-*+l)-l) 


-"->y 


7 = 0,1 , 1-1 


-,(2(..-<+l)) 


=1e/J,) + U* - £ eiM'"-™). 
\ / 


On I, ■*,1-1 


with y( i) E 1R* Xf [i] being a left divisoi of ( 4( i ) 77(0) 

Pi oof Steps v)~vi) of the algorithm embody for / = n n - 1 
, 1 the order recursion 

(4j 11 

= (4'/’ -B}’) 

/ / C*-‘ +l ". \ 

i-H )— l) I+C<*( +1) HfU H)) | I 

( 10 ) 

The construction similar to a Routh scheme of {( (J 'UOlLi 
, and ihc full-rank condition of ( 4( 0* B( 0) seive to 
eliminate terms of degree i from 4 ( / _l) and D\ ~ i} so that 


0 = Aj'V 
0 = B f (l V 


A=() 


( 11 ) 


as long as B\ l) # 0 The order recursion of B\ l) for / = u, n - 1 
, 1 with deg B[ l U i) = i and 2?[ 0) = 0 r , X 7 r and can always be 
made as A^°(0) = I rXj for all i The resultant A\ l)) terminates as 
a nonsingular left divisor of (4( /), B(i)) as {M {l) (jr)\\L i of (29) 
are unimodular □ 

After determination of C {2t ~ [) and C (2,) in (10) tor / = n, n - 1 
, l, rt is straightforward remove the left divisor Q(t) and to 
recursively calculate the approximants {( 4 ( / ) 1 i?/ 0 )}ILi of poly 
nomial degree / according to (12H13) The resultant approximant^ 
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Time |i] Time fsj 

Fig. 3. Input-oulpul data used in Example 2 with apparent cross-coupling 


Fig. 4. Simulated data including partial state, output error, prediction error, 
and model output with input-output data as used in Example 2. 


properties. 


represent coprimc factorizations according to Theorem J with exact 
representation of the original transfer function for / = u. In particular, 
the sequences }, \C (2, ~} are constant matrices whenever the 
matrix sequences and { 4 ( /*}”_i are full-rank matrices. 

Example I—Common Factors: Consider the following left matrix 
factorization with the common left divisor Q(:~ l ) = J + D :~ l {D 
being assumed invertible) 

A ( ? ] =(/+Z>i-')(l + 

= / + (!?+.4,): ‘+/X4,-.- 2 

b\ 2) =(i + Dr')B t r' 

= lir,' + DD t r\ (32) 


Application of the algorithm gives 
C"' ={DI), ) + /J>.4, 

.4 ( , n =/+(D + .4,). _1 +DAr- 2 

- + DDr.~‘ 2 )C ( ' > = I + Di 1 
C (21 = D~ l DV i = 0, 

B\' ] =Bm~' + DB,-- 2 

- (I + Di~')C u, £''- 0. (33) 

As B \]* = 0 this algorithm terminates after one recursion only 
with the resulting: model order u - 1 = 1 of the transfer function 
approximanl. The common left divisor (1 + Di~ ] ) appears as the 
upproximani as shown in Theorem 3. □ 

Example 2—A Simulated Example: Consider the multivariable 
system 


Uk 


( 1.2 -0.29 \ / -2.0 

^(>.25 0.7 ) in ~' + V-10.0 

(U.j n-u'k e 1R" 


0.07 \ 

1.0 )"*-' + “'*• 

(34) 


with input {if a }. zero-mean while noise {»'*) with } = 

and output f,(/n}. Simulaied data are according to Fig. 3. 
which exhibit obvious cross-coupling properties. The estimate &n lor 
model order n = l and N = 1000 samples of input-output data is 


#n — Mi &i ) 7 

_ / —1.2012 0.290S -2.1053 0.0910 \ r 

"+-U.2490 -0.7010 -9.9715 1.0123 ^ 


and the continued-fraction approximation matrices arc 

r d) /0.9797 -0.2945 \ 

C \9.4039 -3.5929 ) 

and 

r ( 2 ) _( -2.1053 0.0916 \ 

C “ —0.0715 1.0123 ) 


(36) 


Polynomial matrices of the right coprime factorization can be com¬ 
puted from the real-valued matrices C f(n and C’ ,J) as 



)) " V/ + c ,l) c C2) i- 1 

/ - 2 . 1053 - 1 
- 9 . 9715 :; * 1 
1.0 + 0 . 8736 - 
\ 16 . 0285 - ’’ 


0.091 Gi ~ 1 
1.0123c" 1 
—0.2084c" 1 
1.0 - 2.7759c" 1 


(37) 


Once the continued-fraction matrices {C (2 ‘ , and 

have been determined it is straightforward to obtain the prediction 

error - and the partial stale £ according to 



The partial state £ and up to n shifted values of £ determine the 
stale of the system described according to the controllable canonical 
state-space realization [14, p. 4031, [2], Thus, the state observer 
embodied in (38) provides a state estimate that is obtained along 
with the residual (or prediction error) of transfer function estimation. 
Simulated data including partial state, output error, prediction error, 
and model output are shown in Fig. 4. 


IV. Conclusions 

We have shown thal the continued-fraction expansion provides an 
alternative parameterization of multivariable systems in the context of 
system identification. The close relationship between the continued- 
fraction approximation and solution to the Diophantine equation is 
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demonstrated, Hence, this parameterization provides a computational 
link between various matrix factorizations, between transfer function 
estimation and state estimation, between prediction-error methods and 
output^error methods, and berween model reduction and control. The 
lattice structure provided by the continued-fraction approximation 
facilitates calculations and computations can be organized by means 
of order recursions backwards and forwards. In future work it is 
intended to investigate identification properties in the presence of 
colored noise and adaptive control. 
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All Fixed-Order Hoc Controllers: Observer-Based 
Structure and Covariance Bounds 

T. Iwasaki and R. E. Skelton 


Abstract — This note obtains a parameterization of the set or all 
stabilizing controllers of order less than or equal to the plant, which 
yields for the closed-loop transfer matrix a specified H^. norm bound. 
The algebraic results of covariance control are applied to the H x 
control problem to yield a parameterization in terms of the Lyapunov 
matrix, which carries many system properties (such as Hi performance, 
covariance bounds, system entropy at infinity, etc.). All low-order 
controllers are shown to have observer-based structure for "reduced- 
order models" of the plant and are characterized by two Riccati equations 
with a coupling condition. 

1. Introduction 

The main objective of this note is to provide a new derivation 
of the standard H-*. control theory [1|, |2] using the algebraic 
approach in the covariance control literature [3H6]. The idea is 
to consider the bounded real equation for the closed-loop system 
as an algebraic problem to be solved for the controller parameters. 
Similar algebraic approaches in the literature include [4], [7]-| 10J. 
In particular, [7] and [4] (appeared simultaneously) extend the result 
of [9] to obtain a parameterization of fixed-order output feedback 
//rx, controllers. The "standard" case (full control penalty and full 
measurement noise) was considered in [7] where the existence 
conditions are characterized by coupled Riccati equations similar to 
those of [9]. These Riccati equations contain controller parameters, 
however, and thus the parameterization is implicit Reference [4] 
(see [5] for its journal version) solved a singular case (no con¬ 
trol penalty and no measurement noise) where existence conditions 
are given by three matrix equalities withom controller parameters, 
and all fixed-order II x controllers are parameterized explicitly. In 
[7J and (4], computations of fixed-order controllers are extremely 
difficult. 

Contributions of this note over the existing literature are the 
following. We consider the "standard case" and remove the controllei 
parameter dependence of the Riccati equations in |7] and obtain 
explicit formulas for all H^ controllers, which allow us to prove the 
observer-based structure of all H x controllers, including nonccnlral 
controllers of order equal to or less than the plant (note lhat [ 11 shows 
the observer-based structure for the central (full-order) controller, hul 
nor for any other strictly proper // x controllers). Another contribu¬ 
tion is extending the results of [7], [4|, ro provide a parameterization 
of all H-x controllers in terms of the solution to the bounded real 
equation, which we call the H „ Lyapunov matrix. Recall that the H * 
Lyapunov matrix is an upper bound for the state covariance [ 111 and 
can be used to define several other system properties including system 
entropy [12| and the (dual) mixed H^/H performance measure 
defined using power semi-norms [ 131- (An improved upper bound 
on Hi norm is provided in the mixed Ih/H^ solutions in |I4|.) 
Thus, such a parameterization is useful (especially for the full-order 
controller case) to incorporate II j related performance specification^ 
in addition to the H K norm bound. 

Manuscript received September 14, 1992; revised May 18. 1994. 
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Finally, we should mention that our result still has a computational 
difficulty, as In [71 and [4], for solving the coupled Riccati equations 
except for the full-order controller case. Thus the computational 
aspects of the fixed (low) order H ^ controller design problem needs 
a more thorough exploration. 

We will use the following notation. For a matrix -4, .4' denotes the 
transpose, p(-4) and (A) the spectral radius and the maximum 
singular value, respectively, and .4 + denotes the Moore-Penrose 
pseudo-inverse of ,4. If A is nonnegative definite, A i/2 is the (unique) 
nonnegative definite square root of .4. || ■ || x and || • || ? denote the 
Hz* and Hi norms, respectively. 


II. Preliminaries 

Consider the class of dynamic systems described by the slate space 
realizations of the n^th-order plant (£ p ) and the /» t th-order controller 
(Sc) 


f.r = Aa 

-1- B\w 

+ Th a 


(SpK* = 


+ I>i2« 


[y = CV 

-f D 2l ir 



*>{';: 

A, .r, + 

B,u 


Cjt. + 

D, 1 / 


lo give the closed-loop system 

(S.i) 



1 v \fi ,i = -4,/.r t f 

-1- B, 1 1 r 


,r 

\ ^ =-C 

+ D,( ii' 

J’,1 

jr, 


where ./■ G ft 1,1 is the plant state, ir 6 ft"" is any external input, 
including plant disturbances, measurement noise, etc., u € ft”" is 
the control input, ; € ft"' is the regulated output, y £ ft” w is the 
measured output, and.r, £ ft " 1 is the controller state. We denote 
the closed-loop transfer matrix from ir to : by T-„ . The following 
assumptions are made for (£>,): 

API) (.4. Bj ) is stabilizable and (( j. .4) is delectable. 

AP2) (.4,/!i) is stabilizable. 

AP3) D\j [ C\ Oia ] = [ 0 V ]. V > 0 

On [B\ Di, ] = [() V]. V > 0 . 

API) is obviously necessary for the existence of a stabilizing con¬ 
troller. AP2) and AP3) are technical assumptions which are standard 
in the II ^ control literature (e.g., [ 1 ]). Note that AP2) and AP3) 
imply that {A fl ,B l( ) is stabilizable if and only if (.4,./?,) is 
stabilizable. This fact can easily be verified by standard manipulations 
once we notice that 


[D, ( 



B>D, 

B, 


r. 



D' 2] 


0 


where T is nonsingular due to the second part of AP3). To state the 
x control problem, we need the following definition. 

Definition I: Given a positive scalar 7, the controller CH ,) is said 
to be an H (Xl controller if the following four conditions hold: 

Cl) (A r . B t ) is controllable and (.4,, C, ) is observable. 

C2) A, i is asymptotically stable. 

C3) ||J:,,|U < -). 

C4) V 2 / - D l( n' 1( > o. 

I hc first minimality condition reflects the fact that we .seek a low- 
irder controller. Note that conditions C2) and C3) imply V 2 / - 
V'tD' r( > 0, To utilize Lemma 1 below, we impose condition C4) 
vhich excludes the case <r, un ,(D t( ) = 3- Nevertheless, the set of 


all stabilizing minimal controllers which yield the strict inequality 
H 2 " 1 ; «i* ||«j < 7 is contained in the vset of all H ^ controllers defined 
above. Without loss of generality, we will set the norm bound 7 
to one to facilitate the rest of our presentation. Now the H x control 
problem can be* stated as follows: 

Find the necessary and sufficient conditions for the existence of 
an He*, controller. If one exists, obtain an explicit formula for all 
such controllers 

This problem has been solved [1], [2J. Our objective is to solve this 
problem by a different approach based on the bounded real lemma 
given below, which turns out 10 give an explicit parameterization of 
all low-order H ^ controllers in terms of a physically meaningful 
quantity. To this end, let us state the bounded real lemma modified 
for controller synthesis. We need the following definitions 

Q-.= AuX't+X^X'+B.rB:, ( 1 ) 

+ (A-.,c:, +JJ.,A'f)ir 1 (-Y„C'( +D, l D' ll )\ 


A :=r( 2 ) 

C := { (.4,. D,. C,. D,): R> 0 ami 3X,, > 0 s.l. Q = 0 }. 

(3) 

Lemma I: Lei a plant (D,.) and a controller (3D.) be given. 
Suppose AP2) and AP3) hold. Then the following statements are 
equivalent. 

i) The controller (D,) is an H x controller. 

ii) The controller (E,) is a minimal realization of some controller 

<S,) € (\ 

Proof: Suppose i) holds. Then it is a standard fact (see, for 
example, [151 for a detailed proof and (16] for relaxation of the 
controllability assumption in [ 15J) that (E, )£ C. Hence ii) holds. 
Conversely, suppose ii) holds. Let matrices associated with (S t ) 
be denoted with . If ( A,.B,) is stabilizable, then (A t f.B if ) is 
stabilizable and, by Lyapunov theory, .4,/ is asymptotically stable. 
In this case it is straightforward to show || T lt ||^ < 7 . Hence, any 
minimal realization (II,) of (D. ) satisfies all four conditions Cl)-C4), 
and we conclude i). Now. when {A ,, B ,) is not stabilizable, it can be 
shown by a coordinate change that a controllable realization of (E,), 
denoted by (£, ), is such that (D, )£ C and the corresponding pair 
is stabilizable. Then the same argument as above applies. 
This completes the proof. 

If we view the (Riccati) equation Q = 0 as a Lyapunov equation 
with a forcing term quadratic in A\ f , a nonnegative definite solution 
X, ( can be used to prove closed-loop stability, while also defining 
the H v performance. Hence we call A', i an " H^ Lyapunov matrix." 
As mentioned in Section I, the "covariance control approach" to the 
Hr >. control problem taken in this note is the following. Consider the 
matrix equation Q = 0 as an algebraic problem to be solved for the 
controller parameters. In this case, solvability conditions characterize 
the set of all i/>_ Lyapunov matrices which can be "assigned" to 
the closed-loop system by some controller (H,). Then the general 
solution to Q = 0 provides an explicit Formula for all controllers (H,) 
which yields a specified X r < as an H < x Lyapunov matrix. A controller 
constructed in this way may turn out to be such that {Ani,B<() is not 
stabilizable, in which case, the usual bounded real lemma (e.g., [15]) 
cannot guarantee closed-loop stability. Lemma 1 stated above shows 
that, even for such a case, the closed-loop system is guaranteed to 
be internally stable. 

Recall that the Lyapunov matrix A\< is related to the H 2 
performance. As shown in [II], for the error signal e := Cox, the 
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Bi norm of the transfer matrix T, w from w to a is bounded above and there exist Lv G L\ G 1Tp x "v and Ln G 

by a function of X c t as follows such that 


\\T, m \\l <J= traceCo-YpCo 


AXr+X p A'+X p C[CxX v + DiB\--BiV- l B\ + Li L[ =0 (6) 


where Y p is the Il-block (n,, x n P ) of X ti The following lemma 
shows a necessary condition for an controller to be "minimal 
Order" in die H 2 related sense 
Lemma 2 Given an n t th-order controller (E, )G C Let 


A. Y p + X,,A' + - rii-'GIA; + BiB\ + L\ L\ = 0 

(7) 

(/ - A„ V+ )[Li L{ - r, i?r'r[ }(1 - Y„,.Y+) = 0 , (8) 


A' 


t( := 




(4) 


be any solution to Q = 0 Suppose X' pi Y,„ ? 0 Then there exists 
a controller (E t )G C of order n< < n t which yields the same Hi 
performance bound J 

Proof Using the singular value decomposition of Y r „ (A\ l / 2 ) + , 
define a matrix ^ as 


:= 


r/( v. l/ V 


Y P< (A, I/ - ! ) + = riS 1 i 1 ' 


where \\ G Since rank(A' /J( )< n t , we have n t < v, 

Noting that A\< > 0 is equivalent to 

A\ > o. X p - Xy, x;x'y, >0, (/ - Yr V/ ) \; t = 0 


\\D\iLnDi] || < 1 


(9) 


where 


Y — v _ V v -1 v' 

\f, — -\fj \ fU .1 


i? ( = u~ l - l„\ l'„, r, = x v c' 2 l' d + b 2 i 1 

The following theorem shows that £ is indeed the set of all H x 
Lyapunov matrices Y ( which can be used to prove that a controller 
(E,) is admissible It also gives an explicit formula for all controllers 
which yield a given H ^ Lyapunov matrix for Ihe closed-loop system 
A proof is given in the appendix We shall need the following 
definitions 


P = \„ Y \ /’+ = \, 


computation of Q tyQty verifies that a controller (E ) of order 
n, given by 

A, = v,'<A-yv 4,A ', i/2 1 , B, = r,'( Y l/2 ) + B, 


C,=C D, = D, 


satisfies Q = 0 with 


Y, f 


' Y p f,E, 
iiH i 


> o 


(*5) 


Since the 11-block of Y ff is identical to that of \ lff the controller 
(E r ) yields the same Hi performance bound. 

By mathematical equivalency, if X' vt \ p , ^ 0, then there exists 
a controller of lower order which has the same entropy at infinity 
when Ct = Co [121 By duality, the same statement can apply to the 
mixed HifH * cost of Doyle et al [13] 

Lemmas 1 and 2 motivate us to define the following 
Definition 2 A controller (E,) is said to be an admissible 
controller if (E, )G C and there exists a matrix X tf > 0 which solves 
Q = 0 and satisfies Y', Y,„ > 0 
Since we look for H\ controllers of low order, we will restrict 
our attention to admissible ff ^ controllers in the rest of this paper 
It should be noted that the conditions X' pi X f)l > 0 and A\< > 0 
imply that A”, > 0 


4 = P+ 4J\ = P + 02 C 2 =CP (10) 

n v =1 -l' n l In I» = BJn + \,C" 2 \ 1 
r, =x i ,c' 2 l'i > + bj n = b 2 Ii,+ \ i C\ 1 

\ + r, = <r)L, Hi = (->1 , n , 1/2 

Fv =(-)/» =. «r I 7? v ,/ ' e 

Theorem I The following statements are equivalent 
1 ) There exists an admissible If contioiler 
n) C ^ v 

In this case, let (E ) be any controller Then the following statements 
are equivalent 

in) (E ) is an admissible H^ controller 
iv) There exist X, ( € C and orthogonal matrices Ti and F\ 
such that 

4 =4 + DiC, -D'Ci + BiD Ci ( 11 ) 

+ P+[Y,C;r, - l\ lil'UB, + A Hi 'D )'Y+]/* 

B, = P+(L\ r, R\ fi +T\ ). 


Ill All Low Order Controllers 

This section provides a parameterization of all admissible H ^ 
controllers in terms of the H ^ Lyapunov matnx Mathematical tools 
developed in covariance control theory are useful here [5] To state 
the result, let £ be a subset of real symmetric matrices defined as 
follows X,e 6 is an element of £ if 


V 


fC 



X P , 

x< 


> 0 , 


A fi ( A pr > 0 


C, = [Li V, py 1 - r, )'(A';,)', D, = Lu 
where Lp satisfies (9) and 

N .= X, Cl + A-;,. C' 2 Dl + D, 1 -Dl 


V, = E( Hi + NnrFt N'„t , l\ = -EtH\ + N, v F, N'm 

(12i 

where columns of A>r , Xhv, Np \ , and Nn\ are orthonorma 
bases for the null spaces of Ev* Hi , E\ , and H\ , respectively. 
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Theorem 1 shows the observer-based structure of all (low order) 
Hoo controllers. In particular, if we consider strictly proper controllers 
(Z), - 0 ), ( 11 ) yields 

,i\ =5 A# ( + Bui + D, (y — CiJf) -I- ti\ 


Necessity: Suppose (E,) is an admissible # x controller. Then 
Q = 0 holds for some X r t > 0 , or equivalently 


I 

0 


-x rt at 1 


Q 


-x; 


-i y' 


0 

I 


= 0 


u = C,.r,, w := P+(XpC[Ci - L\ UvV~ [,i D[X+)P*.. 

where (4.are defined in (10). Thus any low order 
strictly proper admissible H n « controller has the structure of state- 
estimator plus estimated-state feedback for the "reduced-order model" 
( A , Bi-Cv) of the original plant, obtained by keeping only the states 
belonging to the range space of A',„ (or X p - A',,). As in the full- 
order case (see [ID, a low-order ZTx estimator also has an extra 
term w, which can be thought of loosely as an estimate for the worst 
case Li disturbance. 

Note that the two Riccati equations ( 6 ) and (7) are the familiar ones 
which describe the existence conditions for an H * controller [ 2 ] if 
i) X v is invertible so that ( 6 ) can be multiplied by A“ J from both 
sides, and ii) the free matrices Li and L\ are set to zero. Since the 
controller order is fixed (n, is specified by the dimension of A", r), we 
have another condition ( 8 ). The familiar spectral radius condition in 
[1 j is implicitly embedded in the set C: by the definition of X p and the 
positiveness of AT, we require X p > X v > 0 , which is equivalent 
to p( X^Xp) < 1 when A',, > 0. Notice that the third condition 
( 8 ) disappears if we consider the full-order (/<, = n p ) controller 
case since A' /Jf becomes square and invertible. Thus a full-order H^ 
controller can be designed by solving two Riccali equations. The 
more detailed exploration of the full-order case, including explicit 
upper bounds on the plant state covariance and the control input 
covariance in terms of the II ^ Lyapunov matrix, can be found in the 
conference version of this note [17J. For the general case (», < //,,), 
however, finding X t( > 0 that satisfies all three conditions (6)-(8) 
does not seem to be tractable. Homotopic continuation methods [18J 
may be useful to solve the coupled Riccati equations. 


whose partitioned blocks are given by (c.f. [7]) 

AX„ + X„A' + .V„ic;c, - C' 2 V - l C 2 )X p + 73, D\ 
+{X P ,xr'D, - f, )Ry ! (X,„x,~' b, - f v )' = o. u ’ 


(A + A- ( ,C;G ).V,„ + X,,C 2 B: - Xp.X.-'A.X, + 

-a',,, xr'B, ( CtX + vb: + vL' n n;,'\“) = o. 

(14) 

a.x, +x,a: +A';.(c;r,- c;v-'c 2 )A',„ + 

mb . + A-;,c5r- 1 )r(B. + .Y;,csr"‘)' = o. 

Also, it is necessary that 11 -block of Q is equal to zero, i.e., 

AX,, + + X„C\ CiX p + Bi B\ - B 2 UD ' 2 

+( Xp. a + r, ) R-„ 1 (A',„ C + IV)' = 0 . 

Since the last term in (16) is nonnegative definite with rank < n u , it 
is necessary that ( 6 ) holds for some Lr 6 i?"satisfying 

L,L' r = (Xp,c; + ru)R r '(X r ..c: + r, )'. (17) 

Similarly for (13), il is necessary that (7) holds for some L\ € 
W""*"’' satisfying 

L\ I.\ = (Xp,Xr'D. - 1\ )R; x (Xp,Xr'B, - Tv)'. (18) 

Now, applying the result (Appendix C) of [51, there exists a C ( 
satisfying (17) if and only if the equality in ( 8 ) holds, and all solutions 
O are given by 


IV. Conclusion 

The set of all H controllers of order equal to or less than the 
plant is parameterized in terms of the II\ Lyapunov matrix, which 
carries many system properties such as II >-related performances 
and the system entropy. The 7/ x Lyapunov matrices, which can 
be "assigned" to the closed-loop system by some controller, are 
characterized by the two Riccati equations and the coupling condition 
in Theorem 1 , which are numerically nontrivial to solve for the 
general low-order controller case. Hence, the computational issue 
remains open fot; further research. The coupling condition disappears 
for the full-order case, and thus the computational problem becomes 
much easier as in the literature [ 1 J. Our novel results show that 
any low-order Hr x controller has observer-based structure. This has 
been shown [|| only for the full-order (central) controller. The 
observer part of the low-order II ^ controller estimates the states of 
the "reduced-order model" of the original plant obtained by some 
model reduction methods based on a projection formulation. 

Appendix 

Proof of Theorem J: Recall that all H ^ controllers are char¬ 
acterized by Q = 0 and i? > 0 in (I) and ( 2 ). Note that R is 
Positive definite if and only if \\D\iD, Dn\\ < 1. So we consider 
ihe following mathematical problem: Given D, (=: Lp) such that the 
above norm bound constraint is satisfied, find necessary and sufficient 
conditions for the solvability of Q = 0 for (.4, ,D, ,C,). The proof 
begins with necessity. 


C = [LrVvR\ /l — IY)'(A'+)' (19) 

where Ui is defined by ( 12 ) with Fv being an arbitrary orthogonal 
matrix. Thus we have established the necessity of (6)-(9). 

Sufficiency: Now, suppose conditions (6M9) hold. We need to 
show the existence of a controller which satisfies (13H15). In the 
necessity part of the proof, we have shown that C, given by (19) 
satisfies (16) under supposed conditions. To prove sufficiency, we will 
construct D, and A, satisfying (13)—(15) together with C, given by 
(19). 

Using the result of 15] again, there exists a D ( satisfying (18) if 
and only if 

(7 - A',„ Xp, )[L\ L\ - T, i?r‘ f \ ](/ - A',, ( Xp .) = 0 ( 20 ) 

holds and all solutions D t are given by 

B, = X, Xp, (L, T\ 7f{ /a + f i ) ( 21 ) 

where U\ is defined by ( 12 ) with Fy being an arbitrary orthogonal 
matrix. It can be verified that the conditions (6H9) imply that (20) 
always holds. Thus, B, given by (21) solves (13). 

Next, note that (14) is solvable for A, if and only if (/- 
X pt X^ )Qn = 0 holds where Qi 2 is the left hand side of (14). 
In this case, the unique solution A t is given by 

A, = A\.A-+Qi 2 A7'. ( 22 ) 

Now we claim that the above solvability condition is always satisfied 
by the choice of B ( and C f given by (21) and (19), respectively. To 
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prove this, it is sufficient to show that 

Qu~(l- -Y„, X+ )Qu\r'K = o (23) 

since A\ > 0 and X f pt X p( > 0 Substituting (19) and ( 21 ) into (23), 
and using (12), after some manipulations, we obtain Qu = 0 Thus 
A,, B t and C given by (22). ( 21 ), and (19) solve (13) and (14) 
provided (6H9) holds. 

Finally, straightforward algebraic manipulations show that (15) 
al$o holds The basic steps are the following First solve (14) for 
Xju XT 1 A t X t and then substitute into 


Intrinsic Difficulties in Using the Doubly-Infinite 
Time Axis for Input-Output Control Theory 

Tryphon T Georgiou and Malcolm C Smith 

Abstract —We point out that the natural definitions of stability and 
causality in Input-output control theory lead to certain inconsistencies 
when inputs and outputs are allowed to have support on the doubly- 
inftnite time-axis. In particular, linear time-invariant systems with right- 
half plane poles cannot be considered to be both causal and stabllizable. In 
contrast, there Is no such conflict when the semi4nhnite time axis is used. 


Qu = A' p< AT 1 QuXr 1 -Yp, 

where Qu denotes the left hand side of (15) Using (7)-(9) and (17)- 

(18), after some manipulations, it can be verified that Qn = 0, which 
implies (15). This completes the proof Q E D 
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I Discussion 

Consider two systems P,(t * 1, 2) defined in lerms ol convolution 
representations 



where hi(t) = r' for t > 0 and zero otherwise, and hi(t) = -V 
for t < 0 and zero otherwise Both systems have Laplace transfer 
functions l/(•>-!), but with differing regions of convergence The 
first system is unstable and causal, and the second is stable and 
noncausal (in fact anticausal) according to the usual definitions 
To view these systems abstractly in the input -output setting (or m 
the behavioral framework of [3]) we need to work out the system 
trajectories for signals in some I unction space, say £_> This amounts 
to finding the graph, i e, the set of input-output pairs in £<. We 
mention that an explicit working out of such an appioach for the semi 
infinite time-axis is given m (21 The rcmaiks below are concerned 
with the case of the doubly-inhnite lime-axis 

We consider first Pi From (Ip 158] / € C \ and <j £ C, implies 
/ * g 6 £, and 


\U*o\\, s ll/ll. Ml, 

Thus, P 2 can be alternatively represented by the following graph on 

( ■*. K) = ^ ] j ( 1 ) 

after transforming to the frequency domain (As usual, denotes 
the Fourier transform which maps C>(-x x) isometrically and 
isomorphically onto £z(~/x. / x) and Ci |0, x ) onto the Hardy 
space Hi of the nght-half plane (1, Chapter 19] Functions in H * 
can be extended analytically into the right-half plane by replacing 
J* by *) 

Now consider Pi We will first restrict the inputs to the space 
£j[ 0 , x) Since 

y(t) — j t l ~ T u(t) (It = c ( ( t t u(t)(It 

J o J I) 

a necessary condition for »/(/) € £ 2 10 x) is that 

f c ~ T u(r) r/r —t () 

/ o 
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Fig, I. Standard feedback configuration. 


as / —► r>o, which is equivalent to (t u) - 0 . In fact, this is also 
sufficient since then 


V(t) = 



u(t) dr — h 2 * u. 


Thus the domain of Pi in £i[Q, x) is equal to the orthogonal 
complement of r“'C and, moreover. Pi coincides with Pa on this 
domain. After taking transforms, the orthogonal complement of l/(* 
+ 1 )C is (k— l)/(# + 1 )'H >. Therefore the graph of Pi restricted to 
signals with support on [ 0 , x) becomes (in the frequency domain) 


4 'P, 


[o •%) 



Hi =: GH<> 


Since P\ is shift invariant, the graph of Pj on £j( — x . x ) contains 
(again, expressed in the frequency domain) the infinite union 


[j ( 2 ) 

/ -0 


The graph of Pi on £ 2 [ - x. x ) is actually slightly bigger than (2) 
(see below) but we will not need it for the next part of the discussion. 

We now turn our attention to the requirement of stabihzabilily 
in the feedback configuration ol Fig. I. Our definition of stability 
is the usual one: for all external £> disturbance inputs, there must 
he solutions of the feedback equations in £ > so that the closed-loop 
operators arc norm bounded. If P\ is stabilized by some compensator 
C on £j(-x, x ), then it turns out that the graph of Pi must be 
closed. (This is a standard argument which proceeds as follows. Take 
a Cauchy sequence in the graph of Pi and set this equal to ('}*). Since 
by assumption the feedback equations have unique solutions they 
must be given by: u i = r/|. ij\ = dj, u* = 0, i/j = 0. Now take the limit 
of the Cauchy sequence at the external inputs. Since the closed-loop 
operators arc bounded, this pair of signals must also appear at u i, i/i.) 
Thus, the graph of P\ must contain the closure of (2). But the closure 
of the infinite union ( 2 ) equals the graph of Pj ! Moreover, since P> is 
an anticausal operator, then there are L’j( — x. x) input-ouput pairs 
which satisfy the convolution representation for P- but not for P|. 
Such a pair is: = r *(/ > 0), 0 {t < 0) and u(1) = — 

which equals 

\(.s - 1)(* + 1) / 

m the transform domain. Thus, it seems incorrect to close the graph 
of P, and use the ordinary definition of input-output stability. On 
I he other hand, slabilizability of P\ would require that any pair of 
possible disturbance signals d 1 , d 2 could act on the feedback system 
in Fig. 1 and produce a bounded response. If, however, a disturbance 

(£M0 

with i7, y as in (3), is allowed to act on the input ports in Fig. 1, 
then there exists no solution which is consistent with the feedback 
equations and the integral representation of Pi- 


Of the following possible remedies, none seem to be satisfactory: 

1) To consider Pi to be nonstabilizablc on jCi(-x, oo), and to 
restrict attention to open-loop stable systems only. 

2) To seek an alternative definition of closed-loop stability for the 

— «x ) case which would agree with the common belief 
that Pi is stabilized by proportional negative feedback of gain 
greater than one. 

3) To identify the systems Pj, P 2 . 

It should be noted that the use of extended spaces does not improve 
the situation, since the difficulty is to determine the correct behavior 
for signals in x). 

Option (1) is a correct conclusion based on existing definitions. 
This would mean, however, that the doubly-infinite set-up is of 
limited interest for control purposes, since systems which in the usual 
sense are "open-loop unstable” would have to be excluded. Option 
( 2 ) does not seem to provide any satisfactory alternatives. It is, of 
course, possible to restrict attention lo the subspace of £1 consisting 
of functions which have support on some interval [J\ x:) for some 
arbitrary finite T. This would treat the graph of P\ precisely as (2) 
and would work with driving signals in Uv which is not 

a closed subspace of jx). This appears to be a rather 

cosmetic solution, however, which more or less amounts to working 
on the half line. A more natural avenue would be to consider the 
aciual £j(-x. x ) graph of the convolution operator Pj. Similar 
reasoning lo the above shows that this is the same as the graph of 
Pj, but with the restriction that the inputs satisfy 

j 1 ' T u(t) (It - 0 . 

Again, this means that the graph Pi is not a dosed subspace of 
r.j(-x. v) and the problem is then to find a suitable subspace 
for the external driving signals. The option of trying to work on 
some subspace of Cj (- x . x ) with signals which “decay sufficiently 
fast” towards minus infinity again does not seem lo offer a satisfying 
resolution. Option (3), although unsatisfactory, is perhaps not as out¬ 
rageous as would first appear. If we consider the system represented 
by the differential equation 


(J = </ + <' 

we can reproduce the trajectories of Pi by solving this equation 
forwards in time and those of Pj by solving it backwards. This 
more or less amounts to abandoning any notion of causality (compare 
with the need to consider the anticausal trajectory (3) as a valid 
input-output pair for Pi). This is not a natural option, however, if 
the direction of time is well defined (which is a basic assumption if 
we consider questions of control.) 

II. Conclusion 

The purpose of this note has been to point out certain features 
of input-output control theory on the doubly infinite time axis 
which appear intrinsically unsatisfactory. The difficulties are not of a 
mathematical nature—a consistent picture is obtained with a variety 
of definitions. The problem lies in trying to escape from conclusions 
which limit the engineering relevance of the theory, c.g., among the 
causal systems, only the stable ones are stabilizable. If the definitions 
lead to such a conclusion, then it would not seem worthwhile to 
develop an elaborate theory of stabilization in that context. In contrast 
we remark that input-output systems theory on the doubly infinite 
time axis does have many important uses, e.g„ in discussions of 
fundamental limitations in filtering imposed by causality conditions 
or in system approximation using Hankel operators. 
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Regional Observability of a Thermal Process 

A. El Jai, E, Zerrik, M. C. Simon, and M. Amouroux 


Abstract —The realization of a ceramic protector covering a subregion 
of a substrate and using chemical vapor deposition (evd) techniques needs 
the temperature field in the deposition zone to be controlled. A feedback 
control of this temperature is based on the knowledge of the temperature 
in the considered subregion. The difficulty occurs because measurements 
can only be obtained out of the deposition zone. The purpose of this paper 
is to give an original approach for the reconstruction of the state in the 
deposition subregion. 


I. Motivation 

Thermal treatment is a usual technique to elaborate materials. 
The quality of the produced material depends on the control of the 
temperature field. The purpose of the treatment may be either the 
improvement of mechanical properties of surfaces (to be reinforced by 
metallic or by ceramic substrate) or the prevention against corrosion 
or oxidation. 

In the considered problem we try to realize a ceramic protector 
using chemical vapor deposition (evd) techniques [7]. Only a part of 
the substrate is concerned with the deposition. Thermal and mechan¬ 
ical qualities of the obtained material depend on the homogeneity of 
the temperature field of the material. The control of the temperature 
field in the concerned zone is determined by observations outside the 
concerned zone. Such a problem can be solved with using a regional 
observability concept [4], 

The aim of this paper is to give an extension of parabolic systems 
observability connected to the above problem. If the system is defined 
in a space domain lh we are concerned with the state observation 
not in the whole domain U but only in a given subregion jj c 
tl. This situation occurs in many practical applications where the 
knowledge of the state is necessary only in a critical subregion. The 
measurements are given by means of a finite number of sensors which 
may be pointwise or zone and located inside U or in its boundary 
W = T. Let y be the state of a system (5) with a state space 
A = L*( 12), and assume that a state //u at a given time to is 
unknown. Suppose now that measurements are given by means of 
an output (depending on the number and the structure of the 
sensors, the measurement interval, ■ ■ *)• The studied problem concerns 
the reconstruction of the state j/o on the subregion 

The next section is devoted to the regional reconstruction of the 
state, then we develop a numerical approach, and in the last section 
the results are applied to the considered thermal process. 

Manuscript received February 2, 1993; revised December 8, 1993 and June 
17, 1994. 
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A. Problem Statement 
Let 

• be a regular bounded open set of R”, (n = 1,2 or 3) with 
boundary T = Oil. 

• u- 1 be a nonempty given subregion of il. 

• [0 stj] with tf > 0a time measurement interval, we denote 
Q = S2x]0 J,[ and £ = Tx}0<tj[. 

■ A be a linear differential operator defined by 



where oo, 6 D(i1 x [0, f/]), the space of C* functions with 
compact support in I>(H x [0. t /]). We assume that 4 is elliptic; A* 
is the adjoint operator of A. 

We consider the system described by the state-space equation 

[ y* A Ay = 0 in Q 

S y = 0 in £ (2.2) 

[yW — Do supposed unknown in il 

and the output function 

-(f) = Cy(t) (2.3) 

where 

(': y e L 2 ({U,a 2 (ii)) — 4 e L J (lHj.Z). (2.4) 


Equation (2.3) gives measurements of the state of the system. These 
measurements can be obtained by pointwise or zone sensors which 
may be located in D or in T. 

Let us recall that a sensor is defined by a couple {D. f) (sec [1]) 
where: 

i) D C 12 is the support of the sensor in the /one case, while 
D = {ft} in the pointwise case; and 

ii) / € L 2 (D) is the spatial distribution of the sensor in the zone 
case, while / = ft/, in the pointwise case (ft/, is the Dirac mass 
in ft). 

Let 

{ </d ^ to be estimated 

2 n \ , . . (2.5) 

flo u\uj undesired 

The problem consists of reconstructing //,*, with the knowledge ot 
(2.2) and (2.3). From (2.2)-(2.3) we can write - = Ky {} where h 
is as an operator: L*(il) -> I 2 ((U,:Z). We recall the following 
definitions. 

Definition 2.1: 

1 ) System (2.2) with output (2.3) is weakly observable if kt rK - 
{()}. The sensor is then said to be strategic |2]. 

2) System (2.2) with output (2.3) is weakly regionally observable 
in a,' C D if ftvrh — {0}. In this case the sensor is said 
to be -j- strategic with 

:L 2 (U) —* L 2 ^) 
v-i = -k (2.t 

(-1* is the restriction of c to -j.) the adjoint of u- 

vMVl-tt’dl) (2.7 
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is defined by 



The concept of regional analysis has been recently developed by 
E$l Jai and Zerrik, and various papers have been devoted to this new 
approach of distributed parameter systems [31, [4|. 

B. Reconstruction Method: Case of a Poinrwise Sensor 
In this case, system (2.2) is observed via the output 

-(f) = y(Ki) (2.9) 


Proposition 22: If the sensor (Mi>) is ^-strategic then (2,18) 
has a unique solution y* 0 E G which corresponds to the regional 
state yk to be observed in u\ 

Proof: 

1) Firstly we show that if the sensor (h. fy,) is u/—strategic then 
(2.14) defines a norm on G. For y?° E G, ||v?°lk; = 0 

= 0 for all t E [0, //]. As the system is autonomous we 
have € /wrA’\* and as the sensor (Mj,) is u.-strategic 
then = 0 in SL 

2) The uniqueness of (2,18) is immediate if A is an isomorphism 
from G —♦ G J . 

We have 


where h 6 H is the sensor location. Consider the set 

G = {r/ E L 2 m | (I = 0 in (2.10) 


(A/./) = <P, 7 x(. ,’(0»./> 
= (.'(()),/> 


and 


G = { tje rr(il) I {1 = Oin + } ( 2 . 1 1) 


and denote (■. ■) the A 2 (12) scalar product. It is then easy to see that 
for g 6 G and g 6 G 




( 2 . 12 ) 


For y° E G, consider the system 


v' + V = u Q 

y = 0 £ 

yil)) = / S 2 


(2.13) 


and a formal use ol Green’s formula gives, by multiplying 
(2.15) by and integrating by parts 


- f i'IV + jVW 

+ / </E 

,/y (Am ,/y 

= - / ' f{h.t) 2 ,it 

Jo 


where r is the outward normal derivative to <9S2 and, for the 
operator .1 defined in (2.1) 


which has a unique solution y* E A‘(G)> Moreover is continuous 
on (<A Then we can consider the operator || * || ( . defined by 


u 

Y - ” 1 


E G 



(2.14) 





c).r, 


<'os(//. . 1 ’, ). 


From boundary and initial conditions we have 


which is a semi-norm on G. Consider now the system 

-v' + Ti; Q 

l' =:() £ . (2.15) 

l'(//) = U 12 

For y' 0 E G, (2.13) gives y"(&./) and with (2.15) we can obtain a 
corresponding i. • (0) . 

Let A be the operator defined by 

A/ = P„ i(r(0)) (2.16) 

where P a j is the projection on G 1 (G 1 may be identified to the 
dual G 1 of G). Lei 2 be defined by the solution of 

-2' + .4*2 = --())/’( -r-b) Q 
2 = 0 E. (2.17) 

Z(i,) = 0 

Finally for a convenient choice of in G, (i.e., such that -fib.1) = 
-(/)), system (2.15) is the adjoint of the observed system (2.2)-(2.9) 
and hence, the observation problem in the subregion a- will be 
naturally solved by the equation 

A/=P*a(Z((»). (2.18) 


(W> = (P„ A (r< ()))./> 

= ll/llr.- 

One deduces that A is an isomorphism from G to G 1 (all this can 
be made completely rigorous without too much difficulty). Finally 
(2.18) has a unique solution y* n = which ensures the observation 
of the initial state in the subregion jj. 

Remarks: 

1 ) In [41 we show that (/>,fy,) is ^-strategic <=> y\(fc) / 0 for 
all /, where (y? f ) is the set of eigenfunctions of .4, with Dirichlel 
boundary conditions, in L l U 1 ) associated to the eigenvalues A,. 

2) These results may be easily extended to other kinds of mea¬ 
surements |4J. 

III. Numerical Approach 

The numerical approach may be achieved very easily when one 
can calculate the eigenfunctions of the system; this is the case of 
invariant systems and will be developed in the next subsection. In 
the general case an adapted technique is given and detailed later. 

We have seen that the solution of the regional observation problem 
is obtained by the solution of the equation 


The considered approach is derived from Lions Hilbert uniqueness 
method [5] which has been developed for the usual controllability 
concept but here the context is somewhat different. We have the 
following result. 


A/ = P,; L (2(0))/6G. (3.1) 

In the next sections we give an implementable approach for solving 
the above equation. 
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A. Invariant Systems Case 

tn this section we consider the case where the system is described 
by an equation with constant coefficients. The idea, in this case, is 
to calculate as shown below, in a suitable basis (<pj) of L 2 (Q), the 
components' A tJ of A. 

We assume that A has in L 2 (ft) a complete set of eigenfunctions 
such that 


Ay?j — Aj <pj ft 

= o r . ( 3 . 2 ) 

iur = i v, 


Without loss of generality we suppose that the eigenvalues A, of A 
are of multiplicity one. As the sensor is pointwise, we have 


Electrode I Electrode 2 



Fig. 1. Longitudinal section of the reactor 


(A* 0 ,/) = ll/ll?. = f' *>( h.tM (3.3) 

J 0 

and with 

X> 

I(j(M) = M (3.4) 

< “= I 

we have 

(A/./) = 

* J 

(A, 4 Aj )t j ___ i 

x jT+A-' (15) 

One deduces 


Algorithm, 

1 ) Solve (2.17) (- 4 . 7(0)). 

2) Initiate </° in G. 

3) Solve (2.13) (-* ^(hj)). 

4) Solve (2.15) <- i/(0)). 

5) If ||0(O) - Z(0)\\ f > - return to Step 2). 

6) The observed state on the subregion + is given by t/d = </°. 

Remark The choice of y" 0 can be made by an optimization process 
which does not need the calculus of the gradient [9], The initial 
choice of y' 0 must be as close as possible to the stale suggested by 
the real system. But any arbitrary choice will lead to the same result, 
with more computations. This is applied to the thermal considered 
problem. 


A = (Azj), /,7 = 1, 


A.j = ^ - (3.6) 

hence, the regional observation problem is transformed in the linear 
system 

M 

£a,,/ = T(0) , = 1.--..A/ (3.7) 

J~ l 

where 2T(0) (respectively, ^) are the components of (re¬ 

spectively, <p°) in the basis (</j). Let us remark that by construction, 
the observed state </d will vanish in 

B . General Case 

In the general case, it is not easy to calculate the eigenfunctions of 
the operator A. Here we shall give a direct approach which allows to 
overcome this difficulty and leads to the state to be estimated in u>. 
We have seen that the regional observability is equivalent to finding 
such that ^’(0) = 7(0) in u, where 0(f) and 7(f) are solutions 
of (2.15) and (2.17). So the problem turns up naturally to find 
which is the solution of the following problem 

min||V>(0) - 7(0)||* J( ., 

o _ • W>°) 

/ € G 

The resolution of this problem can easily be achieved by the direct 
minimization algorithm. 


IV. Application to thl Thermal Deposition Problem 
A System Description 

The deposition phenomenon is produced in a reactor with cold 
walls where a substrate is placed. The temperature is controlled by 
the energy supplied by electric current (Joule eflect) between two 
electrodes maintained at a constant temperature (they are thermally 
controlled by a refrigerating fluid circulation). The Joule effect 
produces a temperature elevation in the substrate which exchanges 
heat with the surroundings essentially in the radiative form (around 
a temperature of 1300 degrees C). The deposition concerns only a 
subregion jj C ft. Sec Fig. 1. 

The substrate is first steered to the deposition temperature, and 
then the covering process starts. The surface of the substrate is 
modified, its emissivity evolves, and then thermal exchanges start 
The modification of the thermal profile is to be compensated with 
action on the electric current. Then it is important to know the 
variation of this profile around the previous equilibrium state. One of 
the difficulties is due to the fact that only the temperature out of the 
deposition zone jj can be measured by optical pyrometry. In view of 
this, we have to conceive a precise estimator of the temperature profili 
in the sensitive subregion. This is made by the regional observattoi 
techniques developed in the previous sections. 

B. Thermal Model 

The modelization of the thermal deposition problem can be cantei 
out in one-dimensional space with ft =]0, f[. Let -j C ft be thi 
deposition zone. The radiative phenomenon introduces a nonlineui 
term in the thermal balance which takes the conductive exchange 
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Fig. 2. Comparison of the real state 7o(.r) (supposed to be unknown) and Fig. 3. Zoom ot the real and estimated stales in the deposition subregion, 
the estimated state 7J), (r) on the deposition subregion uj = [0.3,0.7]. 


into account. This leads to the following moderation of the system 


OT(. vj) 

r(o,n = iy 

T(f,t) = Ti 
T(.r.(l) = r,,(j-) 


2 ■<r(( + <)T'UJ)+ £/‘* 


] 0 .([x]<M,[ 

10 , M 
]0. M 


where 

x : space variable, 

nr. mass element by unit length, 

A : thermal conductivity, 

P : emissivity, 

( : depth ot the element, 

T : temperature of the element. 


t : time variable. 

v v \ calorific heat element. 

t> : electric sensitivity. 

( : length of I he clement. 
a : Stephan-Boltzman constant. 
I : current intensity. 


C Linearized Model 


The previous model is nonlinear but the nature of the problem 
allows the study to be made in a linear context. Indeed the deposition 
is produced around a temperature 7o. The modification of the thermal 
profile has to be compensated with a control of the electric current 
around To. Therefore (4.1) can he linearized around 7b corresponding 
to current intensity / = Jo. 

Let A T ~ T - 7b, A/ = I - h. The linearized model is then 

s dATUJ) 

0 f 

< = a P -jj tJpH + liT^T(.v.t) + -> ]0.([x]0./,[ (4.2) 

AT(().0 = AT(M) = 0 jn.fjl 

< A7(.r. 0) = 0 j(U[ 


where a, ,1, and 7 are given by 


Xfe 
mc v ' 


J = 


-8 :(t{( -I- r) 


nic p 




-AJ 


(4.3) 


where AJ is the control variable and A, p, 7, are 

constants to be specified for numerical simulation. 

The problem is to determine the variation of the temperature profile 
only on the deposition subregion around the previous equilibrium 
rating. The measurements are given by the output function 


D. Simulation 


For numerical simulations we consider the following values for 


the different parameters 



n 

=]0.1[ 

uJ 

= ILtl 

7b 

= T-i = 900° C 

la 

= 38 A 

1 

= 0.510"* m 

( 

= 10“'* m 

- 

= 0.2 

<’r 

= 61.10-'* JKg" 1 K - ' 

A 

= 1.03210* Wm -1 K" 1 

in 

= 1.0 Kgm " 1 

a 

- 5.7110' M Wm K" 4 

/> 

- 2.8710"'Sim 1 . 




(4.5) 


The sensor is located out of the deposition zone with b = 0.85. 


Using the algorithm of Section III-B, we obtain the following 
results: Fig. 2 gives the estimated state (dashed line) and the real 
state (continuous line). 

The regional state in the deposition subregion is estimated with a 
good accuracy 


lift, -ft|iL (w) 

iiftii?^, 


~ 1 . 110 " 4 . 


(4.6) 
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Towards a Generalized Regulation Scheme for 
Oscillatory Systems via Coupling Effects 

Kevin L. Tuer, M. Farid Ck)iiiaraghi, and David Wang 


Atomic!—The vibration suppression laws presented In this work utilize 
an energy transfer phenomena that is evident in some linearly and 
nonllneaiiy coupled systems. The first iteration of Uie formulation Is 
presented in this paper. These control laws are unique in that the states 
approach the operating point on the order of a cosine function, the 
nonlinear control law renders a unidirectional control input, and the 
intuition associated with the energy transfer analogy aids the controller 
design. 


I. Introduction 

Many techniques have been formulated to suppress oscillations in 
both linear and nonlinear systems. The standard linear time-invariant 
state-space representation of a system has the form 


Jr = Ax ■+■ Bu 
y = Cjt + Du 


(I) 


where .4 € JR "* n , B £ C £ JR**" and D £ JR''* 7 ". In 

contrast, a typical nonlinear system is represented by 


Jr = f(j\ u) 
y = gl*n »0 


( 2 ) 


based characteristics of a system. The control input to the plant is 
chosen to ensure that the closed-loop system is dissipative which 
translates to a decrease in the system energy. Other nonlinear control 
techniques involve the use of a variable feedback law including 
gliding mode [4] and gain scheduling [6|. For example, the former 
involves specifying a sliding surface in the state space and defining 
the appropriate control parameters such that a sliding mode exists and 
the surface is locally or globally attractive. Implementation typically 
involves high speed switching feedback which can induce spillover 
effects. Gain scheduling is similar except the transition between 
feedback structures is smoother |6). Some nonlinear systems support 
the use of feedback linearization techniques [4]. Using this approach, 
the designer develops a feedback law that essentially linearizes the 
plant. Then, a wealth of linear control algorithms can be utilized 
to complete the control design. These techniques, however, tend to 
suffer from a lack of robustness in the presence of uncertainty. More 
recently, neural networks [7] and fuzzy logic [7] have been utilized 
for vibration suppression applications as well. 

Implementation of the control techniques outlined above may 
render a system that is stable in an Lj, 1-^, or exponential sense. 
Stability in this sense, however, does not preclude the possibility 
of one or more of the states of the controlled system exhibiting 
an unacceptable rate of convergence to the desired state and/or 
unwanted oscillatory behavior. The techniques presented in this work 
are intended to reduce or eliminate these effects. 

For the purposes of this paper, the investigation is conducted using 
a canonical second order system with a stare- space model given by 


where f:JR"'+ n -*■ JR" and g:JR m +" —* JR l> . The corresponding 
control techniques can also be classified as linear or nonlinear. 
Included in the linear realm are techniques such as the standard PID 
control [ 11, pole-placement control |l], and linear quadratic (LQ) 
techniques 11]. The latter involves the minimization of a quadratic 
cost function to define the control law. More recently, the H * 
optimal control technique [2] has been used quite extensively. It 
entails the selection of a controller from the set of all possible 
stabilizing controllers by minimizing the infinity operator norm of 
the transfer function between the exogenous inputs and exogenous 
outputs of a standard augmentation. Structurally similar approaches 
include the //2 {2J and L\ [3] control methods. The field of nonlinear 
control includes techniques such as the Lyapunov approach [4], the 
passivity formalism [4], and dissipative system theory [5]. These 
particular strategies have energy-based scenarios. The Lyapunov 
approach involves specifying a control input such that the candidate 
Lyapunov function is locally positive definite and its time derivative 
locally negative (semi) definite. A similar approach is taken using 
the passivity formalism which involves the use of a Lyapunov-like 
function that is an indicator of the energy content of the system at any 
point in time and space. The control input to the system is selected 
such that the system is passive and/or dissipative. In addition, closed- 
loop Stability of a feedback system can be ascertained using the well 
known passivity theorem [4|. Dissipative system theory, in contrast, 
employs energy storage and supply functions to model the energy 
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0 


1 




•» /> + //* ( x p ) + 


0 

h m 


(3) 


where C/m £ JR 4 are the damping coefficient and natural fre¬ 
quency respectively, ftai € JR, fp(x) contains weak nonlinear terms 
and the state vector is {x p := [.rj jj] = [x .i ]f. Equation (3) 
can be considered as the model of one oscillatory mode of a multi- 
degree-of-freedom (MDOF) system. Thus, the intention is to develop 
the theory for a canonical oscillatory system with the ultimate goal 
of utilizing the theory for multi-mode oscillation suppression. For 
notational purposes, the system given in (3) will be denoted as the 
plant. 

The essence of the proposed techniques is to exploit the energy 
link that can exist between two coupled, second order, oscillating 
systems. If one equation models a system to be controlled and the 
other models a controller, then it is plausible to alter the dynamic 
characteristics of the plant using the controller. Under this scenario, 
the plant and the controller dynamics are characterized by amplitude 
and/or phase modulation giving rise to a beat phenomenon. As a 
result, the envelope of the plant response is periodic and approaches 
a minimum at approximately the same rate as a cosine function. 
Subsequent controller action can then be instituted to "remove" 
energy from the entire system. Using the energy transfer analogy, 
the design is well directed and efficient. The designer is able to 
incorporate physical meaning with the mathematics to yield a control 
strategy that is intuitive and relatively easy to troubleshoot. 

The use of coupling effects to suppress structural oscillations 
has been addressed recently in [8]-[14], Golnaraghi was one of the 
first researchers to use modal coupling effects to control structural 
oscillations [8], [9]. Tuer et al. [10). Duquette et al. [11], Duquette 
[12], and Tuer [13] examined the use of coupling effects to control 
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Fig, I. Experimental Responses [12]. (a) Coordinate coupling, (h) modal 
coupling. 


oscillations in a flexible cantilever beam on both a theoretical and 
experimental level. The desired coupling effects were achieved via 
physical components. A DC motor was attached to the free end of 
the beam. A rigid beam-mass assembly was connected directly to the 
shaft of the motor. This assembly was designated the secondary link. 
The position of the secondary link was regulated at 90 degrees to 
the flexible beam via position and velocity feedback from the motor. 
Fig. 1(a) shows the experimental plot of the regulated response of 
the tip of the flexible beam using the coordinate coupling control law 
after subjecting the tip to an initial displacement. Similarly, Fig. 1 (b) 
illustrates the up response employing the modal coupling control 
law. Both strategies function very effectively. These experimental 
results illustrate that physical systems can exhibit both coordinate and 
modal coupling effects under the proper conditions. This investigation 
also showed that these phenomena can be used, rather effectively, 
for control purposes. This physical insight provides the primary 
motivation for the current research. 

The disadvantage of the aforementioned work is the need lor 
unconventional actuation devices to apply the necessary control. 
This problem has been addressed by Tuer et al. [14]. They have 
initiated the generalization of the use of modal coupling effects 
to control a canonical system using more conventional actuation 
devices. In this paper, both linear and nonlinear coupling effects are 
employed as paradigms in the development of control laws to regulate 
an oscillatory state of a system. As well, stability issues for both 
control strategies are addressed. A numerical example is presented 
to illustrate the use of the proposed techniques and corroborate the 
mathematical predictions. 


II. Controll.hr Implementation 

To suppress the oscillations of a system of the form of (3) using the 
proposed techniques, a second system must be introduced to which 
the plant can be coupled using feedback. This is accomplished via 
the introduction of second-order differential equation with a linear 
structure similar to that of the plant. This system becomes the 
ontroller in the closed-loop system. Thus, the closed-loop system 
ynamics arc dictated by equations of the form 


’•h ' 


J‘2 _ 





Wr 


+ /jj Uv) + 


0 

l>2 1 


It (4) 


Disturbances 
MO) 


x 

* = Ax + Bu -*0 


y = Cx 4 Du 


Controller 
ft, « f.(x) 




Design Parameters 

Fig. 2. Block diagram of controlled system. 


-1 1 


■ o r 

.r \ 


0 

.i* i 

— 

_-*■? 0 

/ 1 . 

4* 



where (4) models the plant, (5) models the controller and 6 2R* is 
the coniroller frequency. The function C(-) is a damping mechanism 
utilized to effect control and /,,(•) contains the plant nonlinearities. 
The plant control input «, which lakes the form of a state relation in 
the formulation, and /, (>) are used to establish the coupling between 
the plant and the controller. The coefflcicnt of the relation for u, 
the elements ot /,(■), ihe form of C( ), and the controller frequency 
are the design parameters of the system. A block diagram of the 
closed-loop system is illustrated in Fig. 2. As evident from Ihe figure, 
the disturbance is modelled by initial slate conditions acting at the 
output of the plant. 

The design of the controller is a two-stage process. The first 
step involves establishing the required coupling effects. The second 
step revolves around specifying the remaining design parameters to 
establish the frequency of amplitude modulation and defining the 
appropriate parameters of the energy removal mechanism such that 
the disturbance induced motion is quelled in accordance with the 
given design specifications. 


A. Establishing Coupling Effects 

The proposed technology hinges on both linear and nonlinear 
coupling effects to function. In the past, the phenomena resulting 
from coupling effects were examined primarily from an analysis 
standpoint. However, the focus of this work is to utilize the desirable 
features of these phenomena for control purposes. 

J) Coordinate Coupling: In a mathematical sense, coordinate cou¬ 
pling refers to Ihe nondiagonal nature of the inertia and/or stiffness 
matrices of a system of second order, ordinary differential equations 
of motion. In certain situations, the system response may exhibit 
periodic amplitude modulation or a beat phenomenon. Since the 
solutions of (4) and (5) are predominantly oscillatory, this amplitude 
modulated response must be the result of the superposition of "tuned" 
periodic functions. This can be illustrated using the analogy of the 
pointwise addition of two periodic functions with the same amplitude 
and nearly the same frequencies of oscillation. There are well-defined 
points in lime where the two functions reinforce each other to yield a 
doubling of the amplitude. Conversely, there are points in time where 
the two functions act to cancel one another yielding a net amplitude 
of zero. On a physical level, this is analogous to a transfer of energy 
between the coordinates of the system. 

The development of the control law based on coordinate coupling 
was conducted by first recognizing that a similar phenomena can 
be established between a second order, single-degree-of-freedom 
(SDOF) plant and second order, SDOF controller through the proper 
selection of the controller parameters and the control input to the 
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plant. That is, the control law was established so as to generate a 
closed-loop system with inertial coupling. To this end, the closed-loop 
equations have the form of (4) and (5) with 

, U = - J’4 

/,(■) = - R\h 

/„(•) = 0 (6) 

where K € J7T 1 ". In generaL the last condition in (6) need not be 
satisfied. It is implicitly assumed, however, that any nonlinearities 
contained in /,,(•) are at least one order of magnitude smaller than 
the linear terms of the closed-loop system equations. The solutions 
Of (4) and (5) with (6) invoked, assuming zero damping, are readily 
found to have the form 


JTl 


'C,.4l 

CiA'V 

ros(ufit + tf j ) 

/ 1 


C,.4J 

ClAi 

VOs(jJ2t + $ 2 ) 


where Al £ JR is the ah entry of the jth eigenvector corresponding 
to the jfth natural frequency of the closed-loop system, The 

constants Cj E B and tf, £ JR are defined by the initial conditions 
imposed on the system. The eigenvalues of the undamped system 
A 1,-2 £ C\ are 

. _ K’ + S) ± ygg+gg- 4(i - Tn^Tj ... 

Al2 2(1 — A') (8) 

The degree of coupling and frequency of amplitude modulation is 
contingent on the magnitude of K since this parameter dictates the 
proximity of the system’s natural frequencies. 

The first solution in (7) models the plant response whereas the 
second solution pertains to the controller response. Using the design 
parameter, A, the natural frequencies of the closed-loop system 
can be placed arbitrarily close to one another. Since the plant and 
controller responses are the result of the superposition of two periodic 
functions with nearly the same frequencies, a linear beat phenomenon 
can be established. Un(ier these conditions, the periodic functions 
will completely cancel one another provided the amplitudes of the 
two functions are equivalent. In terms of the solution for the plant 
response, this translates to enforcing the condition C\ .4] = Ci A'i in 
(7). In [12], ii was shown that this condition is satisfied if 

a. 1 , = uJp. (9) 

To ensure a relative plant energy minimum at a specific point in lime, 
the parameter K is chosen so as to satisfy the relation [ 12 ] 

^ — ( 10 ) 
^'2 - 

where the user-defined n £ 1 T \0 dictates the amplitude modulation 
frequency. Both (9) and (10) are necessary conditions to establish 
the desired amplitude modulated response. The time at which the 
first cancellation occurs, //,, can be shown to occur at [ 12 ) 

tf. =——■ 01) 

*2 ~ ^'1 

Thus, satisfying (9) ensures that C\A\ = CzAf to foster complete 
cancellation at time fy r , the value of which is dictated, primarily, by 
the magnitude of n as expressed by (10). That is, the parameter n is 
used to define the point in lime where the position and velocity of the 
plant are simultaneously zero which corresponds to zero plant energy. 

In the foregoing formulation, acceleration feedback was utilized 
to establish dynamic coupling effects. Analysis shows that a static 
coupling paradigm can also be utilized and that both (9) and (10) 
hold under this scenario as well [15], [16]. Since both types of 
coupling render the same characteristics, the remainder of the analysis 
is conducted for the dynamic coupling scenario only. 


2) Modal Coupling: A nonlinear system may exhibit modal cou¬ 
pling effects if it is in a state of internal resonance. Internal resonance 
has its foundation in the field of nonlinear oscillations. Mathemati¬ 
cally, a necessary but not sufficient condition for the existence of 
^internal resonance is defined by the commensurability of the linear 
natural frequencies of the nonlinear equations of motion, u>, [17], 
That is, there exist constants {tf i, • ■ ■. E H\0] such that 

*Mi + ^2 + -" + « 0 . ( 12 ) 

Under such conditions, an energy bridge is formed between the 
commensurable modes of oscillation. If the left-hand side and right- 
hand side of ( 12 ) are identically equal, an exact commensurability 
condition exists which fosters a state of maximum energy transfer. 

The type of internal resonance condition is contingent on the order 
of nonlinearities. For instance, if quadratic nonlinearitics are present 
in the system, a 2:1 internal resonance condition may exist. The 
specific commensurability relation under this scenario is tfi-i>i 4 - 
$ 2 ^'2 0 where the associated constants are {tfj =■ ± 2 , tf 2 =- q=i} 
or{tfi = ±l.tf^ = 4-2}. A state of 2:1 internal resonance is 
employed in this paper. 

As with the coordinate coupling-based control law presented in 
the previous section, the internal resonance-based control law was 
developed by recognising that a state of internal resonance can be 
established between a second order, SDOF plant and a second order, 
SDOF controller. Thus, using this paradigm, the closed-loop system 
equations are of the form given by equations (4) and (5) with 

(/ = — -1 ./■ } 

/«(■) 
fv(') =0 

Ur 't (13) 

where - l. 12 £ B^ are design parameters. In general, /,,(•) need 
not be zero. In this case, if the structure of the nonlinear terms, 
f p (‘), parallel the nonlinearity introduced by the control law, they 
can be used to effect the control. If these terms have a structure 
differing from that of the nonlinearities introduced by the control, thc> 
will not affect the integrity of the controller provided they are weak 
relative to the control law nonlinearities. Finally, the last condition in 
(13) is employed to satisfy the commensurability requirements and 
is necessary for establishing modal coupling effects. 

Other forms of nonlinear feedback can be utilized in the formu¬ 
lation of the control law [15]. That is, the nonlinear terms remain 
quadratic but the individual terms may include position, velocity, 
and/or acceleration feedback. The analysis and resulting relations 
differ slightly but, nevertheless, have essentially the same structure 
Therefore, as with the linear case, the potential to use other forms 
of feedback renders more freedom to the design. In this paper, the 
system given by (4), (5), and (13) is analyzed and discussed in detail 

To illustrate the modal transfer of energy and to derive a relatim 
indicating the degree of energy transfer, a perturbation analysi 
technique known as the method of multiple scales is utilized [J7| 
Using this approach, an assumed solution for the undamped nonlmea 
system is constructed in the form of an asymptotic series. Sever* 
relations indicating the existence of internal resonance are derive 
in the process of formulating the exact form of the elements of tl 
series solution. The system solution is of the form 

-n =.rio(7o, T \) + F.rn (7oi Ti) + ■ ■ ■ 

•tj =J*ao(To, Ti ) + ^31 (To, Ti) + ■ ■ ■ (^ 

where T t are new independent time scales and the correspondin 
asymptotic sequence is constructed using powers of {f € B + | 0 
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Fig. 3. Undamped Modal Response. - . /?i.-: ft? 

f < 1}. Substituting (14) into the undamped form of the nonlinear 
equations of motion yields a set of conditions that must be satisfied in 
order to satisfy the constraints implicit on the series solution. These 
are known as the secular terms or solvability conditions and can be 
expressed in terms of the modal amplitudes, 7J|. /? 2 , and associated 
phases, oi.d-/, of the system 

/?, = — 0.0625*) i -t'|, 7 ?]mij(<;j ) 

/?; =0.25*20/,,7? i i?‘j.siii(c>l 
R}0 =(0.5 - )2-*/ # ,- 0.0625*)ij.> # ,7?i)ros(o) 

(> =2n j - n i (15) 

where the overdot ( ) indicates differentiation wilh respect to the 
time variable T\. The variables B i and U> model the envelope of 
the response of the plant and controller, respectively. Equations (15) 
model the response of the envelopes of the modes of oscillation of the 
system. Simulation of these equations indicate that the desired transfer 
of energy between modes is indeed occurring as illustrated in Fig. 3. 
The corresponding parameter values are = 30. *i = - j = 0.5). 
Equations (15) are used to generate an energy relation which has the 
form 

+ 06) 


B. Achieving Vibration Suppression 

To effect control over the system, the energy must be transferred 
from the plant to the controller and "removed" before it has an 
opportunity to transfer from the controller back to the plant. To 
accomplish this task, two fundamentally different approaches are 
proposed [14]. The first method is the disabled input method (DJM). 
Using this method, the existing energy bridge is severed once the 
energy has transferred from the plant by disabling the control signal 
to the plant. The second method is termed the dissipated energy 
method (DEM). This approach, in essence, dissipates the energy 
as it is transferred from the plant to the controller via a damping 
mechanism associated with the controller. 

I) Disabled Input Method (DIM): An outline of the design proce¬ 
dures for the coordinate and modal coupling controllers are delineated 
separately beginning with the former. 

a) Coordinate coupling controller: Using this paradigm of con¬ 
trol, the governing equations of the closed-loop system are given by 
(4)“(6) with ('(■) = 0. The first step of the procedure is to set the 
controller frequency, -t 1 ,. equal lo the natural frequency of the plant, 
- 4 to facilitate a maximum energy transfer. The desired response 
frequencies are established using the controller gain K since the 
natural frequencies of the closed-loop system are explicitly a function 
of K. The frequency of amplitude modulation and primary response 
frequency are 



(^ ■i + *’i 

1 2 ) 

\ 2 / 


respectively, where the former is an indication of the settling lime 
of the system. 

Since the proposed techniques hinge on energy transfer, it is 
necessary to construct an energy function for the plant to effect 
control. In general, the energy function has a quadratic form and 
is given by 

Xp (^Xp (20) 


where E t is representative of the system energy and Bf is 
indicative of the energy of mode i. Equation (16) shows that, provided 
ll\ is a periodic function, a continuous exchange of energy occurs 
in this conservative system. The secular term equations are further 
analyzed to generate a relation used as an indicator of the degree of 
energy transfer within the system. This relation takes the form 

r . 4 1 .[fl,(0)- ! /?i(0)ros(o(0))]‘ ! 

A ] — ----- n n 

-,j [*i^7?i(0)* + 7f2(OP 


where 7v'i E In general, the smaller the value of A’i. the better 
the transfer of ertergy between modes. Perturbation analysis shows 
•hat 7v i = 0 corresponds to an unstable equilibrium point and is 
therefore inadmissible. It appears after extensive numerical analysis 
(hat (17) can be extrapolated to the full-order nonlinear system 


^2 [.r i (0) 2 .f i {())<'os(cp( ()))]“ [.rj(0) a .n (0) ] _ 

-,,[41i.ri(0)-’+.r'.,(()) 2 ]'’ ” 11 [l^'ilO ) 2 +.r,«))-’] ' 


A’i • 


(18) 


fhus, (18) illustrates the effect that the disturbance and the design 
parameters have on the degree of energy transfer. This is an important 
result because 7 ii and (18) can be utilized in the formulation of the 
controller. To guarantee a particular level of energy transfer, 7v i is 
elected arbitrarily small and the design parameters are chosen to 
satisfy (18). In addition, actuator constraints can be incorporated into 
(he design at this stage since a smaller value of h i translates to 
arger control effort. Further detail of the perturbation analysis and 
he derivation of (17) is contained in [14]. 


where Q is a real, diagonal, positive definite weighting matrix 


Q = 



o' 

l 


( 21 ) 


The final task of the design involves monitoring the plant's energy 
on a continuous basis and disabling the input to the plant when 
the magnitude of the energy function falls below a predetermined 
threshold value, 7?/ )/A/ E 1R+. That is 


a — 



Q-' r 

■>,, Q*v 


> E nni 

< F ,jni . 


( 22 ) 


b) Modal coupling controller ■ Under this scenario, the closed- 
loop dynamics arc dictated by (4), (5), and (13). The procedure 
resembles the latter with a few fundamental differences. First of all, 
the frequency of the controller. , is set equal to one-half the natural 
frequency of the plant, Next, with (^) = 1 and an arbitrarily 
small A\ the required controller initial condition, (0), is calculated 
using (18). To attain the desired amplitude modulation frequency, 
maintain (^) = 1 and set )io 2 such that the point at which the 
amplitude envelope achieves a minimum value corresponds to the 
desired settling time. At this point, double peak symmetry in the plant 
response is achieved to maximize the dwell time at the plant energy 
minimum thus rendering the system more robust to errors in disabling 
time. This is done by adjusting the parameter .r.i(0). Using this new 
value of .i-riO) and the specified design parameters, the degree of 
modal energy transfer. K i, is recalculated using (18). If the energy 
transfer corresponding to this value of 7vi is insufficient, the value 
of 7\j is reduced and the previous steps are repeated. Otherwise, 




m 
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the final Mage of the design entails monitoring the system energy 
and disabling the input in a similar manner described in the previous 
section. That is 



*ZQ*p > E? iM 
< Ei ,ni 


(23) 


Although effective, this technique suffers from lack of robustness to 
errors in disabling time. When the input is disabled, the energy bridge 
between the plant and the controller is destroyed. Thus, whatever 
energy is in the plant mode when the input is disabled remains in the 
plant mode unless controller action is reinitiated. 

b) Dissipated energy method (DEM): The design procedure for 
the DEM is similar to the DIM except for the means of oscillation 
suppression. Using this approach, a damping model is incorporated 
into the controller equation. The energy link facilitates the transfer 
of energy from the plant to the controller where it is dissipated 
through the controller’s damping mechanism. This technique can 
be implemented in either a time invariant fashion or a time-varying 
fashion. The time invariant approach involves invoking the clamping 
mechanism associated with the controller in conjunction with the 
disturbance. That is, given that the disturbance is applied at time t c i 
and the damping mechanism is invoked at time then f, mp = tj- 
Thus, the system approaches tfie operating point slower than a cosine 
function but quicker than an exponential function. The time-varying 
approach involves the use of a function that provides an on-line 
indication of the plant energy. Once this indicator function drops 
below a user-specified threshold, the damping effects are invoked. 
Thus, given that t mtn denotes the time at which the plant reaches 
its first energy minimum, then {t imp = t mil , - d : 0 < d < 
(tr,un - fdM € 1R+}. Alternatively, the damping invoke condition 
can be energy based. That is 


C(-) 


f 0, 4 Qx p >EP rht 


(24) 


where Ep EM is the user-specified energy threshold and C f / e ,( ) is 
the tuned damping mechanism rendering the desired response. This 
implementation allows the envelope of the response to approach 
the operating point with the speed of a cosine function until the 
switching condition is invoked at which point the damping effects 
are implemented. From this time forward, the speed of approach 
becomes similar to that of the time invariant approach. The threshold, 
EP C u must be set at a value that guarantees that the damping 

effects are employed. That is, if the plant energy does not reach the 
threshold value, the damping effects will not be invoked and the 
desired response will not be achieved. 

Unlike the DIM, the DEM technique is relatively robust. More 
details of the design procedure can be found m [14]. 


plausible to conjecture stability based on intuition. A similar stability 
conjecture can be made for the time-varying DEM since the closed- 
loop system is stable before and after the damping mechanism is 
invoked. 

* 2) Modal Coupling Controller: The fundamental approach to as¬ 
certaining the stability of a nonlinear system is to use Lyapunov’s 
indirect method [4]. In this case, the closed-loop system equations 
are linearized about the origin of the state-space. If the time invariant 
DEM is utilized and thus ( p > 0, C > 0, the system is asymptotically 
stable. If, however, C = 0 and/or C P = 0, then the stability of the 
system cannot be ascertained since at least one of the eigenvalues 
acquires a zero real part. Thus, Lyapunov’s direct method is adopted 
[4]. The existence of quadratic nonlinearities in the closed-loop 
system suggests the use of a candidate Lyapunov function of the form 

V(.r) = aa'i + bxi + r.r] + d.v\ (25) 

where £ 1R + . Taking the time derivative of (25) and 

substituting the expressions for the derivatives of the state variables 
yields 

V(.r) = -4C,^V'2 ~ 4(/C•■’I + (2c - 

+ (2rij2 — 2ft")i + (2 r — 2rfu.',‘).ri.i’-i. (26) 

Selecting the variables of the candidate function to be 

a = 4 - b = 1, c = ( — U',\ (}-(—) (27) 

induces the derivative of the Lyapunov function to take the form 

t’(JT) = -4C P .^ - 1( — (28) 

12 

which shows that the closed-loop system is stable over the entire 
state-space provided 

C„C >0. ->,,.->,,(21) >0. (29) 

12 

As with coordinate coupling, stability for the modal coupling DIM 
and the time-varying DEM is difficult to prove. Closed-loop stability 
can be conjectured based on a similar argument as given in the 
previous section. 

Asymptotic stability for the nonlinear feedback configuration can¬ 
not be ascertained due to the fact that once the plant is motionless 
or the equation associated with the controller becomes inactive, 
the modal energy bridge is destroyed. The energy bride can be 
regenerated at any time, however, provided both the plant and 
controller are active. Thus, it may be possible to prove asymptotic 
stability if a small, persistently exciting signal is applied to the 
controller equation for the duration of plant activity. 


C. Stability Analysis 

The stability of the closed-loop system can be ascertained for both 
the coordinate coupling and modal coupling scenarios. For the DEM, 
a viscous damping mechanism (i.e., (?(•) = 2( c jj, x* ) is implemented 
in the controller equation. 

1) Coordinate Coupling Controller: System stability using the lin¬ 
ear feedback approach is attained by examining the eigenvalues 
of the closed-loop system which, in the absence of damping ( 
C p = 0, C = 0) are dictated by (8). Thus, the undamped closed-loop 
system is globally stable provided 0 < K < 1. If the time invariant 
DEM approach is utilized with ( p > 0, < t > 0, it can be shown that 
the system is globally asymptotically stable provided 0 < A' < 1. 
Stability for the DIM and the time-varying DEM is more difficult 
to show. Since, however, the closed-loop system is stable when the 
control is applied and after the control is disabled using the DIM, it is 


III. Numerical Examples 

In this section, the proposed control techniques are applied to *i 
plant of the form 

[ 0 run 

t= [-900 oJ' + [lJ" (30) 

where C P ,/ P (-) = 0 have been assumed for convenience. Thi 
uncontrolled plant response is given in Fig. 4. If coordinate couplin' 
effects are utilized, the control structure is 

u = -j\|, /, (•) = -A r i* 2 , u\ — uj p . (31 

The plant and controller responses when (31) are enforced and h 
= 0.0017 are given in Fig. 5. Clearly, the desired transfer of energ 
is occurring. If the value of K is increased to A'® 0.005, whic 1 
corresponds to 425, the first energy minimum of the plant occur 
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Fig. 4. Uncontrolled plant response. 
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(a) 
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hg. 5 System response using coordinate coupling (a) Plani, (b) conlrollei. 
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Time (sec) 


Fig. 6 Plant response, coordinate coupling, with desired amplitude modu¬ 
lation frequency 

at approximately 1.5 seconds as illustrated in Fig. 6. Utilizing the 
DIM results in the controlled response and input profile illustrated 
in Fig. 7. The steady-state oscillation is an indication of the amount 
of energy in the plant when the input was disabled. If the DEM 
approach is adopted and a viscous damping mechanism is utilized, 
a time invariant implementation results in the plant response and 
input profile given in Fig. 8 whereas if the time-varying approach is 
'itilized, the plant response and input profile are as given in Fig. 9. 

If modal coupling effects are employed, the control parameters are 

i/ = /,(■) = = 2^ ■ (32) 

Hie corresponding plant and controller response with (32) invoked 
is illustrated in Fig. 10. The magnitudes of the nonlinear coeffi¬ 
cients are then increased to render a plant response with the first 
v nergy minimum occurring at approximately 1.5 seconds (Fig. 11). 
>o achieve symmetry in the response, the initial position condition 



to 
20 
10 

? 0 

i -10 

-20 
30 

0 05 l 15 2 25 3 35 4 

Time (sec) 

(b) 

Fig 7. DIM controller using coordinate coupling, (a) Plant response, (b) 
input. 



Fig. 8 DF.M time invariant controller using coordinate coupling, (a) Plant 
response, (b) input. 

is increased. The resulting response is illustrated in Fig. 12. Finally, 
implementing the DIM control law yields the plant response and 
input profile given in Fig. 13. With the given choice of nonlinear 
feedback, the control input is always on one side of zero. This may be 
advantageous for some applications. If the plant is controlled using 
the time invariant DEM, the plant response and input profile are 
as given in Fig. 14. If the time-varying approach is implemented, 
however, the plant response and control input profile are as given in 
Fig, 15. Fig. 16 illustrates the controlled response and input profile of 
the plant subject to a proportional-derivative control law. These are 
included for comparison purposes. A complete list of parameters for 
the coordinate coupling and modal coupling controllers is contained 
in Tables MIL 

In all cases, the proposed control strategies function very effec¬ 
tively. Unlike many techniques, the controller design is aided by the 
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■(b) 

Fig. 9. DEM time-varying controller using coordinate coupling, (a) Plant 
response, (b) input. 
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Fig. 13. DIM controller using modal coupling, (a) Plant response, (b) input. 


(a) 



0 1 2 3 4 5 6 7 8 
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Fig. 10. System response using modal coupling, (a) Plant, (b) controller. 
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Fig. 11, Plant response, modal coupling, with desired amplitude modulation 
frequency. 

intuition accompanying the energy transfer analogy upon wmen me 
derivation of the control laws were based. Thus, troubleshooting and 




IV. Conclusions 

In this paper, new techniques were outlined to suppress the 
oscillations of a canonical second-order system. In general, the 
oscillation to be suppressed may result from the application of anothei 
control technique or may be disturbance induced. 
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Fig. 15. DEM time-varying controller using modal coupling, (a) Plant re¬ 
sponse, (b) input. 
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(a) 
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(b) 

Fig. 16. PD controller, (a) Plant response, (b) input. 

There are several advantages affiliated with these techniques. First 
T all, a great deal of intuition can accompany the mathematics in the 
ormulation of the design as a result of the energy transfer analogy, 
his leads to more efficient controllers and renders troubleshooting 
-latively easy to perform. Secondly, the plant response approaches 
’C operating point on the order of the cosine function whereas most 
v isting techniques render responses that approach the operating point 
'ponentially. Finally, the nonlinear feedback paradigm proposed in 
his paper yields either a positive or negative acting control input 
hich may be useful for certain applications. 

The work presented in this paper represents the first stage in the 
^velopment of generalized vibration suppression laws using linear 
'id nonlinear coupling effects. The extension to the multi-degree-of- 
"edom case involves the transformation of the plant equations to 
form that facilitates the repeated application of the SDOF theory. 


TABLE I 

Coorthnatl Coupling Controller Parameters 



TABLE 11 

Modai Coupling Coni roller Parameters 



Thus, the concepts presented here are integral to the development 
of the MDOF theory. The details of the extension are contained in 
f 15J and [16]. 
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/i-Synthesis of an Electromagnetic Suspension System 

Masayuki Fujita, Torn Namerikawa, 

Fumio Matsumura, and Kenko Uchida 


Abstract —This paper deals with //-synthesis of an electromagnetic 
suspension system. First, an Issue of modeling a real physical electro¬ 
magnetic suspension system is discussed. We derive a nominal model as 
well as a set of models In which the real system is assumed to reside. 
Different model structures and possible model parameter values are 
fully employed to determine unstructured additive plant perturbations, 
which directly yield uncertainty firequency weighting function. Second, 
based on the set of plant models, we setup robust performance control 
objectives. Third, we make use of the D-/v iteration approach for the 
controller design. Finally, implementing the controller with a digital signal 
processor, experiments are carried out. With these experimental results, 
we show robust performance of the designed control system. 

1. Introduction 

Electromagnetic suspension systems can suspend objects without 
any contact. The increasing use of this technology in its various forms 
makes the research extremely active. The electromagnetic suspension 
technology has already applied to magnetically levitated vehicles, 
magnetic bearings, and so on. Recent advances on this field are 
shown in 11], and [5). 

Feedback control is indispensable for magnetic suspension systems, 
since they are essentially unstable systems. To synthesis a feedback 
control system, a precise mathematical model for the plant is required. 
It is known, however, that a design model can not always express the 
behavior of the real physical plant. An ideal mathematical model 
has various uncertainties such as parameter identification errors, 
unmodeled dynamics, and neglected nonlinearitics. The controller 
is required to have robustness for stability and performance against 
uncertainties on the model. 

Recently, //-synthesis , which is constructed with both H^ syn¬ 
thesis and //-analysis, has been developed for the design of robust 
control systems [7), [8]. Beyond the singular value specifications, 
the //-synthesis technique can put both robust stability and robust 
performance problems in a unified framework. Applications of the 
//-synthesis method have been reported in |3|-[4|, and [9). This 
electromagnetic suspension system is a simple S1SO system, but in |3] 
and 110], the authors also applied the H /// synthesis to a magnetic 
bearing, and the effectiveness of this design method was evaluated 
for a MIMO system. In the case of applications of fJ^/n control to 
real physical systems, it is quite important to selecl appropriate design 
parameters. These parameters construct some parts of the generalized 
plant, e.g., uncertainty and performance weightings. 

In this paper, we will evaluate //-synthesis methodology exper¬ 
imentally with a real electromagnetic suspension system. We will 
model the additive uncertainties and decide the frequency weighting 
function for uncertainty accurately and reasonably. We will show 
that the closed-loop system with a // controller achieves robust 
performance experimentally. 
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/ : declroma^netie force 
m ■ nui of the Iron ball 
J : steady current of the electromagnet 
i 1 1 displacement from the steady current 
E : steady voltage of the electromagnet 
t : displacement from the steady voltage 
X : steady gap between the electromagnet 
and the iron ball 

* : displacement from the steady gap 
K : resistance of the electromagnet 
L : inductance ofthe electromagnet 


Fig. 1. Schematic diagram of the electromagnetic suspension system. 


II. Experimental Setup 


A. Electromagnetic Suspension System 

The structure of the electromagnetic suspension system is shown 
schematically in Fig. 1. The objective of our control experiments is 
to suspend an iron ball stably and firmly without any contact by 
controlling the attractive forces of an electromagnet. Note that this 
system is essentially unstable. 

In Fig. 1, a cylindrical electromagnet as an actuator is located at 
the upper part of the experimental system. Mass of the iron ball 
is 1.75 kg, and it has a diameter of 77 mm. A gap sensor of our 
own producing is placed at the bottom of the system to measure the 
gap length between the iron ball and the electromagnet. The sensor 
is scaled for a gap of 2.4 mm per volt. It is a standard induction 
probe of eddy-current type. Physical parameters of this experimental 
machine are shown in Table I. 


TABLE I 

Parameters of Electromagnetic Suspension System 


Parameter 

Maximum 

Nominal 

Minimum 

Value 

Value 

Value 

m [kg] 

i 

i 

1.75 


x (ml 


5.00 x 10-’ 

4.50 x Iff 3 

/ IA] 

1.18 

1.06 

0.93 

x [m] 

5.00 x 10 4 

0 

- 5.00 x 10 4 

i [A] 

1.18 x 10 4 

0 

-1.26 x 10 1 

L [H] 

5.57 xlO 4 

5.08 x lO* 1 

4.65 x 10 4 

« mi 

2.37 x 10*' 

2.32 x 10* 1 

2.27 x 10*' 

* lNm J /A 2 ] 

3.35 x lO 4 

2.90 x 10 4 

2.53 x lO 4 

*o [m] 

-3.32 x 10 4 

-6.41 x 10 4 

-9.42 x 10 4 

Q [Hm] 


5.79 x 10 4 

5.06 xlO 4 

mSEM 

-3.32 x 10 4 

-6.41 x 10 4 

-9.42 xlO 4 

| ^ [HI 

3.96 x lO’ 1 

3.75 x 10 4 

3.54 x 10 4 


A. Model Structures 

We will employ four different model structures for the system 
depicted in Fig. 1. All of the models are finite-dimensional, linear, 
and time-invariant of the following state-space form 

T — Ac Bu. y = Cx 

•»•=[•*■ ']' ■ « =<*• V = -r- (3.1) 


B. Digital Controller 

The experimental machine is controlled by a digital controller using 
a DSP (digital signal processor). The experimental setup basically 
consists of the DSP which is sandwiched between A/D and D/A 
converters. Real-time control is implemented with a processor NEC 
/*PD77230, which can execute one instruction in 150 ns with 32- 
bit floating point arithmetic. This device has enough fast processing 
speed to stabilize a relatively simple magnetic suspension system in 
Fig. 1. The control algorithm is written in the assembly language for 
the DSP and a software development is assisted by a host personal 
computer NEC PC-9801 under: the MS-DOS environment. The data 
acquisition board MSP-77230 consists of a 12-bit A/D convener and 
a 12-bit D/A converter with the maximum conversion speed of 10.5 
;/s and 1.5 //s, respectively. 

The sensor outputs arc filtered through an analog low-pass circuit 
and then converted to digital signals by A/D converters. The DSP 
calculates the control input signals. These digital signals are converted 
to analog signals by D/A converters with a range of ±5 V. The 
converted signals and the steady current signals are added and 
amplified by 10 times to actuate the electromagnet. Steady-state 
voltage of the electromagnet is 24.6 V, and the maximum voltage 
of a regulated DC power supply is 70.0 V. 


III. Model of Electromagnetic Suspension System 
Our purpose in this section is to introduce an ideal mathematical 
model and an uncertainty weighting function for the system. See [4] 
for details. 


First, we introduce ideal mathematical models for the real electro¬ 
magnetic suspension system. Due to the idealizing assumptions that 
we make, two types of ideal mathematical models can be derived 
hereafter, which are composed of nonlinear differential equations. 
We define them as TypelAJ and Type[B], respectively. 

Since the behavior of the electromagnetic force is nonlinear, we 
then employ the linearization procedure around an operating point. 
To account for the neglected nonlinearity, we derive two types of 
linear model, respectively. Thus, we will derive four linear models 
according to the following manners: 

• Model[Al): L = CONSTANT; and the nonlinearity of the 
electromagnetic forces are approximated up to the first-order 
term in the Taylor series expansion. 

• Model[A2]: L = CONSTANT; and the nonlinearity of the 
electromagnetic forces are approximated up to the second-order 
term in the Taylor series expansion. 

• Model/Blj: L = £(,r); and the nonlinearity of the electromag¬ 
netic forces are approximated up to the first-order term in the 
Taylor series expansion. 

• Model[B2]: L = 1(0*); and the nonlinearity of the electromag¬ 
netic forces are approximated up to the second-order term in the 
Taylor series expansion. 

J) Ideal Mathematical Model—Type [A]: We will derive ideal 
mathematical models for the real electromagnetic suspension system, 
where the following assumptions on the electromagnet are considered. 

• A.l) Magnetic permeability of the electromagnet is infinity. 

• A.2) Magnetic flux density and magnetic field have not hystere¬ 
sis, and they are not saturated. 

• A.3) Eddy current in the magnetic pole can be neglected. 
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Using A.1) and A.2), we can treat the coil inductance I as a 
function of variable x. Then, the system can be written by the 
following nonlinear differential equations 


m jp *»** - f' 


=.f-L_y 

r-h.ro / 


> ~ ±{L( x )i] (3.2)* 

where the coefficients k and Jo in (3.2) are constants determined by 
identification experiments. Further, we introduce another assumption 
for TVpe[A]. 

A.A) The coil inductance is constant near an operating point. 
Furthermore, the electromotive forces due to the differential of 
gap can be neglected. 

Then (3.2), we get 

r = Ri + L,j f . (3.3) 

The ideal mathematical model: TypefAl is represented by (3.2) and 
(3.3). 

ModelfAl]: In view of (3.2) und (3.3), we can obtain the linear 
model (3.4) 


2 kl 7 

= VI (X 4 XoY* 


ni( X 4 xq )■* 

R 


In this way, we deal with the deviation x and i as fixed numbers, 
at the second-order term in the Taylor series expansion and include 
them in the matrix A as Aar, At and A y. 

2) Ideal Mathematical Model~-Type{B]; For the ideal mathemat¬ 
ical model Type[Bl, we also consider the assumptions A.1), A.2), 
A.3) and here in addition to them, we introduce the next assumption 
A.B) instead of A.A). Using this assumption, we can obtain more 
accurate model than one of TypcfA]. 

• A.B) The coil inductance L is a function of a gap j, and written 
as follows 


L(j’) = V - = Lo (3.6) 

x 4 A c* 

where the coefficients Q, Xoc and Lo are also the constants deter¬ 
mined by identification experiments. For any given current i in a coil 
with inductance I, the magnetic co-energy is shown as ^ Li' 2 . Hence 
electromagnetic forces between the electromagnet and the iron ball 
in (3.2) is equal to the change rate of co-energy with respect to the 
distance .r, i.e.. 


= JL / 1 

D.c { 2 


f v j 1 1 ,2 0 L { j) 

l ' )f = 2 ' ST 




1 

C= 0 
0 


Model IA2]: We can further obtain another linear model (3.5) 


Comparing (3.2) with (3.7) 

Ax, — .r<>, Q = 2k. 
Then from (3.2), (3.6) and (3.8), we get 


A - rn( X 4 :* o) 


t A if 0 - 


= Q . c = o 
0 


A 4 Jo 

Sy = l- : ±Ar+±M. 


vi(X 4 Jo ) 2 

n 


2ki <lv , 


Now we obtained the ideal mathematical model: Type[B] which is 
constructed with (3.2) and (3.9). 

ModellBl]: From (3.2) and (3.9), the linear model (3.10) (as 
shown at the bottom of the page) is derived. 

Model [B2j Moreover, the linear model (3.11) can be derived, as 
shown at the bottom of the page. 

Thus, now we obtained four linear model structures: Model! AI ]. 
Model[A2], Model[Bl|, and Mode1|B2]. 


m( A' 4 Jo) 3 


^ 0 
A 4 .i‘o 

,2fc 4 lo (A 4 Jo) 


(A 4 Jn){2fr 4 Io(A 4 Jo)} 

1 ni 7 


C = 0 
0 


m ( X 4 Jo Y 2 
R( X 4 Jo) 

2k 4 Io(A 4 Jo) 


2kP 

rn(X 4 Jo) 3 


21/(1 - 2Aj4 AV) 

(X 4 x-a ) {2 k (1 — A j ) 4 l o (X 4 J o)} 


xU 


MX 4 .ro) 4 2k{ 1 — A r). 


C = 0 
0 


0 

2 kl 

m {X 4 Jn ) 2 U 

R(X 4 Jo) _ 

2A*( 1 — Aj) 4 Io(A* 4 Jo) 


(3. M) 
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TABLE II 
Perturbed Models 


Perturbed Model 

Model 

Structure 

Parameter Change 

model(la) 

Model[Al] 

k -+ kjto&x 

model(lb) 

ModelfAl] 

k kjmin 

model(2a) 

Model[All 

^_max 

model(2b) 

ModelfAl] 

•*0 -> 

model(3a) 

ModelfAl] 

R -* /?_max 

model(3b) 

Model! Al] 

R -> /?_min 

model(4a) 

ModelfAl] 

L -> L_max 

model(4b) 

Model[Al] 

L L _min 

model(5a) 

Model [A2J 

x' -* x'_max 

model(5b) 

Model|A2] 

x xlmin 

model(6a) 

Model|A2] 

/' /'_max 

model(6b) 

Model [A2 J 

f -+ f_min 

model(7a) 

Model[BlJ 

k -> k _max 

model(7b) 

ModellBl] 

k 

model(8a) 

ModellBlj 

*o -> \_max 

model(8b) 

Model[B1] 

^min 

model(9a) 

ModellBl] 

R -> R __max 

model(9b) 

Model [Bl] 

R -> R _min 

model(lOa) 

ModellBl] 

-■> A)_max 

model(lOb) 

ModellBl] 

^Ltnin 

model(lla) 

Model [B21 

x ' xlmax 

model(l lb) 

Model |B2] 

x' -» x'_min 

model(12a) 

Model! B2] 

T i'_max 

model(12b) 

Model [B2] 

r -> F_min 


B. Model Parameters 

To account for unpredictable perturbations in the model parameters, 
we will set the nominal value as well as the possible max./min. 
value of each parameter in every linear model. To obtain the possible 
max./min. value of each parameter, consider the steady-state gap .Y 
= 5,0 mm (nominal). Now let us perturb it with A' = 4.5 mm and A" 
- 5.5 mm (perturbed ±0.5 mm). And, for these cases, we measured 
the three sets of the parameter values. The results ol measurements 
are shown in Table II. 





Fig. 2. Uncertainty weighting. 


D. Modeling Unstructured Uncertainty 

To account for unstructured uncertainties, we should consider not 
only a nominal model but also a set of plant models in which 
the real system is assumed to reside. Considering only unstructured 
uncertainties, we get all unstructured uncertainties together into one- 
full block uncertainty. 

To estimate the quantities of additive model perturbations, we 
employ differences of gain between the nominal transfer function and 
the perturbed transfer function with only one parameter changed and 
the others fixed, where we did not consider that plural parameters 
change together. In such a way, 24 perturbed models have been 
employed. They are shown in Table II. With these notations, we 
can define the corresponding perturbed transfer functions G,j in an 
obvious way 

Xj := Gij - Gn om (1 < i < 12, j = n,6). (3.14) 

Frequency responses of these additive perturbations |A;/(,/x , )| are 
plotted in Fig. 2, with 24 dotted lines. Now let us consider the set of 
plant models. Here we assume the following form 

G := { G nom ± A rtt V</Jl add ■ |jA n <id||ao < 1} (3.15) 

in which the real plant is assumed to reside. All of the uncertainties 
are captured in the normalized, unknown transfer function A rtt /t/. 
It is natural to choose the uncertainly weighting W a ad as follows 
(shown in Fig. 2). Here it should be noted that the magnitude of 
the uncertainty weighting W,,dd covers all the model perturbations 
shown in Fig. 2 

_ 1.4 x 10 -r ‘(l + */8)(l ± .s /170)(1 + s/420) 
a ' ld ~ (1 + s/30)(l + s/35)(l + k/38) 

(3.16) 


C. Nominal Model 

We will form the nominal model using the simplest ModelfAl] 
structure and the nominal model parameter {X= 5.0 mm case). Its 
state-space form is then of the following form 


A nu m — 


0 

4481 

0 

0 

0 


0 

-18.43 

-45.69 


'1 

Dn oin = 0 . Gii orn — 0 

1.969 J L°_ 

And the corresponding nominal transfer function is 

-36.27 


G ii „m — 


[s + 66.94) (a - 66,94)(«-j- 45.69)' 


(3.12) 


(3.13) 


IV. Design 


A. Control Objectives 

Electromagnetic suspension system is essentially unstable. We 
must design a robust controller to stabilize the closed-loop system; 
furthermore, we would like to design a controller to maintain the 
performance against unpredictable disturbances and the uncertainties. 

Let us consider the feedback structure shown in Fig. 3. The box 
represents the set of the models; G of the real system. Robust stability 
requirement for the additive uncertainty can be evaluated using the 
closed-loop transfer function A'5, where 5 := (/ + GK ) -1 . Hence 
robust stability test for G € G is equivalent to 

11 Wadd r A ( / + Gnntn A ) 1 | |oo < 1 - 


(4.1) 






It is noted in Fig. 3 that we factor the uncertainty weighting as 
= W„m, x H„, w ,. where 

=1.0 x lO" 5 . 

_ 1.4 X (1 + a/8)(l -fa/170)(l +a/420) ^ 

" ,Wr (l + .s/30)(l = */35)(l-M/38) 


To reject the disturbances at low frequency band, the performance 
weighting function W pt ,/ is now chosen as 


H ln r f 


200.0 

1 + a/O.r 


(4.3) 


We also factor the performance weighting as \V ptr f = Wx 
W /n ,j r f where 

H p,, /, = 1.0 x l(T r \ H„„ fr = (4-4) 

In practical situation, however, we would like to achieve this perfor¬ 
mance specification for all the possible plant G € G. A necessary 
and sufficient condition for this robust performance is 


[I + GK JU <1. VG€ G. (4.5) 

Now the control objective is to find a stabilizing controller K which 
achieves the following two conditions 

• The closed-loop system remains internally stable for every plant 
model G £ G, 

• The weighted sensitivity function satisfies the performance test 
(4,5) for every plant G € G. 

The design objectives have been specified as the requirements for 
particular closed loop transfer functions with the frequency weighting 
functions Watt'd and 11 The above control objectives exactly fit 
in the /^-synthesis framework by introducing a fictitious uncertainty 
block Rearranging the feedback structure in Fig. 3. wc can 

build the interconnection structure shown in Fig. 4. 


B. ji -Synthesis 

We first define a block structure Ap as 


: ={ : Xm € C, X„ j 6 cj. (4.6) 


Next, consider a generalized plant /’ partitioned as 

. . P = 


Pit Pvi 
Pit Pvi 


(4.7) 



Obviously in Fig. 4, we can get a lower linear fractional transforma¬ 
tion Pi ( P. A’) on P by K 

Pi(P. A):=/>,, + P, 2 A'(/ - P n A')" 1 Pi. (4.8) 

Finally, robust performance condition is equivalent to the following 
structured singular value // test 

sup itjxATtiP- K) (ju>)) < 1. 

The complex structured singular value ft / \ p is defined as 

:== mi,, {Zr(A: A € K dot (/ - .UA) = 0} 

unless no S 6 A makes 7 — M A singular, in which case 
:= 0. In this case a matrix M in (4.10) belongs to C 2 * 2 . 

C. D-K iteration 

Unfortunately, it is not known how to obtain a controller A 
achieving the structured singular value test (4.9) directly. But we 
can obtain the lower and upper bounds of ft. Our approach taken 
here is the so-called D-K iteration procedure. 

The D-K iteration involves a sequence of minimizations over 
either I\ or D while holding the other fixed, until a satisfactory 
controller is constructed. First, for D — 7 fixed, the controller Ki 
is synthesized using the well-known state-space Hr* optimization 
method. Let P\ = P denote the given open-loop interconnection 
structure in Fig. 4, and fi(P. K) be the closed-loop transfer function 
from the disturbances w to the errors c. 

Then, solving the following control problem 

| \F\ ( P \, K \) ||nt, < 7i, 71 =1.3. (4.11) 

The problem (4.11) yields the central controller 7v i as shown in 
(4.12) found at the bottom of the page. 


(4.9) 

(4.10) 



-5.22 x 10 8 (* 4- 12.4G)(* 4- 30.0)(/»+ 35.0)(w + 38.0)(* + 45.C9)(.» + 66.94) 

(s + ().10)(* + 31.6 - ./5 12j(* + 31,6 + j5'.12)(* + 39.77)(* + 315,2 - jZ2M){s + 315.2 + j329.6)<* + 734.7) 


(4.12) 
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Fig. 5. cf and // plot of the first D-/v iteration. 



FREQUENCY (rad/s) 

Fig. 6. <r and // plot of the second D-K iteration. 

Here we try to assess robust performance of this closed-loop 
system using //-analysis associated with the block structure (4.6). 
The maximum singular value and // upper bound of the closed-loop 
transfer function Ti{P \. K \) are plotted in Fig. 5. It is noteworthy 
to point out that the peak value of the upper bound fi plot is not 
less than one. This reveals that the closed-loop system with this H nCJ 
controller Ki does not achieve robust performance condition. 

Next, the above calculations of ft produce a scaling matrix at each 
frequency. In this design, we try to fit the curve using a first-order 
transfer function. 

Now. let denote the new open-loop interconnection structure 
absorbing the scaling matrix D. This time, from the following ffao 
control problem 

||^/(ft,A\.)|U,<7^ y* = 1.0 (4.13) 

we can calculate the controller K* as found in (4.14) as shown at 
the bottom of the page. 

The maximum singular value and // upper bound of this closed- 
loop system are plotted in Fig. 6. Since the value of // is less than 
one in Fig. 6,, robust performance condition is now achieved. 

V. Experimental Results 

The designed controllers A'i and K> are continuous-time systems. 
To implement these two controllers with the digital controller, we 
discretized them via the well known Tustin transform. The controllers 
hi and K% are discretized at the sampling period of 45//s and 60//s, 
respectively. 


! 



I -5___ - .. . ■* . - ...L. --- 

0 0,2 0.4 0.6 04 Ul 

TIME (•) 

Fig. 7, Response to step disturbance with Ki (— 17.15N). 



0 0.2 0.4 0.6 0.8 14 

TIME (s) 

Fig. 8. Response to step disturbance with Ka (—17.15N). 



Fig. 9. Response to step disturbance with Kj (—34.30N). 

We succeeded in the stable suspension of the iron ball using both 
of the controllers A'i and l\ > ■ In the Section IV, robust stability 
and robust performance objectives were considered as the control 
problems. The obtained H 0c controller A’t achieves robust stability 
condition, and // controller A'a achieves not only robust stability but 
also robust performance specification. Hence, we will evaluate robust 
performance as well as robust stability of the closed-loop systems 
with responses against various external disturbances. 

There the disturbances are added to the experimental system as 
an applied voltage in the electromagnet. It is noted that there are 
four types of disturbances. Taking account that the steady-state force 
of the electromagnet is equal to 17.15 N, we added the following 
disturbance forces to the floating iron ball 

downward 17.15 N, downward 34.30 N. 

These disturbances are large enough to evaluate the robustness of both 
these two controllers. Experimental results are shown in Figs. 7-10. 

First of all, these experimental results in Figs. 7—10 show that 
the iron ball is suspended. Responses in Fig. 9 are vibrating ex¬ 
tremely, however, their vibration get on the decrease. This shows 
the closed-loop systems with both the controllers A'i and Ki remain 
stable against these disturbances. Comparing Fig. 7 with Fig. 9, the 
responses with A'i deteriorate extremely against relatively large 
disturbances. While in Fig. 8 and Fig. 10, the responses with the 
controller A' 2 maintain good transient responses against these distur¬ 
bances. Now we can see the following observation. 


—8.01 x 10 9 (# + 10,54 )(s-F15.75)U-I-30.0)(« + 35.0)(a+38.0) 

Kl ~ (« + 0.10)(s + 19.59 - j'5.32)(<t + 19.59 + j’5.32){* -+- 38.48 - j2.70)(s + 38.48 + j'2.70) 
(s + 45.69)(s + 6G.94)(* +169.6) 

(.»+ 176.6)(.« + 420.1 - j272.8)(* + 420.1 + j272.8)(a + 8180)' 


x 


(4.14) 








m 
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Fig. 10. Response to step disturbance with K 2 (—34.30N). 

• The closed-loop system with the // controller Ji ’2 achieves 
robust performance, while the closed-loop system with the 
#oocontroller A', does not. 

VI. Conclusions 

In this paper, we experimentally evaluated a controller designed by 
/i-synthesis methodology with an electromagnetic suspension system. 
We have obtained a nominal mathematical model as well as a set of 
plant models in which the real system is assumed to reside. With this 
set of the models we designed the control system to achieve robust 
performance objective utilizing //-synthesis method. 

First, four types of different model structures were derived based 
on the several idealizing assumptions for the real system. Second, for 
every model, the nominal value as well as the possible maximum 
and minimum values of each model parameter was determined 
by measurements and/or experiments. Third, a nominal model was 
naturally chosen. This model has the simplest model structure of 
all four models and makes use of nominal parameter values. Then, 
model perturbations were defined to account for additive unstruc¬ 
tured uncertainties from such as neglected nonlinearities and model 
parameter errors. Fourth, we defined a family of plant models where 
the unstructured additive perturbation was employed. The method to 
model the plant as belonging to a family or set plays a key role for 
systematic robust control design. Fifth, we setup robust perfonnance 
objective as a struetbred singular value test. Next, for the design, 
the D-I\ iteration approach was employed. Finally, the experimental 
results showed that the closed-loop system with the //-controller 
achieves not only nominal performance and robust stability, but in 
addition robust performance. 
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Parameter-Dependent Lyapunov Functions and the 
Popov Criterion in Robust Analysts and Synthesis 

Wassim M. Haddad and Dennis S. Bernstein 


Abstract — Many practical applications of robust feedback contro 1 
involve constant real parameter uncertainty, whereas small gain or 
norm-bounding techniques guarantee robust stability against complex, 
frequency-dependent uncertainty, thus entailing undue conservatism. 
Since conventional Lyapunov bounding techniques guarantee stability 
with respect to time-varying perturbations, they possess a similar 
drawback. In this paper we develop a framework for parameter- 
dependent Lyapunov functions, a less conservative refinement of “fixed” 
Lyapunov functions. An immediate application of this framework is 
a reinterpretation of the classical Popov criterion as a parameter- 
dependent Lyapunov function. This result is then used for robust 
controller synthesis with foil-order and reduced-order controllers. 


1. Introduction 

The analysis and synthesis of robust feedback controllers entails 
a fundamental distinction between parametric and nonparametric 
uncertainty. Parametric uncertainty refers to plant uncertainty that is 
modeled as constant real parameters, whereas nonparametric uncer¬ 
tainty refers to uncertain transfer function gains modeled as complex 
frequency-dependent quantities. In the time domain, nonparametric 
uncertainty is manifested as time-varying uncertain real parameters. 

The distinction between parametric and nonparametric uncertainly 
is critical to the achievable performance of feedback control sys¬ 
tems. For example, in the problem of vibration suppression for 
flexible space structures, if stiffness matrix uncertainty is modeled 
as nonparametric uncertainty, then perturbations to the damping 
matrix will inadvertently be allowed. Predictions of stability and 
performance for given feedback gains will consequently be extremely 
conservative, thus limiting achievable performance [ I |. Alternatively, 
this problem can be viewed by considering the classical analysis of 
Hill's equation (e.g., the Mathieu equation) which shows that time- 
varying parameter variations can destabilize a system even when 
the parameter variations are confined to a region in which constant 
variations are nondestabilizing. Consequently, a feedback controller 
designed for time-varying parameter variations will unnecessarily 
sacrifice performance when the uncertain real parameters are actually 
constant. 
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The above distinction can also be illustrated by considering the 
central result of feedback control theory, namely, the small gain 
theorem, which guarantees robust stability by requiring that the 
loop gain (including desired weighting functions for loop shaping) 
be less than unity at all frequencies. The small gain theorem, 
however, does not make use of phase information in guaranteeing 
-stability. In fact, the small gain theorem allows the loop transfer 
function to possess arbitrary phase at all frequencies, although in 
many applications at least some knowledge of phase is available 
f2]. Thus, small gain techniques such as theory are generally 
conservative when phase information is available. More generally, 
since = 1 regardless of the phase angle 0 , it can be expected 
that any robustness theory based upon norm bounds will suffer 
from the same shortcoming. Of course, every real parameter can be 
viewed as a complex parameter with phase angle 0 = 0 degrees or 
<j> ~ 180 degrees. 

To some extent, phase information is accounted for by means of 
positivity theory [3]-[15] which is widely used to model passive 
systems such as flexible structures [I6|. (17]. In this theory, a 
positive real plant and strictly positive real uncertainty are both 
assumed to have phase less than 90 degrees so that the loop transfer 
function has less than 180 degrees of phase shift, hence guaranteeing 
robust stability in spite of gain uncertainty. Both gain and phase 
properties can be simultaneously accounted for by means of the circle 
criterion [15], [181-[22| which yields the small gain theorem and 
positivity theorem as special cases. It is important to note that since 
positivity theory and the circle criterion can be obtained from small 
gain conditions by means of suitable transformations, they can be 
viewed as equivalent results from a mathematical point of view. The 
engineering ramifications of the ability to include phase information, 
however, can be significant [1]. 

The above discussion is further illuminated by means ol Lyapunov 
function theory in [15]. Specifically, as pointed out in f 151, a serious 
defect of conventional or fixed Lyapunov bounding theory is the 
tact that stability is guaranteed even if the plant uncertainty A A 
is a function of t. This observation follows from the fact that the 
Lyapunov derivative V(.r(/)) =■ Lj (.£ (/))(.4 + AA(t)).r(t) need 
only be negative for each fixed value of t 115], [23]. Although this 
leature is desirable if A A is time varying, as discussed above, it 
leads to conservatism when A 4 is actually constant. This defect can 
be remedied, however, by utilizing an alternative approach, which 
is consistent with Lyapunov function bounding techniques, based 
upon parameter-dependent Lyapunov functions. The idea behind 
parameter-dependent Lyapunov functions is to allow the Lyapunov 
function to be a function of the uncertainty AA. In the usual case, 
V(j ) = ,r 1 Pj , P is a single, fixed matrix, whereas the parameter 
dependent Lyapunov function Va i(j’) - r 1 P[AA).r represents a 
family of Lyapunov functions. 

The concept of a parameter-dependent Lyapunov functions is 
not new to this paper. Specifically, a parameter-dependent Lya¬ 
punov function of the form T(.r) = ,r 1 P (\], ■ ■ ■, A, ).r, where 
P( At, ■ • •. A t ) 3 = ]Tj_ A A,P ( , is considered in (24|. In this case the 
matrices P, correspond to the vertices of a polytope of uncertain 
matrices with vertices Ai . ■ ■ ■. A r . More recently, 125] considers a 
Lyapunov function with matrix P{(T\ . • * ■. rr,) = Pa -f , tr ,P ,, 
where Pa corresponds to the nominal system and the P t are “first- 
order perturbations” of Po . Numerical techniques are used to deter¬ 
mine P ; and the range of robust stability. Both |24] and [251 discuss 
potential advantages of parameter-dependent Lyapunov functions 
wer fixed Lyapunov functions. 

The goal of the present paper is to develop robust analysis 
md synthesis techniques that exploit the fact that the classical 
’opov criterion [26] is based upon a parameter-dependent Lyapunov 


function, Indeed, recall that the Popov criterion is based upon the 
Lur’e-Postnikov Lyapunov function 

Y+{jt) = * l Px + N f 0 (rr)d 0 - (1.22) 

Jo 

where y = C'.v and o(-) is a scalar memoryless time-invariant 
nonlinearity in the sector [0. k], that is, 0 < 4>{y)y < by*. 
Specializing to the linear uncertainty case o(y) = Fy , where 
0 < F < k, yields 

V, (.-■) = x l Px + X r Ftriltr = x'Px + NF^ 

Jo 2 

= 1 (P + ^ NFC' 1 C] x = x' P(F)x. 

This form appears in [10, pp. 84-89] and was discussed in the context 
of robust analysis in [15]. 

For practical purposes the form of the parameter-dependent Lya¬ 
punov function IV (.r) is useful since the presence of F restricts the 
allowable time-varying uncertain parameters [27). That is, if F{t) 
were permitted, then terms involving Fit) would arise and potentially 
subvert the negative definiteness of IV (>»’). 

This paper has four specific goals: 1) to provide a general frame¬ 
work for parameter-dependent Lyapunov functions; 2) to obtain a 
generalized multivariable version of the Popov criterion for linear 
matrix uncertainty AA (the classical Popov criterion is limited 10 
scalar or diagonal nonlineanties) along with Ha robust performance 
bounds; 3) to provide explicit uncertainty bounds for the multivariable 
Popov criterion in terms of a single Riccati equation that can be used 
for robust controller synthesis; and 4) to develop robust controller 
synthesis techniques based upon the multivariable Popov criterion 
with applications to full-order and reduced-order controllers. 

Not at ton. 

R, R’ x \ R' —real numbers, / x .s real matrices, R T * 1 

C, C y s . C —complex numbers, r x s complex matrices, C' x 1 

E.tr.0, x- —expectation, trace, r x s zero matrix, Kronecker 

product 

I, . () 1 , ()* —r x r identity, transpose, complex conjugate trans¬ 

pose 

()~ 1 . ()“* —inverse transpose, complex conjugate inverse trans¬ 
pose 

V.N'.P' —r x r symmetric, nonnegative-definite, positive- 
definite matrices 

Zi < Z 2 .Z i < Zi -Z,-Zi E N'. Z2-Z1 eP'.Zi.Z, 6 S' 
llZIli -[tr[(1/2 jt)J~ x 

II. Robust Stabii ity and Plrformanch Problems: Analysis 

Let U C U"*" denote a set of perturbations A A of a given nom¬ 
inal dynamics matrix A € R" x 11 . Within the context of robustness 
analysis, it is assumed that A is asymptotically stable and 0 € IP We 
begin by considering the stability of .4 + AA for all AA E IP 

Robust Stability Problem: Determine whether the linear system 

H*) = (A + AA).r(f), t € [0.x) (2.1) 

is asymptotically stable for all A A € IP 

To consider ihe problem of robust performance, we introduce 
an external disturbance model involving white noise signals as in 
standard LQG (Ha) theory. The robust performance problem concerns 
the worst-case Ha norm, that is, the worst-case overM of the expected 
value of a quadratic form involving outputs »(f) = Ex(t ), where 
E € R ty *", when the system is subjected to a standard white noise 
disturbance w(t) e R'* with weighting D 6 R ,,Xf/ . 
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Rohm Performance Problem: For the disturbed linear system 

i{t) = (A -f AAU(t) + Du>(t ), t € [0, oc). (2.2) 
5(0 = E*(t) (2.3) 

where ?/(*)' is a zero-mean d-dimensional white noise signal with 
intensity Id , determine a performance bound >1 satisfying 

J(U)=: Slip liuiNupEJIMOHii} < (2.4) 

A4fM CX 

tn Section VI, (2.2) will denote a control system in closed-loop 
configuration subjected to external white noise disturbances and for 
which ;(f) denotes the .state and control regulation error. 

Next, we express the H 2 performance measure in terms of the 
observability Gramian for the pair (.4 -f AA, £). For convenience 
define the n x « nonnegative-definite matrices /? = E } E.Y 
DD 1 . 

Lemma 2.1: Suppose .4 4- A A is asymptotically stable for all 
AA E U‘ Then 

J{14) = sup trPa 4 V = •*«!> IIG 44 MII 2 (2.5) 

A \€U A 

where Pa 4 6 R ,J x n is the unique, nonnegative-definite solution to 
0 = (.4 -f AA) 1 Pa/ i + P^.4 (A 4- A 4) 4- Ft (2.6) 
and Gaa(s) = E[sl -(.4 4- ±A)]~ l D. 

HI. Robust Stability and Performance via 
Parameter-Dependent Lyapunov Functions 

The key step in obtaining robust stability and performance is to 
bound the uncertain terms A.4 1 Pa 4 4- Pa 1 A A in the Lyapunov 
equation (2.6) by means of a parameter-dependent bounding function 
f2(P, A.4) which guarantees robust stability by means of a family of 
Lyapunov functions. This procedure corresponds to the construction 
of a parameter-dependent Lyapunov function which constrains the 
Class of allowable time-varying uncertainties. The following result 
forms the basis tor all later developments. 

Theorem 3J. Lei Iky. NT —► S" and Po: U —► $" be such that 

AA r P + PSA < Sl 0 (P) - [(.4 + A.4) 1 P 0 (A.4) 

+Po(A.4)(A + A.4)]. A 4 €f(,P £ N" (3.1) 

and suppose there exists P E 1ST satisfying 

(>= A'P + PA + Uo(P) + R (3.2) 


Proof: Note that in (3.1), P denotes an arbitrary element of 
NT, whereas in (3.2) P denotes a specific solution of the modified 
Lyapunov equation (3.2). This minor abuse of notation considerably 
simplifies the presentation. Now, note that for all AA € R nx ", (3.2) 
is equivalent to 

Q = (A + SA) r P + P(A+SA) + (MP)-(SA l P+PSA) + R. 

(3.9) 

Adding and subtracting (A + A.4) 1 Pn(SA) + Po(SA)(A + A.4) 
to (3.9) yields 

0 =(A + A.4) 1 (P + Po( A.4)) 4- (P + P„( A.4))(.4 + A.4) 

+ !2 a (P) - [(.4 + A.4) 1 P 0 (A.4) + P 0 (A.4)(,4 + AA)] 

- (AA 1 P+ PAA)+ P. (3.10) 

Hence, by assumption, (3.10) has a solution P 6 N T1 for all 
A4 E R r,x ". If A A is restneted to the set U* then, by (3.1), J7n(P)- 
((A +AA) 1 P,(AA) + P)(AA)(.4 + AA)] - (AA’P + PAA) is 
nonnegative definite. Thus if condition (3.3) holds for all A.4 E U, 
then Theorem 3.6 of |28) implies (.4 -|- AA. [ft 4- I2(P, AA) - 
(A.4 1 P 4- PAA)] 1/2 ) is detectable for all A.4 E U , where 

Si(P.SA) = rt ) (P)-[(4 + A4) 1 P«(AA)-f P,(A 1)(A + A4)]. 

(3 11) 

It now follows from (3.10) and Lemma 12.2 of [28) that 4 4- A.4 
is asymptotically stable tor all A.4 E U. Conversely, il 4 4- A 1 is 
asymptotically stable for all A A E U , then (3 3) is immediate Now, 
subtracting (2.6) from (3.10) yields 

0 =(.4 4- A.4) 1 (P 4* Pi(A 4) - I\ \) 4- (P 4- Po(A.4) - 1\ \ ) 
x ( 4 4- A 4) 4“ I2o( P) “ [( 1 4* A 4) 1 7n( A 4) 4- Pu( A 4) 
x (4 + AA)]-(AA'P + PAA). AUM 

(3.12) 

or, equivalently, since 4 4- A.4 is asymptotically stable for all 
A.4 E U 

P + 7’o( AA) - Pi t = /" r M+A i)l, [il(P.SA) 

J 0 

- (A4'P+ PAA)]. 1 '"dr 
>(U.4eM (3.13) 

which implies (3.5). The performance bounds (3.6) and (3.8) are now 
an immediate consequence of (2.5), (3.5), and (3.7) □ 

Note that with f2(P, A.4) defined by (3.11) condition (3.1) can 
be written as 


and such that P + Pq( AA) is nonnegative definite for all A.4 E U. 
Then 

(.4 + A.4, E) is detectable. A.4 E U (3.3) 

if and only if 

.4 4- A A is asymptotically stable, A.4 E U . (3.4) 

In this case 


Pa ^ < P 4- Po(A.4), AA E U 
where P&a is given by (2.6). Therefore 

J(U) < tr PV + sup tr Po(AA)V. 

A 4FW 

If, in addition, there exists P 0 E S n such that 
Po(AA) < Po, A A€U 


then 


J(W)<tr((P+/ s o)V]. 


(3.5) 

(3.6) 


(3.7) 

(3.8) 


A 4 1 P + PAA < I7(P, AA). AA € U- P € N" (3.1)' 

where f2(P, A A) is a function of the uncertainty A A. For con¬ 
venience we shall say that /?(■••) is a parameter-dependent 17- 
bound, which is consistent with |29). One can recover the standard 
guaranteed cost bound or parameter-independent 17-bound by setting 
Pq(AA) = 0 so that J?(P, A A) s f2o (P) and therefore AA 1 P 4- 
PA A < J7o( P) for all A A E U . Finally, since we do not assume that 
Po(0) = 0, it follows that f2o(P) need not be nonnegative definite. 
If, however, Po(0) — 0, then it follows from (.3.1) with A A = 0 
that f7 0 (P) > 0 for all nonnegative-definite P. To apply Theorem 
3.1, we first specify functions Ik )(■) and Po( ) and an uncertainty set 
U such that (3.1)' holds. If the existence of a nonnegative-definite 
solution P to (3,2) can be determined analytically or numerically 
and the detectability condition (3.3) is satisfied, then robust stability 
is guaranteed and the performance bound (3.8) can be computed. 

Finally, we show that a parameter-dependent f2-bound establishing 
robuststability is equivalent to the existence of a parameter-dependent 
Lyapunov function which also establishes robust stability. To show 
this, assume there exists a positive-definite solution to (3.2), lei 
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Pq:U —► N T \ and define the parameter-dependent Lyapunov function 
V&a(* ) * (P + Po(SA)).i. Note that since P is positive definite 
and Pq ( XA ) is nonnegative definite, * (. 1 ) is positive definite The 
corresponding Lyapunov derivative is given by 

Y*a(j) = - i 1 [F>o(P)-{A4 r P + /’A4 4(A4A4)' 

x P 0 (A,4) 4 Po( A 4)(A 4 A4)l 4 (3 14) 

Thus, using (3.1) it follows lhat \ & i( i) < 0 so that 4 + A 4 is 
stable in the sense ol Lyapunov Asymptotic stability follows from 
the invariant set theorem 

IV Construction or 

Paramctek-Dependeni Lyapunov Functions 

We now assign explicit structure to the uncertainty set U and the 
parameter-dependent bounding function 17(-. ) Specifically, lei 

Z/={A4eR" x " SA^B u FCa.r £/■[ (4 1) 

where F satisfies 

F C F = {F £ R ,,,nXmo 0<F<M} (4 2) 

and wheie Du £ IR" X "°, C u £ R r " (l x? arc fixed matrices denoting 
the structure ot the uncertainty, F € R"° XT "° is an uncertain 
symmetric matiix, and U £ R ,T, °* T,,u is a given positive-definite 
matrix Note that F may be equal to F, although, foi generality, 
F may be a specified proper subset of F For example, F may 
consist of block-structuied matrices F — block-diag(/ f j f|,/( 4 
fj I( F,) Note that it F ~blotk-diag(F| P , F„, (J ) 
and M = block-diag( M \, U, ,), then 0 < F, < l/,, i = 
1 /no Finally, we assume that 0 E F and M 6 F 
Next, we provide an equivalent characterization of the set F 
Lemma 4 V Let r £ S'and \f £ P"‘° Then FM ] T < F 
it and only it 0 < / < M 

For 14 given by (4 1), the parameter-dependent bound J7( . ) 
satisfying (3 12) can now be given a concrete form Since the 
elements A 4 in 14 are parameterized by the elements f in F, we 
shall write PuiF ) in place of /ii(A4) Finally, we define the sets 
A* and A*„,i such that the product ol the transpose of every matrix 
in A', (resp , An i) and every matrix in F is symmetric ^ respectively, 
nonncgdtive definite) by 

a' =^^ eR. x "° rx = \ 'F.r ef) 

and 

a;,, = e a; >o.FeF\ 

Finally, Lemma 4 1 of [301 implies that there exists // E N" 0 such 
that PX < p for all F € F 
Proposition 4 I Let A £ A" s and 

nr 1 - ,YCoflo) + ( V -1 - ’UoBo) 1 >0 (4 3) 

Furthermore, let 14 be defined by (4 1) and define F2u( )and Pq( ) 
by 

(MP) =(C’o + VCn 4 + Bi P)'[(3/“‘ - XCaBo) 

+ (4/-‘ - \Coflo) 1 ]' I (^+ VC„4 4flo'P) (4 4) 

and 

Po(F) = ClrXCn (4 5) 

Bien (3.1) is satisfied 


Proof Since by (4.3) ( M~ l - XC 0 B„) 4 (M~* - NCoDo )’ 
is positive definite and by Lemma 4 J F - FM~ l F is nonnegative 
definite, it follows that 

0 <[(C 0 + XCoA 4 D] P ) - [Ur 1 - XCaBo) 

+ (M~‘ - XCaBo ) 1 ]FCo] r [( - XCqBq) 

4 ( \r ] - XC'u Co) 1 ] - '[(Co 4 A r C,.4 + Di P) 

- [( M~' - XCaBo ) 4 ( M 1 - XCaBo ) 1 ]FC U ] 

4 2Ca [F - i \r'r]C u 

=Si 0 (P) - Co 1 F(C 0 4- AT,,.4 4 Z?J P) 

-ic2 + 4 , rn l A’ 1 +PBo)FCo + c„'r[(u-' -acoBo) 

+ (at' - \r„Bn) r ]PO. + 2r n , [P-P''/''P]c n 

=iiu(P) - Cat B t l P — PDqFCo 

- r ( ; FXC'n 4 - 4 1 C(J A' 1 FCn 

- c,l fxCoDqFCq - c,! fb,I ci v' rc„ 

=Siu(P) - [( 4 + XA)'Pa(F) + Po[F)(A + AA)] 

- [A4 I P + PA4] 

which proves (3 I) with 14 given by (4 1) □ 

Next, using Theorem 3 1 and Proposition 4 1 we have the following 
immediate result 

Theorem 4 l Let V £ A „a and assume (4.3) is satisfied Further¬ 
more, suppose there exists a nonnegative-definite matrix P satisfying 

0 = 4 1 P + r 4 4 (C 0 + VCn » 4 C,,' P) 1 [(J)/' 1 - XCaBo) 

4 or 1 - \CaDo) 1 ]-'(C , o4 Un l4P n r P)4i? (46) 

Then 

(4 +A LE) is detectable, A 4 £ U (4 7) 

if and only if 

4 -|- A 4 is asymptotically stable. A 4 £ U (4.8) 

In this case, if p £ N ,r, ° satisfies F Y < // tor all F £ F, then 

J(U) < IrKP+Cj/zCoH ] (4.9) 

Proof The result is a direct specialization ot Theorem 3 1 using 
Pioposition 4 1 with 7n(A4) = Ctf FA Cn Since by assumption, 
r\ > 0 for all F £ F, u follows that P + Po(F) is nonnegative 
definite for all F £ F as required by Theorem 3 1 □ 

Note thal asymptotic stabihly in Theorem 4 1 is guaranteed by 
the parameler-dependenl Lyapunov function \j(i) — r 1 (P + 

(<!f\Co)i 

Remark 4 I The condition F Y = ) r f,F f J is analogous to 
the commuting assumption between the D -scales and A blocks in p- 
analysis which accounts for structure in the uncertainty F Note that 
thcie always exists such a matrix V even if F £ F is not diagonal. 
For example, if F = Fi/„, 0 , where Fj is a scalar uncertainty, 
then X can be an arbitraly symmetric matrix. Alternatively, ll F is 
nondiagonal, then one can choose A T = X 0 / mo , where Vn is a scalar 
Finally, F and xY may be block diagonal with commuting blocks 
situated on the diagonal. Chatactcrizatton of the optimal multiplier 
V lor robust controller analysis and synthesis is given in Section VI 
Remark 4 2 Standard loop-shifting techniques [31] can be used to 
consider uncertainties with upper and lower bounds of the form Aft < 
F < Ah, where F £ F and A/]. M 2 € S ,Mu In this case, Proposition 
4 1 holds with F,.4, and M replaced by F - A/i,-4 H- DqAI)Cq 9 
and M 2 - Mi * respectively. Similar modifications can be made to 
Theorem 4 1 
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Next, we use results from positivity theory to guarantee the e^s- 

tenee of a positive-definite solution to (4.6). Let G{s) ~ p—- j 

denote a state space realization of a transfer function (7 (h), tiiafis, 
(7(h) s .4)"* 1 B + D. The notation denotes a minimal 

realization: 

Lemma 4,2 . (Positive Real Lemma / 4 ], [9]): 




G(.) [H-B] 


is positive real if and only if there exist matrices P. L , and IT' with 
P positive definite such that 

0= A'P + PA + L l L, (4.10) 

0 = pn-c' +l t w. (4.M) 

Q = D+D 1 -\Y'\V. (4.12) 


Next, we show that if D-l-D 1 > 0 then (4.10)-(4.12) yield a single 
Riccali equation characterizing positive realness. For the statement 
of this result recall that a square transfer function G{*) is strongly 


positive real if it is strictly positive real 1151 and D + D 

» ... A 3 / I A I I /~U .\ 11,111 f' * 1 ^ 1 'T'l_ 


Lemma 43 [14]. Let C7(s) 

strongly positive real if and only if D + t) 
positive-definite matrices P and R such that 


f—— 1 - 
k 1-^J. 


> 0 . 


Then G{s) is 
> 0 and there exist 


«= A r r+PA+iC-n'D 1 (D+D l r '(C-n l r)+n. (4.i3) 


We now use Lemma 4.3 to obtain a sufficient condition for the 


existence of a solution to (4.6). 


Theorem 4.2. Let 


,-»/ , nun 
t'(’S) ~ 

.4 

-D 0 

.Co + AT n.4 

.1/“ - AT'oZJo 


Then £(*) is strongly positive real if and only if there exist positive 
definite matrices P and R satisfying (4.6). 

Finally, wc specialize Proposition 4.1 and Theorem 4.1 to the case 
in which .V = 0 and M — D ( 7 1 , where Do 4- D jj > 0. In this case 
we have the following result. 

Proposition 4.2: Let Do £ R"' 0 * 7 " 0 be such that Do + Do > 0. 
Furthermore, let U be defined by (4.1) with M = D 7 1 , let PolF) = 
0, and define Uo( ) by 

Do(P) = (C\\ 4* D 0 r P) r (Dn + Do ) 1 (Co -F I)£ P). (4.14) 

Then (3.1) is satisfied. 

Since Po(F) = 0, the case N = 0 corresponds to a parameter- 
independent 12-bound. Hence, it follows from Theorem 3.1 that if 
there exists a nonnegative-definite matrix P satisfying 

0 = .4 , P+ PA + (Co + D'iP)'(D< l + a! ) ■ 1 (Co + Bo 1 P) + fl 

(4.15) 

then (A -|- AA.E) is detectable for all AA £ U if and only if 
A -F A A is asymptotically stable for all AA £ U. Furthermore, 
it follows from Lemma 4.3 that the existence of a positive-definite 
matrix P satisfying (4.15) implies that 

I it ] 

is strongly positive real. Hence the parameter-independent 12-bound 
(4.14) guarantees robust stability in the presence of positive real (but 
otherwise unknown) plant uncertainty. The situation is analogous 
to Ho. bounded real theory, which also depends upon a parameter 
independent f 2-bound, 


V. Connections to the Popov Criterion 

In this section we demonstrate connections between the parameter- 
dependent Lyapunov function obtained in Section 4 and the classical 
multivariable Popov criterion. Traditionally, the Popov criterion is 
stated for component-decoupled time-invariant sector-bounded non- 
linearities We state the Popov criterion for this case and then 
specialize to the case of linear uncertainty. Hence let 21/ £ R n, ° xm ° 
be a given positive-definite matrix and define 


£={?>: R ,n ° - R’"° : r (*)[.V'*(») - y] < 0. y e R m ". 

and o(y) = [«>i(t/i). Oiiy-i ).• • • )] 1 }. 

(5.1) 

If M = diag( M \, ■ • ■, 2l/„ l( ,) is diagonal, then the sector condition 
characterizing & is implied by the scalar sector conditions 0 < 
0,(y,)y, < ALyf* !h 6 R, / = 

Theorem 5,1. (The Popov Criterion) [151: Suppose there exists a 
nonnegative-definite matrix X = iliag( AV ■ ■ •, A\, l() ) such that 
3/~ t -F (/ + Xs)G{s) is strongly positive real, where G(s) 

^—|—~j. Then, tor all the negative feedback inter- 

connection of G(s) and </>(■) is asymptotically stable with Lyapunov 
function 


. ^ r ,J ' 

T'/,(.!) = X 1 Pi 4- 2 y I f\(<r) \\clrr. (5.2) 

, -1 

Next, we specialize Theorem 5.1 to the case of constant linear 
parameter uncertainty. Specifically, consider the system ,r{t) = 
(A 4- AA)j(t), where A 4 £ U and U is defined by 

U ={_U: J 4 = -DFC. F = diag( F,, F_.. ■ ■ ■ . ), 

0 <r, < T/,, i = 1 .- ■ ■. mo). 

By setting o(y) = Fy = FCjt Theorem 5.1 guarantees that 44 J T 
is asymptotically stable for all A A £ U. 

It has thus been shown that in the special case that F and 
2 Vare diagonal nonnegative-definite matrices. Theorem 4.1 (with Bo 
replaced by -Do) specializes to the Popov criterion when applied 
to linear parameter uncertainty. This is not surprising since the 
Lyapunov function (5.2) that establishes robust stability has the lorm 

i ^ r'- 

\) (a ) =- .»■ Pr + 2\] F t (ry, = (Co,r), (5.3) 


or, equivalently 

Vi (x) = ,i 1 P.r + ,r‘ Cj FAT',,,r = .r 1 P.i + £ F, N..r ' C,J Co.r 

I ~ J 

(5.4) 

which is a specialization of the parameter-dependent Lyapunov 
function considered in Section IV to the case of diagonal uncertainty 
F. The results of Section IV, however, allowed nondiagonal uncertain 
matrices F, which cannot be addressed by means of the nonlinear 
theory. Finally, note that the uncertain parameters F are not allowed 
to be arbitrarily time-varying, which is consistent with the fact that 
the Popov criterion is restricted to time-invariant nonlinearities. 

VI. Robust Controller Synthesis via Parameter-Dependent 
Lyapunov Functions: Fixed-Order Dynamic Compensation 

In this section we consider robust stability and performance with 
dynamic output-feedback controllers. For generality, the compensatoi 
dimension may be less than the plant order n. Define h = n + n t , 
where n« < n. 
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Dynamic Robust Stability and Performance Problem Given the 
nth-order stabihzable and detectable plant with constant structured 
real-valued plant parameter variations 

j(f) = (A + A4)j(/) + Uii(f) + D,iii(#),f > 0, (6 1) 

y(t) = Cj(t) + Dju(t) (6 2) 


Dynamic Auxiliary Minimization Problem Determine A € -Vnd 
and controllable and observable ( 4,, D, . C,) that minimize 

J( 4 ,,D,.C,.X ) = ir(P + Cj/iC d)V (6.9) 

where P E N" satisfies 


where u(t) E R r,, ,u(f) E H <1 , and y(t) E R*, determine an n.th- 
order dynamic compensator 

i ( (1) = 4 i,(f) + £ tV (f), (63) 

u(t) = C 1,(1) (64) 

that satisfies the following design entena 
0 The closed-loop system (6 1 )-(6 4) is asymptotiLally stable for 
all A 4 E U and 
n) Th£ performance functional 

./( 4, D, C ) = 

1 

sup Inn sup -E{ / [i 1 ()i?i <(s) + i/ 1 () /?* II (s )]d s} (6 5) 
A i- >- * J o 

is minimized 

For each uncertain variation A 4 E U, the closed-loop system 
(6 l)-(6 4) can be written as 


0 =A* P + r 4 + (Co + \ Cu 4 + P„ r P)' [( M~ l - NCoDo) 

+ ((!/-' - \CdPo)'] '(Co + ACo4 + D„ l P) + J? 

(6 10 ) 

Necessary conditions for the dynamic auxiliary minimization prob¬ 
lem will provide hxed-ordei dynamic output feedback controllers with 
guaranteed robust stability and performance The following result is 
required lor the statement of the main theorem 
Lemma 6 J [32] I et Q P be // x n nonnegative-definite matrices 
and suppose that rank QP = n Then there exist n x n matnees 
G, T and an n, xn, invertible matrix U, unique except for a change 
of basis in R M( , such lhat 

qp = 6' \ir ir.* = /„, (6 id 

Furthermore, the u x n matrices r = 6 1 1 and t± = /„ - r are 
idempotent and have rank n and n - n, respectively 
To state the main result of this section let P Q E R" *" and define 
the notation 


i(f) = ( 4 + A l)i(/)-h Du (f) />() 


(6 6 ) 


where 

J(f) = 

and where the closed loop disturbance Du(t) has intensity 


‘it) ■ 

4 ~ 

4 

DC 

A l = 

A 4 

On y H 


, *i — 

B,( 

4 


0,1 X II 


1 — f)D 1 where D — 


D,D 2 


1 = 


0 

n,\ 2 D,' 


= 


D ] D} 1,, = D,nl = til/ = D~_T)\ The closed-loop 
system uncertainty A 4 has the form A 4 = II n PC u where 

Oo = . Co = [CoOm„x ] Finally, if 4 + A 4 is 

Ufi, X rri o _ 

asymptotically stable lor all A 4 E U for a given compensator 
(4 D . C ,), then it follows from Lemma 2 1 that the performance 
functional (6 5) is given by 


J( 1 D , C ) - sup tr P^^ 

A \(14 


where ^ satisfies the n x n Lyapunov equation 

0 = (4 +A^)'P A1 +P^ ,( 4 + A 4) + P 

* here 


E = [Ei CiC’,], i? = E r F = 


P. 

0 


0 

C,'lhC 


I ?n =( If' 1 - V oOo) + ( 4/- J - NCoPo)' 

C =C(i + N C u 1, 4/ = 4 -|- Built ) 1 C . 

/?., ^iQ + n'Cp' ^ 'Po' \c»fl. 

p, =p 1 p + p'r l) ' v 7 /?„ '<r+ n,!P) 
r^r 1 ), ‘c \ r -QE + Dpitp'nir 

4y = 4 f + Du /?„ 1 Do P — ( / + Po P() 1 \ f d Pn 

Iheoitm 6 / Let w // assume P 0 > 0, and assume V E Ami 
Furthermore, suppose there exist n x n nonnegativc-dehmte matnees 
P Q P Q satisfying 

0 = 4 *■ P •+ P4, J + P, +C 7 Po‘(’ + PPoPo"'Bn l P 


(6 7) 


(6 8 ) 


- P, r /?JP, 4- nP.'pJP.rj 
n -(4/ +p„p,7 , p ( ![p + p])g 
+ y< 4/. + p„p a - | p 0 '[p + p]) r 
+ 4, -Qry+r.yryrl. 

0 = 41 P+ P 4+ PCi) /?„ 1 D,l P 

+ PjflJ. , P. ~rlP'R„! p,t ± 

0 = i y y + y 4^ + yry - r^yryri 

rank Q =rank P = rank QP =■ n, 
and let 4be given by 


(6 12 ) 


(6 13) 

(6 14) 
(6 15) 
(6 16) 


Next, we apply Theorem 4 1 lo controller synthesis Specifically, 
e replace the Lyapunov equation (6 8) tor the dynamic problem 
' ith a Riccati equation that guarantees that the closed-loop system 
' robustly stable Thus, for the dynamic output feedback problem, 
heorem 4 1 holds with 4,7?, V replaced by .4, /?, \ f This leads to 
<ie tollowmg problem 


4 ( =r[4 g -yr]G y, .p f =rQC l vr\c, = -pt/p.g 1 

(6 17) 

Then ( 4 + A 4, E) is detectable for all A A E 11 if and only if 
4 + A 4 is asymptotically stable for all A A E U In this case the 
performance of the closed-loop system (6 7) satisfies the bound 

.1(A ,, D, ,C.)< tr[( P + P)\ , + PQSQ + C,, 1 j/C'oV,] (6 18) 
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Proof; The proof i& constructive in nature and is similar to the 
proofs given in [14] and [33]. Specifically, first we obtain necessary 
conditions for the Dynamic Auxiliary Minimization Problem and 
then show by construction that these conditions serve as sufficient 
conditions for robust stabilization and provide a worst-case 
performance bound. □ 

Theorem 6.1 provides constructive sufficient conditions that yield 
dynamic feedback gains A< , B< , C r for robust stability and perfor¬ 
mance. When solving (6.12M6.15) numerically, the matrices M and 
N and the structure matrices Do and Co appearing in the design 
equations can be adjusted to examine tradeoffs between performance 
and robustness. Finally, to further reduce conservatism, one can view 
the multiplier matrix A r as a free parameter and optimize the worst- 
case Ffa performance bound J with respect to iV. In particular, setting 
QJ/dN s 0 yields 

O^MCoVCj + [(M~' - NC 0 Do) + (M- 1 -NCoBo) 1 ]-' 

x (Co + AToi + BjP)QA l C„ T + [(AT" 1 - A’C„Bo) 

+ (A/" 1 - NCoBo) 1 ]~ J (Co + A’CoA + BjP)Q(C 0 

+ nc 0 A + bJ p) 1 [u\r ' - xco Bo) 

+ (M~‘ - NCoBo)’'}- 1 B u ‘C'o (6.19) 

where Q satisfies 

0 =<i + BoRo' A' Co A + Bo Ro ' C'o + Bo Ro ‘ Do P)Q 

+ 0M + BoRo'NCoA + B 0 R„'Co + BoRq'bJ P) 1 + V. 

(6.20) 

By using (6.19) within a numerical search algorithm, the optimal 
compensator and multiplier X can be determined simultaneously, 
thus avoiding the need to iterate between controller design and 
optimal multiplier evaluation. 

Remark 6.1: Several special cases can immediately be discerned 
from Theorem 6.1. For example, in'the full-order case, set n t = v 
so that r = G = f =? /„ and n =0. In this case the last term in 
each of <6.12>-(6.15) is zero and (6.15) is superfluous. Alternatively, 
letting Dq = 0,C'o = 0 and retaining the reduced-order constraint 
w t < n yields the result of [32|. 

VII. Numerical Algorithm and Illustrative Results 

In this section we describe a numerical algorithm for solving 
the Riccati equation (6.10) along with the expression (6.19) for the 
optimal multiplier X. We also present numerical results for controller 
synthesis via an illustrative example. 

To synthesize dynamic compensators, we let // = [M d j - M\ )X in 

(6.9) and determine (A, . D, , C, , N ) to minimize J(A t *B ,, C , .V) 
subject to (6.10) with P € N". To do this we form the Lagrangian 

C(A, . B,, C,. A\ A 0) =<r[(P + Cn 1 (Afj - A/, )ATo)r 
+ {(.4 + B 0 A/iCo) 1 P 
+ P( A + J3oA/i Co) 

+ [Co + NC„(A + Bo Ah Co) + Bj P] 1 
■ Rn‘[Cu + NCi)(A + Bo Ml Co) 

+ dU>\ + R)Q] (7.1) 

where 

Bo = [(A/,-A/,)-' -A'CoBo]+I(A/ 2 -Af,)-' -A-CoBo]' 1 (7.2) 

and Q € R" x " is a Lagrange multiplier. The partial derivatives of C 
are then used in the search procedure. Note that the shifted version of 

(6.10) discussed in Remark 4.2 is used in (7. 1 ) to address uncertainties 
with upper and lower bounds of the form Mi < F < M?. 



Fig. 1. Three-mass system. 



Fig. 2. Performance vs robustness trade-off foi LQti and Popov controller 


A quasi-Newton search algorithm was initialized with X = 0 
and the LQG gains. For given values of the robustness bounds M\ 
and A/ 2 , the search algorithm was used to find 4 , and A* 

satisfying the necessary conditions. Afler each iteration. \f i and Mi 
were increased and the current design was used as the initial step for 
the next iteration. Since the optimal compensator and multiplier are 
found simultaneously, there is no need to iterate between control lei 
design and optimal multiplier evaluation. 

Consider the three-mass, two-spring system shown in Fig. 1 with 
m i = nt* = nit = 1 and an uncertain spring stiffness As. A control 
force acts on mass 3 while the position of mass 1 is measured resulting 
in a noncolocated control problem. The nominal dynamics, with stale 
variables defined in Fig. 1. arc given by 
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D -2 = [0 l],antU*i — A’ 2 upi.i = 1. The actual spring stiffness < 
the second spring can be written as A^ = Aram.m + AA- so tlv 
the actual dynamics are given by A{Sk) = A + Bo&kCu, wheie 
Bo = [0000-11] 1 and Co = [01-1000]. Furthermore, let 


Ex 


1 1 0 0 0 0 
0 0 0 0 0 0 


0 

1 









TRANSACTIONS ON AUTOMATIC CONTROL* VOL. 40, NO 3. MARCH 1995 



Two full-order {n, = /*) Popov compensators were designed. Fig. 2 
compares performance versus robustness trade-offs of the Popov 
compensators (dashed) with the normalized LQG design (solid). 
Fig. 3 shows the magnitude and phase of both a Popov design and 
the LQG design. Note that the Popov design robustifled the LQG 
controller notch by increasing both the width and the depth of the 
notch. 
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Design of L«Q Regulators for Slate 
Constrained Continuous-Time Systems 

Carlos B. T. D6rea and Basflio E, A, Milani 


Abstract —A new methodology to the design of L-Q regulators for 
continuous-time systems subject to Hnear state constraints is proposed, 
consisting of two parts. In the first one, the positive Invariance of the 
polyhedron defined by the constraints In the state space Is imposed, 
guaranteeing thereby that the constraints will not be violated. Further¬ 
more, the admissible constrained controllers are parameterized via the 
determination of the elements which are fixed in the state feedback 
matrix. This parameterization enables In a second part the L-Q regulator 
to be obtained from the solution of a parameter optimization problem 
Subject to linear constraints, for which It is proposed a specialized feasible 
directions method. 


I. Introduction 

In most of engineering applications, linear systems subject to state 
and input constraints are frequently found, since such constraints 
are generally associated to physical limitations in the course of the 
variables or to the validation domain of the linearization of nonlinear 
models. The controllers to be designed for such systems should 
therefore be capable to attain their performance specifications without 
violating the constraints. 

Due to its known attractive properties, a classical control systems 
design specification which is widely used is the optimal linear- 
quadratic (L-Q) regulation, which aims to achieve a compromise 
between the speed of convergence to the zero state and the magnitude 
of the input amplitudes necessary to that, by minimizing a quadratic 
performance index [1]. The problem of L-Q regulation of systems 
under constraints has been treated to present in the framework of 
optimal control theory by means of variational calculus techniques, of 
difficult application because complex, generally resulting, moreover, 
in an open-loop “bang-bang" control law (see e.g., [2]). Other 
techniques are based on the choice of weighting matrices for which 
the solution of the unconstrained optimal L-Q regulator problem 
results in a control law that respects the constraints [3], [4], Such 
techniques imply clearly in a limitation to the designer, since the 
weighting matrices cannot be freely chosen in order to satisfy 
performance requirements. 

The forthcoming of new techniques to the design of controllers for 
linear constrained systems has been propelled lately by the develop¬ 
ment of the theory of positively invariant polyhedra, that are domains 
from which the state vector trajectories cannot escape (see e.g., (5], 
[61 and references therein). The basic idea behind the techniques 
derived from this theory is to determine a control law that imposes 
the positive invariance of the polyhedron defined in the state space 
by the linear constraints on the state vector. In order to solve this 
problem, Castclan and Hennet [7], by interpreting geometrically the 
positive invariance conditions, have proposed elaborate eigenstructure 
assignment techniques. 

The aim of this paper is to develop a new methodology to the 
design of L-Q regulators for continuous-time systems under linear 
state constraints, applying to this end results from the invariant poly¬ 
hedra theory and parameter optimization techniques. This objective 
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is achieved in two stages, In the first one, by an appropriate change 
in the state-space representation of the system and the utilization 
of results from the geometric theory of control systems [8], the 
conditions for positive invariance are transformed into a set of linear 
constraints on the elements of state feedback matrices, which are 
thereby parameterized. As a consequence of this parameterization, 
the problem of positive invariance of an unbounded polyhedron 
with simultaneous closed-loop stability is split into two distinct ones 
for lower order systems: a stabilization problem and a problem 
of positive invariance of a bounded polyhedron. In the second 
stage, the original problem of slate constrained L-Q regulation is 
reformulated as a parameter optimization problem subject to the 
constraints defined in the first stage. To solve this problem, it is 
proposed a specialized feasible directions method which is initialized 
with a point obtained from the solution of an auxiliary linear 
programming problem. 

This paper is organized as follows. Section II contains preliminar¬ 
ies. In Section 111 the positive invariance of the polyhedron defined 
by the constraints is achieved together with the parameterization of 
the controllers. Section IV presents the design of the constrained L-Q 
regulators. An example in Section V illustrates the application of the 
proposed design and some conclusions are found in Section VI. 

II. Preliminaries 

Consider the linear, time-invariant, continuous-time system 

.r(t) — A.r(t) + Du(t) (1) 

where .r(t) E ft" is the state vector, u(t) E ft"' is the input vector 
and the system is supposed to be stabilizable. Under a state feedback 
control law, »/(/) = F.r(f), it can be written as 

.i-(o = M + tfn<(f). (2) 

Definition / [5/' A nonempty set 12 is a positively invariant set 
of system (2) if and only if for any initial state r(0) E il the state 
vector trajectory remains in 12. 

Assume now that system (2) is subject to symmetrical linear 
constraints, | —p < G.r(t) < p), where G E r < n is a 

full rank matrix and the elements p, of vector p E 4?' arc such 
that p, > 0. These constraints define in the stale space the convex 
symmetrical polyhedron 

S[c;, p] = {.!■ Ef/-/)< Gj < p] (3) 

where the inequalities are componentwise. For /• < », S[G\ p] is 
clearly unbounded. 

Proposition 1: The symmetrical polyhedron (see also |6]) S[G\ p] 
is a positively invariant set of system (2) if and only if there exist 
matrices H and K E x ', with H tJ > 0 for i ^ j and K,, > U 
for any kj such that 

(tf- K)G = G(A + DF) (4) 

(H+K)p< 0. (5 

Proof: Let i?[T, be a general, not necessarily symmetrica* 
convex polyhedron defined by 

B[T,jj] = {,/• E ft n /T.r < lj} (6' 
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where T € R x ' and m € H f , >0 In [5] and [6] u is shown 

than 1?[7\ u;] is positively invariant for system (2) if and only if there 
exists a matrix 7 G W T *\ with J , > 0 for / ^ y such that 

/r = T( 4 + DF) (7) 

/u, < 0 ( 8 ) 

The polyhedron 5 [G p] can be written as a generic one (6) One 
only needs to consider 





( 10 ) 


with ff , M , > 0 for / ^ ) and h , I , >0 for any / ) 
Substituting (9) (10) in (7) (8) one arrives to (4) (5) and two other 
ledundant equations □ 


III PoSHlVL INVARIANCLOI S[Cr p] 

This section is concerned with the obtention o( a proportional state 
leedback law // = f i tor which the symmetrical polyhedron 5[G p] 
is positively invanant and the closed loop system (2) is asymptotically 
stable C onsider then the following change of basis m the system (1) 


i =Qi (ID 

where (j t ft is an orthogonal matrix such that 

G - GQ = [0 G ] G € ft * (12) 

The new representations of system (I) and polyhedion S [(» p] are 
i “ 4 i -I- Du (13) 

l = g^=[| M !'*] 4 6 j? x (14) 

D - Q B - B n l O. 6 S ' (IS) 

S[C. ,)] = {| e R /-P<C,I <(>) (16) 


Proposition 2 The polyhedron S [G p] is positively invariant for 
system ( 2 ) it and only if 

1) There exists a matrix F\ £ ft 1 x [ } such that 

+ 7?*Fi - 0 (17) 

2) Theie exist matrices F* € ft * 77 and A € ft y , with 

77 ,, > 0 for / / ) and h„ > 0 foi any / j such that 

(77 - 7v )&2 = Gj( Lj + BaFJ (18) 


Consider the state feedback matrix m the transformed 

system (13) 

F=FQ=qF, Fj] Fjer*" (20) 

Us substitution together with 6 (12), 4 (J4) and 7? (IS) in (4) yields 

[0 (77 - A )G ] = [GA 4., -f TJiFi) 0 2 ( \ 22 f 7? 2 7 2 )] 

( 21 ) 

To satisfy this equation it is necessary that G 2 ( 4 i + B 2 F\) = 0 
Since G^ (12) has linearly independent columns this condition holds 
if and only it (17) holds Condition 2 may be directly verified from 
(3) and (21) □ 

Considering (17) satisfied it can be noted from (18) (19) that the 
original pioblem of positive invariance ol S[G p] for the system 
represented by the pair (4/3) has been reduced to the positive 
invariance of the bounded polyhedron S[G^ p\ for a lower order 
system represented by the pair ( 4.* D J This reduction becomes 
significative mainly tor large dimensions systems subject to few 
constraints since redundant equalities are eliminated 

It is easy to verity that condition (17> is equivalent to the (4 7?)- 
mvanance ol kti(G) (he null space of the matrix G result firstly 
obtained by Castelan and Hennet [7] This interpietation enabled them 
to apply results from the so called geometric approach to control 
systems |8] in the analysis of stabdinability and contiollability of 
state constiained systems and to reformulate the design problem as 
an ugenstructure assignment problem 

In this paper geometuc thcor) results will be applied in a quite 
different manner Suppose that (17) holds In closed loop one has 


4/ =4 + 777 


In f 7?i 71 4| fc + Z?i F 
0 4 +77.7 


( 22 ) 


The eigenvalues ol 4 r are therefore those of submalnces 4/ (l = 
In + 7?i Fi and 4/ — 4 + B 2 I 2 The positive invariance 

ol S[G p] implies certain spectral pioperties for 4 1 mm [S] [6| 
In particular if the inequalities (19) are strictly satisfied 4r is 
asymptotically stable lhe problem ol positive invariance of S[G p\ 
with closed loop stability can theielore be summarized as follows 

• rind 7 1 such that (17) holds and the eigenvalues of 4j i + B\ Fi 
have negative real parts 

• Find 1 2 such that (18) holds and (19) holds with strict inequal¬ 
ities 

The stabilization of 4/ n which represents the map of 4f 
lestneted lo ku(G) is dependent upon the maximal controllability 
subspace contained inkti(G) [8] here denoted as R* This subspace 
defines what can be controlled in ku(G) by matrices F that satisfy 
(17) An efficient and numerically stable method for its computation 
has been presented in [9] (see also [10]) It consists in finding 
orthogonal transformation matrices which change the coordinate basis 
of the system in such a way that the structure of the desired subspace 
is made evident In particular, it is always possible to compute 
changes of basis |9] 

u = Zu i = Qh i (23) 

such that the system (13) is represented as follows 


(77 + A )/i < 0 


(19) 


t = 4i + 77 h 


(24) 



m 
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Q'hAQr 


An .4i2 Aia 

Ai 1 A'*2 4'2;3 

A;*i A.12 4.13 


B - Q'nBZ = 


B\\ B 12 

021 0 
0 :J1 o 


where 011 € ft rx6 has fell column rank 


/ = Z'jFQh 


Fm Fa Fu 
Fat Frt F>;t _ 


ill €«•** (25) 


6 u 6 r xt (26) 


Fn e:#* x< (27) 


Any stabilization procedure may be used to solve Problem 1. 
Regarding Problem 2, a simple and efficient approach to handle it 
is to formulate a linear program (LP) having conditions (33), <34) 
as constraints. A scheme similar to that proposed in [12], where the 
objective function is chosen so as to increase the rate of convergence 
of the state to the equilibrium, cannot be applied here because there 
are no bounds on the gains of F\s. This difficulty can be removed 
by choosing as objective function to be minimized, for instance, the 
sum of the absolute values of the elements of Fia. The solution of 
Problem 2 can therefore be obtained from the following LP 

h r 


min 

-i.HJi.W 


« = 1 J = 3 


G = GQr = [0 0 63]: Ga eW*' (28) 



i,y 


7 C 0 

■ * 

11 
(— t 

3 

0 

ker(G) = 

0 /„-(,+,) 


0 


0 0 


where I n denotes the identity matrix of order 1 ? 

S[G. fj] = {.f*€ A?"/ “ P < Gz'< f> } ■ (30) 

Let F be the set of matrices F for which ker(G) is (.4,0)- 
invariant. Since 03 1 has linearly independent columns, the set F 
is uniquely defined by 

F=(F/[i 3 , .4s a ] + An[A, F, 2 j=o} (31) 


and in closed loop, for F £ F 


At = .4 + BF = 


Af 41 

0 

0 


•4*,, -4/,, 

Afri Ar- n 

0 -4/-:,, 


(32) 


where the eigenvalues of Af u may be freely assigned while those 
of Af 22 cannot be modified by any F £ F. If one considers the 
output equation y{t) = Gx(t), the eigenvalues of Ai 22 are the so- 
called transmission zeros (invariant zeros for some authors) [8], [101. 
Therefore, if any of them has positive or null real part, the positive 
invariance of S[G. p] with simultaneous closed-loop asymptotical 
stability cannot be achieved. In this case, a possible solution is to 
add adequately constraints to the states so as to assign only the 
eigenvectors corresponding to the stable eigenvalues of Af 2 . 2 to the 
null space of the new matrix G [7], [11]. 

Assuming henceforth that the eigenvalues of Af 2 . 2 have negative 
real parts, it can be observed from the representation (24)-(31) that 
the original problem of positive invariance of S(G\ p\ with stability 
can be split into two distinct problems as follows: 

Problem 1: Given Fn and F 12 fixed by (31), find F 21 such that 
the eigenvalues of An -f B\\Fu 4- 012/21 have 
negative real parts. 

Problem 2: Find F\.\ € such that there exist H and A' 

which satisfy 

(H - A )C»3 = G.h(4;v 3 4* 0,s 1 Fi 3 ) (33) 

(0 -f A>< 0 . .■ (34) 

with Hr, > 0 for i ^ j and K tJ > 0 for any /, j. 


sub.to : (H — A )Ga = Cn(A 33 4* 0a 1 / 13 ) (35) 

(0 4- K)p < — rrp 
F\ a 4- IT > 0 


H u > 0 for i ±j\ K rj > 0: W tJ > 0. 

In view of the constraints involved, it can be easily verified that the 
minimization of the objective function yields IT,) = 0 for F\ 3 ,, > 0 
and Wjj = —Fn ■ f° r /is,, < 0- Therefore, at the optimum, 
4- 214 ’,j = |F,a 0 -|. 

The small positive number <x must be chosen so as to guarantee the 
strict satisfaction of inequalities (34) as well as to avoid an excessive 
proximity to the origin of the eigenvalues of (.433 4- 0;n/n). In 
fact, it can be easily shown [12] that —a defines an upper bound on 
the real part of such eigenvalues. 

Before continuing, it should be remarked that the preceding devel¬ 
opment has been motivated by the case r ■= rank(G) < j?. When 
v — n the polyhedron S[G, />] is bounded and the changes of basis 
(11), (23) are not required. In this case, a stabilizing F for which 
S[G\ p] is positively invariant may be directly obtained from the 
resolution of (35) using the corresponding matrices of system (2) and 
constraints (3), in their original basis. The same is applicable to the 
L-Q regulator design, to be treated in the next section. 

IV. Constrained L-q Regulator Design 

The resolution of Problems 1 and 2 defined in last section provides, 
if possible, a controller such that 5[G. p] is positively invariant and 
the closed-loop system is asymptotically stable. In a typical control 
system design, however, performance specifications are also included 
and among them the optimal L-Q regulation is widely used. Consider 
then the quadratic performance index 

J[u(t)) = E |y [.r(t)' S[Siu:(t) 4 - u{t) f S 2 u(t )]<//j (36) 

where E is the mathematical expectation operator; 5 1 and 5a arc 
constant real matrices; S 2 is positive definite; .t o = ,r(0) is a random 
vector with E{.r 0 ) = 0 and E{ro.ro} = .Yo-Yo. 

The objective of the state constrained L-Q regulator problem to be 
solved here is to find a state feedback law u — F.r such that S[G, p) is 
positively invariant and the performance index (36) is simultaneous!} 
minimized. Consider then the changes of basis (11), (23) which lead 
the system to the form (24)-(31). Considering also the state feedback 
law ft — F.r, the index (36) may be rewritten as 

(37) 


j(p) 3 = e< / + F t s2p]mdt 
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where Si ~ Sj {QQn), Sj = Z'S^Z, E{ i 0 } =0 and E{ 1 tJ j 0 } = 
(QQH)^Q^o(QQfiY = Y ( )\o The performance index ,7(F) is 
given by f 11] 

J(F) = T» ( \/) \ (1 7 ) 08) 

( 4 + BF)'T + T( 4 4- BF) + Si S, + F'S F - 0 09) 

We are now in position to formulate the state constrained L Q 
regulator design problem 

mm 7(F ) 

1 fi h 

mb to (H — A Kh = ^ ( 4 u -f ^ 11 Fi j) (40) 

(II+h)p < 0 

r e f h , > 0 f 01 / ^ / a y > 0 

Matrices f € F (11) have hxed and free elements Let o be a 
vector composed by the tree elements of F € F togethei with the 
elements of H and A The problem (41) can therefore be rewritten 
as the following standard linearly constrained minimisation problem 

111111 /(o) (41) 

sub to ( o < i (42) 

Without considering the constraints (42) nun 7(n) (41) coire 
sponds to a parameter optimization problem in L Q regulators subject 
to structural constraints For solution of this kind of problem Dorca 
and Milam |13] proposed a specialized hybrid descent optimization 
procedure composed by a sequence of steps of Modified Newton 
Newton s and Quasi Newton methods Fur obtaining a good trade 
off between the cost per step and the convergence rate the methods 
are chosen along the descent process considering their expected 
computational effort per step an evaluation of the convergence rate 
of the descent sequence and the proximity to the optimal point 

To solve the parameter optimization problem with linear constraints 
(41) (42) the method proposed in [ 13| is transformed into a feasible 
directions optimization method [14] described in the sequel 

Assuming 

VJ(a) gradient vector of /(a) 

77(a) hessian matrix of 7(o) 

/A(n) positive dehmte approximation to 7/(n) 

77 r /(n) matrix obtained from the diagonal scaling of H(o) 

fulfilling the secant condition of Quasi Newton methods, 

as defined in [11], the following active set based procedure ] 14] is 
proposed to solve the constrained minimization problem (41), (42), 
Parting from a feasible point 

1) Initialization 

• Define the tolerance f 

• Set 

l = 0 Hi = Oy 

2) Active set computations 

• Construct C\, the active constraints matrix composed 

by the lines C, of C (42) such that 

C iQ_i = 


3) Computation of the projection matrix 

• Compute the orthogonal matrix \ by performing the 
following QR factorization [14] 

<4 =W=[t, VjK 1 

where the matrix l \ has full row tank 

4) Test for convergence 

• Compute the reduced gradient vector 

'y(a/) ~ 1 

• It ||</(a,)|| > c go to step 5 Otherwise compute the 
Lagrange multipliers vector A by solving the system of 
equations 

l\ \-«i'V/(rt,) 

• If \ > 0 STOP and lake n ( as the solution 
Otherwise eliminate the low of C 1 correspondent to the 
most negative component of A and return to step 3 

5) Computation of a feasible search direction 

• Based on the directions proposed in [13] select the 
descent method and define a positive definite matrix 
I (o/) considering ihe following alternatives 


A(n/) 

( H {Of) modified Newton 
=■ < H{<_ i t ) Newton s 
[77/(0,) Quasi Newton 

• Compute 

W(n,) - 1 )\ . 

• Solve the 

linear system of equations 


U(n,)p ; - -(j{n,) 

• Compute 

dt - 1 i‘, 

6) Solution update 



Find a step size -) such that 


<j_i j. | ~ [±i — l lt ls feasible 
7(h h i) < 7(<jj) 

• Set 

/ — /-hi 

• Return to slep 2 

Remat k? 

1) The existence of -) m step 6 is assured by the positive 
definiteness of matnees £( 0 .,), il/(o,), m step 5 [15] Due 
to the non convexity of /(a) it is not possible to assure that 
the attained minimum point in step 4 is the global one To 
the author’s knowledge, a convex programming approach to 
problems with the structural constraints present in (41), is not 
available to date 

2) The optimal state feedback matrix F* in the original basis (2) 
is obtained from the optimal solution F(a*) by 


F* = ZF(c±)(QQr)' 



5*8 
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v. Numerical Example 

The following example has been borrowed from [5]. Consider the 
system for which 



■ 9.10 

0.47 

—6,33 


1.82 

3.61“ 

.(■ 

7.62 

0 

7,56 

D = 

1.24 

—3.77 


2.62 

-3.28 

9.91 


-4.91 

0 


The state constraints are defined by 


‘5.69 

1.97 

-1.68' 


■ 8.75 ■ 

_2.24 

- 1 .C 8 

5.59 

P = 

10.50 


The system is transformed into the representation (24)-(31) yielding 



’ 1.7936 

1.8674 

-4 = 


11.1206 


-1.6275 

1.1278 

f-1.1843 

-4.1421' 


B = 

3.4836 

G = 


[ 4.6808 0 


12.9823 

-2.1901 

6.0957 


0 6.2514 0.0017 
0 0.0055 6.2520 


The matrices F € £ are given by 



0.3477 

1.0678 


.V 

A’ 


V 

V 



Fig. 1. Trajectories in projection (C7.r)i x (G.rh for the constrained (..) 
and unconstrained (-) L-Q regulator. 


for existence of such controllers are directly verified from structural 
properties of the matrices of the system at each step of the design. A 
specialized feasible directions method has been proposed to solve the 
resulting constrained parameter optimization problem. The proposed 
design makes no assumptions, other than the usual ones, on the 
weighting matrices and is very simple when compared to other 
approaches available in the literature. 


where the elements X are free to assume any value. For such 
matrices, it can be readily verified that the closed-loop system will 
have -3.0399 as a fixed eigenvalue. 

Using 5 1 = h; S 2 = h\ A’oA'o -- ~I:\ and starting from the 
solution of the LP (34) with a = 0.1, the resolution of the parameter 
optimization problem (42) yields 

J(F') =.4.1255 


F* 


0.3477 -2.5595 -4.6649 
1.0678 -7.4796 -1.9137 


In the original basis, 


-0.2561 -1.0943 3.2597 

-8.7191 -0.3818 -1.0603 


The optimal solution of the unconstrained L-Q regulator problem 
is .V = 1.1228. However, for ;r(0) = [0.6161 5,4503 3.2695]', 
it can be seen in Fig. 1 that the state vector trajectory resulting from 
this solution violates the imposed constraints. 


VI. Conclusion 

In this paper a new approach has been proposed to the design 
of L-Q regulators for state constrained continuous-time systems. 
The design problem has been stated as a parameter optimization 
problem subject to linear constraints. The respect to the constraints 
has been guaranteed by achieving closed-loop positive invariance 
of the polyhedron defined by them. By appropriately changing the 
state-space representation of the system, the positive invariance of 
an unbounded polyhedron with simultaneous closed-loop stability 
has been achieved by solving two distinct problems for lower order 
systems; a stabilization problem and a problem of positive invariance 
of a bounded polyhedron, which can be solved by properly defining 
a linear programming problem. Moreover, the admissible constrained 
controllers have been parameterized by. means of the determination 
of the fixed elements in the state feedback matrix. The conditions 
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The Finite Inclusions Theorem 

Richard D Kaminsky and Theodore E Djaieris 


Abstract —This paper presents a novel, necessary and sufficient condi¬ 
tion for a polynomial to have all its roots in an arbitrary convex region 
of the complex plane. The condition may be described as a variant of 
Nyquist’s stability theorem; however, unlike this theorem It only requires 
knowledge of the polynomial's value at finitely many points along the 
region's boundary. A useful corollary, the Finite Inclusions Theorem 
(FIT), provides a simple sufficient condition for a family of polynomials 
to have Its roots in a given convex region Since FIT only requires 
knowledge of the family’s value set at finitely many points along the 
region's boundary, this corollary provides a new convenient tool for the 
analysis and synthesis of robust controllers for parametrically uncertain 
systems. 


I Introduction 

With regards to linear time invariant (HI) systems there are 
now two well established ways to model system uncertainty cithei 
parametucally or through 7f x norm hounded classes of transfer 
functions In the former case which wc arc mainly interested m 
here both robust stability and performance questions often reduce to 
showing that a certain parametric family of polynomials (or matrices) 
has its zeros (eigenvalues) in a given icgion of the complex plane 
As engineers we would like to have effluent numerical methods tor 
verifying this inclusion While many eood methods and interesting 
results have come to light our optimism in seeking a general solution 
to this problem especially one that will permit us to conveniently 
synthesize robust controlleis must be tempered by the fact that 
there are certain fundamental mathematical difficulties in relating a 
polynomial ^ coefficients to its zeros 

Many it not all—of the major results connecting a polynomial s 
coefficient with its zeros can be derived from the argument principle 
(aka Nyquist s theorem) Among them the tact that a degree 
n polynomial has precisely n zeios (counting multiplicities) the 
algebraic root clustering cntena of Roulh Hurwitz Schur Cohn 
and Juiy the Hermilc Biehler theorem and Kharitonov s theorem 
The purpose of this paper is lo present a novel variant of the 
argument principle which is useful in solving robust parametric 
control problems 

Recall that Nyquist s theorem (1 e the argument principle) states 
Ihe following Let P C C be an open simply connected set 
/ D —► C be a function analytic on P except lor finitely many 
poles, and I C P be a counterclockwise, closed rectifiable, Jordan 
curve on which / has ncilhei poles noi zeros Furthei let \ and 
\ be the number of zeros and poles of / inside T and let A be the 
net number of Counterclockwise encirclements about the ongm made 
7 the curve f(Y) Then A - % = A 
As a numerical method for determining pole/zero locations 
Nyquist's theorem is flawed because it requires evaluating ){~) at 
he infinitely many points Y In practice, of course we often choose 
initely many points («a) c I and from a plot of {/(sa)} make an 
ducated guess as to the number of counterclockwise encirclements 
' that f(T) makes about the origin Despite the room for error here 
fins procedure usually works quite well, yet it laises the question, 
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I ig 1 A closed Joidan curve with convex mlenoi 


1 Can \ ever be rigorously determined from some finite set of points 

The mam tesult of this papei is that when / is a polynomial 
of degree <n and the interior of l is convex, one can determine 
whether or not \ =- A - u from an appropriately chosen set of 
points \j( si <)} This result the hniie Nyquist theorem (Theorem 1), 
leads natuially to a very useful sufficient condition lor a family of 
polynomials lo have all Us zeros inside Y namely the finite inclusions 
theorem (Theorem 2) 

Section II derives the finite Nyquist theorem and Section 111 derives 
the finite inclusions theorem The case of Hurwttz stability was dealt 
with in 11 ] hor an application of these results to robust control 
analysis and synthesis problems see 12] the complete version of this 
paper 


II Thf* Hnh i Nyquist Thiorem 

Let us introduce some notation L el C R R+ and Z be the sets of 
complex real positive real and integer numbers respectively Let arg 
- be the argument of - £ C\ {()( and Arg the principle argument 
of ^ t C \ {0} with range (— zr tt] Let ,* denote the complex 
conjugate of 6 C ext I the exterior of a closed Jordan curve Y, 
int T the interior ol a closed Jordan curve Y coS the convex hull of 
S and exi Q the set of exlreme points of a convex set Q 
Nyquist s theorem states that an nth degree polynomial p has all Us 
zeros inside a counterclockwise closed Jordan cuive T if and only 
if the point ;>(s) revolves a net 27 t;i radians counterclockwise about 
the origin as s traverses Y An inconvenient leature of Nyquist’s 
theorem is that p(s) must be evaluated at the infinitely many points 
T The following novel theorem slates that when int Y is convex, 
p{s) need only be evaluated at finitely many points After this paper 
was accepted for publication it was brought to our attention that a 
similar result was stated in [4] 

Theoiem I-Finite Nyquist Theoiem Let p( s) = 
where n > 0 and a, G C\ and let Y C C be a closed Jordan 
curve such that int Y is convex (see Fig 1) Then, p is of degree n 
(i e , n N ^ 0) and has all its zeros in int Y if and only if there exist 
m > 1 angles 0* 6 R and a counterclockwise sequence of points 
s* € I\ 1 < A < w, such that 


|0A + 1 “ 0k | < 7T 

(la) 

1 

3 

A 

(lb) 


(1c) 

**rg s*) = (mod 2tt ) 

(Id) 
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Fig. 2. A closed Jordan curve with convex interior. 

Proof: The proof requires several preliminary definitions. First, 
Jet I * [0, 1) and define the total variation [3, p. 328] of a function 
/: I —>■ R to be 

n — 1 

V / = sup V |/(.r,+i) - /(j-j )|. 

«>> 

0<X|<X2<■■-■< , Tn<l J 

Intuitively, V / is the total “vertical distance” through which / 
oscillates on J. Note, V / is a nonnegative (possibly infinite) quantity, 
and by the elementary inequality |o + 4| < |«| + |3|, n, J e R , 
we have: 

Lemma 1: For arbitrary /, (j: J —► ft, V( / 4- g) < V / + V g. 

Second, let us parameterize the closed Jordan curve F (see Fig. 1) 
by a continuous function F: / —► C where F(0) = s i. For 
convenience both the curve and function will be denoted by T, and 

the left-hand limit liin ( _j - T(/) will be denoted by F(l“). The 

above adjective closed refers to the property F(0) = T(1' ) while the 
adjective Jordan refers to the property T( .i*) Hi/) (i.e., T does 
not intersect itself). Pfote, an intuitive—but difficult to prove—fact is 
that any closed Jordan curve F partitions the plane into two regions: 
a bounded, open region int F and an unbounded, open region exi Y. 

Last, let us define the notion of a phase function 6: / —► R which, 
simply put, describes the phase of the first degree polynomial s - : 
as s traverses F. Specifically, for £ F define 0 :1 —> R to be any 
continuous function such that 

0 (t) = arg(F(f) - -)(mod2 tt). (2) 

For z € F \ {r(0)} lake this same definition but at t G (0,1) 
where F (t ) — 2 = 0, only require 0 to be right continuous and 
lira,. 6(f) - Mt ) € [0, 2tt). This minor complication arises, 
of course, because (2)'s right-hand side is undefined at t =s t . For 
technical reasons 0 will be left undefined in the c = F(0) case. 

From Fig, 2 it should be apparent that when int F is convex at least 
One phase function 0 —and actually, infinitely many phase functions 
differing by multiples of —exists fot any given z ^ r(0). Further, 
0 will have certain properties that depend only on whether ; is inside, 
on, or outside T. 

Case J: z 6 int F. Here 0 increases continuously and monoton- 
ically by 2?r (consider in Fig. 2). Hence, V 0 = 2n and 

6(1" ) - 6(0) = 2tt. 

Case 2 : c € F \ {F(0)}. 0 increases continuously and mono- 
lonically by some amount 0o < tt except at the point t G(0, 
1), T (t ) - z = 0, where it makes a downward jump of height 
0o (consider z *= zo in Fig. 2). Hence, V 0 = 20o < 2tt and 
6(F ) - 6(0) 0. 


Case 3: 2 € ext F. 6 increases continuously and monotonically by 
some amount 0+ < ic over a portion of the interval 1 (consider 2 = 
and the counterclockwise B A portion of F in Fig. 2) while on 
the other portion it decreases continuously and monotonically by 0+ 
(consider the A B portion of T). Hence, V 6 = 20+ < 27r and 

6(F) - 6(0) = 0. 

Now, from these observations we can prove the “if* part of 
Theorem 1. Let p be of degree 0 < d < n and have (not necessarily 
distinct) zeros {h}; so 

n d 

p(g)==Oft 

>=0 A'= 1 

Substituting F(f) for h and letting o k be an arbitrary phase function 
associated with z k - , we see 

rf 

argp(r(/)) = arga ( / 4 ^ arg (F(/) - z k ) 

1 

= $(t KllJClfl 2 7T ) 

d 

where $(/) = Arg a ,/ -f Y ^Qk(t). 

A;-:. 1 

Lei 0 = t^ < t 2 < ■ ‘ • < tjn < 1 be such that F(U ) = 1 < 

k < m , and note (lc) implies {F(^ ) } H ] = 0; so, (Id) 
implies for all k and some Ca £ ) = 0k -1- 2 7rC a . Thus, 

$(]~) = $(0) + 27tA' ~ ^1 -f 27r(A T -H Ci ) where A" is the number 
of zeros z k . lying in int F. Now, observe 


-1 


= 27T?7 4 — 0,„ 

4 0k+ 1 ~ 0k 

k—\ 

(3) 

< + 9i - 0 ,„ 

#ii — 1 

'1 4 ^2 “ 0k | 

(4) 

< |27T#7 4 0\ — 0 m 

1 + 2jt(.V + Cl - Cm - »)| 


+ Yi - 

k- 1 

0 k 4 2 7T ( C A 4 - 1 Ck )| 

(5) 

= |<P(1") - $(l„, 

in- 1 

)i+ Y 

A —1 

(6) 

1 / 

< V4> < Y v 

1 . — 1 


< 27r d < 2mi 


O) 


where in going from (4) to (5) we used (la), (lb) and the fact 
n, A r , and Q are integers. Since the left-hand side of (3) and 
right-hand side of (7) are equal, we would have a contradiction il 
the inequality between (4) and (5) were strict. Hence, necessarily 
N + Ci ~ Cm - « = 0 and Ca + i - Ck = 0, 1 < k < w. Summing 
these equations shows N = .??. Thus, as we set out to prove, p is o 
degree u and has all its zeros in F. 

It now simply remains to prove the “only if’ part of Theorem 
Let p be of degree n and have all of its zeros in int T. If 11 = » 
trivially p(») = o 0 ^ 0, and Theorem 1 is satisfied for m = 

= Arg oo, and any »\ € F. So, suppose n >0. By Nyquisi 
theorem, the point p(s) will revolve 2 nn radians counterclockwis 
about the origin as s traverses F. Hence, if we choose m = 2 n 4 
and 0k = 7r(l - l/3n)(k - 1), 1 < k < m, by the continuity ‘ 
pU) we will be able to determine {»* } satisfying the theorem. A 
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interesting point here is that m = 2n.+1 is the smallest m for which 
the 1 theorem can be satisfied. Q.E.D- 

With little effort Theorem 1 and its proof can be generalized to 
certain, nonconvex, simply connected regions as follows. Suppose T 
is an arbitrary closed Jordan curve and u > 0 a constant such that for 
each z ^ f(0) there exists a phase function (p with V <j> < v (note, 6 
may not exist if i € T and in some neighborhood of T “oscillates 
too rapidly”). When int T is convex, we saw there always exists a d> 
with VoOs; 2k. When int T is not convex, however, we must 
take v >2 k. Now, to make the above proof go through, the right- 
hand sides of (lab) must be replaced by some k > 0 where w, v. and 
u satisfy the inequality vn - 2 tn < 2n-2h\ For then, noting that the 
right-hand side of (7) is vn, we must conclude A r + Ci - Cm - n = 0 
and (k-tx Ck = 0, 1 < k < w, to avoid making the gap between 
(4) and (5) so large as to yield a contradiction. As before, summing 
these equations shows N = w. 

The above inequalities imply n < (2k - 2 k.) /[v - 2k) and k < n 
(when n > 0). So, generalizing Theorem 1 in this way requires placing 
an upper bound on the degree n of p (perhaps requiring it = 0) 
and making the right-hand sides of (lab) strictly less than k. In 
our opinion, these conditions are rather ugly, and so, as matter of 
aesthetics, we chose to state Theorem 1 for only convex regions. 

Note, a more useful generalization of Theorem 1 is to unbounded 
convex regions. This is readily accomplished by approximating a 
given unbounded region by a suitably large bounded region as was 
done in |2]. Also, in certain cases where conjugate symmetry is 
present Theorem 1 can be specialized to require roughly half as many 
evaluations of p as would otherwise be needed 11], |2]. 

Ill. The Finite Inclusions Theorem 

A problem of fundamental importance in robust control theory 
is that of proving a family of linear differential equations has only 
solutions which decay and/or oscillate within a prescribed range of 
rates. Via the Laplace transform this problem can be reduced to 
proving a family of polynomials has all its zeros inside a prescribed 
region of the complex plane. With this problem in mind we now state 
the following, very useful corollary to Theorem I. 

Theorem 2-Finite Inclusions Theorem: Let />(*; q) = D''-o n j 
(q)s J . q € Q, where Q is an arbitrary set, ?» > 0, and t\j:Q C. 
Further, let F C C be a closed Jordan curve such that int F is convex 
(see Fig. 1). Then, for all q E Q< p(-:q) is of degree n and has all 
its zeros in int T if there exist m > 1 intervals (n*. M C R and a 
counterclockwise sequence of points .*k E F, 1 < k < m, such that 

1 - (ik . bk - «k+ 1 } < tt (8a) 

max \ t) ui ~ (<M + 2?m). (/>] 4- 27m)— a r „} < k (8b) 

Vi <*.<„, p(s k \ Q) C Sk = re ,0 \r > 0, >0 E («kJ>k )}. (8c) 

Proof: For each q E Q, (8c) implies for all A\ p(*kiq) ± 0 
and there exist 0k E such that arg p(si;q) = 0k (mod 2 tt). 

Further, from the inequality 

V \B -«\< sup \»-9\ = mnx{d-a.b-c) 

e(r ,ii) e^(a.b) 

O'&ir.d) 

we see conditions (8a), (8b) imply Vi<* <„, \0k+i - 0 k | < tt and 
2kv + 0i - 0 m | < 7 T. Hence, for each q € Q Theorem 1 implies 
■'(Sg). is of degree n and has all its zeros in int T. Q.E.D. 

For a convex region int F Theorem 2 roughly says that if we can 
<nd a sequence of open sectors Sk “spaced” no more than n radians 
pan and revolving a net 2kh radians about the origin and if we 
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can find a counterclockwise sequence of points € T such that 
the value seti*/»(«*;Q) = {p{#k\q)<q € C?} lie in these sectors, 
then each member of the family of polynomials p(-;g), q € Q, is of 
degree u and has all its zeros in int F. 

In general, it should be noted that Theorem 2 is only a sufficient 
condition for a family of polynomials to have its zeros in a given, 
convex region. When p(s:g) is continuous and affine in q € Q 
and Q is a compact, convex set, however, it is also necessary (i.e., 
the word “if’ in Theorem 2 may be replaced by “if and only if,”) 
To see this, suppose p(s\q) is continuous and affine in q, Q is 
a compact, convex set, and for all q E Q, p(‘\q) is a degree h 
polynomial with all its zeros in int F. First note, p(s: Q) is a compact, 
convex set that revolves n times counterclockwise about the origin 
without touching the origin as a traverses T (since for arbitrary 
q E Q> p(s\q) revolves in this fashion). By the above facts and 
the continuous deformation of p(s;Q) as s traverses F, we may 
define two continuous functions a Jr. 10,1] —* R, such that for all 
t E i, {re ,0 |r> 0 ,0 E [n(f),fc(f)]| is the smallest sector containing 
p(F(f );Q), and so, 0 < b[t) - a(t) < 7r, n( 1) = u(0) + 27m, and 
h( ]) — Ii(()) + 27tm. Since a continuous function defined on a compact 
set achieves its supremum and is uniformly continuous on that set, we 
see that r = sup b(t) - a(t) = nmxb(t) - a(t) < k and that a and b 
are uniformly continuous. Thus, for an arbitrary 0 < f < ( tt - e)/2 we 
can choose a sufficiently fine partition {/n} of / such that conditions 
(8a), (8b) are satisfied with <u = o(tk) — e and bk ~ b(tk ) + f, and 
by construction, (8c) is satisfied with .s* = F(4). 
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Further Results on Rational 
Approximations of C l Optimal Controllers 

Zi-Qin Wang, Mario Sznaier, and Franco Blanchini 

Abstract —The continuous-time persistent disturbance rejection prob¬ 
lem iC x optimal control) leads to nonrational compensators, even for 
8ISO systems [4], J7J; [Hj. As noted In [4], tbe difficulty of physically im¬ 
plementing these controllers suggest that the most significant applications 
of the continuous time C l theory is to furnish bounds for the achievable 
performance of the plant. Recently, two different rational approximations 
of the optimal C l controller were developed by Ohta et at, [6] and 
by Blanchini and Sznaier [11. In this paper we explore the connections 
between these two approximations. The main result of the paper shows 
that both approximations belong to the same .subset tfr of the set of 
rational approximations, and that the method proposed in [1] gives the 
best approximation, in the sense of providing the tightest upper bound of 
the approximation error, among the elements of this subset. Additionally, 
we exploit the structure of tbe dual to the C 1 optimal control problem 
to obtain rational approximations with approximation error smaller than 
a prespecified bound. 

1. Introduction 

A large number of control problems involve designing a controller 
capable of stabilizing a given linear time invariant system while 
minimizing the worst case response to some exogenous disturbances. 
This problem is relevant for instance for disturbance rejection, 
tracking and robustness to model uncertainty (see [2] and references 
therein). When the exogenous disturbances are modeled as bounded 
energy signals and performance is measured in terms of the energy 
of the output, this problem leads to the well known H ^ theory. The 
case where the signals involved are persistent bounded signals leads 
to the C l optimal control theory, formulated and further explored by 
Vidyasagar (7), [8] and solved by Dahleh and Pearson both in the 
discrete* [3], [51 and continuous-time [4] cases. 

The C l theory is appealing because it directly incorporates time- 
domain specifications. Moreover, it furnishes a complete solution to 
the robust performance problem (see [2] for a good tutorial and a 
list of relevant references). However, in contrast with the discrete 
time V theory, the solution for the continuous-time £ ] optimal 
control problem leads to nonrational compensators, even for SISO 
systems. As noted in [4], the difficulty of physically implementing 
these Controllers suggests that the most significant application of 
the continuous time C y theory is to provide performance bounds 
for the plant. Recently, two rational approximations to the optimal 
£'■ controller were developed independently [6], 11], Although these 
approximations are based upon different techniques ([6] follows an 
algebraic approach while [1] exploits the properties of the Euler ap¬ 
proximating set), they seem to be strongly connected 11]. Noteworthy, 
they yield closed-loop plants with the same pole structure. 

In this paper we explore the connection between these approaches. 
The main result of the paper shows that both belong to the same 
subset il T of the set of admissible rational approximations, and that 
the method proposed in [1] gives the best approximation (in the sense 
of providing the tightest upper bound of the error) among the elements 
of this set. Additionally, by exploiting the structure of the dual to 
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the £ 1 optimal control problem we furnish a procedure to compute 
rational approximations with error smaller than a prespecified bound 
e, and we show that the approximation error-*0 as O(r). 

The paper is organized as follows. In Section 11 we introduce 
the notation to be used and we restate the main results concerning 
the £ l problem and its rational approximations. Section III contains 
the majority of the theoretical results. Here we compare the two 
approximation methods and we show that, in a sense, the method 
proposed in [1] yields the best rational approximation. In Section IV 
we present a simple design example and we compare the optimal £ l 
Controller with its rational approximations. Finally, in Section V we 
summarize our results. 

II. Preliminaries 
A. Notation and Definitions 

denotes the set of nonnegative real numbers. £** (/?+) de¬ 
notes the space of measurable functions /(f) equipped with the 
norm: ||/||c* = ess ■ sup JV |/(/)]. £ l (/?+) denotes the space 
of Lebesgue integrate functions on R+ equipped with the norm 
||/||i = \f{t)\dt < x. Similarly, /j denotes the space of 

absolutely summable sequences h = j/*,} equipped with the norm 
\\h\\ i = |A, | < x. 1t£ l denotes the subspace of £' formed 

by matrices with real rational Laplace transform. A denotes the space 
whose elements have the form 

tx: 

l> = + 

kz- 0 

where //(f) E Ai(i?+). {//J} E /i and f, > 0. equipped with 
the norm \\h\\ A = \\h L \\ fjl 4- ||//||/,. Given a function /(f) E £' 
wc denote its Laplace transform by F[s) 6 : similarly, given 

h E A. we denote its Laplace transform by If(s). By a slight abuse 
of notation, we denote as ||F(.s)|| l = ||/(f)||i and ||/f(«)|| / \ = ||//||,a. 
Throughout the paper we use packed notation to represent stale-space 
realizations, i.e., 

CH*) = C(.sf - A)~ l B + D = 

Definition I: Consider the continuous lime system C»(.s ). Its Euler 
approximating system (EAS) is defined as the following discrete time 
system 



From this definition it is easily seen that we can obtain the EAS 
of G( fi) by the simple variable transformation * = (~ - 1 )/r, i.e., 

0«,^,. C (£Z2). 

On the other hand, for any given r we can relate a discrete time 
system to a continuous system by the inverse transformation ~ = 
1 -I- r.s. It is obvious that the discrete time system is, in fact, the EAS 
of the continuous time system obtained in this form. 

Definition 2: Consider a system of the form 

$(») = TAh) + TA*)Q(*) 

where 7 l>(a;) has all its zeros {,:t, - 2 , ■ * -, } in the open right-ha- 1 

plane and where, for simplicity, we assume that all the zeros an 
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Fig/1. The generalized plant. 

distinct. $(«) is said to be admissible if it is stable and satisfies the 
interpolation conditions 

$(;■*.) = Tiizit ). k = 1, ■ • ’, /i. 

B. The C l Optimal Control Problem 

Consider the system shown in Fig. 1, where 5 represents the 
system to be controlled; the scalar signals u £ and u represent an 
exogenous disturbance and the control action respectively; and where 
C and y represent the output subject to performance constraints and 
the measurements available to the controller, respectively. As usual 
we will assume, without loss of generality, that any weights have 
been absorbed in the plant 5. Then, the C l optimal control problem 
can be stated as: Given the system (5) find an internally stabilizing 
controller u{s) = K{s)y(s) such that the worst case (over the set of 
all uj(t) £ C* , ll^'Hc* < 1 ) maximum amplitude of the performance 
output ((f) is minimized. 

By using the YJBK parameterization of all stabilizing controllers 
[4], |9J, the problem can be cast into the following model matching 
form 



Remark!: It was shown in [4] that we need to satisfy constraint^ 
(4) only for all t where is finite and can be determined 

a priori. Even so, there are still infinite constraints, and therefore the 
dual problem is a semi-infinite linear programming problem. 

C. Rational Approximations to the Optimal C l Controller 

From (5) it follows that, unlike in the discrete-time case, the C\ 
optimal controller is irrational even if the plant is rational. Prompted 
by the difficulty in physically implementing a controller with a 
nonrational transfer function, two rational approximation methods 
have been recently developed by Ohta et al. [ 6 ] and by Blanchini 
and Sznaier [1]. In the sequel we briefly review these results. For 
brevity, we refer to the former as the OMK method and to the latter 
as the HAS method. 

Theorem 2 (Ohta et al., [(>]): Lei 


'.4, 

B, 

Ci 

D , 


T\{s) = -- and T 2 (h) 


be minimal realizations. Define 



I. = B 2 n:;' 

.-i = a , - i.c , 


Al = LDi + R,B , 


where rt, is the unique solution of the linear matrix equation 
Ai?, - /?, A\ = LC \. 


/io = . inf ||^(-‘«)||yi == inf ||T, (#) -+• 7i*(.s)Q(.s)|| A ( 2 ) 

h fi</iliilizinu, Q£ A 

where T,, T> are rational stable transfer functions. 

Next, we recall the main result of [4], showing that a solution to the 
£} optimal control problem can be found by solving a semi-infinite 
linear programming problem. 

Theorem / (Dahlch and Pearson 14]): Let T> (s) have n zeros 
in the open right-half plane and no zeros on the jw axis. Then 

fio~ inf ||r, (*) + Ta(*)Q(#)|| a 

Q£A 


= max 


Y/>, Re{Ti lm,T)(:,)} 


subject to 

I r(0| = 


Then, there exist finite sets {/ , f -.> • ■ ■ f,„} and {Pi Pa * - * P...} such 
that 


3/ = ^ P, exp (-At, )L 


( 6 ) 


and //o = |P«I- For r > 0 , define .Y(/,, t) = the small¬ 

est integer larger than or equal lo /,/r. i = 1 . 2 , • • ■. m. and 
X{t) = X{tm> t). Finally, denote by p(r) the minimizer of 
||P(r) - o11j subject to 


M — (t){I 4- r A) 




(7) 


(3) 


where o( r) = [o, (r. o m (r)]. and q ~ [Pi, • ■ •, 0 ,,,]. Consider 
the rational system 4>(.s r) with the following state-space realization 




(Mr) 

fft/, t- \ I 


Yn i Re{ r T ’ 1 } H- 4 - „ Im {< ' ' } 

< 1. 

TM-s. t- ) — 1 

\C:k(t) 

Dm ( r) 


Vf € rt+. (4) 


where the matrices A, D\, C\ j. Dm are defined as follows 


Furthermore, the following facts hold: i) the extremal functional r*(t) 
equals 1 at only finite points: f 1 , ii) an optimal solution 

$(*) = T)(s) + T 2 {*)Q{*) to the left side problem always exists; 
and iii) the optimal p has the following form 




(- 1 1 0 

0 -11 0 
0 0 -1 1 


... o' 


P — ,V)(f — t ,). t, € rt+. 

and satisfies the following conditions 

a) P»r*(f,) > 0; 

m 

b) 

rti 

c) = r,(sfc,. *• = 1, 


0 

Vo 


rn finite 


(5) 


I) -1 1 

0 0 - 1 / 


.V(r)by.\ T (r) 


D,( r): itsAthelementj 0 ' 4 if * i (*<'•• r). i = 1 , 

\o.(r), it k — t) 

C 3 (r) = (r-' 0 ••• 0), lby.V(r) 

n <T , _ /»• if jV(/i. r) / 0 
" \*i(r). ifJV(f,.r) = 0 . 

Then, as r, -♦ 0 , we have that $(», r) -♦ uniformly in the 

wide sense in the open half plane Re(*) > -<r for some <r > 0; and 
II^IIa, as well as its upper bound 7 = Y^L\ |p«(t)|* converge to /*o. 
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Remark 2: As we will show later, (6) and (7) are just another 
version of the interpolation condition. 

Next, we recall the main result of [1] showing that the C 1 norm 
of a stable transfer function is bounded above by the l l norm of its 
EAS. Moreover, this bound can be made arbitrarily tight by taking 
the parameter r in (1) small enough. This result is the basis for th$ 
approximation procedure proposed in [1]. 

Theorem s ( Blanchini and Sznaier [1]): Consider a continuous 
time system with rational Laplace transform 4(a) and its EAS, 
4 i? (z, t). If t) is asymptotically stable, then 4(a) is also 
asymptotically stable and such that 

ll*Wlli r)||i. 

Conversely, if 4(a) is asymptotically stable and such that 
f]4(tf)||i = yi,., then for all // > p r there exists r* > 0 such 
that for all 0 < r < r\ 4 £ (r, r) is asymptotically stable and such 
that H 4 r)||i < /<. 

Theorem 4 [I): Consider a strictly decreasing sequence r; —► 0, 
and define 


in = inf r,)||i 

•stfihiliziug K 


where 4^(z. r,) denotes the closed-loop transfer function. Then the 
sequence pi is nonincreasing and such that p, —► po, the optimal 
£' cost. 

Corollary: A suboptimal rational solution to the £ ] optimal 
control problem for continuous time systems, with cost arbitrarily 
close to the optimal cost, can be obtained by solving a discrete-time 
h optimal control problem for the corresponding EAS. Moreover, if 
K(z ) denotes the optimal l\ compensator for the EAS, the suboptimal 
£} compensator is given by A'(r.s -hi). 


III. Analysis of the Different Rational Approximations 

In this section we^ analyze the rational approximations generated 
by the OMK and EAS methods. The main result shows that both 
approximations belong to a certain subset fi T of the set of rational 
approximations, and that the EAS method generates the best approx¬ 
imation among the elements of this subset. Wc begin by showing 
that the two expressions for matrix M in Theorem 2 are just another 
version of the interpolation conditions. 


A. Characterization of All Rational Approximations 
Lemma 1: For a closed-loop system of the form 

*(«) = t, € n+ 


the following two conditions are equivalent 
v 

a) 4( Zk) = = T) ( Zk ), k = 1, ■ • ■, n. 

• i-i 
H 

b) M = y ^<j>i exp( - At i)L. 

i= i 

Moreover, we have ||$(«)IU = f = EJ=i M- 

Proof: b)=> a) can be proved following the proof procedure of 
[6, lemma 2] by simply replacing 4opt with 4. Similarly, the fact 
that b) is necessary for a) to hold can also be concluded from the 
proof. The expression for ||4(*)IU follows from direct calculations. 

In the next lemma we give a complete characterization of all 
rational approximations. 


Lemma 2: For any rational closed-loop system 

IX-M 1 + 

jf=l i=;l 

where Re(A,) > 0 and Nij integers, the following two conditions 
are equivalent 

m 

a) = + =r,(4 t ). * = 1,•••.«. 

J = li=l 

b) M = '£52^(1+ 

j- 1«=1 

Moreover, we have ||4(s)|U < 7 = J2%i Evil I fa A- 

Proof: a)£> b) can be proved using the same idea. The calcula¬ 
tions, through straightforward, are tedious, and are omitted here for 
space reasons. By direct calculation we have 



>=i 


B Comparison of the OMK and the EAS Rational Approximations 


Lemma 2 gives a characterization of all rational admissible closed- 
loop systems. All these closed-loop systems can be thought as 
candidate rational approximations of the £*-optimal closed-loop 
system. In the sequel we concentrate on a specific subset ih 
and we show that both the OMK and the EAS methods generate 
approximations that belong to this subset. For r > 0, define 


Hr 



(H) 


By direct calculation, the closed-loop system obtained by OMK 
methods is 

HI 

1=1 


Suppose that the L-optimal closed-loop system for the EAS is given 
by 

t = J 

then the closed-loop system obtained using the EAS method is 
4(«) = (r)(I -|- rny N ' . 


It follows that the approximations generated by both methods belon: 
to the set Q r * with a specific {AY} determined by each method. 

Remark s: In the OMK method, { A(f,, t)} depend directly m 
{/, }, and hence on the £ 3 optimal closed-loop system, Henc 
obtaining a rational approximation requires solving the C l optirn 
control problem first. However, as pointed out in Remark 1, solvir 
exactly this problem entails solving a semi-infinite linear program 
ming problem. The EAS method requires only solving a discre 1 
li optima] control problem, which is considerable easier, since on I 
finite-dimensional linear programming is involved. 
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Remark 4 Note that additional OMK-like rational approximations 
can be chosen among the elements of 12 r by simply modifying the 
rule for selecting r) For instance, N could be selected as the 
largest integer smaller than or equal to i l /r or t /r- 1-0 5 Clearly, 
the convergence property also holds for these approximations 
As it is shown in Theorem 6, the EAS method can be interpreted 
as approximating the original optimization problem as opposed to 
directly approximating its irrational solution This makes it quite 
unique Besides the computational advantages, we show in the sequel 
that the EAS method has two other important merits 
Theorem $ The rational approximation of the £’ optimal con 
trailer given by the EAS method is the best one in the sei il r in the 
sense that it leads to the smallest upper bound -> 

To prove Theorem 1 we need to prove first the following results 
Lemma 7 Consider the following discrete time systems 



Define 

r, = n., d / 

if = 4 i - f j ( i 
Mi = l / T)[f + 1/ ] Bu B\i 

where 7?i/ is the unique solution of the lineal matrix equation 
4, 'Hu 4 1 / H u + 1 1 ( ii = 0 

Then wc have 

Mi = tM 


where M is defined in Theorem 2 

Proof First we show that 1/ is always invertible Note that 


(/' > ’ = 


1 / 

-I, 

D ,'( , 



Sinct it is assumed that f (s) has only unstable zeros so docs 
7 f ( ) I his means all the zeros (poles) of T l ( ) ((7 1 ) l ) have 
magnitudes larger than 1 Invcrtibility of 1/ follows immediately 
Recall now that Ft i is the unique solution of the following lineai 
matrix equation 


4/?i - /?, 1 - K , =0 


We can verify that 4, 1 Ihi also satisfies the above equation 

1 l, 1 /?,, - Vtfu *i 

= ' '(/?!/ - 4/ 4,/ - 7/ Cu ) = 0 


Hence /? { ■= 4^ 1 /?iz and \fi = t\I 
Lemma 4 Consider the discrete time 1\ optimal control problem 
*or the EAS system 


lif = inf || T{ 4* T 1 Q\\\ 

Q si il I 

' closed-loop system 4>' (-) = £' i ° - V ,s admissible 
Wishes the interpolation conditions 

<b' (_/ ) = T! ( * ) a = 1 II 

v herc -{ are the zeros of 1} it and only it 

<1 

Mi = ^,4/ ' L, 

t I 


i e 

(9) 


Proof From the definition of EAS (9) is equivalent to 
/ 

*(j*) = X>'<l + r -»r' =7i(ei). * = 1, '*• 

= 1 

From Lemma 4 (10) is equivalent to 
] 

A/-£\p.(/ + 7-41 v I 

t- 1 

I 

tM -]To </ + r4, -tTC )~ N *ri 
-1 

\h =^2„»,n do 

i 

Fquivalence of (9) and (10) follows now from Lemma 2 
Note that Lemma 4 is true tor any discrete time systems since 
a discieie time systems can always be thought as an EAS of some 
continuous time systems 

Proof of Theorem 6 Consider a set of admissible closed-loop 
systems 



From Lemma 4 it follows that conditions (8) and (12) are identical 
Therefore for every ( ) in Hi there exists a corresponding 
4>(s) = 4* 1 (1 4 - rv) m il r and vice versa Since 

= J>i=-ii*'iii 

it lollows that the closed Imp system < F / obtained by solving the 
optimal / l control problem for the LAS yields the smallest •) among 
the elemenis of Ihe set Hj Hence the rational closed-loop system 
obtained by LAS methods also has the smallest upper bound among 
the set il 


C Tht LAS Mfthod Rcusittd 

Although the results of |6] and |l] show that the optimal C 1 
controllers can be approximated arbitraltly close with a lational 
controller these results did not provide a way of obtaining an 
approximation with error smaller than a prespecihed bound, rather, 
they icquired solving a sequence of problems and checking the 
approximation error until the desired precision was achieved In this 
section we indicate how to select the parameter r tor the EAS method 
in such a wav that the error of the resulting approximation is smaller 
than a prespeuhed bound Moreover we show that this approximation 
error convetges to 0 as fast as r 

Vuorem 6 Given any f > 0 we can find a r a prion for the 
EAS method such that 


in < /1 o (1 4- O 

Moreover the approximation errot converges to zero as O(r) 

Proof Consider the optimal l\ control problem for the EAS 

(14) 


hi = IIr/(-) + rj(-)c>< )||, 

Q sfnl I 


and its dual 
in ~ max 
subject to 

|/u - 0 | = 


ReR/u' )} + £<>,+ . lm{i/ )} (14) 

-1 

» u 

M<-' r‘}+!>+" mu' r‘> 


< i. 


( 10 ) 


0 , 1 2 


(15) 
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where Tf and T/• are the EAS of TV and Tj , respectively, and where 
zf denotes the zeros of . From the relationship between the EAS 
and its corresponding continuous system, the above dual problem is 
equivalent to 


ft e = max 


Re{Ti (sj )} 4- j^ot'i+n Im{Ti(^ r )} 


(16) 


subject to 


; Re{(1 -4- rzj)~ k } + ^ot i+n Im{(l 4* rzi) fc } 


»~i 


< 1, 


k = 0, 1, 2, • 


(17) 


which can further be thought as an approximation of the dual problem 
of C x -optimal control problem 


Note also that constraints (19) will be automatically satisfied for 
all t > t mAX where f« mx is finite and can be determined a priori [4]. 
Given any t < t max 4 assume that the constraints (17) are satisfied 
and consider any t < f fimx . Selecting k such that f* < t < t* 4- r 
we have that 

H II 

2><1 + rc,) -1 - 

i-l i-l 

< ||«||imp,x{|(] + rzi)~ k - 

< IlcrUi ^niax{|(l + - f~ 8 ‘'*|} 

+ max{|f ~’ <lk — e **1}) 

< |M|i ^inajt{t„ lox r -, |(l + rz,)~ 1 

+ max{l - = f (r). (22) 


fin = max 

"j 


^a,Rc{r 1 ( il )} + ^ O'l + n Im{T\(;,)} 


subject to 


|r(i, «)| i 


Re{<> ■*"'*} + y^n,+„ Im{c *•'} 


(18) 


< 1. 

VI e R+ (19) 


in the sense that the constraints (19) are firstly sampled at the 
time interval /* =■ kr and then the irrational terms e~~' tk are 
approximated by rational terms (1 + Tz t )~ k . 

For simplicity, in the sequel we will assume that all the zeros 
z, are real as in (4], although the proofs can be easily extended to 
encompass complex zeros as well. - 

An upper bound on ||a||i for all a satisfying the constraints (15) 
can be derived as follows. Define the following sets 



l-l 
n 

Y°i(l + TZi) 

i=l 

H 

Y«Al + TZi) 


< lVt 


,«} 


2 o} 


< 1VA* 

< 1 k = 0, 1, ■ , n 



( 20 ) 


From [I] it can be shown that, if r < r, then S c C S(r) C 5(r) 
and 5(r) C i?(r). Hence 

sup ||a||i < sup lid'll 1 < sup I|nr||i 
(»6.S(r) '*e.S'(r) 

< sup IMh < || J F~ 1 (r)|| QKi .i (21) 

where 

F(t) = 

/ i i i \ 

+ (1 + rsir 1 (1 + tz„ )~ l 

^l+ T3i)- n+I (1 + r; 2 )-’* +1 ... (1 + tz„ )-" +l > 

Ilf 1 ||sc.i denotes the induced norm of F~ l from to lh , and 
where -T is fixed. 


The first inequality is immediate. The second one follows from the 
triangle inequality. The last one can be proved as follows: If \a\ < 1 
and |/;| < 1, then 

la*' - 6*| = |a - 61 ]a fc_1 + a k ~ 2 b + • ■ ■ + 6*"' | < *| fl - 6| 
so we have 

Id + rc.r* - e- s '**| < *1(1 + 7--,)- 1 - e--‘ r | 

< Ur* '|(1 + TZi)" 1 - 

Note that both terms in the parenthesis can be made as small as 
one desires. So given any e > 0, we can choose a r such that the 
right-hand side of the inequality is less than or equal to r. For this 
value of r we have 


ii 


|r(f. «)| 





< 1 + <S 


1^1 I 

In particular the above inequality holds for n* 
problem of EAS. Since 


W 6 R+. 


which solves the dual 


(Tu r(t. o’)) = ^o*Ti(i,) = fii: 

i-l 

we have that 


//o 


= in ax 

r*0 


(Tt,r) 

Iklk 


max 

IkIIOG <!•+■' 


(Tu r) 
1 + e 


> 


1 4 f 


Finally, the fact that f(r) = O(t) follows from considering the 
Taylor expansion of equation (22). 


IV. An Example 


Consider the example introduced in (4] and further studied in 111 
and [6], The plant is 


P(s) 


« - 1 
a - 2’ 


The control objective is td minimize ||$||i = ||PC(1 + PC) l ||i 
The optimal closed-loop system is [4] 

^OPT(a) = 1.7071 - 4,1213f ; “ 0,M14a 
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Pig 2 Upper bound -) versus r EAS method—solid line OMK 
method—dotted line and OMK like method—dashed line 


uilh an optimal cost // 0 - 5 8264 For r = (M r i the rational 
dosed loop system obtained using the EAS method is 

$(<0 = 1 8001 - 5 1878(1 + 0 4)0" ‘ 

wilh ■> — ||$(<<)|| i = 7 2879 The OMK method yields 

$(s) = 2 3917- 5 0148(1 +0 45s) 1 

with = ||$(s)|| , = 7 4295 Finally, it we consider the OMK 
like approximation obtained b> selecting ’N (f,, r) = largest integer 
smaller than or equal to t./r t = 1 , w we obtain 

$(s) = 4 222 - G 122(1 + 0 45s) _l 

with ' = 11(s)11 A = 10 411 Fig 2 shows the upper bounds 
corresponding to different approximation methods veisus r For 
this example the 4 norm of the closed-loop system coincides with 
its upper bound in all cases (since there are only 2 interpolation 
constraints) It is interesting to note that while the bound obtained 
using the EAS method decreases monotonously with r (theoretically 
proved in Theorem 4), those corresponding to the OMK and OMK- 
like methods do not An estimated error bound f(r) curves tor this 
example is shown in Fig 4 It is very close to a straight line if a 
linear scale ta used for the r axis 

V Conclusions 

A recent research effort PH5]. [7], [8], has led to techniques for 
designing optimal compensators that minimize the worst case output 
amplitude with respect to all inputs of bounded amplitude In the 
discrete-time SISO case, minimizing the l l norm of the closed-loop 
impulse response yields a rational compensator Unfortunately, the 
solution to the continuous-time version of the problem is nonrational 
Prompted by the difficulty of physically implementing a system with a 
nonrational transfer function, rational approximations were recently 
developed |6], [I] 

In this paper we compare these approximations and we show that 
they are strongly connected Indeed, both approximations can be 
considered as elements of the same subset il r of the set of rational 
approximations 



In Section 1I1-B we show that the EAS method proposed in [1] 
yields the best approximation (m the sense of providing the tightest 
upper bound of the error) among the elements of this set 
Finally, in Section III C we exploit the structure of the dual 
problem lo provide a procedure that allows for selecting the parameter 
r for the EAS method to guarantee that the approximation error is 
smaller than a prespecified bound e Moreover, wc also show that 
this approximation error— >0 as (J(t) 

We believe that these results, combined with the features of the 
EAS method mentioned in (1|, namely, the tacts that i) it removes 
the ill posedness due to the presence of zeros on the imaginary axis; 
ii) it leads to computationally simple problems, m) it furnishes a 
monotomcally nonmereasmg bound, and iv) it is easily extendable to 
the MIMO case make this method an attractive tool for the design 
ol controllers foi continuous time systems 
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Supervisory Control of Timed Discrete-Event 
Systems under Partial Observation 

F. Lin and W. M. Wonham 


Abstract —This paper extends the authors* previous work on observ¬ 
ability of discrete-event systems by taking time into consideration. In 
a timed discrete-event system, events must occur within their respective 
tower and upper time bounds. A supervisor can disable, enable, or force 
some events to achieve a given control objective. We assume that the 
"supervisor does not observe all events, which Is often the case in practice. 
We generalize the concept of observability to timed discrete-event systems 
and show that it characterizes the existence condition for a supervisor. 
Wfe also generalize normality, a stronger version of observability, to 
timed discrete-event systems, which has nice properties that are absent 
in observability. We then derive conditions under which observability 
and normality are equivalent. We propose two methods to synthesize a 
supervisor, a direct approach and an indirect approach. An example is 
given to illustrate the results. 


I. Introduction 

Since the theory of supervisory control was introduced about 
10 years ago in [131, many ideas have been developed, notably 
the concepts of controllability [13], (14], observability |3|, [6], [8|, 
and normality [8], [9|, [10]. These concepts were, however, defined 
only for untimed discrete-event systems, in which the occurrence 
times of events were not modeled. As the theory advances and 
applications increase, there is a need to generalize these concepts 
to timed discrete-event systems, in which lower and upper time 
bounds of events are specified and events must occur within their 
respective lower and upper time bounds. In [l|, controllability is 
generalized to timed discrete-event systems and supervisory control 
is discussed under the assumption of full event observation. In 
practice, however, not all events are observable due to limitations on 
detection and communication. In such cases, observability becomes 
relevant, because a supervisor must control the system based on 
partial observation of events. This paper is devoted to supervisory 
control under partial observation. 

Wc begin with a brief review of the definitions on timed discrete- 
event systems and the definition of controllability of [1] in Section II. 
In Section III, we formalize supervisory control under partial obser¬ 
vation and discuss the constraints it must satisfy. We also generalize 
the definition of observability to take time into consideration. We 
then prove that a supervisor exists if and only if the language to 
be synthesized is controllable, observable, and L fn (G) -closed. If the 
language to be synthesized is not controllable and observable, then the 
problem of synthesizing an optimal supervisor becomes complicated, 
partly because, as to be shown in Section IV, the supremal control¬ 
lable and observable sublanguage of a given maximal legal language 
need not exist. Two methods will be proposed to solve the synthesis 
problem. In the first method, a stronger version of observability, 
called normality, is defined. The supremal controllable and normal 
sublanguage of a given language exists, and preserves the L m (G')- 
closedness of the language, and hence a supervisor can be synthesized 
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Wayne Slate University, Detroit, MI 48202 USA. 

W. M. Wonham is with the Department of Electrical and Computer 
Engineering. University of Toronto, Toronto, Ontario M5S 1A4, Canada. 

IEEE Log Number 9407004. 
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based on it. We also show that this stronger version of observability 
is actually equivalent to observability itself if all controllable (or 
prohibitible) events as well as tick arc observable. In the second 
method a supervisor is first synthesized assuming full observation 
and then modified when some events become unobservable. We show 
that, under some conditions, the second method is better than the first 
in the sense that the second method yields a supervisor generating a 
larger language and it can be implemented more efficiently, Finally, 
an example of two trains is given to illustrate the results of the 
paper. The development in this paper is parallel to the corresponding 
development by the authors on unlimed discrete-event systems [81. 

II. Preliminaries 

In this section, we briefly review the relevant results of [1] on 
timed discrete-event systems. Following the notation and definitions 
of [11, a timed discrete-event system is modeled by an automaton 

G= (S. Q. K ry ( „ y f(l ) 

where the events are partitioned into 1) prospective events H S| „ that 
have finite upper lime bounds, 2) remote events D I( . IM whose upper 
time bounds are infinite, and 3) tick of the global clock: 

£ = USninUfflfA-}. 

Remark 1 To exclude the physically unrealistic possibility that 
events in E - {tick} occur infinitely often during one unit of time, 
we require that G be activity loop free [1|, that is, 

(Vtf 6 QHV* € (E - { <J) # </• 

Remark 2 If G is constructed from the activity transition graph 
GVi and time bounds on events as shown in [ 11, then the advance of 
time will never slop. Under the assumption of activity loop Irecdom, 
this is equivalent to saying that G is type-J blocking free 1111, that is, 

(v.s e 

where £/.(*) = {tr £ S: str 6 L}. 

Remark J. Again, if G is constructed from G'<„< and time bounds 
on events, then the fact that no tick is possible after a string implies 
that some prospective events must be possible after that string, that is, 

(V.s € HG))tick $ i] Mff )(.s) => Eni».. H 5J/,«,)(*) ^ 0. 

In this paper, we assume that G is constructed from G’*, v and time 
bounds on events. 

Example J: Consider a single track linking two stations as de¬ 
picted in Fig. 1. To control the movement of trains, three traffic 
lights are installed along the track. Also installed are four detectors 
to observe the movement of trains. The track is divided accordingly 
into four sections: 5i, S 2 , $u and S i. 

Suppose there are two trains T\ and 7a scheduled to depart from 
Station 1 to Station 2. For T t , i = 1, 2 the designated events are 
as follows: 

a t[ : T t entering 5, <t i4 : T t exiting S, 

tr,A\ T x entering Si <r, fl : T, exiting 5a 

fr l2 : T, entering S,j <r,*: T, exiting 5.i 

< 7 , 5 ; T % entering S 4 <r , 0 : T, exiting 5i. 


0018-9286/95$04.Q0 © 1995 IEEE 
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*1 M *3 16 12 18 15 10 

fr° »fr°-fr o fr o fr o fro fro fro fro 

Fig 2 (Numbers denote subscnpts of event) 

The activity transition graphs G l ucl (for T,) is given in Fig 2 The 
lower time bound l« and upper time bound v„ for events <r are 

tr <T,1 (T,2 #Ti«j (T <T, (1 (7 iK <7i() 

0 0 0 0 J 12 1 

0 a cc n c oc x 1 2 1 1 

Hence the prospective events are 

£spe — 1 ( , H’ih, (T|0. <72ti <Tif ^io) 

and the remote events are 

Smn = {^11 , ffll, ^12 <? IS <721 <T i i (T 22 n l r } 

Simple time bounds are used here for the sole purpose of limiting 
the number of states They do, however, illustrate the advantages of 
the timed model (cf Section VI) Moie general tune bounds could 
be used at the expense of more computation 
From Gi, t G 2 %i{ and the time bounds we can construct the timed 
discrete event system G describing the concurrent movement of 7\ 
and Tj, which has 216 states 

To introduce control, let C £ lt „ be the set of prohibitive 
(or controllable) events, E| , C L ilitj U E h , be the set of forcible 
events, and = (U, M 11 US H ) --^,5 be the set of uncontrollable 
events The following definition fl | generalizes the definition in [IT) 
Definition 1 Let A C /(G) We define A to be controllable 
(with respect to 1(G)) if, for all s £ A 

1 ) SaMOSm =0=»E|(, )(s)n(L U {/ifA}) C E*(s) 
and 

2 ) L/ v s) n Si r / 0 => l /( o(n) n s c (s) 

In other words given history s £ A , ll there are no forcible events 
available then tick is not controllable if forcible events arc available, 
then tuk can be postponed In proofs we will often use the following 
equivalent definition for controllability 

1 ) (Vs e A)LnrjMHS, C Ek(s) and 

2) (Vs € A )Sh(s)D^ r , = 0 A //r A 6 L/ ( r)(-) => f/rA € 

I/v(s) 

III Observability 

In practice, a supervisor may not be able to delect the occurrences 
of all events When that happens, the existence of a supervisor is 
no longer guaranteed by controllability alone we need to introduce 
the concept of observability Observability lor untimed discrete-event 
systems was introduced m [11 [ 8 ] In this paper we generalize this 
oncept to timed discrete event systems 
To this end, we denote the observable events by £ C ^ and the 
unobservable events by =■ 1 - A projection P L* -+ 

«s defined inductively as follows 

Pf = f k f is the empty string 
I Ps. it <T € s 

P(w)aS \(J\) ff . it IT € L 

1 other words, if a sequence of events a occurred in G, a supervisor 
ui only observe P*» Therefore, a supervisor is now described by 
map 7 PL(G) 2 % The language generated by 7 L(G 7 ) 
^lled closed behavior), is defined inductively as follows 

n * e /(G, 1 ), 

2) If * 6 Z(G, 7 ) t G e -)(P')* Htr 6 rhen HtT € 
Z(G, 7 ), and 

No other strings belong to /(G, 7 ) 


The language marked by 7 (called marked behavior) is defined as 

Z«.(G, 7 ) = L(G 7 )ru m (G). 

The supervisor * is said to be nonblocking if 

I m (G 7 ) = Z(G, 7 ) 

We say a supervisor is admissible ll it satisfies the following 
two requirements First, no uncontrollable events can be disabled, 
and second, if tuk is physically possible and no forcible events can 
preempt it, then it cannot be disabled Formally, tor all k 6 Z(G, 7 ), 
wc require 

1 ) C i(Ps), and 

2) Eftr )(s) n MPs) D lr r = 0 a til k £ E/ ( r )(*) =>■ tn k £ 
Y(P-) 

The system supervised under an admissible supervisor cannot have 
type 1 blocking as noted in the following proposition 

Proposition I (112]) For all s £ L(G 7 ) 

)(«)n-)(Ps )^0 □ 

We show that for a supervisor to make a correct decision (1 e, to 
synthesize a language A ) based on partial observation, the following 
observability condition must be satisfied 

Definition 2 Let A C L(G) We define A to be observable (with 
respect to L(G)) if, lor all s s' £ E* 

Ps - Ps => consis(s s') 

where consist s') is true if and only if 

1) (Vfl- £ S)s<T £ A A s' £ A A so £ L(G) => s'(t £ iT, and 

2) (Vrx 6 E)s'(t 6 A A s t A A srr € /(G) => srr £ 7T 

In words two strings s and w' satisfy consist*, s') if and only 
il s and s' are consistent with respect to one-step continuations 
in A Observability requires that if two strings look the same to 
a supervisoi then they must be consistent This definition is a 
modification of observability as defined in [ 8 | 

This observability condition together with the controllability condi¬ 
tion guarantees the existence of a supervisor as shown in the following 
theorem 

Theorem I([I2]) Let A C /, (G) be a nonempty language 
There exists a nonblocking supervisor -) such that Z n (G, 7 ) — A 
if and only if the following three conditions are all satisfied 

1 ) A is controllable with respect to /(G) 

2 ) A is observable with respect to /(G), and 

1 ) A is / , (G)-closed 

If we are only interested tn the closed behavior, then we have the 
following corollary 

torollan 1 Let A C /(G) be a nonempty language There 
exists a supervisor ^ such that Z(G ^) = A if and only if the 
tallowing three conditions are all satisfied 

1) A is controllable with respect to /(G) 

2 ) A is observable with respect to /(G), and 

1) A is closed 

IV Normality 

The language describing a control objective is not very likely 
to be controllable and observable when a control objective is first 
specified because the objective is usually laid down independently 
of the consideration of controllability and observability of events 
When this happens, the original control objective is not achievable 
by a supervisor So we will try to achieve the largest possible part 
of the original control objective This means finding, if possible, the 
largest sublanguage of the original language that is controllable and 
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observable. Such a supervisor is “optimal’' in the,sense that a system 
generating a larger language can “run” faster when driven by the same 
Stochastic process generating event lifetimes, jf a certain “fairness” 
condition is satisfied. The reader is referred to [7] for this result. 

Let 0 E C Lm(G) be the language describing the original 
control objective, called the maximal legal language. We assume that 
E is X,„(G)-closed. 

The supremal controllable and observable sublanguage of E does 
not exist in general (see [12. example 2J) because the set of observable 
languages is not closed under union. To overcome this difficulty, we 
first modify the definition of normality introduced in [8J. 

Definition 3: Let K C L(G). We define K to be normal (with 
respect to 1(G)) if, for all * £ E\ 

* £ L[G) a Ps £ pT\ => a e T\. □ 

Since s £ T? => s £ L(G) APa €_PA' is always true, K is normal 
if and only if L(G) n P' ml Pl\ — K. Therefore, if K is normal, we 
can check whether a string s € L(G) is in K by checking whether its 
projection Ps is in PK. In other words, information on occurrences 
of unobservable events is not needed in deciding whether s £ A\ 
Hence, we expect that normality will be stronger than observability. 

Proposition 2 ([12]): If K is normal, the K is observable. 

The set of normal languages is algebraically better behaved than 
that of observable languages, in the sense that it is closed under 
arbitrary unions. 

Theorem 2 ([12]): The set 

CN{E) — {A' C E: A is controllable and normal} 

is nonempty and closed under arbitrary unions. Therefore, the supre¬ 
mal element of CA"(£), supCAT(E), exists and belongs to CN(E). 

Assume that all events are observable. Then normality is automat¬ 
ically satisfied. So we have the following corollary. 

Corollary 2: The set 

C(E) = {A' C E: A is controllable} 

is nonempty and closed under arbitrary unions. Therefore, the supre¬ 
mal element of G(£), supC(£), exists and belongs to C(E). 

Similarly, by assuming that all events are controllable, we have 
the following corollary. 

Corollary 3: The set 

A T (E) = {A* C E: K is normal} 

is nonempty and dosed under arbitrary unions. Therefore, the supre¬ 
mal element of A r (£), sup N(E), exists and belongs to N(E). 

From the above results, we propose a direct approach to synthesize 
a supervisor when E is not controllable and observable. The approach 
is to synthesize a supervisor for sup CN(E). For this to work, 
however, we need to know that supC'AT(E) is also L tn (G)~ closed. 

Proposition 3 (112]): If E is A m (G)-dosed, then supCA T (£) is 
also L Til (G)-dosed. 

Proposition 2 shows that normality is stronger than observability. 
But, how strong is it? The following proposition partially answers 
this question. It states that if tick and all controllable events are 
observable, then normality is equivalent to observability in the 
presence of controllability. 

Proposition 4 (l 12]): Assume Ehif> U {tick} C If A' is 
controllable and observable, then A' is controllable and normal. 

V, Modification for Partial Observation 

In the previous section, a direct approach was proposed to syn¬ 
thesize a supervisor if E is not controllable and observable, that is, 
a supervisor was synthesized for supCA r (£). Jn this section, we 


propose a different approach: we will first synthesize a supervisor for 
sup C(E) under the assumption of full observation and then modify 
the supervisor for partial observation. We show that, under certain 
conditions, this indirect approach leads to a supervisor with a larger 
dosed behavior. Furthermore, as shown in [4], when the resulting 
supervisor is implemented on-line, the computational complexity for 
updating control after observing a new event is linear with respect to 
the number of states in the supervisor for full observation. In other 
words, in the worst case, the computational complexity of the indirect 
approach at each step is of the order G( |A'|), in comparison with the 
total complexity of G(2 ,A 1 ) for the direct approach, where |A'| is the 
number of states in the refined generator of A*. 

We need the following preliminary results. 

Proposition 5([12]): The set 

VN(E) = {A’ C E:K is dosed, controllable, and normal} 

is nonempty and closed under arbitrary unions. Therefore, the supre¬ 
mal element of C\V(£) exists, belongs to (^iVfE), and is given 
by 

sup CN(E) = supC.V(E). □ 

By this proposition, we do not need to distinguish supGA'(J?) and 
sup CK(E). 

Corollary 4 The set 

T*(E) = {A" C E : K is dosed and controllable} 

is nonempty and closed under arbitrary unions. Therefore, the supre¬ 
mal element of C(E) exists, belongs to C(E), and is given by 

supG( E) =■ supf ’(7?). □ 

Corollary 5 m The set 

A (£) = {K C E: T\ is dosed and normal} 

is nonempty and dosed under arbitrary unions. Therefore, the supre¬ 
mal element of N(E) exists, belongs to V(E), and is given by 

sup A'( E) = sup X (T). □ 

Since supG(E) — sup C'(£) is dosed and controllable, we can 
synthesize u supervisor with full observation -/ : L(G) —♦ 2 such 
that 

L{G, = supG(E) 

where L(Cl, V) is defined in the same way for L(G, - ) except that 
Ps = ,N. 

Under partial observation, we modify V to a supervisor 
y: PL(G) —► 2^ as follows. Let [t] = P~ l [t) D A(G, V) be 
the set of strings in £(G, V) having the projection t and 

l(t) = U (V(.s) U (S “ !/(*,)(*)))• 

A similar modification was introduced in |4J for untimed discrete 
event systems. However, for a timed discrete-event system, the issuer 
of forcible events and nonblocking need to be dealt with differently 
To this end, we define coherence as follows. 

Definition 4: Let K C L(G), We define A to be coherent (will 
respect to L(G)) if, for all s € T\, 

tick £ £/,(r;)(*) => (Vs*, £ P" l Psf) T\) 

■(S*(*') nSr (lI = s*(*")nsi (ll ) 

A ($!,(,;)(*') n s, (1I = ) n Sr,,). 

In words, if K is coherent* then for every string s that can be follow 1 
by tick in £(G), the set of forcible events that are feasible (or leg^ i 
after any string having the same projection as * is the same. 
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Fig 3. (Hon/onlal events in same column, or vertical evenis in same row 
are same.) 

Proposition 6 ([121)’ If V is an admissible supervisor and 
L(G. “') is coherent, then - is also an admissible supervisor. 

This modified supervisor -) may block. To guarantee nonblocking, 
it is convenient to introduce the additional condition of livelock- 
freedom [5], First, let us recall the definition of livelock-freedom 
given in [51. 

Definition 5' Let K C S*. We define K to be livelock free if, 
lor all a £ Tv 

(3/i £ JV)(W £ E* )|/| > // A si £ TT => (3i/ £ ) 

■ (h < u < st K u £ K) 

where N denotes the set ol natural numbers and .s < u denotes that 
s is a prefix of u. 

In words, if A is livelock-free, then every infinite chain of strings 
in I\ visits the set of marker states infinitely often. 

Proposition 7 ([ 12]) If L„,(G) is livelock-tree and is admissi¬ 
ble. then -) is nonblocking. 

The implication of the above proposition is that, under the assump¬ 
tion of livelock-freedom, any admissible supervisor is nonblocking. 
Finally, we can prove that the modified supervisor geneiates only 
legal behavior that contains at least supC.Y( A). 

Theorem 3 ([12]), The language marked by the modified super¬ 
visor o is bounded by 

1 supCY(A) C C A. □ 

To conclude this section, we note that L{G. ) obtained this way 
may not be maximal (see Example 3 in fl2J) 

VI. Examplf 

We continue our discussion of the example in Section II. Our 
mtrol objective is to prevent two trains from colliding. So the 
'rresponding maximal legal language E can be obtained by deleting 
4es in G corresponding to two trains being in the same section at 
same time. 

Because of the locations of the traffic lights and detectors, the 
mtrollable and observable events are 

Slub = {^11 • <t|l» 0 \^- &21 • 02 *' rr ^ r * } 

E„ = {<T I I , 0-|2, <T n, (Tin, (T2 In 022- 02 f i- ^20 } - 



Fig. 4 (Horizontal events in same column, or vertical events in same row 
are the same.) 

Since the movement of a train can be initiated, we assume the 
following events are forcible: 

^for = {^11, 0\2- ^21, (T<i i, (T-22- ^2 r »}- 

To compare the result with that of untimed model, we use the direct 
method to synthesize a supervisor. We first calculate supC.Y(A) and 
then synthesize a supervisor based on sup CN(E). 

In [61, the same exattiple is discussed in the untimed 
framework. Only events Err; = {<rn, tfu, ^ 12 , tfis, 

ffio, 021 - 022 * 02 r >- } are used in the untimed model. 

Therefore, we project supC\Y( A) onto Errj and compare it (Fig. 3) 
with the controlled behavior of the untimed model (Fig. 4) from [6J. 
Clearly, the timed model allows more strings to be generated. This 
is due to the additional information provided by the timed model. 
For example, in the untimed model, the fact that a train has left Si 
cannoi be verified until it enters Sj (where a detector is located). 
However, in the timed model, this can be inferred by the passage of 
one unit of time as a consequence of the time bounds imposed. 

VII. Conclusion 

We have generalized supervisory control under partial observation 
to limed discrete-event systems. Observability and normality are gen¬ 
eralized to express the existence conditions of a supervisor for limed 
discrete even! systems. Two approaches for supervisor synthesis are 
proposed. Although the detailed derivations of the results are different 
from those for unlimed discrete-event systems, the two developments 
are parallel. Future research will focus on efficient implementations 
[2] and decentralized supervision {31, [8], [9], 
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Apology and Correction to “Process Control and 
Machine Learning: Rule-Based Incremental Control” 

Dominique Luzeaux and Bertrand Zavidovique 

Apology and Correction 

It has come to our attention that the name of the second author was 
omitted from the above paper appearing in the June, 1994, issue of 
Transactions on Automatic Controi , pp 1166-1171 Due to an 
apparent error in transcription, a former editorial assistant omitted the 
name of Bertrand Zavidovique, who was the second author on this 
paper Through a regrettable and remarkably coincidental sequence 
of oversights, the original error was not corrected either in the 
typesetting of the article or in the proofreading of the galleys by 
the first author The Transactions offers a sincere apology to Prof 
Zavidovique for this mistake and hereby provides publication of the 
title and authors as it should have appeared in June The paper in 
question appears in vol 39, no 6 
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Mathematical Control Theory: Deterministic Finite- 
Dimensional Systems —Eduardo D. Sontag (New York: 
Springer-Verlag, 1990). Reviewed by Stephen P. Boyd. 

The title of this book gives a very good description of its contents 
and style, although “Introduction to” could be added to the beginning. 
The style is mathematical: precise, clear statements (i.e., theorems) 
are asserted, then carefully proved. The book covers many of the 
key topics in control theory, except (as the subtitle has warned us) 
those involving stochastic processes or infinite-dimensional systems. 
The level is appropriate for a senior undergraduate majoring in 
mathematics or a graduate student in electrical engineering who has 
been exposed to the style of mathematics (in a first course on analysis 
or algebra). This book fills an important niche and can play a key 
role in increasing the awareness and appreciation of control theory 
among mathematicians. 

The first chapter is a nicely written extended discussion about what 
control theory is. Sonlag introduces some of the key concepts (e.g., 
system, feedback, state-space, and nonlinearity) with simple examples 
in an informal style. For an engineering graduate student, it will be a 
review of some undergraduate courses; for the mathematics student, 
it provides some cultural background for the rest of the book. This 
chapter is not in the same style as the remainder of the book so I 
might have numbered it Chapter 0. There are no theorems or proofs, 
and Sontug keeps the discussion at the level ol informal ideas, tor 
example, (p. 5) “Designs based on linearizations work locally for 
the original system/' Pity the mathematics student who then searches 
backward for the precise definition of the terms “design" and “work"! 

The book proper starts at Chapter 2, which covers the abstract idea 
of a system. The style jumps from the informality of Chapter 1 to 
a more in-depth style with a list of long mathematical definitions. 
Sontag writes that the chapter contains some “abstract nonsense" (p. 
ix), e.g., very general and abstract definitions of a dynamical system. 
On this topic he points out a very good analogy from the foundations 
of mathematics. A student of mathematics first thinks of a function 
m an informal way. as a “mapping” or “subroutine." The student 
then encounters the formal definition of a function from A to B as a 
subset of Ax B which satisfies certain properties. Not long thereafter 

The reviewer is with the Information Systems Laboratory, Department of 
klectrical Engineering, Stanford University, Stanford, CA 940305 USA. 

IEEE Log Number 9407567. 


the student reverts to the more informal model of a function as a 
“mapping" or “subroutine.” The difference is that now the student 
really knows what a function is. In a similar way. Chapter 2 introduces 
the student to the formal definition of a dynamical system. After 
this introduction, the student can revert to a less formal idea of a 
dynamical system (e.g., “the state summarizes the effects of the past 
inputs on the future outputs"). But now, these ideas rest on a firm 
footing. Sontag punctuates the heavy definitions with nice examples, 
which makes the medicine of a formal foundation easier to take 

Chapter 3 covers various topics involving reachability and control¬ 
lability. Sontag examines these topics in some depth for the important 
special case of linear, time-invariant (LTI) systems. The treatment 
might be too brief for the student who has had no previous exposure to 
LTI systems; this is no problem for the engineering graduate student, 
who will have had one or possibly two courses which use these ideas. 

Chapter 4 covers (state) feedback, with a bit on Lyapunov stability. 
In this chapter, the author gives a precise statement of the linearization 
principle described informally in Chapter I. Chapter 5 covers the 
dual notion of observability as well as an introduction to realization 
theory and minimality. Although Section 5.8, Abstract Realization 
Theory,* is marked as skippable, it shows the advantage of the 
abstract approach: generality. As an example, Sontag constructs a 
minimal realization of a parity check system. 

In Chapter 6 Sontag considers observers and dynamic feedback. 
He cannot give a detailed discussion of the design of observers (e.g., 
the trade-off between convergence rate and noise sensitivity) since 
he assumes no knowledge of stochastic processes, but he does give 
a deterministic version of the Kalman filter in the next (and final) 
chapter, which covers optimal control. Two sections in Chapter 6 are 
too brief for students who have not been introduced to the topics: 
frequency-domain considerations (Section 6.4) and parameterization 
of stabilizers (Section 6.5). These sections should have been marked 
as optional or even not covered (like stochastic processes). 

Chapter 7 covers optimal control, taking Bellman's dynamic pro¬ 
gramming approach. One of the advantages of this approach is that 
it applies to more abstract systems (e.g., finite state systems) so 
an example of this type would be useful before launching into the 
traditional case of a linear system with quadratic cost. This chapter 
ends with a nice deterministic development of the Kalman filter. Here 
the goal is to find the initial state that minimizes an integral quadratic 
measure of the difference between the output observed and the output 
predicted, given the initial state. 
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The appendixes cover very useful material, e.g,, singular value 
decomposition. It is a pity that some of these topics are not covered 
in the traditional mathematics curriculum. 

The book contains about 400 references. A list of references does 
not have great added value today because of computerized literature 
searches. Sontag, however, ends each chapter with a very useful Notes 
and Comments section with annotated pointers to the references. The 
references seem to be concentrated in Western journals (e.g., IEEE 
Transactions on Automatic Control, Systems and Control Letters ); 
only a handful are from the former Soviet Union, which has a 
Very strong tradition in mathematical control theory tracing back to 
Pontriagin and Lyapunov. I must admit that this comment applies 
equally as well to the books 1 have written. 

There are several uses for this book. It would work very well as 
the text for an undergraduate senior-level mathematics course. This 
course could replace, or at least complement, the traditional course on 
19th century calculus of variations that is often found as a senior-level 
mathematics elective. The course could also serve graduate students 
in engineering. 

The second use for the book is as a reference or supplementary 
text for engineering graduate students, particularly Ph.D. students. 
These students have already been exposed to many of the ideas in 
Sontag's book in engineering-oriented courses. Often in their first 
year of Ph.D. study they firm up their mathematics background by 
taking mathematics courses such as analysis, functional analysis, 
abstract algebra, and differential geometry. Sontag's book serves as 
an excellent bridge between the two disciplines: it covers topics 
traditionally treated in engineering courses, but in a mathematical 
style. 


One similar book is Vidyasagar [3], which is oriented more towards 
the advanced engineering student and 1 think more appropriate 
for an engineering course on advanced control theory. The book 
by Delchamps [1J could serve as a secondary text with Son tag’s 
book, since it covers more linear systems theory, also in a precise 
mathematical style. The book by Wonham [4] covers linear system 
and control theory in the most elegant mathematical style, It could 
serve as the text for a sequel to the introductory mathematical control 
theory course based on Sontag’s book, The recent text by Zabczyk 
[5], Mathematical Control Theoiy An Introduction , is similar to 
Sontag's book in style, level, and coverage. Zabczyk'& choice of 
more advanced topics differs from SontagV, Zabczyk includes more 
on optimal control as well as a fairly complete introduction to infinite- 
dimensional systems. 

Sontag clearly put much thought and effort into this book, and u 
shows, The book succeeds in conveying the important basic ideas 
of mathematical control theory, with appropriate level and style, to 
seniors m mathematics. 
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’taler, M., and A Sideris Feedback control of quantized constrained 
systems with applications to neuromorphic controllers design, T-AC Jul 
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Tsai, W.K. see Antonio, J K , T-AC Feb 94 385-391 
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of trajectories for systems with nonholonomic constraints; T-AC Jan 94. 
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2034-2046 

Wei Ren, and P R. Kumar. Stochastic adaptive prediction and model 
reference control, T-AC Oct 94 2047-2060 
Weiss, H., Q Wang, and J,L Speyer. System characterization of positive real 
conditions, T-AC Mar 94 540-544 
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A 

bsolute stability 

convex approach, absol. stabil. problem. Gapski, P.B., + , T-AC Sep 94 
1929-1932 

dyn. lin. time-delay systs., robust controller design. Mahmoud\ M.S., + 
T*AC May 94 995-999 

nonUn differential-delay systs., absol. stabil. Gil, A/./., T-AC Dec 94 
24.81-2484 

parametrically uncertain nonlin. feedback systs., absol. stabil. Marquez, 
H.J., + , T-AC Mar 94 664-668 

C generator excitation; cf. Power generation control, excitation 
ctuators 

uncertain SISO min. phase lin. syst., nonlin. universal servomechanism. 
Ryan, E.P., T-AC Apr 94 753-761 

adaptive control 

bilinear systs., adaptive stabilization, least sq. Xi Sun, + , T-AC Jan 94 
207-211 

book review; Adaptive Control Systems (Isermann, R., et al.; 1992). 

Kaneiiakopoulos, L, + , T-AC Aug 94 1776 
book review; Handbook of Intelligent Control; Neural, Fuzzy and 
Adaptive Approaches (White, D.A., and Sofge, D.A., Eds.; 1992). 
Samad, T., T-ACJul 94 1534-1535 

car reachable posns. computation. Soueres, P., +. T-AC Aug 94 1626-1630 
chaotic behavior in adaptive control systs. Gonzalez, G.A., + , T-AC Oct 
94 2145-2148 

complex discrete-time control systs., simul. automation. Ellis, R.D., + , 
T-AC Sep 94 1795-1801 

continuous descriptor sysls., adaptive observer. Uetake, Y., T-AC Oct 94 
2095-2100 

continuous direct adaptive control, saturation input constraint. Cishen 
Zhang, +, T-AC Aug 94 1718-1722 

continuous-time adaptive decoupling control design. Ortega, R., + , T-AC 
Aug 94 1639-1643 

continuous-time nonlin. systs. adaptive control by neural nets. Fu-Chuang 
Chen + , T-AC Jun 94 1306-1310 

corrections to “On the stability proof of adaptive schemes with static 
normalizing signal and parameter projection” (Jan 93 170-173). Ipannou, 
P., T-AC Apr 94 896 

cyclicly switched param.-adaptive control systs., MIMO design 
models/internal regulators. Morse , A S., + , T-AC Sep 94 1809-1818 
discrete-time adaptive nonlin. syst., least sq. estimator. Kaneiiakopoulos, 
l, T-AC Nov 94 2362-2365 

discrete time-varying systs., robust adaptive controller. Changyun Wen, 
T-AC May 94 987-991 

dyn. systs., robust adaptive control design. Khorasani, K, T-AC Aug 94 
1726-1732 

failure delect./isolation/accommodation syst. Chia-Chi Tsui, T-AC Nov 94 
2318-2321 

feedback lin. systs. adaptive tracking. Marino, /(.,+, T-AC Jun 94 
1314-1319 

first-order nonlin. syst. adaptive control. Brogliato, B., + , T-AC Aug 94 
1764-1768 

Gaussian stochastic control syst. tuning, van Schuppen, J.H., T-AC Nov 94 
2178-2190 

harmonic generation in adaptive feedforward cancellation schemes. 

Bodson, M..+, T-AC Sep 94 1939-1944 
interconnected systs., decentralized adaptive regulation. Changyun Wen, 
T-AC Oct 9* 2163-2166 

jump-Markov systs., stabilizing control law. Dufour, F., +, T-AC Nov 94 
2354-2357 

lin, systs., nonlin. design of adaptive controllers. Krstic, M., +, T-AC Apr 
94 738-752 

multiple models/switching approach. Narendra, K.S., + , T-AC Sep 94 
1861-1866 

nonlin. design of adaptive controllers. Krstic, M., +, T-AC Apr 94 738-752 
nonlin. systs., triangular struct., adaptive control. Seto, D., + , T-AC Jul 
94 1411-1428 

nonlin. uncertain systs. tracking control. Song, Y.D., + , T-AC Sep 94 
1866-1871 

nonminimum phase first-order continuous-time systs., adaptive 
stabilization. Lozano, R., + , T-AC Aug 94 1748-1751 
parab. systs. direct adaptive control. Keum Shik Hong, + , T-AC Oct 94 
2018-2033 

param.-adaptive control, cyclic switching strategy. Pait, F .A/., + , T-AC 
<h*n 94 1172-1183 , . ■ B ^ _ 

pole placement without excitation probing sigs. Lozano, R., + , T-AC Jan 
94 47-58 

predictive controller, robustness props. Clarke, D.W., + , T-AC May 94 
1052-1056 


rigid spacecraft adaptive attitude control. Egeland, O., +, T-AC Apr 94 
842,846 

robot adaptive control based on passivity. Yu Tang, + , T-AC Sep 94 
1871-1875 

robot conboiler, adaptive, Lyapunov stabil. Egeland, O , +, T-AC Aug 94 
1671-1673 

robot manipulators, globally convergent adaptive controller. Mahmoud, 

M S., + , T-AC Jan 94 148-151 

robust adaptive pole placement control. Weyer, E , + , T-AC Aug 94 
1665-1671 

robust adaptive regulation with min. prior knowledge, comment. Jt Feng 
Zhang, T-AC Mar 94 605 

robust approach. Gang Feng, T-AC Aug 94 1738-1742 
robust direct adaptive controllers, normalization tech. Gang Feng, + , 
T-AC Nov 94 2330-2334 

robustness and multitone instabilities. Bodson, M., T-AC Apr 94 864-870 
sing -free multivariable MRAC. Moctezuma, R.G., + , T-AC Sep 94 
1856-1860 

single-arm dyn., robust variable struct./and switching-Z adaptive control. 

Li-Wen Chen, + , T-AC Aug 94 1621-1626 
SISO LT1 discrete-time systs., param.-adaptive controller. Kreisselmeier , 
G., T-AC Sep 94 1819-1826 

slowly varying and LTI systs., ident. and uncertainty principles. Zanies, 

G. , + , T-AC Sep 94 1827-1838 

stabilization, adaptive, nonlin. time-varying controller. Miller, D.E., T-AC 
Jul 94 1347-1359 

stochastic adaptive control algms., unmodified, robustness. Radenkovic\ 
M S., + , T-AC Feb 94 396-400 

stochastic adaptive prediction/MRAC. Wei Ren, + , T-AC Oct 94 
2047-2060 

time-invariant lin. syst. adaptive control. Karason, S.P., +, T-AC Nov 94 
2325-2330 

time varying systs., discrete time adaptive controller, global stabil. 

Radenkovic, M.S., + , T-AC Nov 94 2357-2361 
transient bounds. Zhuquan Zang, + , T-AC Jan 94 171-175 
uncertain SISO min. phase lin. syst., nonlin. universal servomechanism. 
Ryan, E.P., T-AC Apr 94 753-761 
Adaptive control; cf Model reference adaptive control 
Adaptive estimation 

distributed param. systs., adaptive estim., persistence of excitation 
Demetriou, M.A., + , T-AC May 94 1117-1123 
generalized Chandrasekhar recursions from generalized Schur algm. 

Sayed, A.H., + , T-AC Nov 94 2265-2269 
ident., signed output error adaptive, convergence. Garnett, J., + , T-ACJul 
94 1387-1399 

Adaptive estimation; cf. Adaptive Kalman filtering; Adaptive observers 
Adaptive filters 

generalized Chandrasekhar recursions from generalized Schur algm. 

Sayed, A.H., + , T-AC Nov 94 2265-2269 
time-variant displacement struct, and interpolation problems. Sayed, A H 
+ , T-AC May 94 960-976 

Adaptive Kalman filtering 

extended Chandrasekhar recursions. Sayed, A.H., + , T-AC Mar 94 
619-623 

Adaptive observers 

continuous descriptor systs., adaptive observer. Uetake, Y„ T-AC Oct 94 
2095-2100 

Adaptive signal processing; cf. Adaptive filters 
Adaptive systems 

global optim., adaptive partitioned random search. Bo Tang, Z., T-AC Nov 
94 2235-2244 

least sq. estim. in white noise, convergence. Nassiri-Toussi, K., + , T-AC 
Feb 94 364-368 
Aerospace control 

attitude control problem, comment. Fjellstad, O.-E, + , T-AC Mar 94 
699-700 

Algebra 

control algms., automatic differentiation appl. Campbell, S.L.. + , T-AC 
May 94 1047-1052 

jump-Markov systs., stabilizing control law. Dufour, F., + , T-AC Nov 94 
2354-2357 

output feedback problem in lin. systs., bilinear formulation Syrmos, V.L., 
+, T-AC Feb 94 410-414 

sing, perturbed control systs. and nonlin. differential-alg. eqns Krishnan. 

H, +, T-AC May 94 1079-1084 

three-block generalized/std. Riccati cans, comparison. Datouach, M., 
T-AC Aug 94 1755-1758 
Algebra; cf. Polynomials; Set theory; Vectors 
AM 

intersample props., robustness, and sensitivity, quantit*t./quaJitat. anal. 
Feuer, A., + , T-AC May 94 1042-1047 
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Approximation methods 

delay systs approx , Lagucrre formula Lam, J T-AC Jul 94 1517*1521 
delta-operator formulated discrete-time approx Premaratne K + T-AC 
Mar 94 581-585 

discrete-time systs, mixed H 2 /H® control, exact convex optim based soln 
Sznater’M T-AC Dec 94 2511-2517 
Hw model reduction computational scheme Kavranoglu D T-AC Jul 94 
1447-14SI 

Hm-norm approx of systs by const matrices Kavranoglu D TV AC May 
94 1006-1009 

infinite horizon LQ control, soln approx Schochetman IE + T-ACMar 
94 596-601 

* interval systs order reduction, Routh-Pade approx Bartdyopadhyay B 
+ T-AC Dec 94 2454-2456 

model reduction, LF approx balancing, props Prakash R T-AC May 94 
U35-1141 

nonlm syst controllability distribs/ systs approx Ruiz AC + T-AC 
Jul 94 1473-1476 

rational transfer fn, optimal L« approx Kavranoglu D + T-AC Sep 94 
1899-1904 

textured iter algms for tridiagonal tin eqns Ttan-Shen Tang T-ACMar 
94 592-596 

Approximation methods; cf. Interpolation* Linear approximation, Minimax 
methods. Polynomial approximation, Stochastic approximation 
AftMA processes; cf. Autoregressive moving-average processes 
Array processing 

state-space models fast ident Young Man Cho + T-AC Oct 94 2004-2017 
Artificial intelligence, cf. Knowledge-based systems 
Asymptotic control 

observers for nonlin systs in steady state Hunt L R + T-AC Oct 94 
2113-2118 

Asymptotic stability 

approx feedback lin , stabil Kwanghee Nam + T-AC Nov 94 2311 -2314 
asymptotic stabil criteria for lin systs time delay Jumg-Huet Su + 
T-AC Jun 94 1341-1344 

common Lyapunov fh, stable LTI systs, commuting A-matnces 
Narendra KS + T-AC Dec 94 2469-2471 
corrections to u An improved Razumikhin-type theorem and its 
application” (Apr 94 839-841) Xu B T-AC Nov 94 2368 
delay-independent exponential stabil criteria tor time-varying discrete 
delay systs Wu JW + T-AC Apr 94 811-814 
discrete-time nonlin control systs design via smooth feedback Wei Lm 
+ T-AC Nov 94 2340-2346 

discrete-time syst exponential stabil, observer design Aitken VC + 
T-AC Sep 94 1959-1962 

dyn systs, robust adaptive control design Khorasam K T-AC Aug 94 
1726-1732 

extended horizon predictive control Fanjin Kong + T-AC Jul 94 
1467-1470 

flexible joint robots global regulation, approx diff Kelly R + T-AC 
Jun 94 1222-1224 

interval 1TI systs stabilization Kehui Wei T-AC Jan 94 22-32 
logical discrete event systs, I yapunov stabil Passmo KM + l-AC Feb 
94 269-279 

LQG controllers, stable, weighting and covariance matrices Halevi Y 
T-AC Oct 94 2104-2106 

multivariable systs, quantitat robustness measures Horng-Gtou Chen 
+ T-AC Apr 94 807-810 

nonlin regulator, num design kremelmeier G + T-AC Jan 94 33-46 
nonlin systs , asymptotic model matching Di Benedetto M D + T-AC 
Aug 94 1539-1550 

nonminimum phase first-order continuous-time systs, adaptive 
stabilization Lozano R + T-AC Aug 94 1748-1751 
observers for nonlin systs in steady state Hunt LR + T-AC Oct 94 
2113-2118 

pas real systs nonlin controllers Bernstein DS + T-AC Jul 94 
1513-1517 

Razumikhin-type theorem, improved, appls BugongXu + T-AC Apr 94 
839-841 

robot manipulators, compliant, force/posn regulation Chiavenm S + 
T-AC Mar 94 647-652 

robust strong stabilization via modified Popov controller synthesis Wang 
YW,+ T-AC Nov 94 2284-2287 

stabil oflui systs, delayed perturb Trmh H + T-AC Sep 94 1948-1951 
stick-slip avoidance, PD control Dupont P E T-AC May 94 1094-1097 
uncertain systs , asymptotic stabil region estim Han Ho Choi + 7 -AC 
Nov 94 2275-2278 
Attitude control 

comment and enror correction fjellslad O-E + T-AC Mar 94 699-700 
rigid spacecraft adaptive attitude control Egeland, O + T-AC Apr 94 
842-846 


Automata 

logical discrete event systs, Lyapunov stabil Passmo KM + T-AC Feb 
94 269-279 

timed discrete-event systs, supervisory control Brandin BA + T-AC 
Feb 94 329-342 
Automata; cf. Finite automata 
Autoregressive moving-average processes 

CARMA plants, stabilizing 1-0 receding horizon control Chtsci L + 
T-ACMar 94 614-618 

stochastic adaptive prediction/MRAC Wet Ren + T-AC Oct 94 
2047-2060 

Awards 

1993 George S Axelby Outstanding Paper Award given to I 
Kannellakopoulos, P V Kokotovic, and A S Morse Sam M K T-AC 
Feb 94 258 


B 

Bayes procedures 

sens fusion, decentralized detect optimal thresholds Irvmg WW + 

T AC Apr 94 835-838 

Bilinear systems 

bilinear systs , adaptive stabilization, least sq Xi Sun + 7 -AC Jan 94 
207-211 

bilinear systs, state space anal orthogonal series approach 
Paraskevopoulos P N + T-AC Apr 94 793-797 
finite-time optimal control of bilinear systs successive approx procedure 
Aganovic Z + T-AC Sep 94 1932-1935 
output feedback problem in lin systs, bilinear formulation Syrmos V l 
+ T-AC feb 94 410-414 

Biological control systems 

cat falling near-optimal nonholonomic motion planning Ternandes C 
+ T-AC Mar 94 450-463 

Biomechanics 

cat, falling near-optimal nonholonomic motion planning Fernandes C 
+ 1 -AC Mar 94 450-463 

Book reviews 

Adaptive Control Systems (lsermann, R, et al 1992) kanellakopoulos 
1 + T-AC Aug 94 MIS 

Applications of Lyapunov Methods in Stability (Halanay A and Rasvan 
V, 1993) Popov \ M T-AC Dec 94 2S26-2S27 
Applied Optimal Control and Lstimation' (I ewis I I 1992) kamtn 
EW T AC Aug 94 1773-1774 

Controlled and Conditioned Invariants in Linear System Iheory (Basilc 
G , and Marro, G , 1992) Schumacher JM T AC Ian 94 250-251 
Feedback Control Theory (Doyle, J etal 1992) Moore KL T-AC Jul 
94 1532-1534 

Handbook of Intelligent Control Neural 1 uzzy and Adaptive Approaches 
(White DA, and Sofge, DA Lds 1992) S amad T 7 -AC Jul 94 
1534-1535 

H w Control Problem A State Space Approach (Stoorvogcl A , 1992) 
Saben A T-AC Jan 94 252-254 

L mear Multivariable C ontrol Algebraic Analysis and Synthesis Methods 
(Vardulakis, A I G , 1991) Lewis FL T-AC Jul 94 1536 
Linear System Theory (Rugh W J 1993) Khalil Hk T-AC Dec 94 
2528-2529 

New Tools for Robustness of 1 inear Systems (Barmish BR 1994) T-AC 
Dec 94 2525-2526 

Nonlinear System Analysis, 2nd edn (Vidyasagar M 1993) Abed EH 
T-AC Jul 94 1535-1536 

Nonlinear Systems (Khalil H 1992) Grizzle JW T-AC Jan 94251-252 
Robot Dynamics and Control" (Spong, M W and Vidyasagar, M , 1989) 
PotaHR + T-AC Aug 94 1774-1776 

Boundary-value problems 

dyn syst simul, fast parallel recursive aggregation Tsai W K + T-AC 
Mar 94 534-540 

multipoint boundary value stochastic systs stationarity/reciprocity Ji 
Chen + T-AC May 94 1114-1116 

Brushless rotating machines 

brushless DC motors, feedback-1 in control charactn In-Joong Ha + 
T-AC Mar 94 673-677 


C 

Cascade systems 

cascaded nonlm systs, global robust stabilization lmura J-J + T-A 
May 94 1084-1089 

lin continuous-time systs, global stabilization, bounded control 
Sussmam HJ + T-AC Dec 94 2411-2425 
robust control design, cascade struct approach Bontvento C + T-A 
Apr 94 846-849 
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Stabilization of discrete-time nonlin systs, global Byrnes Cl + T-AC 
Jan 94 83-98 
Chaos 

adaptive control systs , chaotic behavior Gonzalez G A + T-AC Oct 94 
2145-2148 

Circuit noise; cf. Filter noise 
Circuits; cf. Sample-and-hold circuits 
C cpnmumcation channels 

optimal multicopy Aloha Wong EWM + T-AC Jun 94 1233-1236 
Communication network routing 

data network path formulated optimal routing time complexity Antonio 
JK 4 T-AC Feb 94 385-391 

determmistic/stochastic queuing networks anal Cheng-Shang Chang 
T-AC May 94 913-931 

distributed asynchronous routing, convergence rate Zhi-Quan Luo + 
T-AC May 94 1123-1129 

queuing networks anal, projection techs Moot Choo C huah T-AC Aug 94 
1588-1599 

Communication switching, cf. Packet switching 
Communication systems; cf. Data communication 
Communication system software 

distributed asynchronous routing convergence rate Zhi-Quan Luo + 
T-AC May 94 1123-1129 

Communication traffic 

deterministic/stochastic queuing networks anal Cheng-Shang Chang 
T-AC May 94 913-931 

optimal multicopy Aloha Wong F W M + T-ACJun 94 1233-1236 
queuing networks anal, projection techs Moot Choo Chuah T-AC Aug 94 
1588-1599 
Compensation 

CARMA plants, stabilizing 1-0 receding horizon control Chisci L + 
I-4C Mar 94 614-618 

continuous control sysl, digital equiv design methods comparison Hall 
SR T-AC Feb 94 420-421 

continuous time nonminimum phase lin systs gain margin improvement 
Wei )ong Yan + 1 -AC Nov 94 2347-2354 
diagonal decoupling dyn output feedback /const precompensator Eldem 
V T AC Mar 94 503-511 

discrete-time compensators, 17R Tadjine M + I-ACJun 94 1259-1262 
discrete-time uncertain systs ultimate boundedness control Lyapunov 
Blanchmi F T-AC Teh 94 428-431 

H * compensator design min order observers Hsu CS + T-AC Aug 94 
1679-1681 

Hcr-optim reduced order observer based controller Vtoorvogel A A + 
T-AC Teb 94 355-360 

input-output decoupling nonlm intcractor appl Dj Benedetto MD f- 
TALJun94 1246-1250 

nonlm Jeedback parametrically uncertain systs absol stabil Marquez 
HJ + T-AC Mar 94 664-668 

nonlin systs asymptotic model matching Di Benedetto MD + T-AC 
4ug 94 1539-1550 

nonlin systs asymptotic tracking necessary conditions Grizzle JW + 

T AC Sep 94 1782-1794 

nonlin uncertain systs tracking control Song YD + T-AC Sep 94 
1866-1871 

rational / 1 suboplimal compensators for continuous-time systs Blanchmi 
F + J-AC Jul 94 1487-1492 

rigid robotic manipulators, robust tracking control ManZhihong + T-AC 
Jan 94 154-159 

obust stabilization/perform , MRAC modeling error compensation Sun 
J + T-AC Mar 94 630-635 

robust strong stabilization via modified Popov controller synthesis Wang 
YW + T-AC Nov 94 2284-2287 

>ISO discrete-time systs preview tracking Halpern ME T-AC Mar 94 
589-592 

'mith predictor for controlling proc, integrator and long dead-time 
Astrom KJ + T-AC Feb 94 343-345 

late observer/feedback, compensators, comment Bender D J T-AC Feb 
94 447-448 

ncertain nonlm systs, struct invariance Castro-Lmares R + T-AC 
)ct 94 2154-2158 

‘br systs control Karl WC + T-AC Jan 94 222-226 
°'iplexity theory 

iscrete-time filters from high-order s-to-z mappings Schneider AM t + 
FAC Feb 94 435-441 

'n syst simul, fast parallel recursive aggregation Tsai WK + T-AC 
War 94 534-540 

^ uristics, rules of thumb, and the 80/20 proposition Yu-Chi Ho T-AC 
May 94 1025-1027 

1 e 8 rated probabilistic data assoc Musicki D + T-AC Jun 94 
, 237-1241 

1 tsq estim , efficientalgm Rafajlowicz E + T-ACJun 94 1241-1243 


robust anal, uncertainty value sets Eszter EG , *- T-AC Nov 94 
2315-2318 

single-machine scheduling, resource optimal control Cheng TCE +, 
T-ACJun 94 1243-1246 

worst case ident, H* model validation Guoxiang Gu> T*AC Aug 94 
1657-1661 

worst-case syst idem, time complexity Poolla K, + , T-AC May 94 
944-950 

H calc, computational complexity Braatz R.P + T-AC May 94 
1000-1002 

Computational geometry 

nonlm syst controllability distnbs / systs approx Ruiz A C + T-AC 
Jul 94 1473-1476 
Computation time 

data network path formulated optimal routing, time complexity Antonio 
JK + T-AC Feb 94 385-391 

path formulated optimal routing algm , time complexity Antonio JK 
+ T-AC Sep 94 1839-1844 
Computer communication; cf. Data communication 
Computer instructions 

quantized constrained systs, feedback control, appls to neuromorphic 
controllers design Sznaier M +■ T-AC Jul 94 1497-1502 
Computers; cf. Distributed computing, Parallel processing 
Continuous-time systems 

constrained continuous-time systs with cone props , pos invariant sets 
Tarbounech S + T-AC Feb 94 401-405 
continuous direct adaptive control, saturation inpul constraint Cishen 
7hang + T-AC Aug 94 1718-1722 

continuous-time adaptive decoupling control design Ortega R 4 T-AC 
Aug 94 1639-1643 

continuous time nonminimum phase lin systs , gam margin improvement 
Wei-Yong Yan + T-AC Nov 94 2347-2354 
continuous time syst s , transfer fns , model errors Schoukens J + T-AC 
Aug 94 1733-1737 

corrections to “On undershoot in SISO systems” (Mar 94 578-581) de la 
BaraS BAT T-AC Aug 94 1771 

corrections to “Positively invariant sets for constrained continuous-time 
systems with cone properties” (Feb 94 401-405) Tarbounech S T-AC 
Aug 94 1771 

deadbeat ripple-tree tracking Jetto L T-AC Aug 94 1759-1764 
delta-operator formulated discrete-time approx Premaratne K + T-AC 
Mar 94 581-585 

descriptor systs , lin feedback closed-loop Stackelberg strategy HuaXu 
+ T-AC May 94 1097-1102 

I in continuous-time systs, global stabilization, bounded controls 
Suwmann H J + T AC Dec 94 2411-2425 
nonminimum phase first-order continuous-time systs , adaptive 
stabilization Lozano R + T-AC Aug 94 1748-1751 
perturbed lin systs upper covariance bounds Bolzern P + T AC Mar 
94 623-626 

quadratic stabilization of continuous time systs Mahmoud MS + T-AC 
Oct 94 2135-2139 

rational L ] suboplimal compensators for continuous-time systs Blanchmi 
F + T-AC Jul 94 1487-1492 

SISO systs , step response, undershoot Leon de la Barra S B A T-AC 
Mar 94 578-581 

time delay estim in continuous LTl systs Tuch J + T-AC Apr 94 
823-827 

uncertain continuous-time implicit systs, regional pole placement, 
robustness Chun-Hsiung Fang + T-AC Nov 94 2303-2307 

Controllability 

adaptive control systs , transient bounds Zhuquan Zang + T-AC Jan 94 
171-175 

arbitrary supervisory control, finite state supervisors Sreenivas RS 
T-AC Apr 94 856-861 

cat falling, near-optimal nonholonomic motion planning, appl 
Fernandes C , + T-AC Mar 94 450-463 
discrete-time q-Markov cover generation, inverse Lyapunov method 
Sreeram V -4 T-AC Feb 94 381-385 
feedback control by multirale PWM Khayatian A + T-AC Jun 94 
1292-1297 

impulse differential lin systs , constrained controllability Benzaid Z + 
T-AC May 94 1064-1066 

infinite horizon LQ control, soln approx Sc hoc hetman IE + T-AC Mar 
94 596-601 

multirate sampled-data systs struct props Longhi S T-AC Mar 94 
692-696 

multiscale systs , Kalman filters, Riccati eqns Chou KC +, T-AC Mar 
94 479-492 

multivariable nonlm controller vibr damping Kanestrom RK,+, T-AC 
Sep 94 1925-1928 
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nonlin. syst. controllability distribs,/ systs. apprcw. Rut, A.C., + , T-AC 
Jul 94 1473-1476 

pos. realnetis conditions, charactn. assuming controllability. Weiss, H., + , 
T-AC Mat 94 540-544 

quantized constrained systs., feedback control, appls. to neuromorphlc 
controllers design Sznaier, M, + , T-AC Jut 94 1497-1502 
rigid spacecraft systs. controllability, central gravitational field. Lian, 
K,-Y„ + , T-AC Dec 94 2426-1441 

SfSO LTI discrete-time systs., par am.-adaptive controller. Kreisselmeier, 
G., T-AC Sep 94 1 819-1826 

state deadbeat control problem, general soln. Eldem, V, + , T-AC May 94 
1002-1006 

structural wave control, reduced-order model. Quart Wang, + , T-AC Aug 
94 1711-1713 

supervisory control, blocking and controllability of Petri nets. Gtua, A , 
+ , T-AC Apr 94 818-823 

switched-modc power converters, multirate modeling and control design 
Khayatian , A., + , T-AC Sep 94 1848-1852 
timed discrete-event systs., supervisory control. Brandin, B.A., +, T-AC 
Feb 94 329-342 

time-invariant I in. systs. controliability/obscrvability. Katning Wang, + , 
T-AC Jul 94 1443-1447 
Control systems 

matching principle for systs , restricted inputs Rutland, N K, T-AC Mar 
94 550-553 

Control systems; cf. Specific topic 
Convergence of numerical methods 

2D stochastic approx, convergence/diff. eqn limit Dye-Jyun Ma , + , 
T-AC Jul 94 1439-1442 

data network path formulated optimal routing, time complexity. Antonio, 
J.K., + , T-AC Feb 94 385-391 

Hod control syst ident, least sq methods, comment Uvstone, MM , + , 
T-AC Jul 94 1531 

noncooperative equilibria computation, relax, algms Uryas'ev, S, + , 
T-ACJun 94 1263-1267 

parab. systs. direct adaptive control. Keum Shtk Hong, + , T-AC Oct 94 
2018-2033 

path formulated optimal routing algm , time complexity Antonio, J K, 
+ , T-AC Sep 94 1839-1844 

recursive ident., nonlin. Wiener model. Wigren, T, T-AC Nov 94 
2191-2206 

sing -free multivariable MRAC. Moctezuma, RG, + , T-AC Sep 94 
1856-1860 

Cost-optimal control 

failure prone mfg syst hedging point policies. Jian-Qiang Hu, + , T-AC 
Sep 94 1875-1880 

filtering/optimal cost control, uncertain lin systs Petersen, I.R, +, T-AC 
Sep 94 1971-1977 
Covariance analysis 

perturbed lin systs , upper covariance bounds Bolzern, P, +, T-AC Mar 
94 623-626 
Covariance matrices 

lin. discrete-time systs., optimum lin recursive cstim Carazo, A H,+ , 
T-AC Aug 94 1636-1638 

LQQ controllers, stable, weighting and covariance matrices. Halevi, Y, 
T-AC Oct 94 2104-2106 

nonminimum phase first-order continuous-time systs., adaptive 
stabilization. Lozano, R, + , T-AC Aug 94 1748-1751 
time-varying params., bounded error ident. Bittanti, S, + , T-AC May 94 
1106-1110 


D 

Data communication 

distributed asynchronous routing, convergence rate. Zhi-Quan Luo, + , 
T-AC May 94 1123-1129 

network path formulated optimal routing, time complexity. Antonio, J.K., 
+ , T-AC Feb 94 385-391 
Data transmission; cf. Data communication 
DC motors 

brushless DC motors, feedback-lin. control charactn In-Joong Ha, + . 
T-AC Mar 94 673-677 

LQ optimal regulators, stabil. robustness Dohyoung Chung, +, T-AC Aug 
94 1698-1702 

ripple free sampled-data robust servomechanism controller, exponential 
hold. Yung-Chun Wu, + , T-ACJun 94 1287-1291 

Decision-making 

2D stochastic approx., convergence/d iff eqn. limit. Dye-Jyun Ma, + . 
T-AC Jul 94 1439-1442 

heuristics, rules of thumb, and the 80/20 proposition Yu-Chi Ho, T-AC 
May 94 1025-1027 


risk-averse decentralized discrete-time LEQG games. Srikant, R, T-AC 
Apr 94 861-864 

Decision-making; cf. Bayes procedures; Signal detection 
Decoupling of systems 

continuous-time adaptive decoupling control design. Ortega , R. t +, T-AC 
Aug 94 1639-1643 

diagonal decoupling, dyn output feedback /const, precompensator. Eldem, 
V„ T-AC Mar 94 503-511 

disturbance decoupling, constrained Sylvester eqns. Syrmos, VL, T-AC 
Apr 94 797-803 

input-output decoupling, nonlin. interactor appl. Dt Benedetto, M.D, + 
T-ACJun 94 1246-1250 

input-output lin., state equivalence and decoupling. Jn-Joong Ha, +, T-AC 
Nov 94 2269-2274 

row-by-row stable decoupling, static state feedback, struct soln Martinez 
Garcia, JC.+, T-AC Dec 942457-2460 
Delay estimation 

continuous LTI systs. Tuch, J., + , T-AC Apr 94 823-827 
Delay systems 

contact stabil. of simple posn. controllers, effect of time delay and discrete 
control. Fiala, J., +, T-AC Apr 94 870-873 
corrections to “An improved Razumikhin-type theorem and its 
application” (Apr 94 839-841). Xu, B„ T-AC Nov 94 2368 
DEDS, observability, delay Bose, S, ■+ , T-AC Apr 94 803-806 
delay-independent exponential stabil criteria for time-varying discrete 
delay systs Wu, JW,+, T-AC Apr 94 811-814 
delay systs. approx., Laguerre formula Lam, J, T-AC Jul 94 1517-1521 
distributed delay lin systs., feedback stabilisation Feng Zheng, + , T-AC 
Aug 94 1714-1718 

D-stabil., robust, for I in uncertain discrete delay systs Te-Jen Su, +, T-AC 
Feb 94 425-428 

dyn. lin time-delay systs , robust controller design Mahmoud, MS, + 
T-AC May 94 995-999 

Hao-optimal discrete-time fixed-point and fixed-lag smoothing, game 
theory approach Theodor, Y, + , T-AC Sep 94 1944-1948 
lin syst approx., time-scaling factor, Laguerre models Wang, L, +, T-AC 
Jul 94 1463-1467 

nonlin. differential-delay systs, absol stabil Gil, Ml, T-AC Dec 94 
2481-2484 

quadratic stabilization of continuous time systs Mahmoud, MS, + , T-AC 
Oct 94 2135-2139 

Razumikhin-type theorem, improved, appls BugongXu, f , T-AC Apr 94 
839-841 

robust stabilization of syst, control delays Kojima, A . + T-AC Aug 94 
1694-1698 

robust stabil, quasipolynomial convex directions/testing sets Kharitonov 
VL.+, T-AC Dec 942388-2397 

stabil of lin systs , delayed perturb Trinh, H, +, T-AC Sep 94 1948-1951 
state delayed systs , memoryless H* controllers Joon Hwa Lee, + , 7 -A( 
Jan 94 159-162 

time delay estim in continuous LTI systs Tuch, J, + , T-AC Apr 94 
823-827 

time lag nonlin systs , vibr. control. Lehman, B,+, T-AC May 94 898-912 
uncertain lin systs , delay depend , robust stabil BugongXu, T-AC Nov 
94 2365 

uncertain nonlin. interconnected systs, time-varying state delay 
stabilizing control Mahmoud, MS, T-AC Dec 94 2484-2488 
Descriptor systems; cf. Singular systems 
Detection; cf. Fault diagnosis; Signal detection 
Diagnosis; cf. Fault diagnosis 
Difference equations 

three-block generalized/std Riccati eqns comparison Darouach, M, 1 
T-AC Aug 94 1755-1758 

Differentiability 

control algms., automatic differentiation appl Campbell, SL , + , T- 1C 
May 94 1047-1052 

flexible joint robots, global regulation, approx, diff Kelly, R, + , T- ’( 
Jun 94 1222-1224 

recursive prediction error algm log likelihood fn. derivatives. Hoof r 
M,A., T-AC Mar 94 662-664 
Differential equations 

2D stochastic approx., convergence/d iff. eqn limit Dye-Jyun Ma, 
T-AC Jul 94 1439-1442 

control algms., automatic differentiation appl. Campbell, SL, + , 7 ( 
Afoy 94 1047-1052 

finite-time optimal control of bilinear systs., successive approx. procctL e 
Aganovic, Z, + , T-AC Sep 94 1932-1935 
impulse differential lin. systs., constrained controllability. Benzaid, 2 
T-AC May 94 1064-1066 

lin. multivariable systs,, fund notion of equivalence. Pugh, AC,+,l l( 
May 94 114M145 
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Lyapunov stabil. theory of nonsmooth systs ShevttzSep 94 
1910-1914 

polynomial differential eqns., domain of attraction estim, Levin, A, T-AC 
Dec 94 2471-2475 

time lag nonlin. systs., vibr control Lehman, B, + , T-AC May 94 898-912 
time-varying functional diff. eqns., delay independent stabil 
conditions/decay estim Lehman, B, + , T-AC Aug 94 1673-1676 
Differential equations; cf. Nonlinear differential equations; Partial 
differential equations; Riccati equations, differential; Stochastic 
differential equations 
Differential geometry 

freq. response arcs, stable polynomial, convexity Keqin Gu, T-AC Nov 94 
2262-2265 
Diffusion processes 

almost sure sample stabil of nonlin stochastic dyn systs Zhi Yu Zhang, 
+ . T-AC Mar 94 560-565 

Digital control 

continuous control syst., digital equiv design methods comparison Hall, 
SR, T-AC Feb 94 420-421 
Digital filters; cf. Recursive digital filters 
Digital image processing; cf. Image processing 
Discrete-event systems 

arbitrary supervisory control, finite slate supervisors Sreemvas, RS, 
T-AC Apr 94 856-861 

corrections to “Control of vector discrete-event systems I - The base 
model” (Aug 93 1214-1227) Li, Y, + , T-AC Aug 94 1771 
corrections to “Lyapunov stability of a class of discrete-event systems” 
(Feb 94 269-279) Pass mo, KM, + , T-AC Jul 94 1531 
DEDS, observability, delay Bose, S, + , T-AC Apr 94 803-806 
DEDS, steady state fn , nondiffcrentiability Shapiro, A , + , T-AC Aug 94 
1707-1711 

discrete event systs, decentralized control, normality theorem Lin, F, +. 
T-AC May 94 1089-1093 

logical discrete event systs , Lyapunov stabil Passino, KM, +, T-AC Feb 
94 269-279 

supervisory control, blocking and controllability of Petri nets Giua, A , 
+ , T-AC Apr 94 818-823 

timed discrete-event systs , supervisory control Brandm, B A , + . T-AC 
Feb 94 329-342 

variable lookahead supervisory control, state inform Hadj-Alouane, N B, 
+ , T-AC Dec 94 2398-2410 

vector discrete-event syst controller synthesis Yong LI, +, T-AC Mar 94 
512-531 

Discrete-time filters 

discrete-time filters from high-order s-to-7 mappings Schneider , AM + , 
t-ACbeb 94 435-441 
Discrete-time systems 

adaptive control systs , chaotic behavior Gonzalez, G A ,*■, T-AC Oct 94 
2145-2148 

adaptive nonlin syst, least-sq estimator Kanellakopoulos, 1, T-AC Nov 
94 2362-2365 

adaptive predictive controller, robustness anal Clarke, D W, + , T-AC 
May 94 1052-1056 

asynchronous sampled data syst control, design Voulgaris, P, T-AC Jul 
94 1451-1455 

block multirate input-output model for sampled-data control systs 
Jakubowski, AM,* , T-AC May 94 1145-1147 
certainty equivalence control, dyn games James, MR, T-AC Nov 94 
2321-2324 

compensators, LTR Tadjme, M, + , T-AC Jun 94 1259-1262 
complex discrete-time control systs., simul automation Ellis, RD, + , 
T-AC Sep 94 1795-1801 

contact stabil of simple posn controllers, effect of time delay and discrete 
control FialaJ, +, T-AC Apr 94 870-873 
continuous control syst, digital equiv design methods comparison Hall, 
SR, T-AC Feb 94 420-421 

continuous time nonminimum phase lin systs , gain margin improvement 
Wei-Yong Yan, + , T-AC Nov 94 2347-2354 
corrections to “Robust stability and performance via fixed-order dynamic 
compensation’ The discrete-time case” (May 93 776-782) Haddad, 
WM , +, T-AC Aug 94 1772 

deadbeat npple-free tracking Jetto, L, T-AC Aug 94 1759-1764 
DEDS, steady stale fn, nondifferentiability Shapiro, A, +, T-AC Aug 94 
1707-1711 

delay-independent exponential stabil criteria for time-varying discrete 
delay systs Wu, J W, + . T-AC Apr 94 811-814 
delta-operator formulated discrete-time approx. Premaratne, K., + , T-AC 
Mar 94 581-585 

lelta-operator formulated real polynomials, tabular method for 
determining root distrib Premaratne, K, + , T-AC Feb 94 352-355 
>stabil„ robust, for lin. uncertain discrete delay systs. Te-Jen Su, +, T-AC 
Feb 94 425-428 


dual-rate control syst., sampled-data, stabil. robustness. Tongwen Chen, 
T-AC Jan 94 164-167 

dyn. syst. simul., fast parallel recursive aggregation. Tsai, WX, + , T-AC 
Mar 94 534-540 

exponential stabil., observer design. Attken, V.C, +* , T-AC Sep 94 
1959-1962 

extreme-point robust stabil., discrete-time polynomials Perez, E, + , 
T-AC Jul 94 1470-1472 

feedback control by multirate PWM Khavatian, A., + , T-AC Jun 94 
1292-1297 

fin space approach. Yamamoto, } , T-AC Apr 94 703-713 
generalized q-Markov cover models Sreeram, V. t T-AC Dec 94 2502-2505 
Hammerstein syst., polynomial ident Lang Zi-Qiang, T-AC Mar 94 
569-573 

EL control of discrete-time uncertain systs Geromel, JC, + , T-AC May 
94 1072-1075 

H® control problem, strictly proper meas feedback Stoorvogel, A A , + , 
T-AC Sep 94 1936-1939 

H®-optimal discrete-time fixed-point and fixed-lag smoothing, game 
theory approach Theodor, Y, + , T-AC Sep 94 1944-1948 
Hu type problem Hayakawa, Y, + , T-AC Nov 94 2278-2284 
mtersample props, robustness, and sensitivity, quantitat /qualitat anal. 

Feuer, A , + , T-AC May 94 1042-1047 
Kalman filtering for uncertain discrete-time systs LihuaXie, +, T-AC Jun 
94 1310-1314 

lin discrete syst pole assignment. Benzaouia, A , T-AC Oct 94 2091-2095 
lin discrete time syst, perturb., comment Eslami, M, T-AC Aug 94 
1768-1769 

lin discrete-time systs , optimum lin recursive estim. Carazo, AH,*-, 
T-AC Aug 94 1636-1638 

lin systs., pos invariant sets De Santis, E . T-AC Jan 94 245-249 
LTI discrete-time systs , robust stabil anal Karan, M, + , T-AC May 94 
991-995 

Markovian jump lin systs , lin. min MSE estim Costa, OL V, T-AC Aug 
94 1685-1689 

MIMO stochastic discrete syst state estim Liu Danyang, * , T-AC Oct 94 
2087-2091 

mixed Hj/Hoo control, exact convex optim based soln Sznaier, M, T-AC 
Dec 94 2511-2517 

model matching, suboptimal perfect, with noise, MRACS Mutoh, Y., + , 
T-AC Feb 94 422-425 

multichannel gain margin improvement, sampled-data hold fns Chang 
Yang, + , T-AC Mar 94 657-661 

multirate sampled-data systs , H 2 -optimal design Qut, L , * , T-AC Dec 
94 2506-2511 

multirate sampled-data systs struct props Longhi, S, T-AC Mar 94 
692-696 

multivariable nonminimum phase discrete-time systs LTR procedure. 

Leon de la Barra S, BA, T-AC Mar 94 574-577 
nonlin control systs. design via smooth state feedback Wei Lin, + , T-AC 
Nov 94 2340-2346 

nonlin discrete-lime systs, input-to-state stabil condition and global 
stabilization Kazakos, D , + , T-AC Oct 94 2111-2113 
nonlin syst control, stabil property Jie Huang, + , T-AC Nov 94 
2307-2311 

nonlin systs., stabil, time delayed feedback Xueshan Yang, + , T-AC Mar 
94 585-589 

nonlin syst state estim , conditionally min algm Pankov, A R , + , T-AC 
Aug 94 1617-1620 

optimal dyn output feedback for nonzero set point regulation, 
discrete-time case Haddad, WM, + , T-AC Sep 94 1921-1925 
optimal filtering, stochastic discrete-time systs , unknown inputs Borisov, 
A V, +, T-AC Dec 94 2461-2464 

partially obs discrete-time nonlin systs, risk-sensitive control and dyn. 

games James, MR., + , T-AC Apr 94 780-792 
periodically time-varying lin discrete-time plants, decentralized control 
Khargonekar, P P , + , T-AC Apr 94 877-882 
periodic discrete-time Riccati cqn, num soln Hench, JJ,+, T-AC Jun 
94 1197-1210 

pos of dyn systs, nonpositivc coefT matrices d'Alessandro, P, +, T-AC 
Jan 94 131-134 

q-Markov cover generation, inverse Lyapunov method Sreeram, V., + , 
T-AC Feb 94 381-385 

q-Markov cover models Sreeram, V, +, T-AC May 94 1102-1105 
range tracking loops, large deviation anal. Dembo, A , + , T-AC Feb 94 
360-364 

rational L ] suboptimal compensators for continuous-lime systs Blanchim, 
F„ + , T-AC Jul 94 1487-1492 

ripple free sampled-data robust servomechanism controller, exponential 
hold. Yung-Chun Wu, + , T-AC Jun 94 1287-1291 
risk-averse decentralized discrete-time LEQG games Srikant , R, T-AC 
Apr 94 861-864 


f ( heck author entry for coauthors 


f Check author entry for subsequent corrections/comments 



JEEE T-AC 1994 INDEX-* 16 


robust adaptive pole placement control. Weyer, E., + , T-AC Aug 94 
1665-1671 

robust control design, cascade struct, approach. Bontvento, C, + , T-AC 
Apr 94 846-849 

robust servomech. control, exponential hold, optimal algm. Yung-Chun 
Wu> + , T-AC Jan 94 112-117 

sampled data control systs., H«type problem. Hayakawa, Y., +, T-AC Nov 
94 2278-2284 

sampled-data systs., wordlength constraint, stabil./perform. Fiailho, LI, 
+ , T-AC Dec 94 2476-2481 

S1SO discrete-time systs. preview tracking. Halpern, M.E., T-AC Mar 94 
589-592 

SISO LTI discrete-time systs., param.-adaptive controller. Kretsselmeier, 
G., T-AC Sep 94 1819-1826 

stabil. domains, assigned root location polynomials, convexity props. Test, 
A., +, T-AC Mar 94 668-672 

stabilization of discrete-time nontin systs., global. Byrnes, Cl, + , T-AC 
Jan 94 83-98 

stochastic control, lin.-exponential-Gausslan. Chih-HaiFan, +, T-AC Oct 
94 1986-2003 

stochastic lin. discrete systs., state estim. algm. Ahmed, M.S., T-AC Aug 
94 1652-1656 

time varying systs. discrete time adaptive controller, global stabil 
Radenkovic, MS, + , T-AC A lav 94 2357-2361 
time-varying systs, robust adaptive controller. Changyun Wen , f~/4C Afoy 
94 987-991 

uncertain discrete-time systs., variable struct, control design. 

Myszkorowskt, + , T-AC Nov 94 2366-2367 
uncertain systs., sampled-data controller design Dolphus, R.M, T-AC May 
94 1036-1042 

uncertain systs., ultimate boundedness control, Lyapunov. Blanchini, F, 
T-AC Feb 94 428-433 

unstable plants, sampled-d&ta observers, generalized holds. Haddad, 
W.M., + , T-AC Jan 94 229-234 

VSS control design for uncertain discrete-time systs Wen-June Wang, + , 
T-AC Jan 94 99-102 

Distributed computing 

distributed asynchronous routing, convergence rate Zhi-Quan Luo, + , 
T-AC May 94 1123-1129 

random proc. failures and distributed aigm. Papavassilopoulos, G.P, 
T-AC May 94 1032-1036 

Distributed control 

decentralized stable factors and parameterization of decentralized 
controllers. Date, R, +, T-AC Feb 94 347-351 
eigenstnicture assignment by decentralized output feedback 19~. 

Guang-Ren Duan, T-AC May 94 1009-1014 
large scale interconnected systs. with delays, decentralized stabilization. 
Zhongzhi Hu. T-AC Jan 94 180-182 

multivariable syst. decentralized control, closed-loop props. Campo, PJ, 
+ , T-AC May 94 932-943 

periodically time-varying lin. discrete-time plants, decentralized control 
Kharganekar, P P, + , T-AC Apr 94 877-882 
prod, control methods, distributed, stabil. and perform. Shanjhia, A., T-AC 
Apr 94 725-737 

risk-averse decentralized discrete-time LEQG games. Srikant, R, T-AC 
Apr 94 8 61-864 
Distributed decision-making 

detect, networks with multiple event struct., optim. Pete, A , +, T-AC Aug 
94 1702-1707 

Distributed estimation 

multiresolutional distributed filtering. Lang Hong, T-AC Apr 94 853-856 
Distributed-parameter systems 

adaptive estim., persistence of excitation Demetriou, M.A., + , T-AC May 
941117-1123 

evol. eqns., parab. partial DE, exponential stabil. Keum-Shik Hong, + , 
T-AC Jul 94 1432-1436 

multibody flexible systs., elastic mode estim. Karray, F, + , T-AC May 
94 1016-1020 

multi-DOF nonlin. damping model, spectral dens. Weijian Zhang, T-AC 
Feb 94 406-410 

parab. systs, direct adaptive control. Keum Shik Hong, + , T-AC Oct 94 
2018-2033 

SISO-distributed plants, optimal mixed sensitivity. Flamm, D,S., +, T-AC 
Jun94 1150-1165 
Duality 

guaranteed param. estim problem with uncertain stats., Kalman-Bucy 
filter accuracy. Matasov, A J, T-AC Mar 94 635-639 
lin. min.-phase plants, stabilizing controllers parameterization. Gloria, 
JJ. t +, T*AC Feb 94 433-434 

LP, timed marked graphs eval. Yamada, T, +, T-AC Mar 94 696-698 
multirate sampled-data systs struct, props. Longhi , S, T-AC Mar 94 
692*696 


observers, lin., parameterization and design. Ding, X, + , T-AC Aug 94 
1648-1652 

uncertain systs. stabilizing control design, quasiconvex optim. Keqtn Gu, 
T-AC Jan 94 127-131 
Dynamic programming 

finite queues, priority-discarding policies. Petr, D.W., T-AC May 94 
1020-1024 

partially obs. discrete-time nonlin. systs., risk-sensitive control and dyn 
games. James, MR., +, T-AC Apr 94 780-792 

prod, control, multiple time scales approach. Jiang, J., + , T-AC Nov 94 
2292-2297 

risk-averse decentralized discrete-time LEQG games. Srikant, R, T-AC 
Apr 94 861-864 


E 

Eigenstructure assignment 

eigenstructurc assignment by decentralized output feedback. Guang-Ren 
Duan, T-AC May 94 1009-1014 

fault detect, filters, eigenstructurc assignment. Jaehong Park, +. T-AC Jul 
94 1521-1524 

observer design, eigenvalue assignment, residual generation. Magm , J.-F , 
+ , T-AC Feb 94 441-447 

output feedback problem in lin. systs., bilinear formulation. Syrmos, V L, 
+ , T-AC Feb 94 410-414 

Eigenstructure assignment; cf. Pole assignment; Zero assignment 
Eigenvalues/eigenfunctions 

2D general discrete state-space models, eigenvalues calc. Zou Yun, + , 
T-AC Jul 94 1436-1439 

continuous alg Riccati eqn. eigenvalues Komaroff, N„ T-AC Mar 94 
532-534 

distributed delay lin. systs , feedback stabilisation Feng Zheng, + , T-AC 
Aug 94 1714-1718 

eigenstructure assignment by decentralized output feedback Guang-Ren 
Duan, T-AC May 94 1009-1014 

fault detect filters, eigenstructure assignment Jaehong Park. +, T-AC Jul 
94 1521-1524 

interval LTI systs., stabilization. Kehui Wei, T-AC Jan 94 22-32 
large sparse Lyapunov eqns., approx soln Gudmundsson, T, + , T-AC 
May 94 1110-1114 

nonnegative feedback control, oscills. stabilisation Zaslavsky, B, T-AC 
Jun 94 1273-1276 

observer design, eigenvalue assignment, residual generation Magm, J -F 
+ , T-AC Feb 94 441-447 

output feedback problem in lin systs., bilinear formulation Syrmos. V L 
4 . T-AC Feb 94 410-414 

prod control methods, distributed, stabil and perform Sharlfma,A. T-AC 
Apr 94 725-737 

Riccati eqn , discrete-time alg., closed loop eigenvalues Lm-Zhang Lu 
4- ,T-ACAug94mi-mS 

spectral syst. finite-dimens. approx Erickson, M.A , 4- , T-AC Sep 94 
1904-1909 

structurally uncertain systs., optimal Lyapunov fns Olas, A , T-AC Jan 94 
167-171 

tridiagonal symmetric interval matrices, eigenvalues. Commercon, J ( 
T-AC Feb 94 377-379 

uncertain continuous-time implicit systs., regional pole placement. 

robustness. Chun-Hsiung Fang, + , T-AC Nov 94 2303-2307 
unknown input observers design Darouach, M, T-AC Mar 94 698-699 
Error analysis 

bounded-error tracking of time-varying params Piet-I.ahanier, H, 4 
T-AC Aug 94 1661-1664 

MRAC perform, anal./improvement, new tracking error criteria Dattam 
A., + , T-AC Dec 94 2370-2387 

random param. tracking, robust algm. Juditsky, A., + , T-AC Jun t} 4 
1211-1221 

recursive prediction error algm. log likelihood fn derivatives. Hook / 
M.A., T-AC Mar 94 662-664 

time-varying params., bounded error ident. Bittanti, S, + , T-AC May 94 
1106-1110 
Estimation 

book review; Applied Optimal Control and Estimation" (Lewis, F 
1992). Kamen, E.W., T-AC Aug 94 1773-1774 
model struct, selection test, instrumental variable, statist, props b " 
Nghia Duong, +, T-AC Jan 94 211-215 
nonlin. syst. H« control, output-feedback based. Lu, W.-M., +, T-AC I 1 
94 2517-2524 

polynomial differential eqns., domain of attraction estim. Levin , A., T ' c 
Dec 94 2471-2475 
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Estimation; cf. Delay estimation, Filtering, Innovations methods (stochastic 
processes), Maximum-likelihood estimation, Mean-square-error 
methods, Nonparametric estimation, Parameter estimation, Recursive 
estimation, Smoothing Methods 


F 

Failure analysis 

mfg syst, failure prone hedging point Jian-Qiang Hu + T-AC Sep 94 
1875-1880 

Failure analysis; cf. Fault diagnosis. Reliability 
Fault diagnosis 

failure detect/isolation/accommodation syst Chta-ChiTsui T-AC Nov 94 
2318-2321 

filters fault detect type eigcnstructurc assignment Jaehong Park + 
7-AC Jul 94 1521-1524 

Fault tolerance 

failure detect /isolation/accommodation syst Chia-Chi Tsui T-AC Nov 94 
2318-2321 
Feedback systems 

2D syst model ol Roesser, dccomp Afacan T + T-AC Nov 94 
2261-2262 

approx feedback lin , stabil Kwanghee Nam + T-AC Nov 94 231 1-2314 
attitude control problem, comment Fjellstad O h + T-AC Mar 94 
699-700 

book review, Feedback Control lheory (Doyle J et al, 1992) Moore 
KL 7 AC Jul 94 1532-1534 

brushless DC motors, feedback-1 in control charactn In Joong Ha + 
7-AC Mar 94 673-677 

deterministic nonlin syst ident Shir Kuan Lin 7 AC Sep 94 1886-1893 
floating platform modeling and control Damen A AH 4 T-AC May 94 
1075-1078 

Gaussian stochastic control syst tuning van Schuppen J H T AC Nov 94 
2178-2190 

H, multiob|cctive robust control infinite-horizon Iheodor Y 4 T AC 
Oct 94 2130-2134 

jump-Markov systs stabilizing control law Dufour F ♦ T AC Nov 94 
2354 2357 

MRAC transient perform improvement by feedback Datta A 4 T AC 
Sep 94 1977 1980 

Nyquist envelope ot interval plant family Hollot CV + I-AC 7 eb 94 
391-396 

observers for nonlin systs in steady state Hunt L R 4 7-AC Oct 94 
2113-2118 

quadratic stabilization of continuous time systs Mahmoud MS + T AC 
Oct 94 2135-2139 

rigid robotic manipulators robust tracking control Man7hihong 4 T AC 
Jan 94 154-159 

slowly varying and I TI systs ident and uncertainty principles lames 
G * / AC Sep 94 1827-1838 

spectral syst (mitc-dimens approx Frickson MA 4 T AC Sep 94 
1904 1909 

switched-modc power converters multirate modeling and control design 
hhayatian 4 + T-AC Sep 94 1848-1852 
tip mass/cablc syst stabilization Morgul O + T-AC Oct 94 2140-2145 
Feedback systems, cf. Output feedback State feedback 
Feedforward neural networks 

continuous-time nonlin systs adaptive control by neural nets Fu-Chuang 
Chen 4 T AC Jun 94 1306-1310 
1 eedforward systems 

floating platform modeling and control Damen A A H + T-AC May 94 
1075-1078 

Filtering 

fault detect filters, cigenstructure assignment Jaehong Park + T-AC Jul 
94 1521-1524 

flexible |Oint robots global regulation approx diff Kelly R + T-AC 
Jun 94 1222-1224 

Kalman filtering for uncertain discrete-time systs LihuaXie + T AC Jun 
94 1310-1314 

I in discrete-time systs optimum lin recursive estim Carazo A H 4 
T-AC Aug 94 1636-1638 

M1MO stochastic discrete syst state estim LiuDanyang 4 7-AC Oct 94 
2087-2091 

modified EKF Ahmed N U + T-AC Jun 94 1322-1326 
multiresolutional distributed filtering Lang Hong T-AC Apr 94 853-856 
optimal filtering, stochastic discrete-time systs , unknown inputs Bomov 
A V 4 T-AC Dec 94 2461 -2464 

stabil domains, assigned root location polynomials, convexity props Tesi 
A + T-AC Mar 94 668-672 

stochastic adaptive prediction/MRAC Wet Ren + T-AC Oct 94 
2047-2060 


stochastic lin discrete systs , state estim algm Ahmed MS T-AC Aug 
94 1652-1656 

three-block generalized/std Riccati eqns comparison Darouach M + 
T-AC Aug 94 1755-1758 

time-variani displacement struct and interpolation problems Sayed A H 
+ T-AC May 94 960-976 
Filtering; cf. Estimation, Nonlinear filtering 
Filter noise 

modified EltF Ahmed NU + T AC Jun 94 1322-1326 
Filters; cf. Adaptive filters, Discrete-time filters 
Finite automata 

discrete event systs , decentralized control, normality theorem Lm f + 
T-AC May 94 \ 089-1093 
Finite wordlength effects 

samplcd-daia systs , wordlength constraint stabil /perform Ftailho JJ 
+ T-AC Dec 94 2476-2481 
Flexible manufacturing systems 

I MS, FCFS scheduling policy Setdman TI T-AC Oct 94 2166-2171 
mfg scheduling syst regulator stabilization tech Humes C Jr T-AC 
Jan 94 191-196 
Flexible structures 

flexible joint robots with uncertain params and disturbances, tracking 
control Tomei P T-AC May 94 1067-1072 
ident state-space freq domain approach Bayard D S 7-AC Sep 94 
1880-1885 

inullibody flexible systs elastic mode estim Karray F + 7-AC May 
94 1016-1020 

nonlin uncertain systs tracking control Song YD + T-AC Sep 94 
1866-1871 

Flight control; cf. Space vehicle control 
Flow control 

zero sum Markov games Altman F T-4C Apr 94 814-818 
Flow control, cf. Packet switching 
Force control 

constrained robots force/motion control Grabbe MT + 7-AC Jan 94 
179 

motion/force control robust with nonholonomic constraints Chun Yi Su 
4 T AC Mar 94 609-614 

robotic arms, repositioning control by learning Lucibello P T-AC Aug 
94 1690-1694 

robot manipulators compliant force/posn regulation Chtavertm S 4 
7-AC Mar 94 647-652 

robot variable struct control schemes Bin Yao + T-AC Teb 94 371-376 
stick-slip avoidance PD control Dupont PE I-AC May 94 1094-1097 
tip mass/cable syst stabilization Morgul O 4 T-AC Oct 94 2140-2145 
Forecasting, cf 1 oad forecasting 
Formal languages 

discrete event systs , decentralized control normality theorem lin F 4 
T AC May 94 1089-1093 

supervisory control blocking and controllability of Petri nets Giua A 
4 T AC Apr 94 818-823 

timed discrete-event systs supervisory control Brandm BA + T-AC 
Teb 94 329-342 

Fourier transforms 

continuous coprime factors Treil S T-AC Jun 94 1262-1263 

Frequency-domain analysis 

delay systs approx , Laguerre formula Lam J T-AC Jul 94 1517-1521 
H°o control syst ident least sq methods comment Livstone MM 4 
T-AC Jul 94 1531 

ident family of norms Massoumma M A 4- 7-AC May 94 1027-1031 
ident state-space freq domain approach Bayard DS T-AC Sep 94 
1880-1885 

intersampic props, robustness and sensitivity quantltat/qualitat anal 
Feuer A + T-AC May 94 1042-1047 
interval lin control syst robust parametric design Keel LH + T-AC 
Jul 94 1524-1530 

lin syst approx , time-scaling factor, Laguerre models Wang L + T-AC 
Jul 94 1463-1467 

multirate sampled-data systs, H 2 -optimal design Qui / + 7-AC Dec 
94 2506-2511 

multivariable nonminimum phase discrete-time systs LTR procedure 
Leon de la Barra S BA 7-AC Mar 94 574-577 
nonlin feedback parametrically uncertain systs , absol stabil Marquez 
HJ + T-AC Mar 94 664-668 
F uzzy control 

book review, Handbook of Intelligent Control Neural, Fuzzy and 
Adaptive Approaches (White, DA, and Sofge DA, Fds, 1992) 
Samad T T-AC Jul 94 1534-1535 

supervisory controller, fuzzy control systs Li-Xiu Wang T-AC Sep 94 
1845-1847 
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G 

Game theory 

certainty equivalence control, dyn games. James, M.R., T-AC Nov 94 
2321-2324 

descriptor systs,, lin feedback closed-loop Stacketberg strategy. Hua Xu, 
■f, T-AC May 94 1097-1102 

M«“Optimal discrete-time fixed-point and fixed-lag smoothing, game 
theory approach, Theodor, Y., + , T-AC Sep 94 1944-1948 
H^-optimal control for sing, perturbed systs., imperfect state meas. Ztgang 
Pan, + , T-AC Feb 94 280-299 

lin,-quadratic zero-sum differential games for generalized state space 
systs. Hua Xu, + , T-AC Jan 94 143-147 
mixed Hj/H« control, Nash game approach. Lmebeer, D.J.N., + , T-AC 
Jan 94 69-82 

noncooperative equilibria computation, relax, algms. Uryas'ev, S., + , 
T-ACJun94 1263-1267 

partially obs. discrete-time nonlin. systs., risk-sensitive control and dyn. 
games. James, M R., +, T-AC Apr 94 780-792 

Gaussian noise 

multipoint boundary value stochastic systs., stationarity/rcciprocity. Jie 
Chen, +, T-AC May 94 1114-1116 

Gaussian processes; cf. Gaussian noise; Linear-quadratic-Gaussian control 
Geometry 

output feedback problem in lin. systs., bilinear formulation Syrmos, V.L, 
+ , T-AC Feb 94 410-414 

uncertain nonlin. systs., struct, invariance Castro-Linares, R., + , T-AC 
Oct 94 2154-2158 

Geometry; cf. Computational geometry; Differential geometry 
Gradient methods 

path formulated optimal routing algm. , time complexity. Antonio, J.K, 
+ , T-AC Sep 94 1839-1844 
Graph theory 

complex discrete-time control systs., simul. automation. Ellis, RD, + , 
T-AC Sep 94 1795-1801 

continuous coprime factors. Treil, S„ T-AC Jun 94 1262-1263 
data network path formulated optimal routing, time complexity Antonio, 
J.K., + ,T-ACFeb94WS-m 

LP, timed marked graphs eval. Yamada, 7\ + , T-AC Mar 94 696-698 
packet switching, flow control, rate-based, monotoniclty/concavity 
Budka, K C, T-AC Mar 94 544-548 

path formulated optimal routing algm. , time complexity. Antonio, J.K, 
+ , T-AC Sep 94 1839-1844 

stochastic dyn. job shops/prod, planning. Sethi, S., + , T-AC Oct 94 
2061-2076 

stochastic timed-event graphs, superposn. props./perform, bounds 
Xiao-Lan Xie, T-AC Jul 94 1376-1386 
Graph theory; cf. Trees, graphs 


H 

Her mi flan matrices 

polynomial J-spcctral factorization. Kwakernaak H, + , T-AC Feb 94 
315-328 

spectral rad. bounds in terms of Hermitian parts. Rachid, A , T-AC Jan 94 
196-198 

Hilbert spaces 

constrained optim. in Hilbert space. Shimizu, K,+, T-AC May 94 982-986 

finite-dimens controllers designed for infinite-dimens, systs., state space 
Morns, KA., T-AC Oct 94 2100-2104 

H® optimization 

2 1 4-d.o.f. LQG control, rel., H 2 control. Grimble, M.J., T-AC Jan 94 
122-127 

book review; Ha Control Problem: A State Space Approach (Stoorvogel, 
A,; 1992). Saberi, A , T-AC Jan 94 252-254 

discrete-time Riccati eqn., H* control appl Stoorvogel, AA,+, T-AC 
Mar 94 686-691 

discrete-time systs., mixed H 2 /H« control, exact convex optim. based soln. 
Sznaier, M., T-AC Dec 94 2511-2517 

dissipative H 2 /H* controller synthesis. Haddad, W.M, + , T-AC Apr 94 
827-831 

dual-rate control syst., samplcd-data, stafoil. robustness. Tongwen Chen, 
T-AC Jan 94 164-167 

floating platform, modeling and control. Damen, A.A.H., +, T-AC May 94 
1075-1078 

m compensator design, min. order observers. Hsu, C.S., +, T-AC Aug 94 
1679-1681 

Ha control of discrete-time uncertain systs. Geromel J.C, + , T-AC May 
94 1072-1075 

Hatrcontrol, state availability rel., pole/zero cancellations. Miyamoto, S„ 
+ , T-AC Feb 94 379-381 


multiobjective robust control, infinite-horizon. Theodor, Y., + , T-AC 
Oct 94 2130-2134 

Ha-norm approx, of systs. by const, matrices. Kavranoglu,, D, T-AC May 
94 1006-1009 

Ha-optim., reduced order observer based controller. Stoorvogel, A.A., + , 
T-AC Feb 94 355-360 

H® optim., time-domain constraints. Rotstein, H, +, T-AC Apr 94762-779 

H®-optimal control for sing, perturbed systs., imperfect state meas. Zigang 
Pan, 4, T-AC Feb 94 280-299 

LTI syst. pos. real control. Weiqlan Sun, +, T-AC Oct 94 2034-2046 

mixed Hj/H. control, Nash game approach. Ltmebeer, DJ.N , + , T-AC 
Jan 94 69-82 

mixed // 2 /tf® perform, objectives Kemin Zhou, + , T-AC Aug 94 
1564-1574 

mixed H 2 IH* perform, objectives, optimal control. Doyle, J, +, T-AC Aug 
94 1575-1587 

nonlin. syst. H* control, output-feedback based. Lu, W,-M ., +, T-AC Dec 
94 2517-2524 

rational transfer fh., optimal L® approx. Kavranoglu, D, + , T-AC Sep 94 
1899-1904 

risk-averse decentralized discrete-time LEQG games. Srikant, R, T-AC 
Apr 94 861-864 

sampled data control systs., H® type problem. Hayakawa, Y., +, T-AC Nov 
94 2278-2284 

SISO-distributed plants, optimal mixed sensitivity Flamm, D S, + , T-AC 
Jun 94 1150-1165 

suite delayed systs., memoryless H” controllers Joon Hwa Lee, + . T-AC 
Jan 94 159-162 

worst case ident., model validation. Guoxiang Gu, T-AC Aug 94 
1657-1661 

Hurwitz stability; cf. Routh methods 
Hysteresis nonlinearities 

nonminimum phase first-order continuous-time sysls, adaptive 
stabilization. Lozano, R, + , T-AC Aug 94 1748-1751 

uncertain S1SO min phase lin syst., nonlin universal servomechanism 
Ryan, E.P., T-AC Apr 94 753-761 


I 

Identification 

discrete time Hammerstein syst. ident. Lang Zi-Qiang, T-AC Mar 94 
569-573 

signed output error adaptive ident., convergence Garnett, J, + , T-AC Jul 
94 1387-1399 

Identification; cf. Parameter identification, System identification 

Image processing 

multiscale recursive estim,, data fusion/regularization Chou, KC . + , 
T-AC Mar 94 464-478 

Information theory 

dyn. shock-error models , online estim Knshnamurthy, V , T-AC May 94 
1129-1135 

Innovations methods (stochastic processes) 

lin. discrete-time systs., optimum lin. recursive estim. Carazo, A H, + 
T-AC Aug 94 1636-1638 

Input-output stability 

interval plants, closed-loop hyperstability. Foo, YK, + , T-AC Jan 94 
151-154 

nonlin. discrete-time systs., input-to-state stabil. condition and global 
stabilization Kazakos, D., + , T-AC Oct 94 2111-2113 

robust direct adaptive controllers, normalization tech. Gang Feng, + , 
T-AC Nov 94 2330-2334 

scmi-cancellablc fraction transfer fns. in syst. theory. Bourles, H., T-/U ' 
Oct 94 2148-2153 

Integer programming 

vector discrete-event syst. controller synthesis. Yong Li, + , T-AC Mar “4 
512-531 

Integration (math.) 

delta-operator formulated discrete-time approx. Premaratne, K, + , T-A ' 
Mar 94 581-585 

Integration (math.); cf. Numerical integration 

Intelligent control 

book review; Handbook of Intelligent Control: Neural, Fuzzy » J 
Adaptive Approaches (White, D.A., and Sofge, D.A., Eds, 19< ) 
Samad, T., T-AC Jul 94 1534-1535 

continuous-time nonlin. systs. adaptive control by neural nets. Fu-Chw ’ 
Chen, +, T-AC Jun 94 1306-1310 

rule-based incremental control, machine learning. Luzeaux, D, T-AC ” 
94 1166-1171 

Interconnected systems 

discrete-time systs., stabil. robustness anal. Karan, M., 4-, T-AC May l 
991-995 
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interconnected systs , decentralized adaptive regulation Changyun Wen 
T-AC Oct 94 2163-2166 

large scale interconnected systs with delays, decentralized stabilization 
ZhongzhiHu T-ACJan94 180-182 

nonlin discrete-time systs , mput-to-state stabil condition and global 
stabilization Kazakos , D , +, T*AC Oct 94 2111-2113 
uncertain nonlin interconnected systs, time-varying state delay, 
stabilizing control Mahmoud MS T-AC Dec 94 2484-2488 
ntcrconnected systems; cf. Cascade systems, Large-scale systems 
nttrference; cf. Noise 
interpolation 

four-block problem, recursive Schur-based soln Constantmescu T + 
T-ACJul 94 1476-1481 

polynomial J-spectral factorization Kwakernaak H + T-AC Feb 94 
315-328 

SISO systs , robust stabilization Olbrot AW + T-AC Mar 94 652-657 
time-variant displacement struct and interpolation problems Sayed A H 
+ T-AC May 94 960-976 
Interpolation; cf. Spline functions 
Inverse problems 

discrete-time q-Markov cover generation inverse Lyapunov method 
Sreeram V + T-AC Feb 94 381-385 
lin discrete syst pole assignment Bertzaouta A T-AC Oct 94 2091-2095 
Inverse problems, linear; cf. Matrix inversion 
Irreducible realizations, cf Minimal realizations 
Iterative methods 

data network path formulated optimal routing, time complexity Antonio 
IK h T-AC Feb 94 385-391 

dyn syst simul, fast parallel recursive aggregation Tsai WK + T-AC 
Mar 94 534-540 

fault detect filters eigenstructure assignment Jaehong Park + T-ACJul 
94 1521-1524 

interval lin control syst robust parametric design Keel 1 II + T-AC 
Jul 94 1524-1530 

least absol values estim , computational aspects Fiodorov ED T-AC 
Mar 94 626-630 

matrix sign In calc iterated traction expansion Koc C K + T-AC Aug 
94 1644-1647 

path formulated optimal routing algm time complexity Antonio J A 
+ I-AC Sep 94 1839-1844 

pole placement by output feedback lee TH + T-AC Mar 94 565-568 
queuing networks anal projection techs Mooi Choo Chuah T-AC Aug 94 
1588-1599 

Riccaticqn discrete alg iter matrix bounds Komaroff N T-AC Aug 94 
1676-1678 

state-constrained optimal control generalised dual quasi-Newton algm 
Shimizu k + T AC May 94 982-986 
textured iter algms for tridiagonal lin eqns Tian-Shen Tang T-AC Mar 
94 592 596 


J 

lump parameter systems 

discrete-time Markovian jump I in systs, lin min MSF estim Costa 
01 V T-AC Aug 94 1685-1689 

lump lin systs coupled Riccati eqns Abou Kandil H + T-AC Aug 94 
1631-1636 

lump-Markov systs stabilizing control law Dufour F + T-AC Nov 94 
2354-2357 


K 

kalman filtering 

lutonomous vision-based mobile robot Baumgartner E T + T-AC Mar 
94 493-502 

lyn shock-error models , online estim Kmhnamurtky V T-AC May 94 
1129-1135 

uaranteed estim problem, Kalman-Bucy filter appl Golovan A + 
T-AC Jun 94 1282-1286 

uaranteed param estim problem with uncertain stats, Kalman-Bucy 
filter accuracy Matasov A I T-AC Mar 94 635-639 
in sing perturbed systs , Kalman filtering Gajic Z + T-AC Sep 94 
1952-1955 

11MO stochastic discrete syst state estim LiuDanyang + T-AC Oct 94 
2087-2091 

i odified EKF Ahmed NV + T-AC Jun 94 1322-1326 
ultiscale recursive estim, data fusion/rcgularization Chou KC + 
r AC Mar 94 464-478 

1 ultiscale systs , Kalman filters, Riccati eqns Chou KC +, T-AC Mar 
>4 479-492 


optimal cost control/filtcring, uncertain lin systs Petersen, lit, + T-AC 
Sep 94 1971-1977 

three-block general ized/std Riccati eqns comparison Darouach, M t +, 
T-AC Aug 94 1755-1758 

uncertain discrete time systs appl LihuaXie , + T-AC Jun 94 1310-1314 
Kalman filtering; cf. Adaptive Kalman filtering 
Knowledge-based systems 

rule-based incremental control, machine learning Luzeaux D, T-AC Jun 
94 1166-1171 


L 

Laguerre processes 

Im syst approx, time-scaling factor, Laguenre models Wang I + T-AC 
Jul 94 1463-1467 

Land navigation 

autonomous vision-based mobile robot Baumgartner E T + T-AC Mar 
94 493-502 

Laplace transforms 

transfer fn param ident in freq domain Pmtelon R + T-AC Nov 94 
2245-2260 

Large-scale integration; cf. Very-large-scale integration 

Large-scale systems 

complex discrete-time control systs, simul automation Ellis RD + 

I-AC Sep 94 1795-1801 

H* -optimal control for sing perturbed systs , imperfect state meas Zigang 
Pan + T-AC Feb 94 280-299 

large scale interconnected systs with delays, decentralized stabilization 
Zhongzhi Hu T-AC Ian 94 180-182 

sing perturbed control systs and nonlin differential-alg eqns Krishnan 
H + T-AC May 94 1079-1084 

Large-scale systems; cf. Interconnected systems, Reduced-order systems 

Learning control systems 

P-type learning control Saab S S T-AC Nov 94 2298-2302 

rule-based incremental control, machine learning Luzeaux D T-AC Jun 
94 1166-1171 

Learning systems 

autonomous vision-based mobile robot Baumgartner E T + T-AC Mar 
94 493-502 

continuous-time nonlin systs adaptive control by neural nets Fu-Chuang 
Chen + T-AC Jun 94 1306-1310 

I/O map inverse pass learning update-spline-smoothing Heiss M T-AC 
Feb 94 259-268 

robotic arms, repositioning control by learning Lucibello P T-AC Aug 
94 1690-1694 

rule-based incremental control, machine learning Luzeaux D T-AC Jun 
94 1166-1171 

Least-mean-square methods 

Hoo control syst ident least sq methods comment Ltvstone MM + 
T-ACJul 94 1531 

Least-squares methods 

adaptive pole placement without excitation probing sigs I ozano R + 
T-AC Jan 94 47-58 

bilinear systs , adaptive stabilization, least sq Xi Sun + T-AC Jan 94 
207-211 

convergence of least sq estim in white noise Nassm-Toussi K + 7 -AC 
Feb 94 364-368 

discrete-time adaptive nonlin syst least sq estimator Kanellakopoulos 
I T-AC Nov 94 2362-2365 

efficient algm for least sq estim Rafajlowicz E + T-AC Jun 94 
1241-1243 

extended Chandrasekhar recursions Sayed AH + 7-AC Mar 94 
619-623 

first-order nonlin syst adaptive control Brogliato B + T-AC Aug 94 
1764-1768 

guaranteed estim problem, Kalman-Bucy filter appl Golovan A, + 
T-AC Jun 94 1282-1286 

least absol values estim , computational aspects Fiodorov E D T-AC 
Mar 94 626-630 

multibody flexible systs , elastic mode estim Karray F + T-AC May 
94 1016-1020 

nonmmimum phase first-order continuous-time systs, adaptive 
stabilization Lozano R + T-AC Aug 94 1748-1751 

sing -free multivariable MRAC Mociezuma RG + 7-AC Sep 94 
1856-1860 

stochastic adaptive prediction/MRAC Wei Ren + T-AC Oct 94 
2047-2060 

time-varying params, bounded error ident Bittanti S + T-AC May 94 
1106-1110 

Lie algebras 

rigid spacecraft systs controllability, central gravitational field Lian 
K-Y + T-AC Dec 94 2426-2441 
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robot kinematics, computational aspects of pcoduct-of-exponentials 
formula. Park, FC, T-AC Mar 94 643-647 
Unear algebra; cf. Matrices 
Linear approximation 

approx, feedback KwangheeNam, + , T-ACNov94 2311-2314 

brushless DC motors, feedback-1 In control charactn. In-Joong Ha, + , 
T-AC Mar 94 673-677 

feedback lin. systs. adaptive tracking. Marina, R., + , T-AC Jun 94 
1314-1319 

input-output iin , state equivalence and decoupling. In-Joang Ha, +, T-AC 
Nov 94 2269-2274 

M1MO nonlin. syst. I/O pseudolinearization. Lawrence, D.A., + , T-AC 
Nov 94 2207-2218 

parametrically uncertain nonlin. systs.. robust stabilisation. Sehoenwald, 
D A., +. T-AC Aug 94 1751-1755 
stochastic dyn. systs.. exact lin. Socha, L., T-AC Sep 94 1980-1984 
uncertain nonlin. systs., struct, invariance. Castro-Linares, R. f + , T-AC 
Oct 94 2154-2158 
Linear programming 

closed Jackson queueing network decentralized control. Dye-Jyun Ma, +, 
T-ACM 94 1460-1463 

model validation, time-domain approach. Paolla, K., + , T-AC May 94 
951-959 

prod, control methods, distributed, stabil. and perform. Shartfhia, A., T-AC 
Apr 94 725-737 

queueing networks and scheduling policies, perform, bounds. T-AC Aug 
94 1600-1611 

queuing networks anal., projection techs. Mooi Choo Chuah, T-AC Aug 94 
1588-1599 

timed marked graphs, perform, eval. Yamada, T, + , T-AC Mar 94 696-698 
vector discrete-event syst. controller synthesis. Yong Lt, + , T-AC Mar 94 
512-531 

Unear programming; cf. Integer programming 
Linear-quadratic control 

biased/unbiased controllers LTR design. Turan, L., + , T-AC Mar 94 
601-605 

descriptor systs.. lin. feedback closed-loop Stackelberg strategy. Hua Xu, 
+ , T-AC May 94 1097-1102 

exponential lin. quadratic optimal control, discounting. Hopkins, W £., Jr, 
T-AC Jan 94 175-178 

infinite horizon LQ control, soln approx Schochetman, I E,+, T-AC Mar 
94 596-601 

infinite time-varying LQ-problem, approx soln. Engwerda, JC, T-AC 
Jan 94 235-238 

lin. quadratic control, convex prog., num. method. Peres, P LD, +, T-AC 
Jan 94 198-202 

LQoptimal regulators, stabil. robustness. Dohyoung Chung, + , T-AC Aug 
94 1698-1702 ■ 

optimal cost control/filtering, uncertain lin systs Petersen, I.R., + , T-AC 
Sep 94 1971-1977 

Riccati eqn., discrete-time alg., closed loop eigenvalues. Lin-Zhang Lu, 
+ ,T-ACAug94\m-mS 

robust lin. quadratic designs, real param. uncertainty. Douglas, J., + , 
T-AC Jan 94 107-111 
Linear-quadratic-Gaussian control 

2 l /2-d.o.f. LQG control, rel, H z control. Grimble, M.J, T-AC Jan 94 
122-127 

asynchronous sampled data syst control, design Voulgaris, P., T-AC Jul 
94 1451-1455 

discrete-time compensators, LTR. Tadjine, A/,+, T-AC Jun 94 1259-1262 
infinite horizon optimal control of stochastic systs. Runolfsson, T., T-AC 
Aug 94 1551-1563 

LQG controllers, stable, weighting and covariance matrices. Halevi, Y, 
T-AC Oct 94 2104-2106 

risk-averse decentralized discrete-time LEQG games. Srikant , R., T-AC 
Apr 94 861-864 
Linear systems 

adaptive stochastic control algms., unmodified, robustness. Radenkovic, 
M.S., + , T-AC Feb 94 396-400 

block multirate input-output model for sampled-data control systs. 
Jakubomki, A M., + , T-AC May 94 1145-1147 
book review; Controlled and Conditioned Invariants in Linear System 
Theory (Basile, G. f and Marro, G ; 1992). Schumacher , JM., T-AC Jan 
94250-251 

book review; Linear Multivariable Control: Algebraic Analysis and 
Synthesis Methods (Vardulakis, A.I.G.; 1991). Lewis, F.L ., T-AC Jul 94 
1536 

book review; Linear System Theory (Rijgh. W.J.; 1993). Khalil, H.K, 
T-AC Dec 94 2528-2529 

book review; New Tools for Robustness of Linear Systems (Barmish, B.R.; 
1994). KAC Dec 94 2525-2526 


CARMA plants, stabilizing 1-0 receding horizon control. Chisel, L„ + 
T-AC Mar 94 614-618 

certainty equivalence control, dyn. games. James, M.R., T-AC Nov 9* 
2321-2324 

comments on “Stability margin evaluation for uncertain linear systems 
(by C. Gong and S. Thompson, Jun 94 548-550). Su, J.-H., T-AC Dec 94 
2523-2524 

comments on “System zeros determination from an unreduced matn> 
fraction description” (by K.S. Yeung and C.-M. Kwan, Nov 9> 
1695-1697), Ferreira, P.M.G., T-AC Nov 94 2367 
comments, with reply, on “Vector norms as Lyapunov functions for linear 
systems’ 1 (by H. Kiendl et al ., Jun 92 839-842). Hmamed, A., T-AC Dec 
94 2522-2523 

common Lyapunov fh., stable LT1 systs., commuting A-matriccj. 
Narendra, K.S, + , T-AC Dec 94 2469-2471 
constrained continuous-time systs. with cone props., pos. invariant sets 
Tarbouriech, S., + , T-AC Feb 94 401-405 
continuous coprime factors. Treil, S., T-AC Jun 94 1262-1263 
continuous time nonminimum phase lin. systs., gain margin improvement 
Wei-Yong Yan, + , T-AC Nov 94 2347-2354 
continuous-time systs., global stabilization, bounded controls. Sussmann, 
H.J., + , T-AC Dec 94 2411-2425 

corrections to “Positively invariant sets for constrained continuous-time 
systems with cone properties” (Feb 94 401-405). Tarbouriech, S, T-AC 
Aug 94 1771 

DD discrete time systs., pole assignment. Benzaouia, A , T-AC Oct 94 
2091-2095 

decentralized stable factors and parameterization of decentralized 
controllers Date, R., + , T-AC Feb 94 347-351 
diagonal decoupling, dyn output feedback/const, precompensator Eldem 
V, T-AC Mar 94 503-511 

discrete-time lin. systs., pos invariant sets De Santis, E. T-AC Jan 94 
245-249 

discrete-time Markovian jump lin. systs., lin. min. MSE estim. Costa 
O.L V., T-AC Aug 94 1685-1689 

discrete-time q-Markov cover models Sreeram, V, + , T-AC May 94 
1102-1105 

discrete-time systs., optimum lin recursive estim. Carazo, A H , + , T-AC 
Aug 94 1636-1638 

discrete-time systs., perturb., comment. Eslamt, M., T-AC Aug 94 
1768-1769 

discrete-time uncertain systs., ultimate boundedness control, Lyapunov 
Blanchini, F, T-AC Feb 94 428-433 

distributed delay lin. systs , feedback stabilisation Feng Zheng, + , T-AC 
Aug 94 1714-1718 

D-stabil, robust, for lin uncertain discrete delay systs Te-JenSu, +, T-At 
Feb 94 425-428 

dyn. syst simul, fast parallel recursive aggregation Tsai, W.K , + , T-AC 
Mar 94 534-540 

finite-dimens. model validation, output error, test horizon Hoai Nghm 
Duong, + , T-AC Jan 94 102-106 

full-order/low-order observers, LTI plant, unified ingular syst based 
theory. Cobb, J D., T-AC Dec 94 2497-2502 
H°°-optimal control for sing perturbed systs , imperfect state meas Zigang 
Pan, + , T-AC Feb 94 280-299 

impulse differential lin. systs., constrained controllability. Benzaid, Z, + 
T-AC May 94 1064-1066 

input-output lin., state equivalence and decoupling Jn-Joong Ha, +, T-AC 
Nov 94 2269-2274 

interval LTI systs., stabilization. Kehut Wei, T-AC Jan 94 22-32 
jump lin. systs., coupled Riccati eqns Abou-Kandil, H, + , T-AC Aug 91 
1631-1636 

lin. min.-phase plants, stabilizing controllers parameterization Gloria 
J.J., + , T-AC Feb 94 433-434 

LTI discrete-time systs., robust stabil. anal. Karan, M„ + , T-AC Ma 1 94 
991-995 

min. lin. plants parameterization. Davis, L.D., + , T-AC Apr 94 849-^2 
multivariable systs., fund, notion of equivalence. Pugh, A.C, + , i & 
May 94 1141-1145 

nonlin. design of adaptive controllers. Krstic, M., +, T-AC Apr 94 738 7^2 
output feedback problem in lin. systs., bilinear formulation Syrmos, l 
+ , T-AC Feb 94 410-414 

param. hypothesis testing, stochastic lin. systs., stat. sampling. Dv an 
T.E., +, T-AC Jan 94 118-122 

perturbed lin. systs., upper covariance bounds. Bolzern, P., + , T-At !ar 
94 623-626 

pos. of dyn. systs., nonpositive coeff. matrices, d Alessandro, P, + > ^ 
Jan 94 131-134 

P-type learning control. Saab, S.S., T-AC Nov 94 2298-2302 
S1SO LTI discrete-time systs., param -adaptive controller. Krelsselv ier \ 
G., T-AC Sep 94 1819-1826 
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Slowly varying and LT1 systs, ident and uncertainty principles Zames 
G + T-AC Sep 94 1827-1838 

stabil of uncertain! in systs , saturating actuators Jm-HoonKm + T-AC 
Jan 94 202-207 

state delayed systs , memoryless H® controllers Joon Hwa Lee + T-AC 
Jan 94 159-162 

stochastic control, lin -exponential-Gau&sian Chth-Hai Fan + T-AC Oct 
94 1986*2003 

switching surface design for multivariable VSS Ju-Jang Lee + T-AC 
Feb 94 414-419 

time-invariant lin systs .robust controller synthesis Rantzer A + T-AC 
Sep 94 1802-1808 

time-invariant systs, pos real control Weiqian Sun 4 7 -AC Oct 94 

2034-2046 

uncertain lin systs , delay depend , robust stabil Bugong Xu T-AC Nov 
94 2365 

uncertain lin systs multivariable stabil margin eval Gong C + T-AC 
Mar 94 548-550 

uncertain syst robust stabil anal, nonquadratic 1 yapunov tns 
Zelentwvsky A L T-AC Jan 94 135-138 
uncertain syst robust stabil, guardian map approach Shuoh Rern + 
T-AC Jan 94 162-164 

unknown input lin systs, full-order observer design Darouach M + 
T-AC Mar 94 606-609 

unknown input observers design Darouach M T-AC Mar 94 698-699 
Wiener systs nonparametric ident Greblicki W T-AC Oct 94 2077-2086 
worst-case ident anal , BIBO robustness Partington JR + 7 -AC Oct 
94 2171-2176 

LMS, cf. 1 east-mean-square methods 
Load forecasting 

alternating renewal elec load models El-Fenk S + T AC Jun 94 

1184-1196 

Load frequency control; cf Power generation control load frequency 
Load modeling 

alternating renewal elec load models El-Ferik S + T-4C Jun 94 

1184-1196 

LQ control; cf I mear-quadratic control 

LQG control; cf Linear-quadratic-Gaussian control 

1 yapunov matrix equations 

discrete-lime q-Markov cover generation, inverse 1 yapunov method 
Sreeram \ 4 T-AC Feb 94 381-385 
large sparse I yapunov eqns approx soln Gudmundsson T + 7 -AC 
May 94 1110 1114 

multivariable systs quantitat robustness measures Horng-Giou Chen 
+ T-AC Apr 94 807-810 

sing perturbed syst block-diagonalization Taylor expansion Derbel N 
4 7 AC Jul 94 1429-1431 
I yapunov methods 

asynchronous systs with Poisson transits stabil Leland RP T-AC Jan 
94 182-185 

attitude control problem comment hjellstad O -E + T-AC Mar 94 
6Q9-700 

book review Applications ol l yapunov Methods in Stability (Halanay 
A , and Rasvan V 1993) Popov V M T-AC Dec 94 2526-2527 
comments with reply on ‘ Vector norms as Lyapunov functions tor linear 
systems (by H Kiendlelal Jun 92 839-842) fimamed A T-AC Dec 
94 2522-2523 

common l yapunov fn stable L11 systs, commuting A-matrices 
Narendta KS + T-AC Dec 94 2469-2471 

corrections to An improved Raarumikhin-typc theorem and its 
application” (Apr 94 839-841) Xu B T-AC Nov 94 2368 
corrections to “Lyapunov stability of a class oi discrete-event systems” 
(Feb 94 269-279) Pass mo K M 4- T-AC Jul 94 1531 
discrete-time adaptive nonlm syst least sq estimator Kanellakopoulos 
I 1-AC Nov 94 2362-2365 

discrete-time syst exponential stabil, observer design Aitken VC + 

7 AC Sep 94 1959-1962 

liscretc-time systs , generalized q-Markov cover models Sreeram V 
T-AC Dec 94 2502-2505 

'iscrete-time systs perturb comment Eslami M T-AC Aug 94 
1768-1769 

hscrete-timc uncertain systs, ultimate boundedness control, Lyapunov 
Blanchim F T-AC Feb 94 428-433 

mite-time optimal control of bilinear systs, successive approx procedure 
4ganovtc Z + T-AC Sep 94 1932-1935 

iterval matrices, necessary and sufficient conditions for Hurwitz/Schur 
Mabil Earning Wang + T-AC Jun 94 1251-1255 
1 gical discrete eventsysts , L yapunov stabil Passmo KM + T-AC Feb 
94 269-279 

1 G controllers, stable, weighting and covariance matrices Halevi Y 
*-AC Oct 94 2104-2106 


manipulators with parametric uncertainty, robust control Keun-Mo Koo 
+ , T-AC Jun 94 1230-1233 

multivariable systs , quantitat robustness measures Horng-Giou Chen 
+ T-AC Apr 94 807-810 

nonlm design of adaptive controllers Krstic M + T-AC Apr 94738-752 
nonnegative feedback control, oscills stabilisation Zaslavsky B T-AC 
Jun 94 1273-1276 

output feedback stabilization of nonlm systs Tsimas J + T-AC Apr 94 
806 

parab systs direct adaptive control Keum Shtk Hong + T-AC Oct 94 
2018-2033 

parametrically uncertain nonlm systs , robust stabilisation Schoenwald, 
DA,+ T-AC Aug 94 1751-1755 

perturbed lin systs, upper covariance bounds Bolzern P f T-AC Mar 
94623-626 

Razumikhin-type theorem improved, appls Bugong Xu + T-AC Apr 94 
839-841 

robot controller adaptive, Lyapunov stabil Egeland O + T-AC Aug 94 
1671-1673 

robot manipulators, compliant, force/posn regulation Chiavenm S + 
T-AC Mar 94 647-652 

robust strong stabilization via modified Popov controller synthesis Wang 
YW + T-AC Nov 94 2284-2287 

second-order dyn systs , dissipative controller designs Morns KA 4 
T-AC May 94 1056-1063 

stabil theory of nonsmooth systs Shevitz D 4 T-AC Sep 94 1910-1914 
structurally uncertain systs optimal l yapunov fns Olas A T-AC Jan 94 
167-171 

uncertain nonlin interconnected systs, time-varying state delay, 
stabilizing control Mahmoud MS T-AC Dec 94 2484-2488 
uncertain syst robust stabil anal, nonquadralic Lyapunov fns 
Zelentsovsky A 1 T-AC Jan 94 135-138 
uncertain systs , asymptotic stabil region cstim Han Ho Choi 4- T-AC 
Nov 94 2275-2278 


M 

Machine vision; cf. Robots, vision systems 

Manipulators 

book review Robot Dynamics and Control" (Spong, MW, and 
Vidyasagar, M , 1989) Pota H R 4 1-AC Aug 94 1774-1776 
constrained robots forcc/motion control Grabbe MT 4- T-AC Jan 94 
179 

contact stabil of simple posn controllers effect of time delay and discrete 
control Flala J + 7 -AC Apr 94 870-873 
deterministic nonlin syst ident Shir-KuanLin I-AC Sep 94 1886-1893 

Manipulators, dynamics 

compliant manipulators stable forcc/posn regulation Chiavermi S + 
T-AC Mar 94 647-652 

flexible joint robots with uncertain params and disturbances, tracking 
control lomei P T-AC May 94 1067-1072 
globally convergent adaptive controller Mahmoud M S + T-AC Jan 94 
148-151 

mobile manipulator locomotion/manipulation coord Yamamoto Y 4- 
T-AC Jun 94 1326-1332 

rigid robotic manipulators, robust tracking control ManZhihong 4- T-AC 
Jan 94 154-159 

robot variable struct control schemes BtnYao + T-AC Feb 94 371-376 
robust M1MO terminal sliding-mode control, rigid robotic manipulators 
Zhthong M 4- T AC Dec 94 2464-2469 
single-arm dyn , robust variable struct /and switching-^ adaptive control 
Lt-WenChen + T-AC Aug 94 1621-1626 

Manipulators, kinematics 

flexible joint robots with uncertain params and disturbances, tracking 
control Tomei P T-AC May 94 1067-1072 
kinematics, product-of-exponentials formula computational aspects 
Park h C T-AC Mar 94 643-647 

Manipulators, motion planning 

mobile manipulator locomotion/manipulation coord Yamamoto Y + 
T-AC Jun 94 1326-1332 

Manufacturing planning 

stochastic dyn job shops/prod planning Sethi S + T-AC Oct 94 
2061-2076 

Manufacturing scheduling 

distributed prod control methods Sharifma, A T-AC Apr 94 725-737 
failure-prone mfg systs, structural props of optimal prod controllers 
Jian-QiangHu 4-, T-AC Mar 94 640-643 
FMS, FCFS scheduling policy Seidman T J T-AC Oct 94 2166-2171 
mfg scheduling syst, regulator stabilization tech Humes C Jr T-AC 
Jan 94 191-196 

mfg systs with prod rate-depend failure rates, prod control 
Liberopoulos G 4- T-AC Apr 94 889-895 


h C / c k author entry for coauthors 
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mfg. syst. with unreliable machines, stabil, anal. Han-Fu Chen , 4*, T~AC 
Mar 94 681-666 

queueing networks and scheduling policies, perform, bounds. T-AC Aug 
94 1600-1611 

Manufacturing scheduling; cf. Production control 
Mi^kov processes 

2D stochastic approx., convergCnce/dlfT eqn. limit. Dye-Jyun Ma, + , 
T-ACJul 94 1439*1442 

alternating renewal elcc. load models. El-Ferik, S., + , T-AC Jun 94 
1184-1196 

continuous-time multivariable syst. reduction. Krqjewskt, W,, + . T-AC 
Oct 94 2126-2129 

deterministic/stochastic queuing networks anal. Cheng-Shang Chang, 
T-AC May 94 913-931 

discrete-time Markovian jump lin. systs,, lin. min. MSE estim. Costa, 
OX V„ T-AC Aug 94 1685-1689 

discrete-time q-Markov cover generation, inverse Lyapunov method. 

Sreeram, K, 4 , T-AC Feb 94 381-385 
discrete-time q-Markov cover models. Sreeram, V., + , T-AC May 94 
1102-1105 

discrete-time systs,, generalized q-Markov cover models. Sreeram, V., 
T-AC Dec 94 2502-2505 

flow control, zero sum Markov gomes. Altman, E, T-AC Apr 94 814-818 
jurtip lin, systs., coupled Riccati eqns. Ahou-Kandil, //.,+, T-AC Aug 94 
1631-1636 

jump-Markov systs , stabilizing control law. Dttfour, F, +, T-AC Nov 94 
2354-2357 

mfg. systs. with prod, rate-depend. failure rales, prod, control. 

Liberopoulos, G, + , T-AC Apr 94 889-895 
queueing networks and scheduling policies, perform, bounds. T-AC Aug 
94 1600-1611 

Mathematical programming; cf. Dynamic programming; Linear 
programming 
Mathematics; cf. Algebra 
Matrices 

bilinear systs., state space anal., orthogonal series approach. 

Paraskevopoulos, P.N., 4 , T-AC Apr 94 793-797 
book review; Linear Multivariable Control: Algebraic Analysis and 
Synthesis Methods (Vardulakis, A.LG , 1991) lewis, F L, T-ACJul 94 
1536 

comments on 'Stabilization via static output feedback”. Pimpalkhare, 
A.A., 4 , T-AC May 94 1148 

comments on "System zeros determination from an unreduced matrix 
fraction description” (by K.S Yeung and C -M. Kwan. Nov 93 
1695-1697). Ferreira, PM.G , T-AC Nov 94 2367 
common Lyapunov fn., stable LT1 systs., commuting A-matrices 
Narendra, K.S., 4 , T-AC Dec 94 2469-2471 
constrained continupus-time systs with cone props., pos. invariant sets. 

Tarbouriech, S 4 , T-AC Feb 94 401-405 
continuous alg. Riccati eqn. eigenvalues. Komaroff, N, T-AC Mar 94 
532*534 

corrections to "Positively invariant sets for constrained continuous-time 
systems with cone properties” (Feb 94 401-405). Tarbouriech, S, T-AC 
Aug 94 1771 

descriptor systs control by output feedback. Lovass-Nagy, V, + , T-AC 
Jul 94 1507-1509 

diagonal decoupling, dyn. output feedback /const, precompensator. Eldem, 
V., T-AC Mar 94 503-511 

discrete-time lin, systs., pos. invariant sets. De Santis, E, T-AC Jan 94 
245-249 

discrete-time Riccati eqn,, Hod control appl. Stoorvogel, A. A , + , T-AC 
Mar 94 686-691 

discrete-time systs., perturb., comment Eslami, M. t T-AC Aug 94 
1768-1769 

distributed algms., random proc. failures. Papavassilapoulos, G P., T-AC 
May 94 1032-1036 

distributed delay Lin. systs., feedback stabilisation. Feng Zheng, + , T-AC 
Aug 94 1714-1718 

disturbance decoupled observer design. Hou, M, 4 , T-AC Jun 94 
1338-1341 

disturbance decoupling, constrained Sylvester eqns. Syrmos, V.L., T-AC 
94 797-803 

dyn. syst simuL, fast parallel recursive aggregation. Tsai, WK, 4 , T-AC 
Mar 94 534-540 

eigenstructure assignment by decentralized output feedback. Guang-Ren 
Duaru T-AC May 94 1009-1014 

extended Chandrasekhar recursions. Saved, A H., 4 , T-AC Mar 94 
619-623 

extended horizon liftings for nonminimum-phase systs. Bayard, DS„ 
T«AC Jun 94 1333-1338 

fault detect, filters, eigenstructure assignment Jaehong Park, 4, T-ACJul 
94 1521-1524 


four-block problem, recursive Schur-bascd soln. Constantinescu, T., + , 
T-ACJul 94 1476-1481 

H« control of systs. under norm bounded uncertainties. Gu, K , T-AC Jun 
94 1320-1322 

interval LTI systs., stabilization. Kehui Wei , T-AC Jan 94 22-32 
interval matrices, necessary and sufficient conditions for Hurwitz/Schur 
stabil. Raining Wang, 4 , T-AC Jun 94 1251-1255 
interval matrices stabil. conditions. Sezer, M.E., +, T-AC Feb 94 368-371 
jump lin. systs., coupled Riccati eqns. Abou-Kandtl, H., 4 , T-AC Aug 94 
1631-1636 

Kalman filtering for uncertain discrete-time systs. LihuaXte, 4, T-AC Jun 
94 1310-1314 

lin. discrete syst. pole assignment. Benzaouia, A., T-AC Oct 94 2091-2095 
MIMO nonlin. syst. I/O pseudolinearization. Lawrence, D.A., 4 , T-AC 
Nov 94 2207-2218 

min. lin. plants parameterization. Davis, L.D., + , T-AC Apr 94 849-852 
min. stochastic realizations, parameterization. Ferrante, A., T-AC Oct 94 
2122-2126 

mobile manipulator locomotion/manipulation coord. Yamamoto, Y, 4 , 
T-AC Jun 94 1326-1332 

multipoint boundary value stochastic systs., stationarity/reciprocity Jie 
Chen, + . T-AC May 94 1114-1116 

multiscale systs., Kalman filters, Riccati eqns Chou, KC, 4, T-AC Mar 
94 479-492 

nonlin. systs, vibr. control by AP-forcing Balestrino, A , 4 , T-AC Jun 94 
1255-1258 

periodic discrete-time Riccati eqn., num soln. Hench, JJ., 4 , T-AC Jun 
94 1197-1210 

pole placement by output feedback Lee, TH, 4 , T-AC Mar 94 565-568 
pos. of dyn. systs., nonpositive coeff. matrices. d\Alessandro, P., + , T-AC 
Jan 94 131-134 

rational transfer fn., optimal L® approx. Kavranoglu, D, +, T-AC Sep 94 
1899-1904 

Riccati eqn., discrete alg., iter matrix bounds Komaroff, N, T-AC Aug 94 
1676-1678 

Riccati eqn , discrete-time alg., closed loop eigenvalues Lin-Zhang Lu, 
+ , T-AC Aug 94 1682-1685 

robot controller, adaptive, Lyapunov stabil Egeland, O, +, T-AC Aug 94 
1671-1673 

robust nonsingularity problem, SSV Ghwan-Lu Tseng, + , T-AC Oct 94 
2118-2122 

sign fn calc , iterated fraction expansion Koc, C K + , T-AC Aug 94 
1644-1647 

sing-free multivariable MRAC Moctezuma, RG, + , T-AC Sep 94 
1856-1860 

sing, root distrib. problems, displacement struct approach Pal, D, ■» 
T-AC Jan 94 238-245 

state deadbeat control problem, general soln. Eldem, V, 4 , T-AC May 94 
1002-1006 

textured iter, algms. for tridiagonal lin eqns. Tian-Shen Tang, T-AC Mar 
94 592-596 

time-invariant lin. systs. control lability/observability Raining Wang, 4 
T-ACJul 94 1443-1447 

time-variant displacement struct and interpolation problems. Sayed, A H 
+ , T-AC May 94 960-976 

tridiagonal symmetric interval matrices, eigenvalues. Commercon, JC 
T-AC Feb 94 377-379 

uncertain systs., sampled-data controller design. Dolphus, R M, T-AC May 
94 1036-1042 

unknown input observers design Darouach . M, T-AC Mar 94 698-699 

Matrices; cf. Covariance matrices; Hcrmitian matrices; Lyapunov matrix 
equations; Polynomial matrices, Toeplitz matrices; Transfer function 
matrices 

Matrix decomposition/factorization 

constrained Riccati eqn. factorizations. Weiss, M., T-AC Mar 94 677-6KI 
descriptor systs. regularization by output feedback. Bunse-Gerstner, i 
4 , T-AC Aug 94 1742-1748 

Hao-norm approx, of systs. by const, matrices Kavranoglu, D, T-AC S w 
94 1006-1009 

least sq. estim., efficient algm. Rafqjlowicz, E„ 4, T-AC Jun 94 1241-1 41 
polynomial J-spectra) factorization Kwakernaak, H, 4 , T-AC Fet 94 
315-328 

Matrix inversion 

structurally uncertain systs,, optimal Lyapunov fns. Olas, A , T-AC Jo 94 
167-171 

Matrix multiplication 

trace of matrix product, inequalities. Fang, Y, 4, T-AC Dec 94 2489-^ 90 

Maximum-likelihood estimation 

dyn. shock-error models, online estim. Krlshnamurthy . V., T-AC Ma 94 
1129-1135 
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M«*n-*«l**are-crror methods 

discrete-time Markovian jump hn systs, tin mm MSE estim Costa 
OLV , T-4CAug94 1685-1689 
Mechanical factors 

stick*slip avoidance, PD control Dupont PE, T-ACMay 94 1094-1097 
Mechanical factors; cf. Biomechanics 
Mechanical systems 

motion/force control, robust, with nonholonomic constraints Chun-Yi Su 
+ , T-AC Mar 94 609-614 

structural wave control, reduced-order model Quan Wang + T-AC Aug 
W1711-1713 

Mechanical variables control 

superarticulatedmech systs control Seto D + l-ACDec 94 2442-2453 
vibr systs control Karl WC + T-AC Jan 94 222-226 
Mechanical variables control; cf. Force control, Motion control, Position 
control, Velocity control 
Memoryless systems 

dyn iin time-delay systs, robust controller design Mahmoud MS + 
T-AC May 94 995-999 

state delayed systs , memoryless 11“ controllers Joon Hwa Lee + T-AC 
Jan 94 159-162 

Wiener systs nonparamctric ident Crebhcki W T-AC Oct 94 2077-2086 

Minimal realizations 

full-order/low-order observers, LTI plant, unified ingular syst based 
theory Cobb JD T-AC Dec 94 2497-2502 
min stochastic realizations parameterization Ferrante A T-AC Oct 94 
2122-2126 

Minimax control 

certainty equivalence control, dyn games James MR 7-AC Nov 94 
2321-2324 

Minimax methods 

optimal filtering, stochastic discrete-time systs , unknown inputs Borisov 
A V + l-AC Dec 94 2461-2464 

Minimization methods 

adaptive predictive controller robustness anal Clarke DW + T-AC 
May 94 1052-1056 

least absol values cstun computational aspects hodorov LD T-AC 
Mar 94 626-630 

1m quadratic control convex prog num method Peres PLD + 1-AC 
Jan 94 198-202 

mixed pcrlorm objectives optimal control Doyle J + T-AC Aug 
94 1575-1587 

sampled-ddld robust control, exponential hold optimal algm Yung-Chun 
Wu h 1 At Jan 94 112-117 

sampled-data systs wordlength constraint, stabil/perlorm Fiailho IJ 
+ 7 AC Dec 94 2476-2481 

stochastic Iin discrete systs state estim algm Ahmed MS T-AC Aug 
94 1652-1656 

uncertain syst robust slabil anal nonquadratic Lyapunov fns 
/ elentso\skv A l /-AC Jun 94 135-138 
uncertain systs stabilizing control design quasiconvex optim Keqm Gu 
T-4C Jan 94 127-131 

Minimization mrthods; cf. Optimization methods 
Minimum-energy control 

sampled data control sysLs Flue type problem Hayakawa Y + I-AC Nov 
94 2278-2284 

Mobile robotics, kinematics 

kinematics product-of-exponcntials formula computational aspects 
Park I C T-4C Mar 94 643-647 

Mobile robots, motion planning 

autonomous vision-based mobile robot Baumgartner k T + 1-AC Mar 
94 493-502 

car reachable po^ns computation Soueres P + 1-AC Aug 94 1626-1630 
caL. falling, near-optimal nonholonomic motion planning, appl 
Fernandes C + 1-AC Mar 94 450-463 

traiectory stabilization for systs, nonholonomic constraints Walsh G + 
1-AC Jan 94 216-222 
M ideling 

2D syst model of Rocsser, deconip Afacan T + T-AC Nov 94 
2261-2262 

A time-domain approach, model validation Poolla K, + T-AC May 94 
951-959 

ontinuous-time multivariable syst reduction Krajewski, W + T-AC 
Oct 94 2126-2129 

Hmte-dimens model validation, output error, test horizon Hoat Nghia 
Duong + 1-A C Jan 94 102-106 
Mv del reduction; cf. Reduced-order systems 
^Irdel reference adaptive control 

ieslgn for plants, unknown dead-zones Gang Tao + T-AC Jan 94 59-68 
uultiple models/switching approach Narendra KS + 1-AC Sep 94 
1861-1866 


multivariable direct MRAC, identifiable parameterization* and param 
convergence de Mathelin, M + T-AC Aug 94 1612-1617 
multivariable Iin systs, hysteresis switching MRAC Weller, S&, + 
T-ACJul 94 1360-1375 

parab systs direct adaptive control Keum Shik Hong, + T-AC Oct 94 
2018-2033 

perform anal /improvement, new tracking error criteria Dattam A + , 
T-AC Dec 9$ 2370-2387 

robust MRAC SISO systs Zhihua Qu + T-AC Nov 94 2219-2234 
robust stabilizaiion/perform, MRAC modeling error compensation Sun 
J + T-AC Mar 94 630-635 

sing -free multivariable MRAC Moctezuma RG + T-AC Sep 94 
1856-1860 

stochastic adaptive predlclion/MRAC Wet Ren + 1-AC Oct 94 
2047-2060 

suboplimal perfect model matching, with noise, MRACS Mutoh Y + 
T-AC Feb 94 422-425 

transient perform improvement by feedback Datta A + f-AC Sep 94 
1977-1980 

variable struct MRAC, I/O based, anal /design Liu Hsu 4 T-AC Jan 94 
4-21 

Motion control 

constrained robots, force/motion control Grabbe Ml + T-AC Jan 94 
179 

robot variable struct control schemes Bin Yao + J-4C beb 94 371-376 
Motion control; cf. Manipulators, Mobile robots 
Motion measurement; cf. I racking 
Motion planning 

cat, falling, near-optimal nonholonomic motion planning, appl 
Fernandes C + T-4C Mar 94 450-463 
Motors; cf. DC motors 

Moving-average processes; cf. Autoregressive moving-average processes 
MRAC; cf. Model reference adaptive control 
Multidimensional systems 

evol eqns, parab partial DE, exponential stabil Keum-Shik Hong + 
1-AC Jul 94 1432-1436 

fmite-dimens controllers designed for infinile-dimens systs state space 
Morris K A T-AC Oct 94 2100-2104 
model validation, output error, test horizon Hoat Nghia Duong + l-AC 
Jan 94 102-106 

neutral systs , stabil and stabilizability logemann H + 1-AC Jan 94 
138-143 

nonnegative feedback control oscills stabilisation Zaslavsky B T-AC 
Jun 94 1273-1276 

parab systs direct adaptive control Keum Shik Hong + 1-AC Oct 94 
2018-2033 

robust adaptive pole placement control Weyer b + T-AC Aug 94 
1665-1671 

spectrdl syst fmite-dimens approx Erickson M A + 1-AC Sep 94 
1904-1909 

stabilization, adaptive, nonlm time-varying controller Miller DE 1-AC 
Jul 94 1347-1359 

stochastic dyn systs exact Iin Socha L T-AC Sep 94 1980-1984 
worst-case ident anal , BIBO robustness Partington JR + f-AC Oct 
9-/ 2171-2176 

Multiinput-multioutput systems; cf. Multivariable systems 
Multilinear systems; cf. Bilinear systems 
Multiplication; cf. Matrix multiplication 
Multisensor systems 

decentralized detect, optimal thresholds Jrving W W + T-AC Apr 94 
835-838 

delect networks with multiple event struct, optim Pete A + T-AC Aug 
94 1702-1707 

multiresolutional distributed filtering Lang Hong T-AC Apr 94 853-856 
multiscale recursive estim, data fusion/regularization Chou KC + 
1-AC Mar 94 464-478 
Multivariable systems 

book review, Linear Multivariable Control Algebraic Analysis and 
Synthesis Methods (Vardulakis, A I G , 1991) Lems FL T-ACJul 94 
1536 

comments on “Stability margin evaluation for uncertain linear systems” 
(byC Gong and S I hompson, Jun 94 548-550) Su J-H T-AC Dec 94 
2523-2524 

continuous-time adaptive decoupling control design Ortega R + T-AC 
Aug 94 1639-1643 

continuous-time multivariable syst reduction Krajewski W + T-AC 
Oct 94 2126-2129 

cyclicly switched param-adaptive control systs, MIMO design 
modcls/intemal regulators Morse, A S + 7-4C Sep 94 1809-1818 
decentralized control, closed loop props Campo PJ+ T-AC May 94 
932-943 
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diagonal decoupling, dyn. output feedback /const, precompensator. Eldem, 
K T-AC Mar 94 503-511 

direct MRAC algm. de Mathelin, M, + , T-AC Aug 94 1611-1617 
eigenstructure assignment by decentralized output Redback. Guang-Ren 
bum, T-AC May 94 1009-1014 

floating platform, modeling and control. Damen, AA.H., +, T-AC May 94 
1075-1078 

Hoo mpdel reduction computational scheme. Kavranoglu, D. t T-AC Jul 94 
1447-1451 

hysteresis switching MRAC of lin. multivariable systs. Weller, S.R., + , 
T-AC M 94 1360-1375 

ident., state-space freq domain approach. Bayard, D.S., T-AC Sep 94 
1880-1885 

large scale interconnected systs. with delays, decentralized stabilization. 
Zhongzhi Hu, T-AC Jan 94 180-182 

lin. multivariable systs., fund, notion of equivalence. Pugh, A.C., +, T-AC 
May 94 1141-1145 

M1MO lin. systs. with sens./actuator failures, stabil. Gundes, A.N., T-AC 
Jun 94 1224-1230 

M1MO nonlin. sysl. I/O pseudolinearization. Lawrence, DA., + , T-AC 
Nov 94 2207-2218 

MIMO stochastic discrete syst. state estim. Liu Danyang, + , T-AC Oct 94 
2087-2091 

multichannel gain margin improvement, sampled-data hold fns. Chang 
Yang + , T-AC Mar 94 657-661 

multivariable nonminimum phase discrete-time systs. LTR procedure. 

Leon de la Barra SB.A., T-AC Mar 94 574-577 
multivariable systs., quantitat. robustness measures. Horng-Gtou Chen, 
+ , T-AC Apr 94 807-810 

nonlin. act. vibr. damping. Kanestrom, R.K., + , T-AC Sep 94 1925-1928 
nonlin. systs., asymptotic tracking, necessary conditions. Grizzle, JW,+, 
T-AC Sep 94 1782-1794 

output feedback problem in lin. systs., bilinear formulation. Syrmos, V.L, 
+ , T-AC Feb 94 410-414 

robust anal, uncertainty value sets Eszter, EG., + , T-AC Nov 94 
2315-2318 

robust MIMO terminal sliding-mode control, rigid robotic manipulators. 

Zhihong, M, + , T-AC Dec 94 2464-2469 
sing.-free multivariable MRAC. Moctezuma, R.G, + , T-AC Sep 94 
1856-1860 

std. multivariable // 2 -optimal control problem, polynomial soln Hunt, 
K.J., 4 , T-AC Jul 94 1502-1507 

switching surface design for multivariable VSS Ju-Jang Lee, + . T-AC 
Feb 94 414-419 

uncertain lin. systs., multivariable, stabil. margin eval. Gong, C., + , T-AC 
Mar 94 548-550 

uncertain nontin. systs., struct, invariance. Castro-Linares, /?.,+, T-AC 
Oct 94 2154-2158 1 


N 

Navigation; cf. Land navigation 
Networks; cf. Petri nets 
Neural network applications 

binary hypothesis testing, struct adaptive nets Papadakis, I.N.M., + , 
T-AC Sep 94 1967-1971 

quantized constrained systs, feedback control, appls to neuromorphic 
controllers design Sznaier, M , + , T-AC Jul 94 1497-1502 
Neurocontrollers 

book review; Handbook of Intelligent Control: Neural, Fuzzy and 
Adaptive Approaches (White, DA., and Sofge, D.A., Eds.; 1992). 
Samad, T.. T-AC Jul 94 1534-1535 

continuous-time nonlin. systs adaptive control by neural nets. Fu-Chuang 
Chen, + , T-AC Jun 94 1306-1310 

quantized constrained systs, feedback control, appls. to neuromorphic 
controllers design Sznaier, M, +, T-AC Jul 94 1497-1502 

Newton’s method 

dyn. shock-error models , online estim. Knshnamurthy, V, T-AC May 94 
1129-1135 

state-constrained optimal control, generalised dual quasi-Newton algm. 
Shimizu, K, + , T-AC May 94 982-986 
Noise 

continuous time systs., transfer fns., model errors. Schoukens, J, +, T-AC 
Aug94\m-\W 

failure detect/isolation/accornmodation syst. Chia-Chi Tsui, T-AC Nov 94 
2318-2321 

model matching, suboptimal perfect, wHh noise, MRACS. Mutoh, Y., +, 
T-AC Feb 94 422-425 

worst-case syst. ident., time complexity. Pooila, K., + , T-AC May 94 
944-950 


Noise; cf. Filter noise; Gaussian noise; White noise 

Nonlinear difference equations; cf.Riccati equations, discrete-time 

Nonlinear differential equations 

adaptive control, nonlin. systs., triangular struct. Seto, D , +, T-AC Jul 94 
1411-1428 

almost sure sample stabil. of nonlin. stochastic dyn. systs. Zhi Yu Zhang, 

+ , T-AC Mar 94 560-565 

sing, perturbed control systs. and nonlin. differential-alg. eqns. Krishnan, 
H, + , T-AC May 94 1079-1084 

Nonlinear differential equations; cf. Riccati equations, differential 
Nonlinear equations; cf. Newton’s method; Riccati equations, algebraic 
Nonlinear filtering 

nonlin. syst. state estim., conditionally min. algm. Pankov, A.R., +, T-AC 
Aug 94 1617-1620 

Nonlinearities; cf. Hysteresis nonlinearities 
Nonlinear systems 

adaptive control systs., transient bounds. Zhuquan Zang, +, T-AC Jan 94 
171-175 

almost sure sample stabil. of nonlin. stochastic dyn. systs. Zhi Yu Zhang, 

+ , T-AC Mar 94 560-565 

asymptotic model matching. Di Benedetto , M.D., + , T-AC Aug 94 
1539-1550 

asymptotic tracking, necessary conditions. Grizzle, J.W, + , T-AC Sep 94 
1782-1794 

bilinear systs., state space anal., orthogonal series approach 
Paraskevopoulos, P.N., +, T-AC Apr 94 793-797 
book review; Nonlinear System Analysis, 2nd edn. (Vidyasagar, M, 
1993). Abed, E H., T-AC Jul 94 1535-1536 
book review; Nonlinear Systems (Khalil, H.; 1992). Grizzle, J W, T-AC 
Jan 94 251-252 

brushless DC motors, feedback-1 in. control charactn. In-Joong Ha, + , 
T-AC Mar 94 673-677 

cascaded nonlin. systs., global robust stabilization Imura, J -I, + , T-AC 
May 94 1084-1089 

certainty equivalence control, dyn. games. James, MR, T-AC Nov 94 
2321-2324 

constrained nonlin syst receding horizon control Shin-Yeu Lin, T-AC Sep 
94 1893-1899 

control algms., automatic differentiation appl Campbell, SL, + , T-AC 
May 94 1047-1052 

delay-independent exponential stabil. criteria for time-varying discrete 
delay systs. Wu, J W, +, T-AC Apr 94 811-814 
detect, networks with multiple event struct, optim Pete, A ,+, T-AC Aug 
94 1702-1707 

deterministic nonlin. syst ident. Shir-Kuan Lm, T-AC Sep 94 1886-1893 
differential-delay systs., absol stabil Gil, MI, T-AC Dec 94 2481-2484 
discrete-time adaptive nonlin. syst., least sq. estimator. Kanellakopoulos 
l, T-AC Nov 94 2362-2365 

discrete time Hammerstein syst. ident. Lang Zi-Qlang, T-AC Mar 94 
569-573 

discrete-time nonlin. control systs , design via smooth feedback Wei Lm 
+ , T-AC Nov 94 2340-2346 

discrete-time nonlin. syst. control, stabil. property. Jie Huang, + , T-AC 
Nov 94 2307-2311 

discrete time nonlin. systs., stabil., time delayed feedback. Xueshan Yang 
+ , T-AC Mar 94 585-589 

discrete-time systs., input-lo-state stabil. condition, global stabilization 
Kazakos, D., + , T-AC Oct 94 2111-2113 
first-order nonlin. syst. adaptive control. Brogliato, B., + . T-AC Aug 94 , 
1764-1768 

fuzzy control systs., supervisory controller. Li-Xiu Wang, T-AC Sep 94 
1845-1847 

H* control, output-feedback based. Lu, W. -M.,+, T-AC Dec 94 2517-2524 
input-output decoupling, nonlin interactor appl. Di Benedetto, M.D, t 
T-AC Jun 94 1246-1250 

input-output lin., state equivalence and decoupling In-Joong Ha, + ,T 1C 
Nov 94 2269-2274 

internally stable nonlin. systs. with disturbances, par&mcterizat’m 
Hammer, J, T-AC Feb 94 300-314 

interval plants, closed-loop hyperstability. Foo, Y.K, + , T-AC Jan 94 
151-154 

lin. systs., nonlin design of adaptive controllers. Krstic, M, + , T-AC I p r 
94 738-752 

MIMO nonlin. syst. I/O pseudolinearization. Lawrence , D.A., + , 7 
Nov 94 2207-2218 

MRAC, robust, SISO systs. Zhihua Qu, +, T-AC Nov 94 2219-2234 
multi-DOF nonlin. damping model, spectral dens. Weijian Zhang, J A( 
Feb 94 406-410 

nonlin. servomechanism robust control. Jie Huang, + , T-AC Ju 94 
1510-1513 

nonnegative feedback control, oscills stabilisation. Zaslavsky, B., > 4C 
Jun 94 1273-1276 
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num design Kreisselmeier G + T-AC Jan 94 33-46 

outfit feedback stabilization of nonlin sysls Tsimas J + T-AC Apr 94 

parametrically uncertain nonlin feedback systs , absol stabil Marquez 
HJ,+ T-AC Mar 94 664-668 

parametrically uncertain nonlin systs, robust stabilisation Schoenwald 
DA , + T-AC Aug 94 \75\A755 

partially obs discrete-time nonlin systs, risk-sensitive control and dyn 
games James MR + T-AC Apr 94 780-792 
pos real systs nonlin controllers Bernstein DS + T-AC Jul 94 
1513-1517 

P-type learning control Saab S S T-AC Nov 94 2298-2302 
recursive ident, nonlin Wiener model Wtgren T T-AC Nox 94 
2191-2206 

robots, trajectory stabilization for systs, nonholonomic constraints 
Walsh G + T-AC Jan 94 216-222 

robust control design, cascade struct approach Bomvento C + T-AC 
Apr 94 846-849 

stabilization, adaptive, nonlin time-varying controller Miller DE T-AC 
Jul 94 1347-1359 

stabilization otdiscrete-time nonlin systs global Byrnes Cl + T-AC 
Jan 94 83-98 

state estim, conditionally min nonlin filtering Pankov AR + T-AC 
Aug 94 1617-1620 

stochastic dyn systs , exact lin Socha L T-AC Sep 94 1980-1984 
switched-mode power converters multirate modeling and control design 
KhayaUan A + T-AC Sep 94 1848-1852 
time lag nonlin systs ,vibr control Lehman B + T-ACMay 94898-912 
time-varying functional diff eqns, delay independent stabil conditions/ 
decay estim Iehman B + T AC Aug 94 1 673 1676 
uncertain nonlin interconnected systs time-varying state delay, 
stabilizing control Mahmoud M S T-AC Dec 94 2484-2488 
uncertain nonlin systs struct invariance Castro Linares R + T-AC 
Oct 94 2154-2158 

uncertain nonlin systs tracking control Song YD + T-AC Sep 94 
1866-1871 

uncertain S1SO min phase lin syst nonlin universal servomechanism 
Ryan EP T AC Apr 94 753-761 

vibr control by AP-forcing Balestrmo A + T AC Jun94 1255-1258 
Wiener systs nonparamctnc ident Grebhcki W T-AC Oct 94 2077-2086 
Nonlinear systems, cf Bilinear systems 
Nonparametric estimation 

Wiener systs nonparametric ident Grebhcki W T AC Oct 94 2077-2086 

Numerical integration 

Routh table prop quadratic integral eval appl Beghi A + T-AC Dec 
94 2494-2496 
Numerical methods 

dyn syst simul fast parallel recursive aggregation Tsai W K + T AC 
Mar 94 534-540 

greatest common divisor of polynomials comput matrix pencil based 
num method Karcanms N + T-AC May 94 977-981 
lin quadratic control convex prog num method Peres PhD + T-AC 
Jan 94 198-202 

Numerical methods, cf Approximation methods Convergence of numerical 
methods, Newton s method Optimization methods. Relaxation 
methods 

Nyquist stability 

closed loop syst freq response plots comment Gesvng R T-AC Aug 94 
1770-1771 

interval lin control syst robust parametric design Keel L H + T-AC 
Jul 94 1524-1530 

interval plant family Nyquist envelope Hollot C V + T-AC Feb 94 
391-396 

parametrically uncertain systs , anal and synthesis Kaminsky R D + 

T AC Apr 94 874-876 


O 

lbs rvabillty 

wtinuous descriptor systs adaptive observer Uetake Y T-AC Oct 94 
W5-2100 

1 I DS, observability delay Bose S h T AC Apr 94 803-806 
l dback control by multirate PWM Khayatian A + T-AC Jun 94 
1292-1297 

( lussian stochastic control syst tuning vanSchuppen JH T-AC Nov 94 
178-2190 

1 Itiscale systs, Kalman filters, Riccati eqns Chou KC + T-AC Mar 
1 479-492 

nlm observer, state feedback, asymptotic stabil Hunt LR + T-AC 
<>ct 94 2113-2118 

P realness conditions charactn assuming controllability Weiss H + 
AC Mar 94 540-544 


SISO LTI discrete-time systs , par am -adaptive controller Kreisselmeier 
G T-AC Sep 94 1819-1826 

switched-mode power converters, multirate modeling and control design 
Khayatian, A + T-AC Sep 94 1848-1852 
time-mvariant lin systs controllability/observability Raining Wang + 
T-AC Jul 94 1443-1447 
Observers 

asynchronous sampled data syst control design Voulgans P T-AC Jul 
94 1451-1455 

DEDS, observability, delay Bose V + T-AC Apr 94 803-806 
discrete-time syst exponential stabil observer design Aitken VC + 
T-AC Sep 94 1959-1962 

disturbance decoupled observer design Hou M + 7 -AC Jun 94 

1338-1341 

full-order/low-order observers, LTI plant, unified ingular syst based 
theory Cobb JD T-AC Dec 94 2497-2502 
H« compensator design min order observers Hsu CS + T-AC Aug 94 
1679-1681 

H»-optim , reduced order observer based controller Stoorvogel A A + 
T-AC Feb 94 355-360 

lin observers, parameterization and design Ding X + T-AC Aug 94 
1648-1652 

multibody flexible systs elastic mode estim Karray h + T-AC May 
94 1016-1020 

observer design, eigenvalue assignment, residual generation Magm J-F 
+ T AC Feb 94 441-447 

observers for nonlin systs in steady state Hunt LR + T-AC Oct 94 
2113-2118 

partially obs discrete-time nonlin systs , risk-sensitive control and dyn 
games James MR + T-AC Apr 94 1^0-192 
robots, robust control, lin estim state feedback Berghuis H + T-AC 
Oct 94 2159-2162 

slowly varying and LTI systs , ident and uncertainty principles lames 
G + T-AC Sep 94 1827-1838 

State observer/feedback compensators, comment Bender DJ T-AC Feb 
94 447-448 

switched-modc power converters, mullirate modeling and control design 
Khayatian A + T AC Sep 94 1848-1852 
unknown input lin systs full-order observer design Darouach M + 
T-AC Mar 94 606-609 

unknown input observers design Darouach M T-AC Mar 94 698-699 
unstable plants sampled-data observers generalized holds Haddad 
WM + T-AC Jan 94 229 234 
Observers, cf Adaptive observers 
Optimal control 

bilinear systs , adaptive stabilization least sq Xi Sun -» T-AC Jan 94 
207-211 

book review Applied Optimal Control and Estimation 1 (Lewis F L 
1992) Kamen EW 1 AC Aug 94 1773-1774 
certainty equivalence control dyn games James MR T-AC Nov 94 
2321-2324 

closed Jackson queueing network decentralized control Dye Jyun Ma + 
T-AC Jul 94 1460-1463 

descriptor sySts lin feedback closed-loop Stackclberg strategy Hua Xu 
H T AC May 94 1097-1102 

detect networks with multiple event struct ,optim Pete A + T-AC Aug 
94 1702-1707 

discrete-time Riccati eqn H® control appl Stoorvogel A A + T-AC 
Mar 94 686-691 

discrete-time systs mixed H 2 /H® control exact convex optim based soln 
Sznaier M T-AC Dec 94 2511 -2517 
dissipative H 2 /FU controller synthesis Haddad WM + T-AC Apr 94 
827-831 

failure-prone mfg systs , structural props of optimal prod controllers 
Jian-Qiang Hu + T-AC Mar 94 640-643 
finite-time optimal control of bilinear systs successive approx procedure 
Aganovic Z + T-AC Sep 94 1932-1935 

floating platform, modeling and control Damen A AH + T-AC May 94 
1075-1078 

tour-block problem, recursive Schur-based soln Constantmescu T + 
T-AC Jul 94 1476-1481 

Hqo control syst ident, least sq methods, comment Livstone MM + 
T-AC Jul 94 1531 

H« model reduction computational scheme Kavranoglu D T-AC Jul 94 
1447-1451 

H:-optimal zeros placement Kabamba P T + T-AC Jun 94 1298-1301 
H® control of discrete-time uncertain systs Geromet JC + T-AC May 
941012-1075 

H® control of syst 5 under norm bounded uncertainties Gu K T-AC Jun 
94 1320-1322 

FL-optimal discrete-time fixed-point and fixed-lag smoothing, game 
theory approach Theodor Y + T-AC Sep 94 1944-1948 


k author entry for coauthors 


f Check author entry for subsequent corrections/comments 





H*°-optlmal ccmtrolfor sing, perturbed systs., imperfect statemcas, Zlgang 
Pan, + , T-AC Feb 94 280-299 

infinite horizon LQ control, so to. approx. Schochetman, IE., +, T-AC Mar 
H 596^601 f 

L 3 optimal model order reduction problem, input normal form homotopy. 
Ge, K, +, T-AC Jun 94 1302-1305 

mixed Hz/Hgo control, Nash game approach. Limebeer, D.J.N., + , T-AC 
Jan9469-B2 

M/M/I queue with bal. budget, optima] flow control. Chakravorti, B,, 
T-AC Sep 94 1918*1921 

multirate sampled-data systs., Hz-optimal design. Qui, L, + , T-AC Dec 
94 2506-25 U 

optimal dyn. output feedback for nonzero set point regulation, 
discrete-time case. Haddad\ W.M., + , T-AC Sep 94 1921-1925 
optimal filtering, stochastic discrete-time systs., unknown inputs. Borisov, 
A. V„ +, T-AC Dec 94 2461-2464 

periodic discrete-time Riccati eqn„ num. soln. Hench, J.J., +., T-AC Jun 
94 1197-1210 

prod, control, multiple time scales approach Jiang, J., + , T-AC Nov 94 
2292-2297 

sampled-data robust control, exponential hold, optimal algm. Yung-Chun 
Wu t + , T-AC Jan 94 112-117 

SISO-distributed plants, optimal mixed sensitivity. Flamm, D S., +, T-AC 
Jun 94 1150-1165 

state-constrained optimal control, generalised dual quasi-Newton algm. 

Shimizu, K, + , T-AC May 94 982-986 
std. multivariable /^-optimal control problem, polynomial soln. Hunt, 
K.J., +, T-AC Jul 94 1502-1507 

stochastic dyn. job shops/prod. planning. Sethi, S., + , T-AC Oct 94 
2061-2076 

transfer matrices preconditioning. Rotstein, B., T-AC Nov 94 2287-2292 
vector discrete-event syst. controller synthesis. YongLi, + , T-AC Mar 94 
512-531 

Optimal control; cf. Cost-optimal control; H“ optimization; Linear- 
quadratic control; Minimax control; Minimum-energy control; 
Quantized control; Stochastic optimal control; Suboptimal control; 
Time-optimal control 
Optimization methods 

constrained option, in Hilbert space. Shimizu, K..+, T-AC May 94 982-986 
data network distributed asynchronous routing, convergence. Zhi-Quan 
Luo, + , T-AC May 94 1123-1129 

data network path formulated optimal routing, time complexity. Antonio, 
J.K., +. T-AC Feb 94 385-391 

detect, networks with multiple event struct., optim. Pete, A., +, T-AC Aug 
94 1702-1707 

extended horizon liftings for nonminimum-phase systs. Bayard, D.S, 
T-AC Jun 94 1333-1338 

failure prone mfg. Syst. hedging point policies. Jian-QiangHu, +, T-AC 
Sep 94 1675-1880 

global optim., adaptive partitioned random search. Bo Tang, Z., T-AC Nov 
94 2235-2244 

heuristics, rules of thumb, and the 80/20 proposition. Yu-Chi Ho, T-AC 
May 94 1025-1027 

Heo control syst. ident., least sq. methods, comment. Livstone, M.M., + , 
T-AC Jut 94 1531 

ident., family of norms. Massoumnia, M.-A ., +, T-AC May 94 1027-1031 
Kalman filtering for uncertain discrete-time systs. Lihua Xie, +, T-AC Jun 
94 1310-1314 

lin. syst approx., time-scaling factor, Laguerre models. Wang, L., +, T-AC 
Jul 94 1463-1467 

L 1 optimal model order reduction problem, input normal form homotopy. 
Ge, Y., +., T-AC Jun 94 1302-1305 

MIMO stochastic discrete syst. state estim. Liu Danyang, +, T-AC Oct 94 
2087-2091 

min. lin. plants parameterization. Davis, L.D., + , T-AC Apr 94 849-852 
muitiresolutional distributed filtering. Lang Hong, T-AC Apr 94 853-856 
path formulated optimal routing algm , time complexity. Antonio, J.K ., 
+ , T-ACSep 94 1839-1844 

perturbed Lin. systs., upper covariance bounds. Boizem, P. t +, T-AC Mar 
94 6 23-626 

rational transfer fn. ( optimal L» approx Kavranoglu, D, + , T-AC Sep 94 
1899-1904 

regenerative syst. optim., infinitesimal perturb, anal. Chong, EXP., + , 
T-AC Jul 94 1400-1410 

sens, fiision, decentralized detect optimal thresholds Irving, W.W., + , 
T-AC Apr 94 835-838 

single-machine scheduling, resource optimal control. Cheng, TCE., + , 
T-AC Jun 94 1243-1246 

$ISO discrete-time systs. preview tracking. Halpern, M E., T-AC Mar 94 
589-592 

S1SO systs., robust stabilization. Olbrot, A.W., +, T-AC Mar 94 652-657 


time-invariant lin. systs., robust controller synthesis. Rantzer,A., + , T-A( 
Sep 94 1802-1808 

uncertain syst. pole assignment, polynomial approach. Figueroa, J.L, + 
T-AC Apr 94 831-835 

Optimization methods; cf. Approximation methods; Gradient methods, 
Least-mean-square methods; Least-squares methods; Minima'* 
methods; Minimization methods 
Output feedback 

closed-loop vibr. control, Youla parameterization. Kabamba, P.T., + 
T-AC Jul 94 1455-1459 

comments on 1 "Stabilization via static output feedback'*. Ptmpalkhan 
A, A., +, T-AC May 94 1148 

continuous-time nonlin. systs. adaptive control by neural nets. Fu-Chuan 
Chen, + , T-AC Jun 94 1306-1310 

descriptor systs. control by output feedback. Lovass-Nagy, V.. + , T-AC 
Jul 94 1507-1509 

descriptor systs. regularization by output feedback. Bunse-Gerstner, A, 
+ , T-AC Aug 94 1742-1748 

diagonal decoupling, dyn. output feedback/const, precompensator. Eldem, 
V., T-AC Mar 94 503-511 

discrete time nonlin. systs., stabil., time delayed feedback. Xueshan Yang, 
+ , T-AC Mar 94 585-589 

eigenstructurc assignment by decentralized output feedback. Guang-Ren 
Duan, T-AC May 94 1009-1014 

feedback control by multirate PWM. Khayatian, A., + , T-AC Jun 94 
1292-1297 

Ha-control, state availability rel., pole/zero cancellations. Miyamoto, S., 
+ , T-AC Feb 94 379-381 

MIMO lin. systs. with sens./actuator failures, stabil. Gundes, A.N., T-AC 
Jun 94 1224-1230 

mobile manipulator locomotion/manipulation coord. Yamamoto, Y., + , 
T-AC Jun 94 1326-1332 

nonlin. syst. H® control, output-feedback based. Lu, W.-M, + . T-AC Dec 
94 2517-2524 

output feedback problem in lin. systs., bilinear formulation. Syrmos, V.L ., 
+ , T-AC Feb 94 410-414 

output feedback stabilization of nonlin. systs. Tsinias, J, +, T-AC Apr 94 
806 

pole placement by output feedback. Lee, T.H , +, T-AC Mar 94 565-568 

pos. realness conditions, charactn. assuming controllability. Weiss, H., +, 
T-AC Mar 94 540-544 

stabilization of discrete-time nonlin. systs., feedback equivalence. Byrnes, 
C.1..+, T-AC Jan 94 83-98 

time-invariant lin. syst. adaptive control. Karason, S.P., + , T-AC Nov 94 
2325-2330 

uncertain nonlin. interconnected systs., time-varying state delay, 
stabilizing control. Mahmoud, MS., T-AC Dec 94 2484-2488 

uncertain systs. stabilizing control design, quasiconvex optim. Keqin Gu, 
T-AC Jan 94 127-131 


Packet radio 

optimal multicopy Aloha. Wong, E.W.M., + , T-AC Jun 94 1233-1236 

Packet switching 

flow control, rate-based, monotonicity/concavity. Budka, K.C., T-ACMar 
94 544-548 

optimal multicopy Aloha. Wong, E.W.M., + , T-AC Jun 94 1233-1236 

Parallel processing 

constrained nonlin. syst receding horizon control. Shin-Yeu Lin, T-AC Sep 
94 1893-1899 

Parameter estimation 

adaptive control by multiple models/switching. Narendra, K.S., + , T AG 
Sep 94 1861-1866 

adaptive control systs., chaotic behavior. Goraalez, G.A., +, T-AC On 94 
2145-2148 

adaptive pole placement without excitation probing sigs. Lozano, R f . 
7MC Jan 94 47-58 

autonomous vision-based mobile robot. Baumgartner, E.T., +, T-AC Mm 
94 493-502 

bounded-error tracking of time-varying params. Piet-Lahanier, H., ■ 
T-AC Aug 94 1661-1664 

discrete-time adaptive nonlin. syst., least sq. estimator. KanellakopoMas, 
L, T-AC Nov 94 2362-2365 

extended Chandrasekhar recursions. Sayed, A.H., + , T-AC Ma 94 
619-623 

first-order nonlin. syst. adaptive control. Brogliato, B., + , T-AC Au a 9* 
1764-1768 

guaranteed param. estim. problem with uncertain stats., Kalman-F icy 
filter accuracy. Matasov, A.L , T-AC Mar 94 635-639 

least absol. values estim., Computational aspects. Fiodorov, E D., ? At 
. Mar 94 626-630 
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least sq. estim. in white noise, convergence. Nassiri-Toussi, K. t +, T-AC 
Feb 94 364-368 

lin. syst. approx., time-scaling factor, Laguerre models Wang, L , +, T-AC 
Jul 94 1463-1467 

manipulators with parametric uncertainty, robust control Keun-Mo Koo, 
+ , T-AC Jun 94 1230-1233 

multibody flexible systs, clastic mode estim. Karray, F, + , T-AC May 
94 1016-1020 

multivariable direct MRAC, identifiable parameterization and param. 
convergence, de Maihelm, M, +, T-AC Aug 94 1612-1617 
nonminimum phase first-order continuous-time systs., adaptive 
stabilization Lozano, R, +, T-AC Aug 94 1748-1751 
random param tracking, robust algm Juditsky, A , + , T-AC Jun 94 
1211-1221 

stochastic lin syst. param hypothesis testing, slat sampling Duncan, 
TE.+, T-AC Jan 94 118-122 

time delay estim. in continuous LT1 systs Tuch, 1, + , T-AC Apr 94 
823-827 

Parameter identification 

continuous lime systs , transfer fns, model errors Schoukens, J, + T-AC 
Aug 94 1733-1737 

deterministic nonlin syst ident Shir-Kuan Lin, T-AC Sep 94 1886-1893 
time-varying params, bounded error ident Btttanti, S, +, T-AC May 94 
1106-1110 

transfer fn. param ident. in freq domain Pintelon, R . + . T-AC Nov 94 
2245-2260 

Parameter identification; cf. Power system identification 
Parameter-space methods 

cyclicly switched param -adaptive control systs, M1MO design models/ 
internal regulators Morse, A S, + T-AC Sep 94 1809-1818 
Hoo control of discrete-time uncertain systs Geromel, JC, +, T-AC May 
94 1072-1075 

param -adaptive control, cyclic switching strategy Pail, FM, + , T-AC 
Jun 94 1172-1183 

Parameter uncertainty; cf. Uncertain systems 
Partial differential equations 

evol eqns, parab partial DE, exponential stabil Keum-Shk Hong, + , 
T-AC Jul 94 1432-1436 

parab systs direct adaptive control Keum Shik Hong, + , T-AC Oct 94 
2018-2033 

spectral syst fimte-dimens approx Erickson, MA, H , T-AC Sep 94 
1904-1909 
Passivity 

rigid spacecraft adaptive attitude control Egeland, O . -t , T-AC Apr 94 
842-846 

robot adaptive control based on passivity Km Tang, + , T-AC Sep 94 
1871-1875 

second-order dyn systs, dissipative controller designs Morns, KA, + , 
l-AC May 94 1056-1063 

Periodic control 

multiralc samplcd-dala systs struct props Longhi, S T-AC Mar 94 
692-696 

periodically time-varying lin discrete-time plants, decentralized control 
Khargonekar, P P , + , T-AC Apr 94 877-882 
Perturbation methods 

discrete-time systs, perturb, comment Eslami, M, T-AC Aug 94 
1768-1769 

regenerative syst optim , infinitesimal perturb anal Chong, EKP , + , 
T-AC Jul 94 1400-1410 
Petri nets , 

event graphs, stochastic, cycle time Proth, J-M, + , T-AC Jul 94 
1482-1486 

logical discrete event systs, I yapunov stabil Passino, K M , +, T-AC Feb 
94 269-279 

supervisory control, blocking and controllability of Petri nets. Giua, A , 
* , T-AC Apr 94 818-823 

PI inning; cf. Manipulators, motion planning; Manufacturing planning, 
Mobile robots, motion planning 
Pi ssson processes 

asynchronous systs with Poisson transits, stabil Leland, R P, T-AC Jan 
94 182-185 
Pole assignment 

adaptive pole placement without excitation probing sigs Lozano, R, + . 
T-AC Jan 94 41-5% 

disturbance decoupling, constrained Sylvester eqns Syrmos, V.L, T-AC 
Apr 94 797-803 

3-stabil, robust, for lin. uncertain discrete delay systs Te-JenSu, + , T-AC 
Feb 94 425-428 

Russian stochastic control syst tuning van Schuppen, J.H, T-AC Nov 94 
2178-2190 

n discrete syst pole assignment Benzaouta, A , T-AC Oct 94 2091-2095 
(ile placement by output feedback Lee, TH, +, T-AC Mar 94 565-568 


robust adaptive pole placement control. Weyer, E, + , T-AC Aug 94 
1665-1671 

uncertain continuous-time implicit systs., regional pale placement, 
robustness. Chun-Hsiung Fang, +, T-AC Nov 942303-2307 
uncertain syst pole assignment, polynomial approach. Figueroa, JL,+, 
T-AC Apr 94 831-835 

variable struct, generator voltage regulator design, pole assignment tech 
Aggoune, ME,+, T-AC Oct 94 2106-2110 

Poles and zeros 

adaptive pole placement without excitation probing sigs Lozano , R, + . 
T-AC Jan 94 47-58 

comments on “System zeros determination from an unreduced matrix 
fraction description” (by K.S Yeung and C-M. Kwan, Nov 93 
1695-1697) Ferreira, PMG, T-AC Nov 94 2367 
corrections to “On undershoot in S1SO systems” (Mar 94 578-581) de la 
BaraS, B A L, T-AC Aug 94 1771 

D-stabil, robust, for lin uncertain discrete delay systs Te-JenSu, +, T-AC 
Feb 94 425-428 

dyn lin. systs, pos real lemma generalization Scherer, R,+, T-AC Apr 
94 882-886 

extended horizon liftings for nonminimum-phase systs Bayard[ DS, 
T-AC Jun 94 1333-1338 

H 2 -optimal zeros placement Kabamba, PT, 4 , T-AC Jun 94 1298-1301 
Ha-control, state availability rel, pole/zero cancellations Miyamoto, S, 

+ , T-AC Feb 94 379-381 

iter algm tor pole placement Lee, TH, + , T-AC Mar 94 565-568 
left-invertible generalized state space systs, disturbance rejection 
Paraskevopoulos, P N , + , T-AC Jan 94 185-190 
lin syst approx , time-scaling factor, Laguerre models Wang, L , +, T-AC 
Jul 94 1463-1467 

rational L ] suboptimal compensators for continuous-time systs Blanchim, 
F, H . T-AC Jul 94 1487-1492 

sampled data control systs and tracking, fn space approach Yamamoto, 
Y, T-AC Apr 94 703-713 

scmi-cancellable fraction transfer fns in syst theory Bourles, H, T-AC 
Oct 94 2148-2153 

SISO systs , step response, undershoot Leon de la Barra S, BA , T-AC 
Mar 94 578-581 

state observer/feedback, compensators, comment Bender, D J ,T-AC Feb 
94 447-448 

Poles and zeros; cf. Zero assignment 
Polynomial approximation 

discrete time Hammerstein syst ident Lang Zi-Qiang, T-AC Mar 94 
569-573 

Polynomial approximation; cf. Spline functions 
Polynomial matrices 

deadbeat ripplc-free tracking Jetto, L , T-AC Aug 94 1759-1764 
delta-operator formulated real polynomials, tabular method for 
determining root distnb Prcmaratne, K, + , T-AC Feb 94 352-355 
greatest common divisor of polynomials comput, matrix pencil based 
num method Karcamas, N, + , 1-AC May 94 977-981 
J-spectral factorization Kwakernaak, H, +, T-AC Feb 94 315-328 
lin multivariable systs, fund notion of equivalence Pugh, AC , + , T-AC 
May 94 1141-1145 

std multivariable // 2 -optimal control problem, polynomial soln Hunt, 
KJ,+, T-AC Jul 94 1502-1507 

Sylvester eqns, generalised, consistency Wimmer, H.K, T-AC May 94 
1014-1016 
Polynomials 

1 -D and 2-D digital recursive filters, stabil, Schur polynomials Barret, 
M, + , T-AC Nov 94 2335-2339 

extreme-point robust stabil, discrete-time polynomials Perez, F, + , 
T-AC Jul 94 1470-1472 

freq response arcs, stable polynomial, convexity Keqin Gu, T-AC Nov 94 
2262-2265 

Hurwitz and Schur polynomials, argument conditions Bose, NK, T-AC 
Feb 94 345-346 

Hurwitz polynomials props, polynomial families stabil anal Duan, G -R, 
+ . T-AC Dec 94 2490-2494 

interval lin. control syst, robust parametric design. Keel, LH., + , T-AC 
Jul 94 1524-1530 

parametrically uncertain systs , anal, and synthesis. Kaminsky, RD, +, 
T-AC Apr 94 874-876 

polynomial differential eqns., domain of attraction estim Levin, A , T-AC 
Dec 94 2471-2475 

robust adaptive pole placement control. Weyer, E, + , T-AC Aug 94 
1665-1671 

Robust stabil of polynomials, multilinearly depend coeff. perturb Tian 
Yu-Ptng, + , T-AC Mar 94 554-558 

Routh tabic prop., quadratic integral eval appl Beghi, A, +, T-AC Dec 
94 2494-2496 
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sing, root distrib. problems, displacement struct, approach. Pal, D., + , 
T-AC Jan 94 238*245 

stabil. domains, assigned root location polynomials, convexity props. Tesi, 
A., +, T-AC Mar 94 668*672 

stabilisation, uncertainties/numerator-dcnominator coupling. Chockalingam, 
G.,^, T-AC Sep 94 1955-1958 

strictly Harwitz conditions, stabil. anal. appl. Polyak, B.T., +, T-AC May 
94 1147-1M8 

time-delay syst. robust stabil., quasipolynomial convex directions/tosting 
sets. Kharitonov, V,L., +, T-AC Dec 94 2388-2397 
time-invariant lin systs., stabil. domains inversion. Walter , E, + , T-AC 
Apr 94 886*889 

uncertain syst. pole assignment, polynomial approach. Figueroa, J.L., + , 
T-AC Apr 831-835 

Position control 

autonomous vision-based mobile robot. Baumgartner, EJ. t +, T-AC Mar 
Pd 493-502 

contact stabil. of simple posn. controllers, effect of time delay and discrete 
control. Fiala, J. t + , T-AC Apr 94 870-873 
mobile manipulator locomotion/manipulation coord. Yamamoto, Y„ + , 
T-ACJm 94 1326-1332 

motion/force control, robust, with nonholonomic constraints. Chun-Yi Su, 
+ , T-AC Mar 94 609-614 

robotic arms, repositioning control by learning, lucibello. P., T-AC Aug 
94 1690-1694 

robot manipulators, compliant, force/posn. regulation. Chiaverini, S, + , 
T-AC Mar 94 647-652 

stick-slip avoidance, PD control. Dupont, P E., T-AC May 94 1094-1097 

Positive real functions 

dyn. lin. systs., pos. real lemma generalization Scherer, R., + , T-AC Apr 
94 882-886 

pos. realness conditions, charactn. assuming controllability. Weiss, H., + , 
T-AC Mar 94 540-544 

second-order dyn. systs., dissipative controller designs. Moms, KA, + , 
T-AC May 94 1056-1063 

Power demand; cf. Load forecasting 

Power generation control, excitation 

variable struct, generator voltage regulator design, pole assignment tech. 
Aggoune, M E, + . T-AC Oct 94 2106-2110 

Power generation control, load frequency 

alternating renewal elec, load models. El-Fertk, S, + , T-AC Jun 94 
1184-1196 

Power generation excitation systems: cf. Power generation control, 
excitation 

Power system identification 

alternating renewal elect load models El-Ferik, S, + , T-AC Jun 94 
1184-1196 

Power system modeling; cf. Load modeling 

Prediction methods 

lin. discrete-time systs., optimum lin. recursive estim. Caraio, AH,+ , 
T-AC Aug 94 1636-1638 

recursive prediction error algm. log likelihood fn. derivatives Hooker, 
M,A., T-AC Mar 94 662-664 

Predictive control 

adaptive predictive controller, robustness anal. Clarke, DW, + , T-AC 
May 94 1052-1056 

extended horizon predictive control Fanjin Kong, + , T-AC Jul 94 
1467-1470 

SiSO discrete-time systs. preview tracking. Halpern, ME., T-AC Mar 94 
589-592 

Smith predictor for controlling proc., integrator and long dead-time. 
Astrom, KJ., + , T-AC Feb 94 343-345 
stochastic adaptive prediction/MRAC. Wei Ren, + , T-AC Oct 94 
2047-2060 

Probability 

closed Jackson queueing network decentralized control. Dye-Jyun Ma, +, 
T-AC Jul 94 1460-1463 

distributed algms., random proc. failures. Papavassitopoulos, G.P, T-AC 
May 94 1032-1036 

integrated probabilistic data assoc. Musicki, D, + , T-AC Jun 94 
1237-1241 

mfg. systs. with prod, rate-depend. failure rates, prod, control. 
Liberopaulos, G., +, T-AC Apr 94 889-895 
mfg. syst. with unreliable machines, stabil. anal. Han-Fu Chen, +, T-AC 
Mar 94 681-686 

multi-DOF nonlin. damping model, spectral dens. Wei/ian Zhang, T-AC 
Feb 94 406-410 

random param. tracking, robust algm. JudUsky, A„ + , T-AC Jun 94 
1211-1221 

recursive prediction error algm. log likelihood fh. derivatives. Hooker , 
MA. t T-AC Mar 94 662-664 


sens, fusion, decentralized detect, optimal thresholds. Irving, W.W., + , 
T-AC Apr 94 835-838 
Production control 

distributed prod, control methods. Sharjfhia, A., T-AC Apr 94 725-737 
failure prone mfg. syst. hedging point policies. Jian-Qlang Hu, + , T-AC 
Sep 94 1875-1880 

failure-prone mfg. systs., structural props, of optimal prod, controllers. 
Jian-Qiang Hu, + , T-AC Mar 94 640-643 
machine capacity allocation. Nain, P., +, T-AC Sep 94 1853-1855 
mfg. systs. with prod, rate-depend. failure rates. Uberopoulos, G.. + , 
T-AC Apr 94 889-895 

multiple time scale approach. Jiang, J., +, T-AC Nov 94 2292-2297 
single-machine scheduling, resource optimal control. Cheng, TCE., + , 
T-AC Jun 94 1243-1246 

stochastic dyn. job shops/prod, planning. Sethi, S., + , T-AC Oct 94 
2061-2076 

Production control; cf. Manufacturing 
Production systems; cf. Manufacturing scheduling 
Proportional control 

left-invertible generalized state space systs, disturbance rejection. 

Paraskevopoulos, P N., + , T-AC Jan 94 185-190 
robot manipulators, compliant, force/posn. regulation Chiaverini, S, + , 
T-AC Mar 94 647-652 

stick-slip avoidance, PD control. Dupont, P E., T-AC May 94 1094-1097 
Protocols 

optimal multicopy Aloha. Wong, E.W.M., + , T-AC Jun 94 1233-1236 

Pulse-width modulation 

feedback control by multirate PWM. Khayatian, A, + , T-AC Jun 94 
1292-1297 

Pulse-width modulation, power converters 

switched-mode power converters, multirate modeling and control design 
Khayatian, A„ +, T-AC Sep 94 1848-1852 


Quantized control 

quantized constrained systs., feedback control, appls. to neuromorphic 
controllers design. Sznaier, M, + , T-AC Jul 94 1497-1502 

Queuing analysis 

closed Jackson queueing network decentralized control Dye-Jyun Ma, +, 
T-AC Jul 94 1460-1463 

DD scheduling policies, queueing networks, perform bounds T-AC Aug 
94 1600-1611 

DEDS, steady state fn., nondifferentiability. Shapiro, A , + . T-AC Aug 94 
1707-1711 

deterministic/stochastic queuing networks anal Cheng-Shang Chang, 
T-AC May 94 913-931 

finite queues, priority-discarding policies Petr, D W, T-AC May 94 
1020-1024 

FMS, FCFS scheduling policy. Seidman, TI, T-AC Oct 94 2166-2171 

machine capacity allocation. Nain, P, + , T-AC Sep 94 1853-1855 

M/M/1 queue with bal. budget, optimal flow control. Chakravorti, B, 
T-AC Sep 94 1918-1921 

packet switching, flow control, rate-based, monotonicity/concavity 
Budka, K C., T-AC Mar 94 544-548 

routing, limited state inform, in queueing systs., blocking Sparaggis, P D, 
+ , T-AC Jul 94 1492-1497 

traffic models, queuing networks anal., projection techs. Moot Choo 
Chuah, T-AC Aug 94 1588-1599 


Radar data processing 

integrated probabilistic data assoc. Musicki, D, + , T-AC Jun 94 
1237-1241 

Radar signal analysis 

integrated probabilistic data assoc. Musicki, D, + , T-AC Jun 94 
1237-1241 

range tracking loops, large deviation anal. Dembo, A ,, + , T-AC Feb 94 
360-364 

Radar tracking 

integrated probabilistic data assoc, Musicki, D., + , T-AC Jun 94 
1237-1241 

Radio communication; cf. Packet radio 
Random...; cf. Stochastic... 

RationaJ functions 

freq. response arcs, stable polynomial, convexity. Keqin Gu, T-AC Nov 94 
2262-2265 
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Realization theory; cf. Minimal realizations 
Real-time systems 

constrained nonlin. syst receding horizon control. Shin- Yeu Lin, T-AC Sep 
94 1893-1899 
Recursive digital filters 

1-D and 2-D digital recursive filters, stabil., Schur polynomials. Barret, 
M,, +, T-AC Nov 94 2335-2339 

optimal filtering, stochastic discrete-time systs., unknown inputs. Borisov, 
A. V„ + , T-AC Dec 94 2461-2464 

Recursive estimation 

lin. discrete-time systs., optimum lin. recursive estim. Carazo, A.H., + , 
T-AC Aug 94 1636-1638 

multiscalc recursive estim., data fusion/regularization. Chou, K.C., + , 
T-AC Mar 94 464-478 

recursive prediction error algm. log likelihood fn. derivatives. Hooker, 
M.A., T-AC Mar 94 662-664 

stochastic lin. discrete systs., state estim. algm. Ahmed, M.S., T-AC Aug 
94 1652-1656 

time-varying params., bounded error ident. Bittanti, 5, + , /'-AC May 94 
1106-1110 

Reduced-order systems 

discrete-time q-Markov cover models. Sreeram, V, + . T-AC May 94 
1102-1105 

discrete-time systs., generalized q-Markov cover models. Sreeram, V, 
T-AC Dec 94 2502-2505 

distributed delay lin. systs., feedback stabilisation. Feng Zheng, + , T-AC 
Aug 94 1714-1718 

full-order/low-order observers, LT1 plant, unified ingular syst. based 
theory. Cobb, JD , T-AC Dec 94 2497-2502 
H* compensator design, min order observers Hsu, CS, + , T-AC Aug 94 
1679-1681 

Hcr-optim , reduced order observer based controller. Stoorvogel, A.A., -» , 
T-AC Feb 94 355-360 

interval systs. order reduction, Routh-Pade approx. Bandyopadhyay, B, 
+ , T-AC Dec 94 2454-2456 

L 2 optimal model order reduction problem, input normal form homotopy. 
Ge , Y., + , T-ACJun 94 1302-1305 

model reduction, LF approx, balancing, props. Prakash, R., T-AC May 94 
1135-1141 

structural wave control, reduced-order model. Quan Wang, + . T-AC Aug 
94 1711-1713 

unstable plants, samplcd-data observers, generalized holds. Haddad, 
W.M, +, T-AC Jan 94 229-234 
vibr. systs. control Karl, WC , + , T-AC Jan 94 222-226 
Regenerative stochastic processes 

DF,DS, steady state fn , nondiffcrentiability Shapiro, A , + , T-AC Aug 94 
1707-1711 

regenerative syst optim., infinitesimal perturb anal. Chong, E.K P , + , 
T-AC Jul 94 1400-1410 

Regulators 

discrete-time control problem, strictly proper meas feedback. 

Stoorvogel, A A, + , T-AC Sep 94 1936-1939 
finite-dimens. controllers designed for infinite-dimens, systs., state space. 

Morris, KA., T-AC Oct 94 2100-2104 
mfg. scheduling syst., regulator stabilization tech. Humes , C., Jr., T-AC 
Jan 94 191-196 

nonlin. regulator, num. design. Kreisselmeier, G, + , T-AC Jan 94 33-46 

Relaxation methods 

noncooperativc equilibria computation, relax algms Uryas'ev, S, + . 
T-AC Jun 94 1263-1267 

Reliability 

failure-prone mfg. systs., structural props of optimal prod, controllers. 
Jian-Qiang Hu, + , T AC Mar 94 640-643 
Reliability; cf. Failure analysis; Fault tolerance 
Reliability theory 

mfg. syst. with unreliable machines, stabil. anal. Han-Fu Chen, + , T-AC 
Mar 94 681-686 
Resource management 

machine capacity allocation. Nain, P., +, T-AC Sep 94 1853-1855 
single-machine scheduling, resource optimal control. Cheng, T.C.E, + , 
T-ACJun 94 1243-1246 
Reviews; cf. Book reviews 
Riccati equations 

four-block problem, recursive Schur-based soln. Constantinescu, T., + , 
T-AC Jul 94 1476-1481 

jump lin. systs., coupled Riccati eqns. Abou-Kandil, H., + , T-AC Aug 94 
1631-1636 

mixed H 2 /H® control, Nash game approach. Lime beer, DJ.N., + . T-AC 
Jan 94 69-82 

model-following systs., time/freq. domain design equivalence. Yen-Ttng 
Hsu. + , T-AC Aug 94 1722-1726 


perturbed lin. systs., upper covariance bounds. Bolzern, P„ +, T-AC Mar 
94 623-626 

Riccati equations, algebraic 

comments on “Stability margin evaluation for uncertain linear systems" 
(by C, Gong and S. Thompson, Jun 94 548-550). Su, J.-H., T-AC Dec 94 
2523-2524 

constrained Riccati eqn. factorizations. Weiss, M, T-AC Mar 94 677-681 
continuous alg. Riccati eqn. eigenvalues. Komaroff, N., T-AC Mar 94 
532-534 

discrete-time Riccati eqn., H«, control appl. Stoorvogel, A.A., ■+ , T-AC 
Mar 94 686-691 

lin. sing, perturbed systs., Kalman filtering. Gqjic, Z, + , T-AC Sep 94 
1952-1955 

LTI syst. pos. real control. Weiqian Sun, + , T-AC Oct 94 2034-2046 
min. lin. plants parameterization. Davis, L.D., +, T-AC Apr 94 849-852 
multiscale systs., Kalman filters, Riccati eqns. Chou, K.C ., + , T-AC Mar 
94 479-492 

polynomial J-spectral factorization. Kwakernaak, H., + , T-AC Feb 94 
315-328 

quadratic stabilization of continuous time systs. Mahmoud, MS, +, T-AC 
Oct 94 2135-2139 

Riccati eqn., discrete alg., iter, matrix bounds. Komaroff, N., T-AC Aug 94 
1676-1678 

Riccati eqn., discrete-time alg., closed loop eigenvalues. Lin-Zhang Lu, 

+ , T-AC Aug 94 \6&2-]6&5 

samplcd-data robust control, exponential hold, optimal algm. Yung-Chun 
Wu, +, T-AC Jan 94 112-117 

sing, perturbed syst. block-diagonalizalion, Taylor expansion. Derbel, N, 

+ , T-AC Jul 94 1429-1431 

state delayed systs., memoryless H* controllers. Joon Hwa Lee, ■+ , T-AC 
Jan 94 159-162 

three-block generalized/std. Riccati eqns comparison. Darouach, M, + , 
T-AC Aug 94 1755-1758 

uncertain lin. systs., multivariable, stabil. margin eval. Gong, C., +, T-AC 
Mar 94 548-550 
Riccati equations, differential 

exponential lin. quadratic optimal control, discounting. Hopkins, W.E., Jr„ 
T-AC Jan 94 175-178 
Riccati equations, discrete-time 

Hcd control appl. Stoorvogel, A.A , + , T-AC Mar 94 686-691 
periodic discrete-time Riccati eqn , num. soln Hench, JJ,+, T-ACJun 
94 1197-1210 

Riccati eqn., discrete alg., iter, matrix bounds. Komaroff, N, T-AC Aug 94 
1676-1678 

Riccati eqn., discrete-time alg., closed loop eigenvalues. Lin-Zhang Lu, 
+ , T-AC Aug 94 1682-1685 

samplcd-data robust control, exponential hold, optimal algm. Yung-Chun 
Wu, + , T-AC Jan 94 112-117 

uncertain systs., sampled-data controller design Dolphus, R.M., T-AC May 
94 1036-1042 

Robots 

adaptive control based passivity Yu Tang, + , T-AC Sep 94 1871-1875 
adaptive controller, Lyapunov stabil. Egeland\ O., + , T-AC Aug 94 
1671-1673 

arms, repositioning control, robust finite dimens, learning algms. 

Luctbello, P., T-AC Aug 94 1690-1694 
flexible joint robots, global regulation, approx. difT. Kelly, R., + , T-AC 
Jun 94 1222-1224 

manipulators with parametric uncertainty, robust control. Keun-Mo Koo, 
+ , T-ACJun 94 1230-1233 

robust control, lin. estim. state feedback. Berghuis, H., + , T-AC Oct 94 
2159-2162 

variable struct, control schemes. Bin Yao, + , T-AC Feb 94 371-376 
Robots; cf. Manipulators...; Mobile robots... 

Robots, sensing systems; cf. Robots, vision systems 
Robots, vision systems 

autonomous vision-based mobile robot Baumgartner, E.T., + , T-AC Mar 
94 493-502 
Robustness 

adaptive control algms., robust approach. Gang Feng, T-AC Aug 94 
1738-1742 

adaptive control systs. robustness. Bodson, M., T-AC Apr 94 864-870 
adaptive predictive controller, robustness anal. Clarke, D.W., + , T-AC 
May 94 1052-1056 

book review; New Tools for Robustness of Linear Systems (Barmish, B.R.; 
1994). T-AC Dec 94 2525-2526 

cascaded nonlin. systs., global robust stabilization. Jmura, J.-L, + , T-AC 
May 94 1084-1089 

continuous direct adaptive control, saturation input constraint. Cishen 
Zhang, + , T-AC Aug 94 1718-1722 
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compensation: The discrete-time ^casc" (May 93 776-782). Haddad, 
WM, +, T-AC Aug 94 1772 

discrete time-varying systs . robust adaptive controller. Changyun Wen, 
T-AC May 94 987-991 

D-stabil., robust, for Hn. uncertain discrete delay systs. Te-Jen Su, +, T-AC 
Feb 94425-428 

dual-rate control syst., sampled-data, stabll. robustness. Tongwen Chen, 
T-AC Jan 94 164-167 

dyn. tin/time-delay systs., robust controller design. Mahmoud, M.S., + , 
T+AC May 94 995-999 

dyn. systs., robust adaptive control design. Khorasani, K., T-AC Aug 94 
1726-1732 

extreme-point robust stabil,, discrete-time polynomials. Perez, F., + , 
T-AC Jul 94 1470-1472 

flexible joint robots with uncertain params. and disturbances, tracking 
control. Tomei, PT-AC May 94 1067-1072 
floating platform, modeling and control. Damen, A.A.H., +, T-AC May 94 
1075-1078 

H® multiobjective robust control, infinite-horizon. Theodor, Y„ + , T-AC 
Oct Pd 2130-2134 

intersample props., robustness, and sensitivity, quantitat./qualitat anal. 

Feuer, A., + , T-AC May 94 1042-1047 
interval lin. control syst. robust parametric design. Keel, L.H., + , T-AC 
Jul 94 1524-1530 

interval plants, closed-loop hyperstability. Foo, YK, + , T-AC Jan 94 
151-154 

lin. quadratic designs, real param. uncertainty, robust. Douglas , J, + , 
T-AC Jan 94 107-111 

LQ optimal regulators, stabil. robustness. Dohyoung Chung, +, T-AC Aug 
94 1698-1702 

L z optimal model order reduction problem, input normal form homotopy 
Ge, Y., + , T-ACJun 94 1302-1305 

LT1 discrete-time systs., robust stabil. anal. Karan, M., + , T-AC May 94 
991-995 

MIMO terminal sliding-mode control, rigid robotic manipulators. 

Zhihong, M, + , T-AC Dec 94 2464-2469 
mixed //2/7f« perform, objectives Kemin Zhou, + , T-AC Aug 94 
1564-1574 

model ref. control and modeling error compensation for robust 
perform./stabilization. Sun, J , + , T-AC Mar 94 630-635 
motion/force control, robust, with nonholonomic constraints. Chun-Yi Su, 
+ , T-AC Mar 94 609-614 

MRAC, robust, SISO systs. Zhihua Qu, + , T-AC Nov 94 2219-2234 
multivariable direct MRAC, identifiable parametcrizations and param. 

convergence, de Mathelm, M, + . T-AC Aug 94 1612-1617 
multivariable systs., quantitat. robustness measures Horng-Giou Chen, 
+ , T-AC Apr 94 807-810 

nonlin servomechanism robust control. Jie Huang, + , T-AC Jul 94 
1510-1513 

Nyquist envelope of interval plant family. Hollot, C. V., + , T-AC Feb 94 
391-396 

optimal cost control/filtcring, uncertain lin systs. Petersen, JR, + . T-AC 
Sep 94 1971-1977 

parametrically uncertain nonlin. feedback systs., absol. stabil. Marquez, 
HJ, + . T-AC Mar 94 664-668 

parametrically uncertain nonlin. systs , robust stabilisation. Schoenwald, 
D.A., + . T-AC Aug 94 1751-1755 

polynomial, strictly Hurwitz conditions, stabil. anal. appl. Polyak, B.T, 
+ , T-AC May 94 1147-1148 

polynomials with multilinearly depend, coeff. perturb., robust stabil. Tian 
Yu-Ping, + , T-AC Mar 94 554-558 

proximate time-optimal controller, robustness. Pao, LY,+, T-AC Sep 94 
1963-1966 

P-type learning control. Saab, S.S., T-AC Nov 94 2298-2302 
rigid robotic manipulators, robust tracking control. Man Zhihong, +, T-AC 
Jan 94 154-159 

ripple free samplcd-data robust servomechanism controller, exponential 
hold Yung-Chun Wu, + . T-ACJun 94 1287-1291 
robots, robust control, lin. estim. state feedback, Berghuls, H., +, T-AC 
Oct 94 2159-2162 

robust adaptive regulation with min. prior knowledge, comment. Ji Feng 
Zhang, T-AC Mar 94 605 

robust anal., uncertainty value sets. Enter, E.G., + , T-AC Nov 94 
2315-2318 

robust control design, cascade struct, approach. Bonivento, C., + , T-AC 
Apr 94 846-849 

robust direct adaptive controllers, normalization tech. Gang Feng, + , 
T-AC Nov 94 2330-2334 

robust nonsingularity problem, SSV. Ghwan-Lu Tseng, + , T-AC Oct 94 
2118-2122 


robust stabilization of syst., control delays. Kojima , A., +, T-AC Aug 94 
1694-1698 

robust stabil,, time-varying struct, uncertainty. Shamma 1 , J S., T-AC Apr 94 
714-724 

robust strong stabilization via modified Popov controller synthesis. Wang, 
Y ; W„ +, T-AC Nov 94 2284-2287 

sampled-data robust control, exponential hold, optimal algm. Yung-Chun 
Wu, + , T-AC Jan 94 112-117 

second-order dyn. systs., dissipative controller designs. Morris, K.A., +, 
T-AC May 94 1056-1063 

single-arm dyn., robust variable struct./and switching-X adaptive control. 
Li-Wen Chen, + , T-AC Aug 94 1621-1626 

SISO systs., robust stabilization. Olbroi, A.W., + , T-AC Mar 94 652-657 
SISO systs. with affine param. uncertainties, robust perform. Kogan, J., 
T-AC Jan 94 721-229 

spectral syst. finite-dimens, approx. Erickson, M.A., + , T-AC Sep 94 
1904-1909 

p. calc,, computational complexity. Braatz, RP,, + , T-AC May 94 
1000-1002 

stabil. of uncertain lin. systs., saturating actuators. Jin-Hoon Kim, +, T-AC 
Jan 94 202-207 

stochastic adaptive control algms., unmodified, robustness. Radenkovic, 
MS, + , T-AC Feb 94 396-400 

structurally uncertain systs., optimal Lyapunov fns. Olas, A., T-AC Jan 94 
167-171 

switching surface design for multivariable VSS. Ju-Jang Lee, + , T-AC 
Feb 94 414-419 

time-delay syst. robust stabil., quasipolynomial convex directions/testing 
sets. Kharitonov, V.L., + , T-AC Dec 94 2388-2397 
time-invariant lin. systs., robust controller synthesis. Rantzer , A , +, T-AC 
Sep 94 1802-1808 

time-variant displacement struct, and interpolation problems Sayed, A.H, 
+ , T-AC May 94 960-976 

uncertain continuous-time implicit systs., regional pole placement, 
robustness. Chun-Hsiung Fang, + , T-AC Nov 94 2303-2307 
uncertain lin. systs, delay depend., robust stabil. Bugong Xu, T-AC Nov 
94 2365 

uncertain syst. pole assignment, polynomial approach. Figueroa, JL, + , 
T-AC Apr 94 831-835 

uncertain syst robust stabil anal., nonquadratic Lyapunov fns. 

Zelentsovsky, A.L , T-AC Jan 94 135-138 
uncertain syst. robust stabil., guardian map approach Shuoh Rem, + . 
T-AC Jan 94 162-164 

uncertain transfer fns., value sets comp. Gutman, P -O., + , T-AC Jun 94 
1268-1273 

VSS control design for uncertain discrete-time systs. Wen-June Wang, +, 
T-AC Jan 94 99-102 

worst-case ident. anal., BIBO robustness. Partington, JR., + , T-AC Oct 
94 2171-2176 

Roots; cf. Polynomial matrices, Polynomials 
Rotating machines; cf. Brushless rotating machines 
Routh methods 

delta-operator formulated real polynomials, tabular method for 
determining root distrib. Premaratne, K, + , T-AC Feb 94 352-355 
Hurwitz polynomials props., polynomial families stabil. anal. Duan, G.-R , 
+ , T-AC Dec 94 2490-2494 

interval systs order reduction, Routh-Pade approx. Bandyopadhyay, B, 
+ , T-AC Dec 94 2454-2456 

Routh table prop., quadratic integral eval. appl. Beghi, A., + , T-AC Dec 
94 2494-2496 

strictly Hurwitz conditions, stabil. anal. appl. Polyak, BT. t + . T-AC May 
94 1147-1148 

time-invariant lin systs , stabil. domains inversion. Walter, E., + , T-AC 
Apr 94 886-889 


Sample-and-hold circuits 

intersample props., robustness, and sensitivity, quantitaL/qualitat anal. 
Feuer, A,+, T-AC May 94 1042-1047 
Sampled-data systems; cf. Discrete-time systems 
Sampling methods 

stochastic lin. syst. param. hypothesis testing, stat. sampling. Duncan, 
T.E., + . T-AC Jan 94 118-122 

Scheduling 

FMS, FCFS scheduling policy. Seidman, T.I., T-AC Oct 94 2166-2171 
machine capacity allocation. Naln, P„ +, T-AC Sep 94 1853-1855 
queueing networks and scheduling policies, perform, bounds. T-AC Aug 
94 1600-1611 
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Scheduling; cf, Manufacturing scheduling; Production control 
Search methods 

global Optim., adaptive partitioned random search. Bo Tang, Z, T-AC INov 
94 2235-2244 
Sensitivity 

biased/unbiased controllers LTR design. Turan, L, + , T-AC Mar 94 
601-605 

continuous time nonminimum phase lin. systs., gain margin improvement. 

Wei-Yong Yan, + , T-AC Nov 94 2347-2354 
intersample props., robustness, and sensitivity, quantitat/quaiitat. anal. 

fewer, A., ■+ , T-AC May 94 1042-1047 
Nyquist envelope of interval plant family Hollot, C V„ +, T-AC Feb 94 
301-396 

observer design, eigenvalue assignment, residual generation. Magm, J -F, 
+ , T-AC Feb 94 441-447 

SISO-distribuled plants, optimal mixed sensitivity. Flamm, D S., +, T-AC 
Jun 94 1150-1165 
Sensitivity; cf. Robustness 
Sensors; Cf. Multisensor Systems 
Servosys terns 

Hoc-control, state availability rel„ pole/zero cancellations Miyamoto, S, 
+ , T-AC Feb 94 379-381 

nonlin. servomechanism robust control. Jie Huang, + , T-AC Jul 94 
1510-1513 

ripple free sampled-dala robust servomechanism controller, exponential 
hold Yung-Chun Wu, +, T-AC Jun 94 1287-1291 
samplcd-data robust control, exponential hold, optimal algm. Yung-Chun 
Wu, +, T-AC Jan 94 112-117 

uncertain SISO min phase lin syst, nonlin universal servomechanism. 
Ryan, E P , T-AC Apr 94 753-761 
Set theory 

constrained continuous-lime systs with cone props, pos. invariant sets. 

Tarbouriech, S, + , T-AC Feb 94 401-405 
corrections to “Positively invariant sets for constrained continuous-time 
systems with cone properties” (Feb 94 401-405). Tarbouriech, S, T-AC 
Aug 94 1771 

discrete-time lin systs., pos. invariant sets. De Santis, E, T-AC Jan 94 
245-249 

discicte-time uncertain systs , ultimate boundedness control, Lyapunov 
Blarchim, F, T-AC Feb 94 428-433 

matching principle for systs , restricted inputs Rutland, N.K, T-AC Mar 
94 550-553 

output regulation, necessary condition. Luctbello, P , T-AC Mar 94 
558-559 

robust anal , uncertainty value sets Eszter, EG, + , T-AC Nov 94 
2315-2318 

time-invariant lin systs., slabil. domains inversion. Walter, E, + , T-AC 
Apr 94 886-889 
Signal analysis 

multiscalc recursive estim, data fusion/regularization. Chou, KC, + , 
T-AC Mar 94 464-478 
Signal analysis; cf. Radar signal analysis 
Signal detection 

sens fusion, decentralized detect optimal thresholds. Irving, W.W., + , 
T-AC Apr 94 835-838 
Signal estimation; cf. Estimation 
Signal processing 

recursive ident., nonlin. Wiener model Wigren, T, T-AC Nov 94 
2191-2206 

Signal processing; cf. Array processing; Estimation; Filtering; Image 
processing 

Signal representations 

multiscalc ftcursive estim., data fusion/regularization. Chou, K.C., + , 
T-AC Mar 94 464-478 

multiscalc systs., Kalman filters, Riccati eqns, Chou, K.C, + , T-AC Mar 
94 479-492 
Signal resolution 

multiscalc recursive estim., data fusion/regularization. Chou, K C, + , 
T-AC Mar 94 464-478 

Signal sampling/reconstruction; cf. Sample-and-hold circuits 
Singularly perturbed systems 

discrete-time q-Markov cover models. Sreeram, V ., + , T-AC May 94 
« 1102-1105 

H®-optimal control for sing, perturbed systs., imperfect state meas, Zigang 
Pan, + , T-AC Feb 94 280-299 

lin. sing perturbed systs., Kalman filtering. Gajic, Z, + , T-AC Sep 94 
1952-1955 

sing, perturbed control systs. and nonlin. differential-alg. eqns. Krishnan, 
H, + , T-AC May 94 1079-1084 

sing, perturbed syst. block-diagonalization, Taylor expansion. Derbel, N, 
+ , T-AC Jul 94 1429-1431 


Singular systems 

closed-loop vibr. control, Youla parameterization. Kabamba, PX, + , 
T-AC Jul 94 1455-1459 

continuous descriptor systs., adaptive observer. Uetake, Y, T-AC Oct 94 
2095-2100 

descriptor systs, lin. feedback closed-loop Stackelberg strategy. HuaXu, 
+ , T-AC May 94 1097-1102 

full-order/low-order observers, LTI plant, unified ingular syst. based 
theory. Cobb, J.H, T-AC Dec 94 2497-2502 

robust adaptive pole placement control. Wever, £., + , T-AC Aug 94 
1665-1671 

time-invariant lin. syst. adaptive control. Karason, SP, +, T-AC Nov 94 
2325-2330 

Sliding-mode control; cf. Variable-structure systems 
Smoothing methods 

learning, inverse pass., I/O map, update-spline-smoothing. Heiss, M, 
T-AC Feb 94 259-268 
Software packages 

complex discrctc-time control systs., simul. automation. Ellis, RD., + , 
T-AC Sep 94 1795-1801 
Space vehicle control 

rigid spacecraft adaptive attitude control Egeland, O , + . T-AC Apr 94 
842-846 

rigid spacecraft systs controllability, central gravitational field. Lian, 
K-Y., + , T-AC Dec 94 2426-2441 
Space vehicle tracking 

rigid spacecraft adaptive attitude control. Egeland, O., + , T-AC Apr 94 
842-846 

Sparse matrices 

large sparse Lyapunov eqns, approx, soln. Gudmundsson, T, + , T-AC 
May 94 1110-1114 
Spectral factorization 

constrained Riccati eqn factorizations Weiss, M, T-AC Mar 94 677-681 

polynomial J-spectral factorization. Kwakernaak, H, + , T-AC Feb 94 
315-328 

Spline functions 

learning, inverse pass, I/O map, update-spline-smoothing. Heiss, M, 
T-AC Feb 94 259-268 
Stability 

1-D and 2-D digital recursive filters, stabil., Schur polynomials. Barret, 
M, + . T-AC Nov 94 2335-2339 

adaptive control by multiple models/switching. Narendra , K S., +, T-AC 
Sep 94 1861-1866 

adaptive control systs , transient bounds ZhuquanZang, +, T-AC Jan 94 
171-175 

adaptive pole placement without excitation probing sigs. Lozano, R., + , 
T-AC Jan W 47-58 

adaptive stabilization, nonlin. time-varying controller Miller, DE, T-AC 
Jul 94 1347-1359 

almost sure sample stabil of nonlin. stochastic dyn. systs Zhi Yu Zhang, 
+ , T-AC Mar 94 560-565 

asynchronous sampled data syst. control, design Voulgarts, P., T-AC Jul 
94 1451-1455 

asynchronous systs. with Poisson transits., stabil. Leland, R P., T-AC Jan 
94 182-185 

bilinear systs., adaptive stabilization, least sq. Xi Sun, + , T-AC Jan 94 
207-211 

CARMA plants, stabilizing 1-0 receding horizon control. Chisci, L, + , 
T-AC Mar 94 614-618 

closed-loop vibr. control, Youla parameterization. Kabamba, PX., + , 
T-AC Jul 94 1455-1459 

comments on “Stability margin evaluation for uncertain linear systems” 
(by C Gong and S. Thompson, Jun 94 548-550). Su, J -H., T-AC Dec 94 
2523-2524 

constrained nonlin. syst receding horizon control. Shin-Yeu Lin, T-AC Sep 
94 1893-1899 

constrained robots, force/motion control. Grabbe, MX., ■+ , T-AC Jan 94 
179 

contact stabil. of simple posn. controllers, effect of time delay and discrete 
control. Fiala, J, +, T-AC Apr 94 870-873 

continuous control syst., digital cquiv. design methods comparison. Hall, 
S.R, T-AC Feb 94 420-421 

continuous descriptor systs., adaptive observer. Uetake, Y, T-AC Oct 94 
2095-2100 

continuous-time multivariable syst. reduction Krajewski, W, + , T-AC 
Oct 94 2126-2129 

corrections to '"On the stability proof of adaptive schemes with static 
normalizing signal and parameter projection”(Jan 93 170-173). loamou, 
P , T-AC Apr 94 896 

cyclicly switched param.-adaptive control systs., M1MO design 
models/intemal regulators. Morse, AS, + , T-AC Sep 94 1809-1818 
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delta-operator formulated real polynomials, tabular method for 
determining root distrib. Premaratne , K, 4 , T-AC Feb 94 352-355 
descriptor systs. control by output feedback. LovdsfrNagy, V., + , T-AC 
Jul94 1 507-1509 

deterministic/stochastic queuing networks anal. Cheng-Shang Chang, 
TnAC May 94 913-931 

diagonal decoupling, dyn. output feedback /const, precompensator. Eidem, 
V„ T-AC Mar 94 503-511 

discrete-time adaptive nonlin. syst, least sq. estimator. Kanellakopoulos, 
L, T-AC Nov 94 2362-2365 

discrete-time filters from high-order s-to-z mappings. Schneider, AM, +, 
T-AC Feb 94 435-441 

discrete-time nonlin. syst. control, stabil. property. Jie Huang, + , T-AC 
Nov 94 2307-23II 

discrete time nonlin. systs., stabil., time delayed feedback. Xueshan Yang, 
+ , T-AC Mar 94 585-589 

discrete-time q-Markov cover models. Sreeram, V., + , T-AC May 94 
1102-1105 

discrete-time Riccati eqn., H* control appl. Stoorvogel, A.A, + , T-AC 
Mar 94 686-691 

dissipative H 2 /H*, controller synthesis. Haddad, WM + , T-AC Apr 94 
827-831 

distributed delay lin. systs., feedback stabilisation. Feng Zheng, + , T-AC 
Aug 94 1714-1718 

D-stabil., robust, for lin uncertain discrete delay systs. Te-Jen Su, +, T-AC 
Feb 94 425-428 

evol. eqns.v parab. partial DE, exponential stabil. Keum-Shik Hong, + , 
T-ACJul 94 1432-1436 

finite-dimens, controllers designed for infinite-dimens. systs., state space 
Morris, KA , T-AC Oct 94 2100-2104 
freq. response arcs, stable polynomial, convexity. Keqm Gu, T-AC Nov 94 
2262-2265 

fuzzy control systs., supervisory controller. Li-Xiu Wang, T-AC Sep 94 
1845-1847 

interconnected systs., decentralized adaptive regulation Changyun Wen, 
T-AC Oct 94 2163-2166 

internally stable nonlin. systs. with disturbances, parameterization. 

Hammer , J, T-AC Feb 94 300-314 
interval LTI systs., stabilization. Kehui Wet, T-AC Jan 94 22-32 
interval matrices, necessary and sufficient conditions for Hurwitz/Schur 
stabil. Earning Wang, + , T-AC Jun 94 1251-1255 
interval matrices stabil. conditions. Sezer, ME., +, T-AC Feb 94 368-371 
jump-Markov systs., stabilizing control law. Du/our, F., 4, T-AC Nov 94 
2354-2357 

targe scale interconnected systs. with delays, decentralized stabilization 
Zhongzhi Hu, T-AC Jan 94 180-182 

lin. quadratic control, convex prog, num method. Peres, P.L.D , + , T-AC 
Jan 94 198-202 

lin. uncertain syst. robust stabil, guardian map approach Shuoh Rern, + , 
T-AC Jan 94 162-164 

LTI syst. pos. real control. Weiqian Sun, + , T-AC Oct 94 2034-2046 
Lyapunov stabil theory of nonsmoolh systs. Shevitz, D, + , T-AC Sep 94 
1910-1914 

manipulators with parametric uncertainty, robust control. Keun-Mo Koo, 
+ , T-AC Jun 94 1230-1233 

mfg. scheduling syst., regulator stabilization tech. Humes, C, Jr, T-AC 
Jan 94 191-196 

mfg. syst. with unreliable machines, stabil. anal Han-Fu Chen, + , T-AC 
Mar 94 681-686 

MIMO lin. systs. with sens ./actuator failures, stabil Gundes, A N., T-AC 
Jun 94 1224-1230 

min.-phase lin. plants, stabilizing controllers parameterization. Gloria, 
J.J., + , T-AC Feb 94433-434 

mixed H 2 /H* control, Nash game approach. Limebeer, D.J.N, + , T-AC 
Jan 94 69-82 

multiratc sampled-data systs. struct props. Longhi, S„ T-AC Mar 94 
692-696 

multivariable nonlin. controller, vibr. damping. Kanestrom, R.K , +, T-AC 
Sep 94 1925-1928 

multivariable syst. decentralized control, closed-loop props, Campo, P.J., 
+ , T-AC May 94 932-943 

neutral systs., stabil. and stabilizability. Logemam, H., + , T-AC Jan 94 
138-143 

nonlin. feedback parametrically uncertain systs,, absol. stabil. Marquez, 
HI, + , T-AC Mar 94 664-668 

nonlin. servomechanism robust control. Jie Huang, + , T-AC Jul 94 
1510-1513 

ponlin. systs., triangular struct., adoptive control. Seto, D., + , T-AC Jul 
94 1411-1428 

nonlin. uncertain systs. tracking control. Song, Y.D., + , T-AC Sep 94 
1866-1871 


nonnegative feedback control, oscllls. stabilisation. Zaslavsky, B„ T-AC 
Jun 94 1273-1276 

output feedback problem in lin, systs., bilinear formulation. Syrmos, V.L, 

+ , T-AC Feb 94 410-414 

output feedback stabilization of nonlin. systs. Tsinias, J„ +, T-AC Apr 94 ^ 

output regulation, necessary condition. Lucibello, P., T-AC Mar 94 
558-559 

parab. systs. direct adaptive control. Keum ShikHong, + , T-AC Oct 94 
2018-2033 

param.-adaptive control, cyclic switching strategy. Pait, FM. t + , T-AC 
Jun 94 1172-1183 

partially obs. discrete-time nonlin. systs., risk-sensitive control and dyn. 

games. James, MR,, 4 , T-AC Apr 94 780-792 
periodically time-varying lin. discrete-time plants, decentralized control. 

Khargonekar, P.P., +, T-AC Apr 94 877-882 
pos. real systs. nonlin. controllers. Bernstein, DS., + , T-AC Jul 94 
1513-1517 

prod, control methods, distributed, stabil. and perform. Sharifnia, A . T-AC 
Apr 94 725-737 

quadratic stabilization of continuous time systs. Mahmoud, MS,, 4, T-AC 
Oct 94 2135-2139 

rational /J suboptimal compensators for continuous-time systs. Blanchmi, , 

F. 4, T-AC Jul 94 1487-1492 

rational transfer fn, optimal L« approx. Kavranoglu, D, +, T-AC Sep 94 
1899-1904 

robot adaptive control based on passivity. Yu Tang, + , T-AC Sep 94 
1871-1875 

robot controller, adaptive, Lyapunov stabil. Egeland, O, + . T-AC Aug 94 
1671-1673 

robots, trajectory stabilization for systs., nonholonomic constraints 
Walsh, G, + , T-AC Jan 94 216-222 

robust adaptive pole placement control. Weyer, E, + , T-AC Aug 94 
1665-1671 

row-by-row stable decoupling, static state feedback, struct, soln Martinez 
Garcia, JC, 4 , T-AC Dec 94 2457-2460 
sampled-data systs, wordlength constraint, stabil./perform Fiailho, IJ, 

4, T-AC Dec 94 2476-2481 

SISO LTI discrete-time systs , param.-adaptive controller. Kreisselmeier. 

G, T-AC Sep 94 1819-1826 

stabilization of discrete-time nonlin systs , global Byrnes, Cl, + , T-AC 
Jan 94 83-98 

stabil. of uncertain lin. systs., saturating actuators. Jm-Hoon Ktm, + , T-AC 
Jan 94 202-207 

static output feedback for stabilization. Pimpalkhare, A A , + , T-AC May 
94 1148 

stochastic adaptive control algms., unmodified, robustness Radenkovic, 
MS, + , T-AC Feb 94 396-400 

stochastic adaptive prediction/MRAC Wei Ren, + , T-AC Oct 94 
2047-2060 

structurally uncertain systs , optimal Lyapunov fns Olas, A., T-AC Jan 94 
167-171 

switching surface design for multivariable VSS Ju-Jang Lee , + . T-AC 
Feb 94 414-419 

thrcc-block generalized/std Riccati eqns comparison Darouach, M, + , 
T-AC Aug 94 1755-1758 

time-invariant lin syst adaptive control Karason, S P, 4, T-AC Nov 94 
2325-2330 

time-invariant lin systs, stabil domains inversion. Walter, E, + , T-AC 
Apr 94 886-889 

time lag nonlin. systs., vibr. control. Lehman, B, +, T-AC May 94 898-912 
time-varying functional diff. eqns., delay independent stabil. conditions/ 
decay estim Lehman, B., 4 , T-AC Aug 94 1673-1676 
time varying systs, discrete time adaptive controller, global stabil 
Radenkovic, MS, 4 , T-AC Nov 94 2357-2361 
tip mass/cable syst., stabilization. Morgul, 0„ 4 , T-AC Oct 94 2140-2145 
transfer matrices preconditioning. Rotstein, B, T-AC Nov 94 2287-2292 
uncertain discrete-lime systs., variable struct, control design 
Myszkorowski, P, 4, T-AC Nov 94 2366-2367 
uncertain lin. systs., multivariable, stabil. margin cval. Gong, C„ +, T-AC 
Mar 94 548-550 

uncertain SISO min. phase lin. syst., nonlin universal servomechanism 
Ryan, E.P, T-AC Apr 94 753-761 

uncertain syst. robust stabil. anal., nonquadratic Lyapunov fn£ 
Zelentsovsky, A L., T-AC Jan 94 135-138 
uncertain systs., sampled-data controller design Dolphus, RM, T-AC May 
94 1036-1042 

Uncertain systs. stabilizing control design, quasiconvex optim. Keqin Gu, 
T-AC Jan 94 127-131 

variable struct. MRAC, I/O based, anal./design. Liu Hsu, 4, T-AC Jan 94 
4-21 

vibr. systs. control. Karl, WC„ +, T-AC Jan 94 222-226 
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Stability; cf. Absolute stability, Asymptotic stability, Input-output stability, 
Lyapunov methods, Nyquist stability. Robustness, Routh methods 
State estimation 

biased/unbiased controllers LTR design 7 uran L + T-AC Mar 94 
d01«*60S 

discrete-time Markovian jump I in systs , hn min MSE estim Costa 
OLV, T-AC Aug 94 1685-1689 

disturbance decoupled observer design Hou M + T-AC Jun 94 

1338-1341 

full-order/low-order observers, 1 H plant, unified ingular syst based 
theory Cobb J D, T-AC Dec 94 2497-2502 
guaranteed estim problem, Kalman-Bucy filter appl Golovan A + 
T-AC Jun 94 1282-1286 

I in discrete-time systs, optimum I in recursive estim Carazo AH + 
T-AC Aug 94 1636-1638 

MIMO stochastic discrete syst state estim LiuDanyang + 7 -AC Oct 94 
2087-2091 

nonlin syst state estim , conditionally min algm Pankov A R + T-AC 
Aug 94 1617-1620 

optimal cost control/filtenng, uncertain lin systs Petersen 1R + 7 -AC 
Sep 94 1971-1977 

optimal filtering, stochastic discrete-time systs, unknown inputs Borisov 
A V + T-AC Dec 94 2461-2464 

risk-sensitive estim and differential game Banavar RN + T AC Sep 
94 1944-1918 

stochastic lin discrete systs, state estim algm Ahmed MS T-AC Aug 
94 1652-1656 

State estimation; cf. Kalman filtering Observers 
Slate feedback 

adaptive tracking, feedback lin systs Marino R + T-AC Jun 94 
1314-1319 

CARMA plants stabilizing 1-0 receding horizon control Chisa / + 
T-ACMar 94 614-618 

cascaded nonlin systs global robusl stabilization Imura J I + T-AC 
May 94 1084-1089 

certainty equivalence control dyn games James MR 7 -AC Nov 94 
2321-2324 

constrained continuous-time sysls with cone props pos invariant sets 
l arbour lech S ■+ T-AC Peh 94 401-405 
corrections to Positively invariant sets for constrained continuous lime 
systems with cone properties’ (I eb 94 401-405) Tarbouriech S T-AC 
Aug 94 1771 

discrete-time nonlin control systs design via smooth feedback Wei Lin 
+ T AC Nov P4 2340-2346 

disturbance decoupling constrained Sylvester eqns Syrmos \ L T-AC 
Apr 94 797-803 

failure detect /isolation/accommodation syst Chia C hi Tsui T AC Nov 94 
2318-2321 

feedback control by multirate PWM hhayatian A + 7 AC Jun 94 

1292-1297 

llr control of discrete-time uncertain systs Gtromel J C + T-AC May 
94 1072-1075 

H*,-control state availability rel polc/zero cancellations Miyamoto S 
f T AC Feb 94 379-381 

H a -optim reduced order observer based controller Stoorvogel A A 4 
7 -AC Feb 94 355-360 

H x -optimal control for sing perturbed systs imperfect state meas / igang 
Pan + 7-AC Feb 94 280-299 

input-output decoupling nonlin interactor appl Ih Benedetto MD + 
T-AC Jun 94 1246-1250 

input-output lin state equivalence and decoupling In-Joong Ha + 7 -4C 
Nov 94 2269-2274 

interval LTf systs , stabilization Kehui Wei T-AC Jan 94 22-32 
left-invertible generalized state space systs, disturbance rejection 
Paraskevopoulos P N + T-AC Jan 94 185-190 
MIMO nonlin syst I/O pscudolineanzalion Lawrence DA + 7 -AC 
Nov 94 2207-2218 

mixed Hj/Hoo control, Nash game approach Limebeer DJN + T-AC 
Jan 94 69-82 

optimal cost control/filtenng, uncertain lin systs Petersen IR + T-AC 
Sep 94 1971-1977 

robots, robust control I in estim state feedback Berghuis H + 7 -AC 
Oct 94 2159-2162 

row-by-row stable decoupling, static state feedback, struct soln Martinez 
Garcia JC + T-AC Dec 94 2457-2460 
stabilization of discrete-time nonlin systs, feedback equivalence Byrnes 
Cl + T-AC Jan 94 MW 

state deadbeat control problem general soln Eldem V + T-AC May 94 
1002-1006 

state observer/feedback compensators, comment Bender D J T-AC Feb 
94 447-448 


uncertain nonlin systs, struct invariance Castro-Lmares, R, + , T-AC 
Oct 94 2154-2158 

vector discrete-event syst controller synthesis Yong Li + 7-AC Mar 94 
512-531 

State-space methods 

2D general discrete state-space models eigenvalues calc Zou Yun + 
T-AC Jut 94 1436-1439 

bilinear systs, state space anal, orthogonal series approach 
Paraskevopoulos PN + , T-AC Apr 94 793-797 
block multi rate input-output model for sampled-data control systs 
Jakubowski AM * T-AC May 94 1145-1147 
comments on “Stability margin evaluation for uncertain linear systems 1 
(by C Gong and S Thompson, Jun 94 548-550) Su J -H T-AC Dei 94 
2523-2524 

continuous descriptor systs, adaptive observer Vetake Y T-AC Oct 94 
2095-2100 

dyn shock-error models, online estim Knshnamurtky V T-AC May 94 
1129-1135 

extended Chandrasekhar recursions Saved AH ■+ T-AC Mar 94 
619-623 

fast ident of state-space models Young Man Cho + T-AC Oct 94 
2004-2017 

fmite-dimens controllers designed for infinite-dimens systs , state space 
Morris KA T-AC Oct 94 2100-2104 
four-block problem, recursive Schur-based soln Constantinescu T + 

T AC Jut 94 1476-1481 

generalized Chandrasekhar recursions from generalized Schur algm 
Sctyed AH + T-AC Nov 94 2265-2269 
H®-control, state availability rel, pole/zero cancellations Miyamoto S 
+ T-AC Feb 94 379-381 

H^optim , time-domain constraints Rotsteln H + T-AC Apr 941 62-779 
ident, state-space freq domain approach Bayard D S T-AC Sep 94 
1880-1885 

left-invertible generalized state space systs disturbance rejection 
Paraskevopoulos P N + T AC Jan 94 185-190 
lin -quadratic zero-sum differential games tor generalized state space 
systs HuaXu + T-AC Jan 94 143-147 
lin uncertain syst robust stabil guardian map approach ShuohRern 4 
T AC Jan 94 162-164 

1 n syst pos real control Weiqian Sun + I-AC Oct 94 2034-2046 
mixed 7 / 2 / H perform objectives Kemm /hou + f-AC Aug 94 
1564-1574 

mixed 7 / 2 ///* perform objectives optimal control Doyle J + T-AC Aug 
94 1575-1587 

multichannel gam margin improvement, sampled-data hold fns Chang 
Yang + T-AC Mar 94 657-661 

multiscale recursive estim data fusion/regularization Chou KC + 

T AC Mar 94 464-478 

multiscale systs Kalman filters Riccati eqns Chou KC + T-ACMar 
94 479-492 

nonlin syst H B control output-feedback based Lu W-M + T-AC Dec 
94 2517-2524 

output regulation, necessary condition luubello P T-AC Mar 94 
558-559 

pos realness conditions charactn assuming controllability Weiss H + 

I-AC Mar 94 540-544 

rational transfer fn , optimal 1 * approx Kavranoglu D + T-AC Sep 94 
1899-1904 

recursive prediction error algm log likelihood fn derivatives Hooker 
MA T-AC Mar 94 662-664 

uncertain hn systs , multivariable, stabil margin eval Gong C + T-AC 
Mar 94 548-550 
Statistics 

alternating renewal elec load models Ll-Fertk S + 1-AC Jun 94 
1184-1196 

discrete time Hammerstein syst ident Lang Zi-Qiang T-AC Mar 94 
569-573 ' " 

dyn syst simul, fast parallel recursive aggregation Tsai WK a T-AC 
Mar 94 534-540 

guaranteed param estim problem with uncertain stats, Kalman-Bucy 
filter accuracy Matasov A J T-AC Mar 94 635-639 
Kautz models for syst ident Wahlberg B 7-AC Jun 94 1276-1282 
least sq estim in white noise, convergence Nasstri-Toussi A + T-AC 
Feb 943 64-368 

MIMO stochastic discrete syst state estim Liu Danyang + T-AC Oct 94 
2087-2091 

model struct selection test, instrumental variable, statist props Hoai 
Nghia Duong + T-AC Jan 94 211-215 
multi-DOF nonlin damping model, spectral dens Weijian Zhang 7-AC 
feb 94 406-410 

multipoint boundary value stochastic systs, stationarity/reciprocity Jie 
Chen + T-ACMay 94 1114-1116 
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stochastic tin. discrete systs., state estim. aJgm. Ahmed, MS ., T-AC Aug 
94 1652-1656 

Stochastic approximation t 

20 stochastic approx., convergence/diff. cqn. limit. Dye-Jyun Ma, + , 
T-AC Jul94 1439-1442 

recursive idem., non!in. Wiener model. Wigren, T., T-AC Nov 94 
2191-2206 

Stochastic differential equations 

almost sure sample stabil. of nonlin stochastic dyn. systs. Zhi Yu Zhang, 
+ , T-AC Mar 94 560-565 

infinite horizon optimal control of stochastic systs. Runoffs son, T, T-AC 
Aug 94 1551-1563 
Stochastic games 

flow control zero sum Markov games. Altman, E., T-AC Apr 94 814-818 
infinite horizon optimal control of stochastic systs. Runolfsson, T., T-AC 
Aug 94 1551-1563 

risk-averse decentralized discrete-time LEQG games. Srtkant, R, T-AC 
Apr 94 861-864 
Stochastic optimal control 

infinite horizon optimal control of stochastic systs. Runolfsson, T, T-AC 
Aug 94 1551-1563 

I in exponential Gaussian control problem Chih-Hai Fan, + , T-AC Oct 
94 1986-2003 

partially obs. discrete-time nonlin systs., risk-sensitive control and dyn 
games. James, M.R., + . T-AC Apr 94 780-792 
Stochastic optimal control, linear systems; cf. Linear-quadratic-Gaussian 
control 

Stochastic processes 

distributed algms, random proc. failures Papavassllopoulos, G.P, T-AC 
May 94 1032-1036 

dyn. job shops/prod, planning. Sethi, S., + , T-AC Oct 94 2061-2076 
event graphs, stochastic, cycle time Proth, J -M„ + , T-AC Jul 94 
1482-1486 

machine capacity allocation. Nam, P., +, T-AC Sep 94 1853-1855 
muHiscale recursive cstim., data fusion/regularization. Chou, K.C, + , 
T-AC Mar 94 464-478 

packet switching, flow control, rate-based, monotonicity/concavity. 
Rudka, K C, T-AC Mar 94 544-548 

routing, limited state inform, in queueing systs , blocking. Sparaggis, P D , 
+ , T-AC Jul 94 1492-1497 

timed-event stochastic graphs, superposn. props./perform, bounds. 
Xiao-tan Xie, T-AC Jul 94 1376-1386 
Stochastic processes; cf. Autoregressive moving-average processes; 
Innovations methods (stochastic processes), Laguerre processes; 
Markov processes; Poisson processes, Regenerative stochastic 
processes, Time-varying stochastic processes, Wiener processes 
Stochastic systems 

adaptive stochastic control algms , unmodified, robustness Radenkovic, 
MS, + , T-AC Feh 94 396-400 

adaptive stochastic prediction/MRAC. Wei Ren, + , T-AC Oct 94 
2047-2060 

almost sure sample stabil of nonlin. stochastic dyn. systs. Zhi Yu Zhang, 
+ , T-AC Mar 94 560-565 

bilinear systs , adaptive stabilization, least sq. Xi Sun, + , T-AC Jan 94 
207-211 

exact lin. of stochastic dyn. systs Socha, L, T-AC Sep 94 1980-1984 
Gaussian stochastic control syst. tuning vanSchuppen, J H, T-AC Nov 94 
2178-2190 

lin. discrete-lime systs., state eslim. Ahmed, MS, T-AC Aug 94 1652-1656 
MIMO stochastic discrete syst. state estim. Liu Danyang, +, T-AC Oct 94 
2087-2091 

min. stochastic realizations, parameterization Ferrante, A , T-AC Oct 94 
2122-2126 

multipoint boundary value stochastic systs, stationarity/reciprocity. Jte 
Chen, +, T-AC May 94 1114-1116 

nonlin. syst. stale estim., conditionally min. algm. Pankov, A.R, + , T-AC 
Aug 941617-1620 

optimal filtering, stochastic discrete-time systs., unknown inputs. Borisov, 
A.V., +, T-AC Dec 94 2461-2464 

param hypothesis testing, stochastic lin systs., stat. sampling. Duncan, 
T.E., + . T-AC Jan 94 118-122 

risk-averse decentralized discrete-time LEQG games. Srtkant, R., T-AC 
Apr 94 861-864 

Stochastic systems; cf. Stochastic optimal control 
Suboptimal control 

cat, falling, near-optimal nonholonomic motion planning, appl. 

Fernandes, C., + , T-AC Mar 94 450-463 
model matching, suboptimal perfect, with noise, MRACS. Mutoh, Y., +, 
T-AC Feb 94 422-425 

rational L x suboptimal compensators for continuous-time systs. Blanchtni, 
F.,+ , T-AC Jut 94 1487-1492 


Switched systems 

common Lyapunov fh. t stable LT1 systs., commuting A-matrices. 
Narendra, K S. t + , T-AC Dec 94 2469-2471 

Synchronization 

timed discrete-event systs., supervisory control. Brandin, BA., + , T-AC 
Feh 94 329-342 

Synchronous generator excitation; cf. Power generation control, excitation 

System identification 

cyclicly switched param.-adaptive control systs., MIMO design 
modcIs/inlernaJ regulators. Morse, AS ., + , T-AC Sep 94 1809-1818 
distributed param. systs., adaptive estim., persistence of excitation. 

Demetriou, MA., + , T-AC May 94 1117-1123 
finite-dimens, model validation, output error, test horizon. Hoai Nghia 
Duong, + , T-AC Jan 94 102-J 06 

floating platform, modeling and control. Damen, A.A.H., + , T-AC May 94 
1075-1078 

Hoo control syst. ident, least sq. methods, comment. Livstone, MM , + , 
T-AC Jul 94 1531 

Kautz models for syst. ident. Wahlberg, B, T-AC Jun 94 1276-1282 
least sq. estim,, efficient algm. Rafajlowtcz, E , + , T-AC Jun 94 1241-1243 
norms for syst ident. Massoumnia, M-A., + . T-AC May 94 1027-1031 
param.-adaptive control, cyclic switching strategy. Pait, FM, + , T-AC 
Jun 94 1172-1183 

recursive ident., nonlin. Wiener model. Wigren, T., T-AC Nov 94 
2191-2206 

robust adaptive pole placement control Weyer, E, ■+ , T-AC Aug 94 
1665-1671 

slowly varying and LTI systs , ident. and uncertainty principles Zames. 
G, +. T-AC Sep 94 1827-1838 

state-space freq. domain method. Bayard, DS., T-AC Sep 94 1880-1885 
state-space models fast ident. Young Man Cho, + , T-AC Oct 94 2004-2017 
struct, selection test, instrumental variable, statist, props. Hoai Nghia 
Duong. + . T-AC Jan 94 211-215 

time-varying params, bounded error ident Bittanti, S, + . T-AC May 94 
1106-1110 

Wiener systs nonparametric ident. Grebltcki, W, T-AC Oct 94 2077-2086 
worst-case ident anal., BIBO robustness Partington, J R, + , T-AC Oct 
94 2171-2176 

worst case ident., H“ model validation Guoxiang Gu, T-AC Aug 94 
1657-1661 

worst-case syst. ident., time complexity. Poolla, K, + . T-AC Mav 94 
944-950 

System identification; cf. Parameter identification. Power system 
identification 

System reliability; cf. Reliability 


Time-domain analysis 

model validation, time-domain approach. Poolla, K, + , 7 -AC May 94 
951-959 

Time-optimal control 

robustness of proximate time-optimal controller Pao, L Y, + , T-AC Sep 
94 1963-1966 

Time series; cf. Autoregressive moving-average processes 

Time-varying stochastic processes 

random param tracking, robust algm. Juditsky, A, + , T-AC Jun 94 
1211-1221 

Time-varying systems 

bounded error ident Bittanti, S, + , T-AC May 94 1106-1110 

bounded-error tracking of time-varying params Piet-Lahanier, H, + , 
T-AC Aug 94 1661-1664 

delay-independent exponential stabil. criteria for time-varying discrete 
delay systs. Wu, JW.,+, T-AC Apr 94 811-814 

discrete-time adaptive controller, global stabil. Radenkovic, MS, + , 
T-AC Nov 94 2357-2361 

discrete time-varying systs., robust adaptive controller. Changyun Wen, 
T-AC May 94 987-991 

exponential lin. quadratic optimal control, discounting. Hopkins, W.E, Jr, 
T-AC Jan 94 175-178 

functional diff. eqns., delay independent stabil. conditions, decay estim. 
Lehman, B., +, T-AC Aug 94 1673-1676 

impulse differential lin. systs., constrained controllability. Benzaid, Z., +, 
T-AC May 94 1064-1066 

infinite time-varying LQ-problem, approx, soln. Engwerda, JC, T-AC 
Jan 94 235-238 

MIMO stochastic discrete syst. state estim. Liu Danyang, +, T-AC Oct 94 
2087-2091 

nonlin. systs, vibr. control by AP-forcing. Balestrmo, A., + , T-AC Jun 94 
1255-1258 

P-type learning control. Saab, SS., T-AC Nov 94 2298-2302 


+ Check author entry for coauthors 
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quadratic stabilization of continuous time systs. Mahmoud, M.S,+, T-AC 
Oct 94 2135-2139 

random param. tracking* robust algm Juditsky, A, + , T-AC Jun 94 
J211-122! 

robust stabil., time-varying struct, uncertainty. Shamma, J.S , T-AC Apr 94 
714-724 

slowly varying and LTI systs., ident and uncertainty principles lames, 
G, + , T-AC Sep 94 \%21-\m 

stabilization, adaptive, nonlin. time-varying controller Miller, 0 E, T-AC 
Jul 94 1347-1359 

uncertain nonlin interconnected systs, time-varying state delay, 
stabilizing control Mahmoud, M S . 7\4C Dec 94 2484-2488 
Time-varying systems; cf. Jump parameter systems, Switched systems 
Toeplitz matrices 

generalized Chandrasekhar recursions from generalized Schur algm 
Sayed, A H f +, T-AC Nov 94 2265-2269 
state-space models fast ident Young Man Cho, +, T-AC Oct 94 2004-2017 
Topology 

output regulation, necessary condition Lucibello, P , T-AC Mar 94 
558-559 

Tracking 

constrained robots, force/motion control Grabbe, MT, + , T-AC Jan 94 
179 

deadbeat ripple-free tracking Jetto, l , T-AC Aug 94 1759-1764 
discrete-time nonlin syst control, stabil property Jie Huang, + , 7-dC 
Nov 94 2307-2311 

feedback lin. systs adaptive tracking Marino, R , + , Jun 94 
1314-1319 

flexible joint robots with uncertain params and disturbances, tracking 
control Tomei, P . T-AC May 94 1067-1072 
MIMO nonlin syst I/O pscudolinearization Lawrence, DA,+, T-AC 
Nov 94 2207-2218 

MRAC perform anal /improvement, new tracking error criteria Datlam, 
A v, T-AC Dec 94 2370-2387 

nonlin systs , asymptotic tracking, necessary conditions Grizzle, J W, 4 , 
T-AC Sep 94 1782-1794 

nonlin uncertain systs tracking control Song, YD, + , T-AC Sep 94 
1866-1871 

output regulation, necessary condition Lucibello, P, T-AC Mar 94 
558-559 

prod control methods, distributed, stabil and perform Sharifma A , T AC 
Apr 94 725-737 

rigid robotic manipulators, robust tracking control Man Zhihong, +, T-AC 
Jan 94 154-159 

ripple tree sampled-data robust servomechanism controller, exponential 
hold Yung-Chun Wu, + T-AC Jun 94 1287-1291 
robot adaptive control based on passivity Yu Tang, + , T-AC Sep 94 
1871-1875 

robust stabilization/perform , MRAC modeling error compensation Sun, 
J, + T- iC Mar 94 630-635 

sampled data control systs and tracking, In space approach Yamamoto, 
Y, T-AC Apr 94 703-713 

single-arm dyn , robust variable struct /and switching-^ adaptive control 
Li-Wen Chen, + , T-AC Aug 94 1621-1626 
time-varying params, bounded error ident Bittanti S, + , T-AC May 94 
1106-1110 

Tracking; cf. Radar tracking. Space vehicle tracking 
Tracking loops 

range tracking loops, large deviation anal Dembo, A , + , T-AC Feb 94 
360-364 

Transducers; cf. Multisensor systems 
Transfer function matrices 

dyn lin systs, pos real lemma generalization Scherer, R,+, T-AC Apr 
94 882-886 

H®-norm approx of systs by const matrices Kavranoglu, D, T-AC May 
94 1006-1009 

H® optim , time-domain constraints Rotstem, H, +, T-AC Apr 94 762-779 
LTI syst pos real control Weiqian Sun, + , T-AC Oct 94 2034-2046 
multivariable lin systs, hysteresis switching MRAC Welter, SR, + , 
T-AC Jul 94 1360-1375 

neutral systs, stabil and stabilizability Logemann, H, + , T-AC Jan 94 
138-143 

transfer matrices preconditioning. Rotstem, B, T-AC Nov 94 2287-2292 
Transfer functions 

block multirate input-output model for sampled-data control systs 
Jakubowski, A M, T-AC May 94 1145-1147 
closed loop syst freq response plots, comment. Gessmg, R., T-AC Aug 94 
1770-1771 

closed-loop vibr control. Youla parameterization Kabamba, PT, + , 
T-AC Jul 94 1455-1459 

continuous coprime factors. Treil, S, T-AC Jun 94 1262-1263 


continuous time systs., transfer fns., model errors. Schoukem, J„ +, T-AC 
Aug 94 1733-1737 

decentralized stable factors and parameterization of decentralized 
controllers. Date, R., + , T-AC Feb 94 347-351 
delay systs- approx , Laguerrc formula. Lam,J, T-AC Jul 94 1517-1521 
delta-operator formulated discrete-time approx. Premaratne, K, +, T-AC 
Mar 94 581-585 

discrete-time filters from high-order s-to-z mappings Schneider, AM, +, 
T-AC Feb 94 435-441 

Hod model reduction computational scheme Kavranoglu, D„ T-AC Jul 94 
1447-1451 

H 2 -optimal zeros placement Kabamba, PT, + , T-AC Jun 94 1298-1301 
H® control of discrete-time uncertain systs Geromel, JC, + , T-AC May 
94 1072-1075 

lin. syst approx , time-scaling factor, Laguerrc models Wang , L,+. T-AC 
Jul 94 1463-1467 

MIMO lin systs with sens./actuator failures, stabil Gundes, A N , T-AC 
Jun 94 1224-1230 

mixed HifH ® perform objectives Kemtn Zhou, + , T-AC Aug 94 
1564-1574 

param ident in freq domain Pmtelon, R, +, T-AC Nov 94 2245-2260 
rationalx transfer fn , optimal L* approx Kavranoglu, D, + , T-AC Sep 
94 1899-1904 

robust stabilization/perform , MRAC modeling error compensation. Sun, 
J,+, T-AC Mar 94 630-635 

sampled data control systs and tracking, fn. space approach Yamamoto 
Y , T-AC Apr 94 703-713 

semi-cancellable fraction transfer fns in syst theory Bourles, H, T-AC 
Oct 94 2148-2153 

SISO-distributed plants, optimal mixed sensitivity Flamm, DS, +, T-AC 
Jun 94 1150-1165 

S1SO systs , robust stabilization Olbrot, A W, + , T-AC Mar 94 652-657 
SISO systs with affine param uncertainties, robust perform Kogan, J, 
T-AC Jan 94 221-229 

stabilisation, uncertainties/numerator-denominator coupling Chockalmgam, 
G,+, T-AC Sep 94 1955-1958 

slate delayed systs , memoryless H“ controllers Joon Hwa Lee, + , T-AC 
Jan 94 159-162 

state observer/feedback, compensators, comment Bender, DJ, T-AC Feb 
94 447-448 

state-space models fast ident Young Man Cho, +, T-AC Oct 94 2004-2017 
uncertain syst pole assignment, polynomial approach Figueroa, JL , +, 
J-AC Apr 94 831-835 

uncertain transfer fns , value sets comp Gutman, P -O -t . T-AC Jun 94 
1268-1273 

Transforms; cf. Fourier transtorms, Laplace transforms. Wavelet transforms, 
Z transforms 

Trees, graphs 

multiscale systs, Kalman fillers, Riccati eqns Chou, KC, + , T-AC Mar 
94 479-492 

robust anal, uncertainty value sets Eszter, EG, + , T-AC Nov 94 
2315-2318 

Tuning 

Gaussian stochastic control syst tuning van Schuppen, J H, T-AC Nov 94 
2178-2190 

Smith predictor for controlling proc., integrator and long dead-time 
Astrom, KJ, + , T-AC Feb 94 343-345 

Two-dimensional systems; cf. Multidimensional systems 


11 

Uncertain systems 

cascaded nonlin systs , global robust stabilization, lmura, J-l, + , T-AC 
May 94 1084-1089 

comments on “Stability margin evaluation for uncertain linear systems” 
(byC Gong and S Thompson, Jun 94 548-550) Su,J-H, T-AC Dec 94 
2523-2524 

continuous-time adaptive decoupling control design Ortega, R, +, T-AC 
Aug 94 1639-1643 

design for plants, unknown dead-zones. Gang Tao, +, T-AC Jan 94 59-68 

discrete-time lin. systs., pos. invariant sets. De Santis , E, T-AC Jan 94 
245-249 

discrete-time uncertain systs, ultimate boundedness control, Lyapunov 
Blanchmt, F„ T-AC Feb 94 428-433 

D-stabil., robust, for lin uncertain discrete delay systs Te-JenSu, •* , T-AC 
Feb 94 425-428 

failure detect./iso I at ion/accommodation syst Chia-Chi Tsui, T-AC Nov 94 
2318-2321 

H* control of discrete-time uncertain systs Geromel, JC, + , T-AC May 
94 1072-1075 

H® control of systs. under norm bounded uncertainties. Gu, K, T-AC Jun 
94 1320-1322 


+ Check author entry for coauthors 
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interval lin. control sytt-robust parametric thsipi. Ktel, 

M 94 1524-1530 • 

tin. quadratic designs, real param. uncertainty, robust DougiaS, , 
T-AC Jan 94 107011 

MRAC, robust, SISO systs ZMhua Qu, +■, T-AC Nov 94 2219-2254 
nontin. interconnected systs., time-varying state delay, stabilizing control. 
Mahmoud. MS , T-AC Dec 94 2484-2488 

nonlin. uncertain systs. tracking control. Song, Y.D., + , T-AC Sep 94 
1866-1871 

optimal cost control/filtering, uncertain lin. systs. Petersen, LA, +, T-AC 
Sep 94 1971-1977 

optimal Altering, stochastic discrete-time systs., unknown inputs. Borisov, 
A.V.y +. T-AC Dec 94 2461-2464 

parametrically uncertain nonlin. feedback systs., absol. stabil. Marquez, 
HJ, + , T-AC Mar 94 664-668 

parametrically uncertain systs., anal, and synthesis. Kaminsky, R.D„ +, 
T-AC Apr 94 874-876 

quadratic stabilization of continuous time systs. Mahmoud, M.S„ +, T-AC 
Oct 94 2135-2139 

robust anal., uncertainty value sets. Eszter\ E.G., + , T-AC Nov 94 
2315-2318 

robust stabil., time-varying struct, uncertainty. Shamma, J.S., T-AC Apr 94 
714-724 

SISO systs. with affine param. uncertainties, robust perform. Kogan, J., 
T-AC Jan 94 227-229 

slowly varying and LT1 systs., ident. and uncertainty principles Zames, 
G„ +., T-AC Sep 94 1827-1838 

stabilisation, unccrtainties/numerator-denominalor coupling. Chockalingm, 
G., +, T-AC Sep 94 1955-1958 

stabil. of uncertain lin. systs., saturating actuators. Jin-Hoon Kim, +, T-AC 
Jan 94 202-207 

structurally uncertain systs , optimal Lyapunov fns. Olas, A,, T-AC Jan 94 
167-171 

time-invariant lin. systs. controllability/observability. Kaining Wang, + , 
T-ACJul 94 1443-1447 

uncertain continuous-time implicit systs., regional pole placement, 
robustness. Chun-Hsimg Fang, + T-AC Nov 94 2303-2307 
uncertain discrete-time systs., variable struct, control design. 

Myszkorowski, P., + , T-AC Nov 94 2366-2367 
uncertain lin. systs., delay depend., robust stabil. BugongXu, T-AC Nov 
94 2365 

uncertain lin. systs,, multivariable, stabil. margin eval. Gong, C . + , T-AC 
Mar 94 548-550 

uncertain nonlin. systs,, struct, invariance. Castro-Linares, R., + , T-AC 
Oct 94 2154-2158 

uncertain SISO min. phase lin. syst., nonlin. universal servomechanism. 
Ryan, E.P., T-AC Apr 94 753-761 

uncertain syst. pole assignment, polynomial approach. Figueroa, J.L., 
T-AC Apr 94 831-835 

uncertain syst. robust stabil. anal., nonquadratic Lyapunov fns. 

Zelentsovsky, A.L, T-AC Jan 94 135-138 
uncertain syst, robust stabil., guardian map approach. Shuoh Rern, + , 
T-AC Jan 94 162-164 

uncertain systs., asymptotic stabil. region estim. Man Ho Choi, + , T-AC 
Nov 94 2275-2278 

uncertain systs., sampled-data controller design. Dolphus, R.M., T-AC May 
94 1036-1042 

uncertain systs. stabilizing control design, quasiconvex optim. Keqtn Gu, 
T-AC Jan 94 127-131 

uncertain transfer fns., value sets comp. Gutman, P.-O., + . T-ACJun 94 
1268-1273 

unknown input observers design. Darouach, M., T-AC Mar 94 698-699 
VSS control design for uncertain discrete-time systs. Wen-June Wang, +, 
T-ACJan94 99-m 

u calc., computational complexity. Braa/z, R.P., + , T-AC May 94 
1000-1002 


V Jr, ' 

Variable-structure systems 

constrained robots, force/motion control. Grabbe, M.T., + , T-AC Jan 94 
179 


discrete-time uncertain systs., ultimate boundedness control, Lyapunov. 
Blanchini, ft, T-ACFeb 94 428433 

generator voltage regulator, pole assignment tech. Aggoune, M.E., + . 
T-AC Oct 94 2\Q6-2\\0 

MRAC, variable struct. I/O based, anal./design. Liu Hsu, +, T-AC Jan 94 
4-21 

rigid robotic manipulators, robust tracking control A/nn Zhihong, +, T-AC 
Jan 94 154-159 

robot variable struct, control schemes. Bin Yao, +, T-AC Feb 94 371-376 
robust control design, cascade struct, approach. Bonivento, C. f + T-AC 
Apr 94 846-849 

robust MIMO terminal sliding-mode control, rigid robotic manipulators. 

Zhihong, M., +, T-AC Dec 94 2464-2469 
single-arm dyn., robust variable struct./and switching-^ adaptive control. 

Li-WenChen, + , T-AC Aug 94 1621-1626 
switching surface design for multivariable VSS. Ju-Jang Lee, + , T-AC 
Feb 94 414-419 

uncertain discrete-time systs., variable struct, control design. 

Myszkorowski, P., + , T-AC Nov 94 2366-2367 
uncertain discrete-time systs., VSS control design. Wen-June Wang, + , 
T-AC Jan 94 99-102 

uncertain systs., asymptotic stabil. region estim. Han Ho Choi, + , T-AC 
Nov 94 2275-2278 
Vectors 

comments, with reply, on “Vector norms as Lyapunov functions for linear 
systems” (by H. Kiendl et al., Jun 92 839-842). Hmamed, A., T-AC Dec 
94 2522-2523 
Velocity control 

robust control design, cascade struct, approach. Bonivento, G, + . T-AC 
Apr 94 846-849 
Very-large-scale integration 

constrained nonlin. syst receding horizon control. Shin-Yeu Lin, T-AC Sep 
94 1893-1899 
Vibration control 

closed-loop vibr. control, Youla parameterization. Kabamba, P.T., + , 
T-ACJul 94 1455-1459 

nonlin. uncertain systs. tracking control. Song, Y.D., + , T-AC Sep 94 
1866-1871 

second-order dyn. systs., dissipative controller designs. Morris, K.A., + , 
T-AC May 94 1056-1063 

structural wave control, reduced-order model. Quan Wang, + , T-AC Aug 
94 1711-1713 

time log nonlin. systs., vibr. control. Lehman, B,, +, T-AC May 94 898-912 
Vision systems (noo-biological); cf. Robots, vision systems 
VLSI; cf. Very-large-scale integration 


W 


Wavelet transforms 

multiresolutional distributed filtering. Lang Hong, T-AC Apr 94 853-856 
White noise 

least sq. estim. in white noise, convergence. Nassiri-Toussi, K„ + , T-AC 
Feb 94 364-368 

model struct, selection test, instrumental variable, statist, props. Hoai 
Nghia Duong, + , T-AC Jan 94 211-215 
Wiener processes 

recursive ident., nonlin. Wiener model. Wigren, T., T-AC Nov 94 
2191-2206 

Wiener systs. nonparametric ident. Greblicki, W., T-AC Oct 94 2077-2086 


Z 

Zero assignment 

Hi-optimal zeros placement. Kabamba, P.T., + , T-AC Jun 94 1298-1301 

Z transforms 

transfer fn. param. ident. in freq. domain. Pintelon, R., + , T-AC Nov 94 
2245-2260 


* Chert coauthors 
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Scanning the Issue* 


Adjoint and Hamiltonian Input-Output Differential 
Equations, Crouch, Lamnabhi-Lagarrigue , and van der 
Schaft . , 

In 1887, Helmholtz established conditions under which a set of 
second-order nonlinear differential equations are, or can be trans¬ 
formed into, Lagrangian (or Hamiltonian) form. This paper gener¬ 
alizes the classical Helmholtz conditions to nonlinear input-output 
differential systems, i.e., to nonlinear control systems.lThe adjoint 
variational system and the meaning of self-adjointness for a general 
input-output system are defined in the course of the analysis. The 
results employ recent developments in the theory of variational and 
Hamiltonian control systems. A main advantage of the results of this 
paper over previous related results is their verifiability directly in 
terms of the given input-output model. Several examples are given 
illustrating the results and associated computations. 

Language Convergence in Controlled Discrete-Event 
Systems, Willner and Heymann. 

This paper addresses the issue of how to supervise a discrete- 
event system modeled by a finite automaton so that after a bounded, 
but unknown, number of transitions, the behavior of the controlled 
system will conform to some specified behavior. Several issues 
within this problem area are addressed. Necessary and sufficient 
conditions for finite convergence are provided, and algorithms for 
implementing these tests are developed. These are then applied to 
achieving language convergence through control. This is discussed 
within the context ol finite string behavior as well as asymptotic 
behavior. The main contribution is a detailed analysis of existence and 
computation ol supervisors to achieve eventually correct supervisory 
control. 

Concurrent Vector Discrete-Event Systems, Li and Wonham . 

In earlier papers published in these transactions, the authors 
have addressed the control of vector discrete-event systems (VDES). 
VDES are a class of discrete-event models where the state is 
represented by a vector of integers and state transitions are manifested 
through integer vector addition. This paper extends VDES control 
theory by incorporating simultaneity of events into the model and 
by considering nondeterminism of controllers. The extended VDES 
model with simultaneous (or concurrent) events is termed concurrent 
VDES (CVDES). The main pan of the paper deals with nondetermin- 
istic control of CVDES. The nondeterminism comes from the fact that 
the control action issued by the controller at each step is "randomly" 
(i.e., according to some hidden mechanism) chosen among a set of 
possible control actions. The authors show how to exploit the power 
of nondcterministic control in the presence of concurrency. 

Structure of Model Uncertainty for a Weakly Corrupted 
Plant, Zhou and Kimura. 

Robust control paradigms are typically based on a plant model 
set described by a nominal model and a bound on the distance, 
in a certain metric, between this nominal model and all other 
models in the set. For paradigms to be usable, a procedure must be 
available to determine an appropriate model set from available plant 
information. Several philosophies have been put forth that assign 
different meanings to "appropriateness.' 1 Consistent with the control 
motivation, the philosophy embraced in the present paper is that of 

*This section is written by the Transactions Editorial Board. 


"worst case deterministic" modeling, by which the model set is to be 
the smallest set of plants which includes all those which could have 
produced the available plant information, under given assumptions on 
the noise, and which satisfy given a priori conditions. Specifically, 
the first task undertaken by the authors is to characterize the set 
of all linear, lime-invariant, stable transfer function matrices, with 
the norm below a prescribed threshold, which are consistent 
with given experimental data known to be weakly affected by noise. 
The key result here is a parameterization of This set as the linear 
fractional transformation of a fixed transfer function matrix and a 
structured, norm bounded, free transfer function matrix. Next, with 
the robust control problem in mind, the authors consider determining 
the smallest superset of the set just characterized that consists of a 
"nominal" transfer function matrix of prescribed order, and of an H n0 
"ball" around it. They show that this problem can be expressed as a 
mixed-?u synthesis problem. 

Explicit Formulas for Optimally Robust Controllers for 
Delay Systems, Dym, Georgiou , and Smith. 

H ^,control for distributed parameter systems, initiated in the mid 
1980’s, focused first on developing formulas to provide further insight 
into limitations on performance due to infinite dimensional elements 
(such as time-delays) and second, on developing a direct approach 
to design which does not rely on first finding a finite dimensional 
approximation of the plant. Such an approach has the potential to shed 
light on the structure of ideal compensators for distributed parameter 
systems. 

In this paper, the authors consider single-input/single-output sys¬ 
tems whose transfer functions consist of a strictly proper rational 
function times a delay. They present a closed-form expression for 
the controller which is optimally robust with respect to perturbations 
of the nominal plant measured in the gap metric. The derivations 
involve a synthesis of state-space techniques and the use of a certain 
algebra of "pseudo-derivation" operators, which amounts to a specific 
class of distributed delays. The controller transfer function has a 
simple expression involving state-space matrices computed from the 
rational part of the plant together with a distributed delay element. 
The expression makes it quite straightforward to carry out directly 
the H^ loop-shaping procedure of Glover-McFarlane for the class of 
plants considered and rational weights. An example design is worked 
out to illustrate the technique and a comparison is made with a Smith 
predictor. 

Stochastic System Identification with Noisy Input»Output 
Measurements Using Polyspectra, Tugnait and Ye. 

In this paper, parameter estimation in linear S1SO systems (with 
prescribed orders) is considered. Here inputs and outputs are assumed 
to be contaminated by noise, and identifiability is obtained by addi¬ 
tional assumptions on the probability distributions of the signal and 
noise processes, respectively. Based on a simple relation between the 
true transfer function and some (population) bispectra or integrated 
polyspectra, first estimators of the transfer function are derived from 
the corresponding sample counterpoints. From these, two classes of 
estimators for the transfers function, one called linear (which, strictly 
speaking is also nonlinear) and one called nonlinear, are derived. 
Consistency is analyzed, and bias and variance of the estimators are 
compared (also with other, already existing, estimators) by means of 
a Moute-Carlo simulation. 
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Minimum Bias Priors for Estimating Parameters of models that depend linearly on the state but possibly nonhnearly on 

Additive Terms In State*Space Models, Hochwald and the parameters, where both the state and observation are corrupted 

Nthorai, by additive noise, A prior density for the parameters is introduced 

We treat the problem of estimating parameters of additive terms, that, when combined with the likelihood function to form a posterior 
sometimes called bias terms, in state-space models. We consider density, minimizes the bias of the posterior mean. 
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Adjoint and Hamiltonian 
Input-Output Differential Equations 

Peter E. Crouch, Senior Member , IEEE, Francoise Lamnabhi-Lagarrigue, and Arjan J. van der Schaft 


Abstract — Rased on recent developments in the theory of 
variational and Hamiltonian control systems by Crouch and 
van der Schaft, this paper answers two questions: given an 
input-output differential equation description of a nonlinear 
system, what is the adjoint variational system in input-output 
differential form and what are the conditions for the system to 
be Hamiltonian, i.e., such that the variational and the adjoint 
variational systems coincide? This resulting set of conditions is 
then used to generalize classical conditions such as the well- 
known Helmholtz conditions for the inverse problem in classical 
mechanics. 


I. Introduction 

T HE work we arc describing in this paper has its roots 
in a very old problem in classical mechanics, where 
one asks which Newtonian systems correspond to Lagrangian, 
or variational systems; the so-called inverse problem. There 
are many variants of this problem, see Santilli [14], but the 
simplest one can be stated as follows. 

II q e 11 n is a configuration variable, which satisfies the 
Newtonian system 


F(q. q, q) - 0 (I) 

for a smooth R" valued mapping F, such that OF/dq is a 
nonsingular matrix in a suitable open domain, then under what 
conditions does there exist a function L of r/, ry, so that for 
some ordering of the variables of q 

d »i/ ol t , 

= a) 

The conditions under which this property holds are known 
as the classical Helmholtz conditions, see Santilli [14] where 
generalizations to functions F depending on arbitrary finite 
jets of q are also considered. 

The condition on the rank of OF/Oq in the system (I) above 
enables those systems satisfying (2) to be written also as a 
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Hamiltonian set of equations 


<7 = 


dn T 


dp 

0H T 

dq 


(<7, P ) 


(0. P) 


(3) 


where H(p, q) is the Hamiltonian function of the system, and 
(p, q) arc coordinates on the symplectic phase space R 2n . A 
result of Brockelt and Rahimi [1] is also of interest in this 
context and concerns the linear system E 


t = At + B9i: .r(0) = 0 

!) - Cx (4) 


in which x e R r \ u, y C R"\ and the so-called adjoint system 
E a (see also [12]) 

P = - A T p - C T v „; p( 0) = 0 
!/« = B 1 p. (5) 


It was shown with the minimality of both systems, together 
with the “self-adjointness” condition that the input-output 
maps of £ and E n coincide, that this is equivalent to the 
fact that the system £ has another internal representation as a 
linear “Hamiltonian Contror system 


P = 

q = 

y = 


OH 7 

dq 

dH T 


(P' <1- «) 


dp 

OH 7 ' 

On 


(P< < 7 . «) 


(P . 0 . «) 


( 6 ) 


where for a linear Hamiltonian system 


//(/>, </, u) = ^{p 7 , q J )F 


+ iP T , q r )Gu 


for some matrices F and G. 

The term “self-adjointness” is also used to describe the con¬ 
ditions which ensure that a Newtonian system does correspond 
to a Lagrangian, or Hamiltonian, system. This is explained by 
performing integration by parts to give an expression 



= f\lF*(t,LOdt 

Jo 


+ Q{q 1 0. 0, <}v, qv, ijv, t Olo (7) 
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for some functions F" and Q. The adjoint variational system to 
the variational system +§£ 9® - 0 corresponding 

to (1) is then 

*"(€, t 0 = o- 


Self-adjointness of F is simply the statement that 




Work by Crouch and van der Schaft [5], [6], made an 
extensive investigation and generalization of the result by 
Brockett and Rahimi [1] to nonlinear control systems, based 
on earlier work by van der Schaft [IS]. In particular, a state 
space form of the variational and adjoint variational systems 
was introduced, generalizing the relationship between E and 
E° to nonlinear systems. The concept of self-adjointness was 
correspondingly generalized, and under suitable hypotheses it 
was shown that self-adjointness is necessary and sufficient for 
Hamiltonian realizations of input-output maps. Moreover, the 
self-adjointness condition was successfully interpreted in terms 
of the Volterra series and Fliess series. See also Jakubczyk [12] 
for a generalization to control systems with control entering 
in a nonaffine manner. 

This work, although implicitly generalizing the classical 
Helmholtz condition, fails to give conditions in terms of the 
differential equation representation of the input-output map, 
generalizing the system representation given by (1), and here 
represented by an equation of the form 

*W> = 0 (8) 


where F is a smooth vector valued function of its arguments. 
Furthermore, the self-adjointness conditions of [5] and [6] are 
difficult to check in practice, since in principle one needs to 
compute the state trajectories of the nonlinear system under 
consideration. The present paper gives the full generalization 
of the classical Helmholtz conditions to control systems de¬ 
scribed by (8). The resulting conditions are completely in terms 
of the mapping F and its partial derivatives, They are worked 
out in detail for control systems 

F(y, y (1) , ?/ 2 \w, « (1) ) = 0 (9) 


so that they have a Hamiltonian representation by a system 
of the form (6). To the knowledge of the authors, the only 
previous results in this direction are those given by two of the 
current authors [4], when dealing with the scalar input-scalar 
output version of the system (9). It is interesting to note that 
many problems associated with the preceding analysis coincide 
with those met in the study of time-varying linear systems; see, 
e.g., {2], [9], £10], [11], and [13], 


II. The Adjoint Variational System 

We consider analytic (i.e., C **’), complete, state-space sys¬ 
tems which may be written in the form 

2, i = w), y = h(x), u <= JT*, y 6 FF (10) 


where x = denote local coordinates for 

some state space manifold M and corresponding analytic 
input-output differential representations 

\/o F(y, y, ,y {N) ,u, u,--',v ( ' lf ~ 1) ) = 0€ll p . ( 11 ) 

If system (10) is minimal in the sense of Crouch and van der 
Schaft [5, ch. 3 for the input-affine case and ch. 6 for the gen¬ 
eral case], the corresponding representation (11) will also be 
called a minimal representation. Note that we do not insist hem 
on the relationship between state-space representations (10) 
and input-output differential representations (11); we assume 
that all conditions are met for obtaining one representation 
from the other representation. 

A variational system £" about a given trajectory 
( x(t ), u(t), y(t)) of 2 g is defined in the usual way (see 
15]) as 

v(t) = F(t)v(t) + G(t)u v (t), u v € R Tn , veR n 
y v (t) = H(t)v{t), y v e R p (12) 

with F(t) := §f (x(t), «(/)), G(t) := (f), «(/)). H(t) = 

!£(x(f)), and u tn y x denoting, respectively, the variational 
state, variational input, and variational output. Note that (12) 
results from differentiation of a one-parameter family of 
solutions to (10), cf. [5]. The adjoint variational system E”, 
along the same trajectory (j-( f), u(t), y(t )), is defined as (see 
[5]) 

m = - F T (t)p(t ) - H T (t)v a (t), a u eR», pe R n 
= G T {t)p(1), y a € R ni (13) 


with p, u a , y a denoting the adjoint variational state, input, 
and output, respectively. The fundamental connection between 
variational and adjoint variational systems (along the same 
trajectory of E*) is [5, Lemma 2.1] 

yl{t)>iv{t) - ul(t)y v {i) = ^jjP 7 (fMO- (14) 


Now let us translate this to input-output differential represen¬ 
tations E^o given by (11). Clearly, the variational systems 
EJ^ (along solutions 11 (f), y(t) of E,/ r> ) are defined by the 
system of equations 


dF t . OF . , . 

+ o^ Vv{i) + ■ 




OF 


1 ) 


—,^-"(1) = 0 (15) 


where the solution w(0>X/(0 is substituted in 
fu’if * ‘ ’■ Again, (15) results from differentiation of a one- 
parameter family of solutions (u(t, e), y(/, c)) to (11). Com¬ 
paring EV to E*' there is a potential problem since (see [20]) 
the set of solutions u v , y X} to (15) may be strictly larger than 
the set of solutions u v , y v generated by (12). This has to do 
with the form of the input-output differential representation 
(11) (note that this representation is far from unique); we will 
later on make an assumption on (11) which will eliminate this 
potential problem. 

The next logical question is how to define the adjoint 
variational system E^ 0 . This is not immediate from the 
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definition of Relation (14) should provide the clue to 
a proper definition, although the right-hand side of (14) is 
expressed in the variational and adjoint variational state. 
Direct calculation based on EJ and provides the following 
alternative to (14) 

J u I(T)H{T)$(T,a)G(s)u v (a)dsdt 

(16) 

with $(t, a) being the transition matrix of F(t) (and the 
variational and adjoint variational states initialized at 0 at 
time -oo). Although the right-hand side of (16) is an integral 
expression in u a , u v , this partly motivates the following 
definition (another motivation is provided by the classical 
definition of adjoint variational systems for sets of differential 
equations, cf. 114]). 

Definition I (see also [5]): Consider the variational system 
(15) along a solution (u(/), y(t)) of E t / 0 . The adjoint 
variational system consists of the following set of 
input-output pairs (M a (-), ?/«(■)) : th ere exists a function 

^(?/t ?/? ‘ ‘ ' j i v ^ j/v i ■ ‘ ' i v i a " j Va i IJa i ' ' ’ j ^flj 

u a , ■ • ■) such that for all i E R 

V?(f)Uv(0 - vlitM) = ( 17 ) 

fo r all input-output pairs f i/ t ,(■)) which are solutions of 

yv 

l ls‘ 

To justify this definition we first have to prove that Defini¬ 
tion 1 uniquely characterizes the adjoint variational system. 
Proposition 1: Suppose that there exists a function 

Q{U> ii, •■•,!/„, //,.,■ u, 

i/a 7 i/a ’ " " ’ j ‘ ‘ ‘) 

such that for all solutions (u t ,(/), //„(/)) to E ? ^ 0 and all w a (f) 

.v o T (/)u„(0-« n I '(/)i/,.(/) = ^Q (18) 

for all t € /l, then y a {t) = y a { 0> f and () = Q modulo 
a constant. 

Proof. Subtracting (17) from (18) yields 

(i/o - Ua) T "v = ^(Q - Q) 
for all u„. Hence 

f [v.(0 - y.(0] T «,(/)rff = IQ - QWi- (19) 
Ji i 

Now take a fixed function u„ on [/i, t 2 ] and corresponding 
fixed functions y„, y a on [t-i, t 2 ] and arbitrary u v . It follows 
that the left-hand side of (19) only depends on u„(tj) and 
i£ l ,(£ 2 ) (since the right-hand side does), thus implying that 
both sides of (19) are zero, and y a = y a and Q = Q modulo 
a constant. ■ 

The next thing we have to do is to show that Defini¬ 
tion 1 is consistent with the definition of the adjoint varia¬ 
tional system Comparing (17) to (14) we see that this 


means that p T v has to be expressible as a function Q of 
j/„. jf„, ■ ■ •, u,„ u v , • ■ • ,y a , Va, • • ■, «a. u 0 , • • ■ (and of course 
y, y, ■ • », u, «. • • ■)• To do so we make fundamental use of 
some results obtained by Ilchmann et al. f 10] on time-varying 
linear systems and Coron (3] and Sontag [17] on the relation 
between nonlinear state space systems L, and the variational 
systems EJ; see also [3], [18], and |7], Indeed, in [3], [17] the 
following is shown. Consider the minimal state space system 
Let 7 be an open interval of A, and denote by A"*) 

the set of smooth input functions u : I —» il m , equipped with 
the Whitney topology. Then the set of all u in C°°( 7; ft"*) 
such that all corresponding solutions (x(t), v(t)) of E, defined 
on 7 have the property that the variational systems (12) along 
(j-(r), t/(/)) satisfy 

dim span j - F(t)^j G(i)ur, w € ft m , * > 01 = n 

J (20) 

for all I € 7, is a dense subset of ft m ); see 13, Corol¬ 

lary 1.8]. (Note that the definition of the strong accessibility 
algebra used in [3] is the usual definition in the case of input- 
affine systems E,, while for general systems x =■ /(x, u) it 
corresponds exactly to the definition given in [5, ch. 6].) 

Furthermore, sec [3, Corollary 1.15], the set of all u in 
(7°"(/; ft” 1 ) such that all corresponding solutions (x((), u(t)) 
of E* defined on 1 have the property that the variational 
systems (12) along {r(t), u(l)) satisfy 

dim span/ + F 1 (/)^ H r {1)w\ w e R p , i > 0 ) = n 

(21) 

for all I e 7, is also a dense subset of C°°(J; R m ). Properties 
(20) and (21) express well-known controllability, respectively, 
observability, properties of the time-varying linear systems 
given by the variational systems E”, and thus the above 
statements imply, loosely speaking, that in casa E* is minimal 
then its variational systems are controllable and observable for 
a dense subset of input functions. 

To use now the fundamental results obtained in Ilchmann et 
al [10] and Ilchmann [11] we will now restrict to analytic 
(C“) input functions on the time-interval I. Since (10) is 
assumed to be analytic this will mean that the variational and 
adjoint variational systems (12), respectively (13), are analytic, 
i.e., the entries of F(t), G(i), and 77(f) are analytic functions 
on 7, Let 34 denote the meromorphic functions on 7, and 
denote by M\D) the set of polynomials 

Et# 

>=o 

in D with coefficients from 34 ( D will represent the dif¬ 
ferentiation operator 6 End (Ad), the algebra of A-linear 
maps from 34 to 34). Considering also the multiplication in 
End (34). we arrive at the skew-polynomial ring M[D] (see 
110], [11]) with multiplication rule 

DU 9) = fD(g) + DU)g = (fD + D{f))g, 

f,g€ End (34). (22) 
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The set of m x n matrices over M[D) will be denoted by 
M\D\ m * n . A useful property (see [10]) is that a left inverse 
of a square matrix in M\D) n * n is also a right inverse, and 
vice versa. 

Let us now denote the variational and adjoint variational 
system in shorthand notation by 


( / |“ F ) , '" GUt ' = 0 


1-77 + F T 
, lit 


u„ - Hu = 0 
^jp + H t u h = 0 
y a - G T p = 0. 


Throughout we will restrict to the dense subset of analytic 
input functions for which (23) satisfies the controllability and 
observability properties (20) and (21). Then by [10, Lemma 
3.3 and Theorem 6.4] there exists an invertible matrix 

U(dt) ^(St) c U[ni(n4m))((n+m) 

vii) ml 11 


such that 


u dt F ’ G \[va) za) 


= [In, 0] 


and an invertible matrix 


m Qd)], M\D)( t '+ p)x ^ 

m ml 11 


such that 


dt 


m) 


J [ s ift) 

*(i)l 


We will now apply the following “integration by parts” pro¬ 
cedure to (25), (26), This will be a key tool in all of our 
subsequent developments. 

Consider two matrices M(^), TV(^) over M[I)], of 
appropriate dimensions such that 




with C some constant matrix, say of dimension k x /. (Note 
that M(^) acts also on the entries of JV(^), cf. (22); the 
same of course applies to the expressions (25) and (26).) 

Let now £(/,) be an /-vector of analytic functions, and 
consider the /r-vector of analytic functions 


J d \ ( d \ 

\d1) \di / 


Now premultiply (28) by a fc-row vector of analytic functions 
V T (t)i i.e„ 


r/ r (/)Af 


(*Ms> 


and apply integration by parts to the integral 


(with [(), ( 2 ] C 1 ) in order to shift the differentiations on £(t) 
to differentiations on q{t). It follows that 

= (, T (t)N^^^M^Pjri(t)dt + remainders (30) 

for certain matrices N(£). M(-jj) over M[D) (in fact, M 
and N are dimensioned as M T , respectively N T .) Futhermore, 
from (27) and (30) it follows that the differential operator 
N(i'i)M(i) equals the constant linear mapping C T on all 
functions r/(/j of support within {i i, 1?). However, this means 
that equals C T , and thus the remainders in (30) 

are necessarily zero. 

Applying this procedure to (25) and (26) yields 

k(4) z(4) J[ J~M 

mi) "(i) « II" 

respectively, for invertible matrices ^ ^]'[q /j] obtained 

from \ ^ j by partial integration. 

The action of the differential opcratoi on (23) thus 

results in the equivalenl system of equations 

-PGu l -f Sij t - v 

-QGu t + Rij, = 0 (33) 

and, similarly, a combination of (31) and (24) yields 
VH l u tt - Vy (l = p 

WH T U a - Zifa -- 0 (34) 

It thus follows that i;(/) = -/*($)(»(/) MO 4- (/) 

and p{t) - IJ(£)H 7 (t)u a {t) - V{£)t/ (l (t), implying that 
0 )°(0 appearing in the right-hand side of (14) can be 
expressed as a function Q in //„, u ,, u n , tj u and their time- 
derivatives (and implicitly of //, i/, ■ , u , i/, • • ). This shows 
that Definition 1 is consistent with the definition of the adjoint 
variational system E". Furthermore, as additional informa¬ 
tion we obtain from (33), (34) that input-output differential 
representations of E^’ and E“ are given by 

-Q^PjG(t)u„(t) + R(Pj </„(/) = 0 (35) 

respectively 

^(^)^ T( ° Ua(0 ~ z { ji ) ya[i) = °- (36) 

Now compare (35) to (15), and write (15) in a more convenient 
notation as 

Hie requirement, as alluded to before, that the set of solutions 
u v , y v generated by E" equals the solution set of E)’/ 0 can 
thus be rephrased as the following. 
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Assumption1: There exists an invertible matrix E(4r) C 
M[DY*v such that 

From now on we will suppose throughout that Assumption 1 
holds. 

Our next objective is to give a procedure to compute an 
input-output differential representation of the adjoint varia¬ 
tional system directly in terms of the input-output differential 
representation EJ/ () of the variational system (without going 
through the state space representation). Thus, let us consider 
EJ’^ as given by (15), denoted in shorthand notation by (37). 
Premultiply (37) by a /^-dimensional row-vector C (!) and 
again apply partial integration to 


=s r {,r( ' ) Ki) fc( ' ) -' v (s)”- 


(0 


dt (39) 


U) shift all differentiations in //,,((), it, (t) to differentiation in 
((f). This results in 


n 


£(') 


= / \'/, 7 (0 
.h, 

+ R(a. i),ii,■■■ ,v, 


d) 



for certain matrices I ), N in M[D], and remainders B. 

Comparing this to (17) (with Q playing the role of the 
remainders li) motivates the following definition of the adjoint 
variational system by 


„ a (1) = D(j j )((l) 

Uu(1) = N(f i )((l). (41) 


Note that ( 41 ) is an image representation, in contrast with 
the kernel representation (37). (For a linear time-invariant 
system (41) corresponds to a right factorisation, while (37) 
corresponds to a left factorization of the transfer matrix.) 
Indeed, the (analytic) input-output behavior of ( 41 ) consists 
of all analytic time-functions ?/„(/), u a (t) satisfying ( 41 ) for 
some analytic function (,(/). 

Theorem J: The equations (41) are an image representation 
of the adjoint variational system defined in Definition 1 . 

Proof: We only have to show that Ii in (40) can also be 
expressed as a function of y, /y, ■ • •, ii, ii, - ■ ■, y v , y v , ■ ■ ■, u v , 
it,', , y u , y a , • • •, u a , u a , • • •• 

From (25) we obtain 


i£-f -a o 

-H 0 I p 


U 0 
V 0 
0 Z pJ 


K I p 


(42) 


for some K in A4[D]. Hence, from (32) and (42) we obtain 


P S 

Q li 


I&-F -G 
-H 0 


-I„ -PC S 
0 -QG R 


= \P S 

L Q 


In 

-K 


0 

hi 


*PJ 

0 

0 


0 

0 

hi 


( 43 ) 


Clearly the right-hand side of (43) is an invertible matrix, and 
thus “postmultiplication” of (43) by this inverse yields 

1441 

for some matrices ,4(^), B{j- t ) in M[D ]. Now recall that 
an input-output differential representation of is given by 
(35), while also (38) holds. Thus there also exist matrices 

Mi )’ ) in M \ D \ such ,hat 

KsM*)'"(*)*(»)■''■ (45) 

Applying the partial integration procedure (sec (27)—(30)J to 
(45) yields 

'‘(sMwHGfM®) - '' <46) 

for certain matrices A , B in M[l)\, and with D, N as given 
by (41). Thus, by (41) 

A tit ^) ,y ” (/) 


showing that ^ can be expressed into t/„, u (l and their time- 
derivatives. ■ 

Remark I The above notions seem to be also useful for 
analyzing the controllability properties of an input-output 
differential system (11). Consider the variational and adjoint 
variational systems of (11) given by (37), respectively (41), 
where the entries of the matrix differential operators D ) N t 
and I), N are seen as functions of ,t/. y, * ■ -, u, u, * ■ •. Then 
one may construct matrix differential operators D a , N Q (with 
entries depending on </. </,-••, w, ii. - ■) of maximal rank such 
that D (I N - N a D = 0, implying that 

for all trajectories j/ a . y a generated by (41). Integration by 
parts applied to the kernel representation (47) yields the 
“adjoint of the adjoint system,” given in image representation 

V v (l) = Da(^)0(t) 

y v (t) = N a (J^r)(i) ( 48 ) 



m 
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whore we have suggestively denoted die inputs end outputs by 
ttt, «nd y v , since we went to compare (48) to the variational 
system (37). Indeed, it is easily checked that all trajectories 
«v, y v generated by (48) satisfy (37), while equality of the 
behavior of (48) and (37) seems to correspond to some form 
of controllability of the original nonlinear system (11) [(48) 
defines the "controllable" part of the system]. This is an area 
for future research. 

In the next section we will use the representation of the 
adjoint variational system E“/ e as given by (41) in order to 
give a convenient characterization of Hamiltonian systems. 

III. Characterization of Hamiltonian Systems from 
the Input-Output Differential Representation 

We will derive m this section a complete characterization 
of Hamiltonian systems using the representation of the adjoint 
variational systems as in (41). In Secuon IV we will use an 
alternative approach based on the adjoint variational system 
found in [5]. Note that for systems described as 

£./o F{y , V , V, u, u) = 0 € R, ueR, y e R (49) 

the conditions under which (49) represents a Hamiltonian 
system have been found already in an earlier paper by Crouch 
and Lamnabhi [4], i.e., 

i) = 0 

ill SE(ASK _ (UL\ _ OFd_oF_ ( 5 °) 

' &u ydt Dy dy J dy dt du 


and 

Xy 0 : Ay v + By v + Cy v + Du u + Eu v = 0, 

u v ,y v € R m (53) 


where the (i, fc)th elements of the m x m matrices 
j4, B, C, D, E are given by 


Atk = 
F>ik = 


dy *’ 
db\ 
dv k ’ 


R 

Blk ~ dy k ' 
Etk ~ dii k ' 



(54) 


To compute the adjoint variational system E“/ 0 two con¬ 
secutive partial integrations yield (we are using summation 
convention) 


r f 2 

0 = / + B lk y k + C lk y k + D lk u k + E,kU k ]dt 

Jti 

= r lVA tk yt - (FB.^yt + (CC, k ) (2) y k 

Jt, 

+ Z'D lk u k v -(S'E lk ) {1 '> u k v ]dt 

+1 VR,ku k v + vc lk y k f CE lk u k - (re* ) (1 V |{; 

(55) 


Thus the adjoint variational system is 

K = S l A lk -{CB lk )W + (CC tk )W 
\y k = -?D tk + (CE tL ) li) 

A = i, 


E“ 


/«* 


, m (56) 


for every solution (i/, y , y , u, u) satisfying (49). 

Using the characterization of the adjoint variational system 
given in Section II we now obtain similar conditions for a 
genera] mput-output differential representation (II). 

Theorem 2 Consider a minimal input-output differential 
representation Z t / 0 given by (11) with p ~ m, and its 
variational systems given by (37) satisfying Assumption 
1. Compute the adjoint variational system given by (41). 
Then T, v f 0 is an mput-output representation of a Hamiltonian 
system if and only if 

b (sM 5 ) _W (bMs)"° ,S1) 

along every analytic solution (y(t), u(t)) of E,/ 0 . 

Proof- Observe that (51) is equivalent to EV = E“^ 0 , 
i,e., the input-output behavior defined by (37) is tne same as 
the input-output behavior defined by (41). In the terminology 
of [5], [6] this means that every variational system E” along an 
analytic solution of E,/ 0 is self-adjoint. In [5, ch. 4] it is shown 
that E a is Hamiltonian if and only if every variational system 
along trajectories resulting from piecewise constant inputs are 
self-adjoint. We finally note that by the Approximation Lemma 
[19, Lemma 1J the approximation of piecewise constant input 
functions by analytic input functions will result in state trajec¬ 
tories converging to the state trajectories corresponding to the 
piecewise constant input functions. ■ 

We will now work out in detail the self-adjointness condition 
(51) in case N - 2, i.e., we consider 

E./« : F{y,y,y,u,u)^ 0 eR m , 


while Q is given by the terms between the brackets | | The 
self-adjointness condition (51) now becomes 

m 

£ - A jk D lk ? + A jk (E, k n {1) ~ 

fc=1 

+ B Jk (E lk ?) (2) - (D lk V) W + ( ’,*(£,tO (J) 

+ D jk A lk F - D jk (B lk e) {l) + O lL (C t *^*) (2) 

+ Ejk(A lk ?) {1) - E jk (B, k t')W + E jk (C tk ?)W = 0 

(57) 

along every solution (u(f), y(f)) of This gives the 

following conditions. 

Terms with 

rn 

'£ l [C jk E, k + E jk C, k ] = 0 

fe=i 

Terms with (£’)( 2 ) 

[B 3 kE tk - C lk D lk + 'iC jk E, k + D ]k C, k 

k=l 

- E lk B,k + 3 E jk C lk ] = 0. 

Terms with (CY 1 ' 1 

rn 

y * [AjkE-tk “ 4" 2 BjkEik — 

l 

+ SCjk&ik ” DjkBik + 2DjkCtk + EjkAtk 

- 2E 3 kB % k + SEjkC t k] = 0. 


m y € R m (52) 
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Terms with C- 


and 


T! I AjkDtk + Afk^ik ~jk + BjkEfit — Cjt-Dfk 

fc*J 

+ C]kE$ + DjkAtk - DjkB,k + DjkC,k 
+ EjkAtk — EfkB t k + EjkCj^] = 0. 
t,j - l,-,m. 

Hence, we have the following theorem. 

Theorem 3: Consider a minimal system (52). Then it is 
Hamiltonian if and only if the following conditions 

Ci) CE T + EC T = 0, 

c 2 ) be t -cd t +3CE t +dc t -eb 1 +3Ec t =o, 

£.,) AE t - BD T + 2 BE T - 2 Ct) T + 3 CE r 

-DB T + 2 DS T + EA T - 2 EB T + 3EC T = 0, 
£ 4 ) - AD 7 + AE T - BD T + BE 7 - CD 7 ' 

+CE^ T + DA T - DB t + DC T 
+EA T - EB T + ECW =0 

(58) 

hold along every solution (u(t). y(l)) of (52). 


0J’\ (1) d 2 F »v d 2 F .. d 2 F . &F . 
du ) dudydydu* + dydu^ du 2 U 


and collecting terms first with y^ and then v we obtain 


I 


du By 2 tfudy By ~~ 

of <Ff _ a^yr _ n 

Ou OuOy Ftp i)y 


( 60 ) 


which means that the elements of the matrix given in (59) only 
depend on y, y (the state of the system), and thus (59) defines a 
symplectic form uj on the state space y)\y € 12 , y € /l} 
with coordinate expression 


u 




) 


Case 2’ Let us assume that the input-output representation 
F t , / = 1, ■, m, have the following particular form 


Elaboration of the Conditions C\, £ 2 , 

£j, and £4 m Special Cases 

Case 1: Let us first consider the case w = L i.e., (47). If 
we assume C 1 — ^ ^ 0 , then £] yields 



which is the condition (50)-i). Then £2 reduces to 
tfFdF dEd£ _ 0 
dy du dn dy 

which is automatically satisfied. Furthermore £3 amounts to 

-2 —((—) (1, =o 

dy du du dy dy \ du ) du \ dy ) 
or 

dy du dy \ du ) du \dy ) 

which is the condition (50)-ii), and it is easily checked that 
£4 is precisely the time-derivative of (50)-ii). 

Let us now derive a more explicit expression for the 
remainder Q in this particular case. Since E = 0 , from (55) 

Q = CBy v + £Cy v - (£<7) (1 >//„ 


and from (56) 

£ = -D~ l y a . 


If the conditions (50) hold we readily obtain 


Q = [Va, Vaf 


0 

C/D' 

V 

-c/p 

0 



Now let us take a closer look at (50). Writing out 


(59) 


dF \ (,) d 2 F /a\ d 2 F .. d 2 F . 0 2 F . 

_ 1 +- -v + -y -f 

dy) dy 9 V dyd$ y dydjj dudy 


F*(y, y , fh a, u) = S t (y, y, y) - u t . (61) 

This is the classical case (see [14|). Conditions £ 1 , £ 2l £ 3 . 
and £4 reduce to 


£ 1 ) void 



at every point ( y , y , y). 

These conditions are precisely the conditions (2.1.17) in 
[14]. It follows that (61) represents the input-output behavior 
ol a Hamiltonian system if and only if 

s 1 = R^iy* y)ii k + r l\(y , y), 1 = 

where R ,a and 1) satisfy (2.2.9) in [14J. 

Case 3 1 


Ft(y, y, y< n, u) = S t (/y, y) - u lf 4 = 1,• • • ,m. 


From the result of Santilli, it follows that 

S, = X,k{y)y K +Y t (y), 1 = 1, 

where X^ and Y t satisfy 

dX tJ . OAj j, , BXl , _ ^ 
^ + Tip - u 


,m 


Thus X = ( X,j) defines the symplectic form w. This also 
follows from (18), since y* = £ k and Q = y‘ 0 Xikyt- 
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Example 1: As an example we consider the system given 
by die equations 

q = u 

p = sin q + ap 
: y = p- 

Hie corresponding input-output differential equation is given 
by 

F(y, y , ii, «) ~{y- ay) 2 + {y- ay) 2 u 2 - u 2 = 0. (62) 


nearly, by inspecting the state representation of the system, 
we see that the system is Hamiltonian with the Hamiltonian 
function H = cos <7 + up, if and only if a = 0. This is 
not so evident, however, from the form of the input-output 
differential equation above. We shall employ condition (50) 
of Crouch and Lamnabhi [4] to show this. Condition i), 
OF/dii = 0, is trivially satisfied. We calculate the quantity 

d_dF_ _i)F] _ 
dt dy Oy Oy dt du 

directly from F to obtain 

Z = (—4«)(1 - (y - at /) 2 )[(// J) - ay) 

+ a(y - ay) - u 2 (y - oy)] 

- 4 (i/ - ay)\2u(y - ny)(y - ay) 

- «(1 - (,</ - try) 2 )]. 



From F(y, y, y, u, ii) = 0 we obtain 


(j/ 3) - ay) = 


tJU 

(y - ay) 


- (y - ay)u 2 - 


ini{y - arf 
(>j ~ ay) 


Substituting this expression into the expression for Z and again 
using the definition of F, we see that Z = 0 if and only if 

au(l - (// - ay) 2 )(y - nty) s 0. 


This equation can be satisfied for all j/(0), y( 0), u(0) only if 
a =* 0, so we conclude that the system (62) is Hamiltonian if 
and only if a = 0, as we previously concluded. 

It is also interesting to compare the adjoint system with 
equation (62). The variational system is given by 

«*((£ - ay) 2 v - u) + y v (-av 2 (y - ay)) 

+ yo(u 2 (y - a y) - «(» - ay)) + iiv(y - «(?/)) = o. 

(63) 


Using the method of integration by parts introduced in (56), 
we see that the adjoint system is given by the equation 

y u = S((y-ay) 2 it-v) 


Ua = £au 2 (y - ay) ( u 2 (y - ay) - a\y-ay)) -{(y-ay). 

Thus £ = lfa/((y — ay) 2 u - u). An explicit expression 
for the adjoint variational system in input-output differential 
representation (34) is obtained by eliminating £ from the equa¬ 
tion above. Our theory guarantees that the resulting equation 
coincides with equation (63) if and only if a = 0. 


Example 2. In the case of a linear time-invariant system 

Ay + By + Cy + Dv + Eu = 0, u, y 6 R m 

with A , B, C, D, E constant m x m matrices, the conditions 
C\, £ 2 , £ 3 , £4 reduce to 

CE 1 + EC T = 0 
BE t - CD 7 + DC t - ED 7 = 0 
AE r - BD 7 - DB 7 + EA T = 0 


-AD t + DA 1 = 0, 


which is equivalently to the equality 

(.d + Es)(a t - b 1 . * + rV) 

= (A + Bs + Cs 2 )^) 1 -E T x) 


for all s e C. This last equality m turn is equivalent to the 
transfer matrix (7(s) = (A + Bi s + £\s 2 )~ J (Z) + E. s) of the 
system to satisfy the condition (cf. |11, [15]) C(s) = G T (~. s). 

Example 3 It can be straightforwardly checked that the 
Euler-Lagrange equations with external iorces u 


1 

dt 




u, ye R" 


for any Lagrangian function L(tp tf) satisfy (56) (sec also 
Santilli 1141). 

Consider now an Euler-Lagrange system 



in interconnection with a static nonhneanty N (see van 
der Schaft |15) for details of interconnections), and assume 
for simplicity that L(y . iji) = ^viy 2 — while the 

nonlinearity N is described by a differentiable Junction yy> = 
h(v 2 ). The interconnected system with outputs tji. y 2 and 
inputs i4i, ti 2 is given as (after elimination of uy and v 2 ) 

.. c)V . 

w 0i + Tjj (//l) " 02 “ 11 1 = 0 

oy i 

Ms/i + « 2 ) - 02 = 0. 


Computing the matrices A, li. C, D, E as in (51) yields 

A _ r _i i 

•^r(.vi + ' i 2 ) -1.' 

r= m 0 
00 

D= 0 

l 0 + "2) 

while both B and E are zero. It is readily checked that condi¬ 
tions Ci, C 2 , C 3 , £4 are satisfied, and thus the interconnected 
system is a Hamiltonian system for every scalar differentiable 
nonlinearity j/2 = h(v 2). 

In the multivariable case with y\, y 2 , «i, u 2 , vi, v 2 6 R m 
and L(yi, in) - %yfMyi - V(j/ x ), with M *= Af 1 > 0, 
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rt is straightforwardly checked that the interconnected system 
satisfies C\, £ 2 , £ 3 , £4 if and only if the nonlinearity yi = 

h(v 2 ) satisfies the integrability condition = ( 577 ) .and 
thus there exists (locally) a potential function P{v 2 ) such that 
«2 = 

IV. Characterization op Hamiltonian Systems 
from the State Space Representation 

In this section we derive another formulation of the criteria 
for self adjointness of the variational system (12) correspond¬ 
ing to an input-output differential representation (11). In this 
derivation, however, we work directly with the state space 
representations of the variational system (12) rather than 
the input-output differential representation (15) (which we 
did in the previous section). To do this we obtain a direct 
correspondence between the representations (12) and (15). 

We first observe that the input-output map of the variational 
system EJ,' in equation (12) may be expressed in the form 

jy. (0 =- / tt', (*, <7, Jo)u v ((T)(l(T (64) 

J o 

where we assume that />(()) = 0. and .ro is the initial condition 
of the corresponding state space system (10). We also note 
that the input-output map of the adjoint variational system 
in (13) is expressed in the form 

//«(/) = - f Wj'ia, /, u, ui\)u n {(r)tb (65) 
J o 

where we assume that p( 0) = 0. Thus (as formulated in |5], 
|6|) self-adjointness of the variational systems may be simply 
expressed as the statement 

W v (t. rr, u, .r ( |) = -VT/(f7, /, u , Jo) (66) 

for all / > a > 0, all piecewise constant controls tz, and one 
initial state j 0 . A»s we argued above, it is sufficient to check 
this identity for analytic controls //. Using the notation of (12), 
we may express the kernel \V v in the form 


and we define the sequence of time varying n x m matrices 
Ejt(t) by setting 

= «(/): 

So (t) = G(f). k > 0. (69) 

Noting that 

a) = F(/) $(/,*); 

~^(t. a) = -*(/. a)F(a) 
we easily obtain 
c) k 

(0*ON *) = T \m(t, a); k > 0 (70) 

i) k 

-j^Ht.a)G( «T) = (-l) l *(/,«T)E 4 (ff): k > 0. (71) 

Lemma I In the case of analytic data the identity (67) is 
equivalent to the sequence of identities 

(-i) A+,fl r*(/)E/(/) = s*(/) r r,(o 7 ; kj> o. (72) 

Proof: By applying (70) and (71) to (67) we obtain 

(-l)T k m(t, = -(~I)*E k (t) T *(a, t) r r,(a) 7 . 

By setting / =. ct wc obtain (72). Conversely, from (72) and 
analyticity we recover the identity 

(-1 )* + T t (f)*(f, a)G(a) = E l (/) r «&(a, t) r H(a) T . 

For k = 0, this is the desired identity (67). ■ 

Out main interest is to show that wc may replace the infinite 
set of conditions represented by (72), with a suitable finite 
subset. We require the following results. 

Lemma 2. Consider a time varying state space system 

i = F(t)x4 G(t)u, j(0) = 0, ./ e R l 
y=H{l)r. (73) 

Then 



W, (/, a. .r () ) = *)&(*)(= W, (I, a)) 

where $(/. a) is the transition matrix of F(t). For analytic 
controls i/, H, <1>. and G are all analytic in their arguments. 
Thus the sclf-adjoinlncss criteria (66) may be expressed in the 
form 

//(/)*(/, ff)G(ff) = —G(t) 1 $(cr, t) T II(1) T 

I >a> 0. (67) 

Our first task is to give an equivalent formulation of the 
conditions (67) in terms of standard (time varying) linear 
system objects. We define the sequence of time varying p x n 
matrices 1 \.(f) by setting 

r ^ )= ( 7 l + i?r(/) )" /iT(0; 

r 0 (/) = #(<)<* >o (68) 


+ [ T p (L)$(ti a)G((T)u(<r)(icr. (74) 

Proof: Since F 0 = H(t ). the statement (74) is true for 
p = 0. Assume by induction that (74) is true. Then 

d^ } , x 
,hp+ 

JLJ 

= + r„(0c(0«(0 

k =(1 

+ [ r,, + i(/)$(t, a)G(a)u(cr)da 

J o 

= (r,>-*£<,«)(/) + r,(/)E 0 (Ott(t) 

+ f F p +i(t)$(L <t)G(<j)il((j) da, 

Jo 

This is, then, the identity (74) with p replaced by p + 1. m 
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Lemma 3: Hie input-output map of tfy? controllable time 
varying linear system (73) satisfies the system 

£ A k (t)yW{t) = £ B k (t)uM(t) (75) 


fc=0 

if and only if 


fc =0 


N 


(76) 


52 ^*(t)r fc (t) = o 

fc=0 

Bk(t) = + £ J 4 9 +I (t)^(r jl _ fc (t)Eo(f)) (, ’-'* ) (j). (77) 

„=l n= Jfc V / 


q=k 


p=k 


Proof: The input-output map of system (73) is given by 

y(t)= f <t)G{<t)u{ct)cI(t. 

J o 

If this satisfies (75), we obtain the following expression using 
lemma ( 2 ) 

N p-1 

J >=1 fc =0 

N 

+5;A p (t)r P (/) 

7>-0 
N-l 

- £ fl fc «)« (fc) (0 = 0. (78) 


s 


d»(t, o)G(a)u(o) da 


Thus 

N .t 

V'A,,(t)r p (f) / $((, cr)G((r)u(o) da = 0 
Jo 

E^(€E( r H-‘ E ») (Ur) w u(,, (') 

p—1 k—0 r=0 

- j; B fc (/)«^(o = o. 

fc*0 

Controllability of system (73) and reordering summations now 
yields the desired identities (76) and (77). Conversely, the 
identities (76) and (77) yield (78) which by Lemma 2 ensures 
that the input-output map of (73) satisfies (75) as desired. ■ 
We wish to employ Lemma 3 in the context of the varia¬ 
tional system E" in ( 12 ) and the corresponding input-output 
representation EJ / 0 in (15). However, (76) is not written in 
terms of purely input-output quantities which we require for 
our purposes. We therefore make the following observation. 
If the system (73) is controllable, in the sense that 



rank[E< 1 (t)|E 1 (t)|,---|£N-i(')] = / (79) 


for all limes t and all controls «, then the condition (76) may 
be replaced by die equivalent condition 

N 

^i4 fc (t)r fc (t)[Eo(Ol^i(()l • ■ ■ |£jv-i(<)] = 0 . (80) 

We may now state and prove our main result in this section. 


Theorem 4: Consider a minimal input-output differential 
representation E,y 0 , given by (11) with p = m, associated 
minimal state space representation E, given by (11), and the 
associated variational system EJ|. Then under the assumption 
(1) and the assumption that is invertible for all analytic 
solutions (u, y) of (11), E,/ 0 is an input-output representation 
of a Hamiltonian system E, if and only if 

(-i)* +,+1 r fc (t)E,(0 = s fe (t) r r,(t) r ; o <k,i<N 

(81) 

for all t and all analytic controls v. 

Proof: As in [5]and |6], we know that E s is Hamiltonian 
if and only if (66) holds for all analytic controls u. We have 
shown that these conditions are equivalent to the conditions 
(72), (Lemma 1). These conditions imply those of (81), hence 
establishing the necessity of the conditions (81). To prove 
necessity we argue as follows. We first compute the quantities 
Efc and Ffc for the adjoint variational system E“ in (13). We 
make the substitutions 

F- 4 -F 1 - G —► —H r \ H —> G 1 
in definitions (68) and (69). We obtain 

£S(0 = (-!)* (/^ + F(/) T )Vtf(/) r ) 

= (-i) fc+ 1 rj;(f) T (= (~i) k+ 1 r k (t) T ) 

Tf(t)=(^Jj t -bV)^G(i) 

= (-i )*E£(i)(= (-!)%(/)). 


Thus 

ES(0 = (-l) t+l r fc (f) r ; r a k (t) = (-l) k E k (tj J (82) 

We wish to generate conditions under which the in- 
put-output map (65) of E£ coincides with that of the 
input-output map (64) of E". By Assumption 1, the set of 
solutions ( y v , u v ) generated by E" equals the set of solutions 
of E|y o represented by (15) or 

N N-l 

T A k (t)y[ k \l) = T B K (t)u«\t). (83) 

*•=0 A -=0 

(We also have that the set of solutions (y v . u v ) of E*,’ 
corresponding to zero initial conditions v( 0 ) = 0 , is equal 
to the subspace through the origin of the set of solutions of 
E V.) Hence, we may check self-adjointness of E", simply by 
checking that the input-output map (65) of E“ satisfies (83). 
By Lemma 3, however, the input-output map (65) satisfies 
(83) if and only if the (76) and (77) hold with T*. and 
E k replaced by rjj and Eg, given in (82). Note that the 
controllability assumption required in Lemma 3 is translated 
into controllability of E“, which is simply observability of 
E“. Moreover, we may substitute the condition (76) by (80) 
as long as the condition (79) holds for E J . Thus we obtain the 
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following conditions by substituting (82) into (80) and ( 77 ) 
N 

Y^A k wm T (-v k i-m) T \m) T \ 


-r 2 (() T '--.|(-i) Jv r J v- 1 w r ] = o 


B k (t) = f^A q+l (t)^(-ir- k+i 

q—k p—k 

■(SJ-*(0r o «) T )^-* ) ^. (84) 

We now note that conditions (81) are indeed sufficient to 
ensure that (84) may be simply rewritten as (77) and (80). 
These are satisfied by virtue of the fact that by assumption the 
solutions (y v . n v ) of E* are solutions of Ejy w . It follows that 
we have shown sufficiency of the conditions (81), under the 
apparent further assumption that (79) holds for E“. 

To evaluate the further assumption it is useful to make an 
explicit construction, which is required later on. It is easily 
seen that we may rewrite the system E.[y fj given in (83) in 
the form 

n /v-i 

i)y,W k> = £ (ih(i)u,M) k 

k- 0 *• = (! 

where A\’(L) - A^(t) = 7 . (Clearly the matrices A k and 

Ih are related to the operators D and N defining the adjoint 
variational system in (41).) Under the assumption that 
is invertible we may rewrite T”' in the fonn 

JV -1 TV “ 1 

vi N \l) + £ = E (B k (t)u v (t.)) k . (85) 

k~- 0 k.s= 0 

In this form we may write down the observable canonical form 
for the system (85), which will be a particular realization of EJ,’ 

/-I,v I 0 

— A s -2(0 U I 

V.J:,\7 \ -Ml) <1 <» 


y(t) = (/, 0, ■ • • ,0)(-T'i, • ■ -,Xfi) T . ( 86 ) 

Clearly this time varying system satisfies the condition 

Rank [r 0 («) T |r 1 (f) 7 ’| • • ■ |r W -i(<) T ] = Np (87) 

for all / and all analytic controls u. Thus the corresponding 
adjoint system does satisfy the controllability condition (79). 
Note that the conditions (81) are independent of the particular 
realization of EJ' which is chosen, and in particular there is no 
necessity for the chosen realization to be minimal (as a time 
varying system). ■ 


+ 


\ 

X) 


0/ \xs / 
[Bn- lit) \ 

B 0 (t) J 


MO 


Finally in this section we point out the relationship between 
the conditions (81) and the conditions (49) derived in the 
previous section. Although the conditions (81) are apparently 
expressed in terms of the system E*,or any other realization, 
we may interpret them directly in terms of E]y 0 . In particular 
we may apply the conditions (81) to the realization (85) 
constructed above. The presentation of the resulting conditions 
on the matrices Ak(t), Bk(t) defining as in (83) turns out 
to be different, but equivalent, to the conditions obtained by 
applying Theorem 2 in the previous section. We demonstrate 
the conditions obtained in this section on the system (52), 
We write the corresponding variational system in the form 
of (53) but assume C = / m , the identity matrix for ease 
of explanation. (The general conditions may be obtained by 
replacing A, 0, D, and E by (7~M, C'~ l B, C" V D* and 
G~ l E, respectively.) Now if the variational system is written 


A(t)y v 4- B(t)y v + Vv = ~E(t)u v ~ 0(t)u v 

this may be rewritten in the form of (85) as follows 

Thus the corresponding observable canonical form ( 86 ) be¬ 
comes 

-B{1) l\(.rA ( -E(t) \ 

<>y W v 




W) - Mi) 

y(l) = (/. 0)(:c, :r 2 ) T . 


( 88 ) 


(89) 


The first condition in (81) is simply 

-r„ (0Eu(0 - s 0 (/.) 7 T 0 (/) T 

or simply 

- H(l)G(i ) = G(t) T H(l) T . 

Substituting from ( 88 ) wc obtain 

m = 

Replacing E by C l E we obtain 

E(t)C{t) r + C(t)E(t) T = 0 . 

But this is simply C\, in Theorem 3. 

The next condition in (81) is simply 

r, (<)£»(*) = Si(/.) 7 'r 0 (o T 

or simply (H(t)+H{t.)F(t.))G(t) = ~{G(t) T -G(t) T F(t) T ) 
H 7 (t) or since H(1.) = 0 

H(t)F(l)G(l.) = G(t) T F(t) T H(t) T - G(t) T H(t) T . 

Substituting from ( 88 ) we obtain 

[/. 0] 

= \-E T , E r - D r ] 

- [-£ r , e t - b) 


-B I 
B-A 0 


-E 

E-D 


-B t B t - A r 
1 0 

1 


0 
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which simply reduces to the condition 

B(t)E{t)-E(t) 7 B{t) T +D{t) 7 -D(1)+E{t)-2E{t) 7 =0 

Replacing E, B, and D by C~ 1 E, C~' 1 B, and C~ i D, We 
obtain 

C^BC-'E - E T C~ T B T C~ 7 + D T C~ r - r _1 D 

+ (C- 1 #) - 2(C~ l E) T = 0 

However, from (89) we have 

C~ l E- -E 7 C~ r 

By expanding the above expression and using this identity we 
obtain 

- C~ x BE T C~ r + C~ l EB r C~ T + D 7 C~ r 
-C-'D + C 1 CE t C 1 +C~ 1 E 

- 2C~' EC 1 C~ T - 2E t C~ 7 = 0 

which may be reexpressed, using the mvertibility of C, as 

-BE r + EB 7 + CD 1 - DC t + CE t + EC r 

- 2 EC T - 2CE 1 = 0 

By differentiating (89), however, we obtain 

C'E t + EC T = -EC 1 - CE 1 

We therefore obtain the expression 

-BE 1 + EB t + CD t - DC 7 - iEC 1 - SCE 1 -= 0 

Now it is easily seen that this is just condition Co in Theorem 
3 Clearly, the remaining conditions C\ and £4 will be 
contained in the conditions (81) for N = 2 
We make two final comments on (81) Clearly, by inspecting 
(77) and (80), the number ol conditions in (81) may be reduced 
to TV > A > 0 , TV - 1 > l > 0, A > l For N = 2 this results 
in five conditions, but we already know from Theorem 3 that 
tn the case TV = 2 , there are only four independent conditions 
Thus we expect even the reduced set of conditions (81) to 
include many redundancies Furthermore, Theorem 4 requires 
the assumption that is invertible, which Theorem 2 does 
not 

Conditions (81) do provide a satisfying generalization of 
the Brockett and Rahami result, discussed in the introduction 
In particular, the condition that system (4) is Hamiltonian is 
simply given by the self-adjointness condition 

Ce x( *- a) B = -B T <- Al{t - rr h' 1 

By setting I\ = CA L , D*. = A k B, it is dear that self- 
adjomtnesv is indeed equivalent to condition (81) for N = 

ii - 1 
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Language Convergence in 
Controlled Discrete-Event Systems 

Yosef M. Willner and Michael Heymann 


Abstract —Discrete-event systems modeled as state machines 
in the framework of Ravnadge and Wonham are considered. In 
this paper, convergence of the language generated or marked 
by the system to a specified legal language is investigated. The 
convergence property is studied both with respect to open-loop 
(uncontrolled) systems and with respect to controlled systems. 
In the latter case, both questions of existence and synthesis of 
supervisors that force convergence are examined. Algorithms 
for verification of convergence and for synthesis of stabilizing 
supervisors are provided. Finally, the concept of asymptotic 
behavior of systems is defined, and questions related to this 
concept are investigated. 

I. Introduction 

I N the main paradigm of supervisory control of discrete- 
event systems (DES) |l]-[3], the system is modeled as a 
finite automaton that executes state transitions in response 
to a stream of events that occur nondeterministically and 
asynchronously. The supervisory control problem consists of 
synthesizing a supervisor whose task is to disable events in an 
orderly fashion, from a specified subset of controlled events, 
so as to confine the behavior of the supervised system to within 
a specified legal language. It is generally further required that 
the synthesized supervisor be minimally restrictive in the sense 
that, to achieve legal behavior, the smallest number of events 
is disabled. 

There are situations, however, when legal supervision is 
either impossible or impractical. For example, when a system 
failure occurs, it may be driven to a state at which it is im¬ 
possible to prevent the occurrence of illegal event sequences. 
Furthermore, even when the behavior of the system can be 
confined to within legal bounds, such a constraint may lead 
to the design of a supervisor that is overly restrictive. In such 
situations the question of convergence of the system to the 
legal behavior is of great interest. Intuitively, wc say that the 
system converges to the legal behavior, if, after a finite and 
bounded number of transitions, it begins to behave legally. The 
supervisory control task is then to force such convergence to 
take place. 

In 14] and [5] the problem of stabilization of discrete- 
event systems has been introduced, and the stabilization, 
or convergence, concept was presented in terms of legal 

Manuscript received April 20, 1992; revised May 31, 1994 Recommended 
by Past Associate Editor, P. J. Ramadge, Thus work was supported in part by 
the Technion Fund for Promotion of Research. 

Y Willner is with the Department of Electrical Engineering, Tech¬ 
nion—Israel Institute of Technology, Haifa 32000, Israel 
M. Heymann is with the Department of Computer Science, Tech¬ 
nion—Israel Institute of Technology, Haifa 32000, Israel. 

IEEE Log Number 9408278. 


and illegal states of the system. In [4| legal behavior was 
specified by a set T of legal states. A system was then 
defined to be stable if, for any arbitrary initial state, it reaches 
a state in T after a finite and bounded number of state 
transitions and thereafter remains in T indefinitely. A system 
was called stabilizable if there exists a supervisor under which 
the supervised system is stable. Algorithms for synthesis of 
stabilizing supervisors were also presented. Optimal stabilizing 
supervisors were presented in [8]. 

It is quite clear that the legal behavior of a DES cannot 
always be specified by a set of (legal) states, which constitutes 
a static specification, but rather by a legal language which 
constitutes a dynamic specification. In the latter case the 
system is said to converge to legal behavior if, after a bounded 
number of illegal state transitions, the system produces only 
legal behavior, that is, strings of the specification language. 

The concept of language convergence was firsl introduced 
by Kumar et al (who called it language-stability) in |9j, more 
recently in [16] and [18], and independently by Willner and 
Heymann in [151. A similar concept was also studied in a 
somewhat different setting by Ozvercn and Willsky in [19|. 
In [15] the convergence concept was formulated in terms of 
^-languages (i.e., languages that consist of infinite strings). 
In the present paper we formulate the problem in terms of 
languages over E* and extend the discussion far beyond that of 
[15]. Wc provide algorithms for determining the convergence 
of a regular language to another regular language and for 
synthesizing supervisors that guarantee that convergence lakes 
place whenever such supervisors exist. 

We show that for suffix-closed legal languages, i.e., lan¬ 
guages in which every suffix of a string is also a string of 
the language, the convergence problem can be resolved with 
algorithms of polynomial complexity. In fact, for suffix-closed 
languages the complexity is essentially of the same order as 
the static stabilization problem of [4], When the specification 
language E is not suffix closed, the convergence problem is 
resolved by an algorithm of complexity that is polynomial 
in the size of the recognizer of the system language but 
exponential in the size of the recognizer of the specification 
language. 

A further concept introduced in the present paper is the 
asymptotic behavior of a DES. The asymptotic behavior is 
the language that the system executes after performing an 
arbitrarily large number of state transitions. We discuss the 
problem of synthesizing a supervisor that guarantees the 
confinement of the asymptotic behavior of the supervised 
system within a given legal language. In the case of a 
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suffix-closed specification, the inclusion of the asymptotic 
behavior in the specification language is equivalent to the 
convergence of the supervised system to the legal language. In 
the general case, inclusion of the asymptotic behavior in the 
specification language ensures the convergence of the behavior 
of the supervised system to the legal language. The inverse 
property, that language convergence implies confinement of 
the asymptotic behavior, is not in general true. 

The paper is organized as follows. In Section II, a brief 
review of the basic elements of the framework of [ 1 | is 
given, and the concept of language convergence is introduced. 
Algorithms for verifying language convergence for regular 
languages are presented in Section HI. In Section IV we 
discuss the controlled convergence problem, i.e., the problem 
of existence and synthesis of supervisors that guarantee con¬ 
vergence. Problems associated with asymptotic behavior are 
considered in Section V. At various points of the paper our 
results are compared with the results obtained in [9J and [18]. 

11. Preliminaries 

Wc adopt the Ramadgc-Wonham (RW) [ 1 ] discrete-event 
control formalism, that is, the DES under consideration is 
modeled as a deterministic automaton 

P = (QX^b^ ih Q w ) (2.1) 

where Q is a finite set of states, S is a finite set of event 
symbols, partitioned into two disjoint subsets X), of controlled 
events and X!„ of uncontrolled events. The map b : E x Q Q 
is a partial function called the state transition function, r/o is 
the initial state, and Q m ( C Q) is a subset of final (or marked) 
stales. 

Let E* denote the set of all finite strings over X including 
the empty string (. A subset L C E* is called a language 
over X!. 

Letting b be extended in the standard way (see, e.g., 16]) 
to a function XT x Q -* Q, the language generated by P is 
defined as 

C(P)-~ f/,,)!} (2.2) 

where the notation “!” means “is defined.” The language 
marked by P is defined as 

£„.(/’) := {/ C C(P)\b(1.q o) G Q m \. (2.3) 

A sequence of states q\, ■ • ■ , i/„ G R C Q is called a path 
of P contained in Ii (starling at i/i and ending at q n ) if there 
exists a sequence of event symbols o\< -,it,i-i such that 
r/,+i = for each / = ],-■■,«- 1. When q f) - q ] we 

call the path a cycle (in Ii). A subset It C Q is called acyclic 
if P has no cycles in Ii. If P has no cycles in Q wc also 
say that P is acyclic. We shall say that a state r/ is accessible 
from a state q if there is a path of P starting at q and ending 
at q f . We shall denote by A{I\q) the set of all states of P 
that are accessible from q. 

For a subset Q C Q. we shall denote by P Q the automaton 
obtained from P by replacing the initial state by the set Q, 
that is 

P Q :=(ClX<b,(lQ m ). (2.4) 


The language generated by P t j is then defined as 

C{Pq) ■= {/ E E*| b(t y q)\ for some q G Q}. (2.5) 

The language marked by Pg is then given by (2.3) upon 

replacing P by Pg. When Q is a singleton {//}, we shall 
write P q instead of P { q } and l \ vn — P , 

If p ] = (Q\, E, , (/] 0 , Qi, w ) and P 2 — ( Q 2 , q 2 o y 

Q' 2 m) are two automata over the event set E, we say that P 2 
is a subautomalon of 7\, denoted P 2 C P\< if 

n) Q'l C Q} . Qiti 2 C Qrnl‘ tfJO = 1/20 

h) b 2 (it, 7 ) = by (it, ly) Vit e E.i/ e Q 2 s.t. f/)!. 

Thus, a subautomaton is obtained from a given automaton 
by deleting from it transitions and/or stales along with all the 
transitions incident on the deleted stales. For a subset Q C Q 
we define the restriction of P to Q-Q, denoted P\Q y as the 
subautomaton of P obtained by deleting from it all the states 
of Q and the transitions incident on these slates. 

For given deterministic automata P = (Q, E, b y qo< Q m ) and 
A = (A",E,£,j’o.A^n), the strict synchronous composition 
P\\A of P and A is defined as 

P||A = (gxXE 1 r»-(^.r 0 ).Q m x X m ) 
where a : T, x Q x X ^ Q x X is defined by 

-\s /(M^-'/)•£(<*••<•)) if andf(fT.j-)! 

)> . , | undelined otherwise. 

It is well known that C(P\\A) = C(P) n C{A) and 
£,n(P\\A) = C m (P)r\C m (A). 

A supervisor for a DES P is a maps' : C(P) —► 2^' such 
that for each / 6 C(P), S(t )(C E r ) is the set of controlled 
events that must be disabled next. The concurrent operation of 
the system P and the supervisor S, denoted by S/l\ is called 
the closed-loop system. That is. a transition in S/P can take 
place whenever it can lake place in I ) and is not disabled by S. 
The language C(S/]*) generated by the closed-loop system is 
thus given recursively by 

i G C(S/P) 

(V.s e C(S/P))sa € C{S/P) & SCT £ C(P) A IT 0 S(s). 

The language marked by S/P will be defined by 

C M (S/P) = C(S/P)nC m {P) (2.6) 

and consists of all the strings marked by P that are not disabled 
by S. 

We shall assume that the system P under consideration 
is trim, i .e., eve ry state q € Q is accessible from q { ) and 
C(P) — C m {P). A supervisor S is then called nonblocking 
if C(S/P) = C lu (S/P). 

For strings s, I e E* we let s-t denote their concatenation. 
A string s is a prefix of t , denoted s < L if t = sw for some 
w € E*. If w t , ,s is said to be a proper prefix of /, denoted 
** < L The prefix closure L of a language L C E* is the set 
of all prefixes of strings in L and L is prefix closed if L = L. 
The length of a string / is denoted |/| and the prefix of t of 
length i is denoted pr f (f) (jw 0 (0 = f). 
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For a siring t € E* we denote by suf,(£) the string obtained 
by deleting from t its first i symbols (suf 0 (£) = t and 
suf| t |(f) = f). For a language L C E*, the suffix closure 
of L is the subset of all suffixes of strings in L, that is 


suf(L) = {suf,(f) | V/ € L, V» < |<|}. 

(2.7) 

A language L is suffix closed if suf(L) = L. The following 
properties of the suffix operator are readily verified: 

• The suffix operator is monotone, i.e.. 

Lj C Ij 2 ^ suf (Z^ 1 ) C suf (£ 12 ). 

(2.8) 

• For a family of languages {L„ C E*} 


suf((jL„) = (Jsuf ( L a ), 

(2.9) 

suflU^o) C |Jsuf (L„) 

(2.10) 


with equality in ( 2 . 10 ) if for each a, is suffix closed. 

Let L C S* be a regular language and let P = 
(Q,T^b,(]Q,Q m ) be a (trim) recognizer for L , i.e., C rn (P) = 
L and C(P) = C m (P). A recognizer for suf(L) is then given 
by Pq - (Q,E,A,Q,Q 7 „), that is, C Jn {P Q ) = suf (I) 
(with C m (Pq) as defined following (2.5)). From (2.7), 
L C suf(L), so that L is suffix closed if and only if 
£iti{Pq) C C rn (P). This inclusion is equivalent to the 
condition that C m (Pq) D (E* - C nl {P)) = </>, which can 
be verified by an algorithm with complexity of 0 (|Q| 2 ) (see 
e.g., 110, Proposition 2.31). 

We conclude this section with the definition of language 
convergence that will be needed in the sequel. Let L , M C E* 
be two languages such that M ^ <j>. 

Definition 2.1: The language L is said to converge asymp¬ 
totically to M, denoted M <— L, if for each t £ L there exists 
an integer t > 0 such that suf,(0 6 M. 

Remark 2.1 ■ It is worth noting that the asymptotic conver¬ 
gence M <— L is equivalent to L C E* ■ M (where E* M 
denotes the language concatenation, i.e., s £ E* M if and only 
if s = r-t where r £ E* and t £ M). Thus, the asymptotic 
convergence problem can be investigated as a special case 
of the language confinement problem that was investigated in 
detail by RW in [IJ. 

Definition 2.2: The language L is said to converge finitely 
(or, simply, to converge) to Af, denoted Af <*= L, if there 
exists a finite integer k > 0 such that for each f £ L there 
exists i < fr, for which suf t (/) G Af. In that case the least 
k for which the above holds is called the convergence time of 
L to M and is denoted v\t{L). 

While finite convergence obviously implies asymptotic con¬ 
vergence, the converse is not generally true as illustrated by 
the simple example where L = oTli and M = {/?}. 

It is easily verified that the class of languages that converge 
asymptotically to a given language is closed under arbitrary 
intersections and arbitrary unions while the class of languages 
that converge finitely to a given language is closed under 
arbitrary intersections but only under finite unions. Finally, 


if Li,X/ 2 iM C E* are languages such that L\ C L 2 and 
Af / then 

(Af L 2 ) => (Af «- Li) (2.11) 

and 

(M<=L 2 )=>(A/4=Li). (2.12) 

111. Finite Language Convergence 

In the present section we shall develop necessary and suffi¬ 
cient conditions for finite language convergence. Furthermore, 
we develop algorithms for testing whether M <= L for regular 
languages L,M C E*\M ^ </>, For a variety of interesting 
special cases, algorithms of polynomial time complexity are 
constructed. In the most general case, however, our algorithm 
is of polynomial complexity in the size of the recognizer for 
L but exponential in the size of the recognizer for M. 

Let </> be two regular languages, let P — 

(C^E,^, </ ( ) ,Q m ) and .1 = (A', E,£,.r 0 , A' m ) be deterministic 
trim recognizers for L and A/, respectively, so that C m (P) = 
LC m {A) = M.ZJP) = C(P) and CJ^A) = C{A). Let T H 
be the subset of states in Q given by 

r l\ = {q e Q\C m {P q ) C sut(Af)} (3.1) 

(where, as defined earlier, P q is the automaton P initialized 
at q). 

The following is a necessary condition for (finite-) conver¬ 
gence of L to M. 

Proposition 3.! Let L.M c E* be the languages recog¬ 
nized by the automata P and A , respectively. Then A/ 4= L 
only if the set Q - 7\ is acyclic (in P). 

Proof' Suppose that P has a cycle C in Q - I\. Since 
P is trim, every state of P is accessible from q {) and there is 
a sequence of strings {L}, /, E C(P) = /,,/] < ^ < /*■■■ 
such that for each /, />(/,, q {) ) £ C. Assume that M <= L with 
convergence time /r(= ( j m(L)). Let ij be a string of the above 
sequence such that n := IM > *■ Then q = b(t j. r/o) G C and 
there exists s £ C in (P tl ) such that s £ sut (A7) (since the 
states of C do not belong to T s ). But t, ■ s £ C m (P) = L 
and by the convergence of L to Af, there exists / < k 
such that suf/(/ y • s) £ M. Since n > /, it follows that 
,s = suf n (tj • *) = suf TI _/ suff (tj ■ s) £ suf(A/). This is 
a contradiction, and the proof is complete. □ 

Next, we present a sufficient condition for convergence ofL 
to M. Let T be the set of all slates in Q defined by 

T := {q G Q\C m {P n ) C M) (3.2) 

and assume that </ () ^ 1\ for otherwise L C M and conver¬ 
gence is trivial. Next, let R be defined as the set of all states 
of Q - T that are accessible from q {) through a path that does 
not intersect T, that is 

R := {q G Q\3t £ L, f)(t, r/ 0 ) = q 

and (Vi < K|)4(pr f (/), q 0 ) G Q - T}. (3.3) 

The set R can conveniently be computed as follows. Con¬ 
sider the automaton P\T (obtained from P by deleting from it 
all the states of T and the transitions incident on these states). 
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Then R is the set of all states accessible in P\T from the 
initial state go, that is 

R = A(P\T,q 0 ) (3.4) 

(of course, if g 0 € T then L c M and 7? = 0 ). 

Proposition 32: If R is acyclic and R n Q m = 0 , then 
M <£= L. 

Proof: Assume first that T = 0. Then (since P is trim) 
R = Q so that R n Q m = Q m . If R n Q m = 0, then 
<y T r. = 0 and hence L = £ m (F) = 0 and M «=■ L holds 
trivially. Assume now that T £ 0. If /? n = 0, then 
for each t € L = C m (P) there must exist /,/ < \t\, such 
that q = A(prj(/),g 0 ) € 7 1 . Now the acyclicity of /V implies 
that this i must satisfy the condition that / < \R\. Hence we 
conclude that for each / E L there exists / < \R\ such that 
q - 6(pr,{t),qo) G 7\ implying that &uf,(f) E C m (P q ) C M. 
It follows that M L and ejt/(L) < \R\- □ 

Proposition 3.2 suggests the following algorithm for deter¬ 
mining the convergence of L to A/. 

Algorithm 3.1: 

1) Compute the set T of states defined by (3.2). If g y E T 
then L C M. Convergence holds trivially. Halt. 

2) Compute the set R as defined by (3.4). 

3) Check (e.g., by the algorithm of [7, p. 64]) whether P 
has cycles in R. If the answer is yes, the algorithm is 
inconclusive. Halt. 

4) If R n Q m = 0, then M ^ L Halt. 

The computation of R (which is a standard accessibility 
computation) and the algorithm for testing acyclicity of R as 
given in [7) are both of complexity 1 (7(|Q|). As regards the 
computation of 7 7 we proceed as follows. 

A state q E Q is in T if and only if C m (P q ) C C in (A) 
where A - (A\£,£,.ru,is a deterministic recognizer 
for = M). Equivalently, q E T if and only 

if C 1U (P U ) n C m (A)) = 0. A deterministic recog¬ 

nizer for L"* - £,„(A) is given by the automaton A = 
(A\ £, £ X 11t ), where A -- A' U {(!}.<! being an additional 
“dump” state, X w = {</} U (A' - A r „,). and ( : S x A —♦ X 
is given by 

, f£(<r.i') if j # i> and fttr, j 1 )! 

otherwise. 

The construction of the automaton .4 is of complexity 

o(\x\)- 

For a state 7 e Q, a recognizer for C m (I J V )fl (E* - C m ( A )) 
is given by the automaton /’Jl-A (see Section II) and we have 
that C m (P v ) C £„,(.4), or equivalently q € T, if and only 
if £„,(P 9 ||/1) = 4>. Thus, to compute T, we construct the 
automaton 

^ 5 (^||j4 = (Q x X, L, it, {q. J'o)|(/ 6 Q}iQm x .V,„) 

where a(<r, (q,j)) = ( 6 (ff,g),£(<r.x)) is defined if and only 
if 6 (tr,q)! and £(rr,.r)!. Next, we let C(Q m x X m ) be the set 

'in the present paper we shall assume |S| = 0(1) and we give the 
complexity hounds only in terms ol the number of states. 


M9 

of all states ( 7 , .r) 6 Q x X from which the set Q„, x X m 
is accessible in Pg||.4, that is 

C(Q,n X X m ) := 

{(q,s) £ Q x X|3f 6 E*,n:(/,( 7 ,i)) € Q„, x 

A stale 7 is thus in T if and only if (q, j-q) 0 C(Q, ri x A\„), 
whence T is given by 

T = Q-{qe g|to.ro) 6 C{Q m x *„,)}. 

The computation of C(Q m x ) is of complexity 0(|g||X|) 
and hence the overall complexity of Algorithm 3.1 is 

om\x\). 

An interesting case of the convergence problem is when 
the language M includes the empty string r. This implies 
thai all deadlocking behavior of L is regarded as convergent. 
Specifically, since for each string l E L, suf|/j(/,) = f eM, 
it follows (hat if all strings of L are of bounded length, then 
L converges to any language M satisfying ( E M. A further 
observation is the following. When < E M, then the acyclicity 
of R (as defined in (3.3) or (3.4)) implies convergence of L to 
M. Indeed, for t E L such that \t\ < \R\. suf| f |(() = r E M, 
and for t E L such that |/| > \R\, the acyclicity of R implies 
that there exists i < \R\ such that suf,(/) E M. Thus we have 
the following. 

Proposition 3.3: Let L,M C E* be regular languages 
recognized by automata P and A, respectively, and let R be 
the set defined by (3.4). If ( E M and R is acyclic, then 
M <= L. 

A special case of languages M that include the empty 
string are suffix-closed languages. Such language specifica¬ 
tions correspond to cases when one is interested in the eventual 
correctness of the logical behavior of the process P but does 
not insist on specific initialization. Thus, one is satisfied with 
the fact that the behavior of P “merges” with that of a given 
specificatkm language rather than performing complete strings 
of the specification and one wishes to find conditions for 
suf(Af) ^ C m (P) = L. 

l^t M be suffix closed. Then ( E M, and it can be further 
noted (hat the sets T H as defined in (3.1) and T as defined 
in (.3.2) coincide. Proposition 3.1 then states that if M <= L 
then Q - T is acyclic, and since 7? C Q - 2\ so is also R, 
Combining this fact with Proposition 3.3 gives the following 
interesting theorem. 

Theorem 3.1: Let L, M C E* be (regular) languages rec¬ 
ognized by automata P and A, respectively. Assume that M 
is suffix closed. Then M <*= L if and only if Q - T is acyclic. 

Theorem 3.1 provides the theoretical foundation for the 
following algorithm that determines the convergence of L to 
M in case L and M are regular languages and M is suffix 
closed. 

Algorithm 32: 

1) Compute the set T of states defined by (3.2). If go € T 
then L C A/. Convergence holds trivially. Halt. 

2) Check (e.g., by the algorithm of |7, p. 64]) whether P 
has cycles in Q - T. 

3) L converges to M if and only if the answer to 2) is 
negative. Halt. 
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The complexity of Algorithm 3 2 is <9(|Q||\|) (see the 
analysis of complexity of Algorithm 3 l for details) 

We turn next to the problem of language convergence in 
the case when M is a general (not necessarily suffix-closed) 
regular language The conditions of Proposition 3 2, and in 
the case when f € M of Proposition 3 3, are sufficient for 
convergence and, m view of their relative efficiency, should 
be tested first When the corresponding algorithms (Algorithms 
3 1 and 3 2 , respectively) are inconclusive, one must resort 
to a more elaborate examination As we shall see later, the 
algorithms m the general case arc no longer of polynomial 
complexity 

Let us first examine two examples that will prove helpful 
in obtaining some intuitive insight The first example shows 
that while the acyclicity of R is sufficient for convergence of 
L to M in case * € M (Proposition 3 3), it is not necessary 
when M is not suffix closed 

Example 3 1 Consider the automata P and A as given in 
Fig I, where all the states of both P and A are final states 
Thus, £ m (jP) = C(P) and C m (A) = C(A) and, obviously, 
C{A) 4= C(P) (Here and in subsequent figures the entrance 
arrows indicate the initial states ) Observe that T - {qi }, i e , 
q\ is the only state q € Q such that C(P q ) C C(A) and the 
set R = {r/ 0l tfe} is not acyclic □ 

Example 3 2 Here P and A are given by the automata of 
Fig 2 and again all states (of P and A) are final states 
To verify that C(A) <= C(P) note that 


C(P) — 6ft a*Y — 6ft(aa)*Y + 6ftoL(cm)*i* 


4P' 


Fig 3 

and that both fi{an j* 7 * and are sublanguages 

of C(A) 

Next note that 

C(P q2 ) = ft* 7* = («ft)V + ft(ft«)*7* 

and that («ft)* 7 * c £(j4 r ,) and ft(ft«)* 7 * C C{ \ x „) 

Thus 


C[P»)CC{A ri )\JC(A Jm ) = C{A lrx ^ } ) 

Note further that if 6(1,qo) = (12 for t 6 2]*, then £(/ i 0 ) e 

In Example 3 2 the language L - C(P) (that converges to 
M = C( 4)) satisfies the condition that the system P , starting 
at </o, reaches a state q such that 

C(P q )C C(\ x ) (3 5) 

for some subset \ C A We shall see below that this is actually 
a necessary condition for convergence of l to M Condition 
(3 5) is of course not sulhcient for convergence as can be 
seen from the example in Fig 3 

Here C(P) does not converge to£( 1) although C(P q ) C 
C{A,t) 

Next we present an algonthm that tests the convergence ol L 
to M for geneial regular languages L and M The algorithm is 
based on the construction of a new recogm/er for the language 
L in which the necessaiy condition that the automaton reaches 
a state that satisfies condition (3 5) is also sufficient The 
complexity of the proposed algonthm is exponential in ] \ |, 
the dimension of the state set of 4, but it is polynomial in |(^| 
the dimension of the state set of P 

Theotem 12 Let P = (Q S /) qu,Qin) and 1 — (\ 
£ £, j(),A r „ ? ) be given automata The following algorithm 
vcufies the convergence £ r „( 4) C Jlt (P) 

Algonthm 3 3 

1) Construct a deterministic automaton B - (V E,o 

uo.K,,), where V = Q x 2\ v„ = (r / 0 1,„ = 

Q m x A, and where a E x V —♦ \ is defined as 
follows Let x € 2 ^ be any element Then see (3 6 ) 
at the bottom of the page We assume that B is trim 
Otherwise construct the maximal tnm subaulomaton of 
B 

2) Test whether there exists in 8 a state u = (q, x) £ V 
such that q E Q m and x n A n , = </> If such a state 
exists, Cm {A) ik Cm(P) Halt 

3) Compute the set T C V defined by 

T= {v = (q X ) e V\C m (P q ) C C rn (A x )\ (3 7) 


(3 6 ) 


/„ fW^vhx') where = { i 0 } U {j' € JC|(3j € *)£(<r, c) = ./'} if 6{(r,q)' 

1 ’ [ l ' x> 1 ~ \ undefined otherwise 
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4) Test whether V -T is acyclic (in D) (use, e.g., the 
algorithm of 17]). 

5) If the answer to 4) is affirmative, then C in (A) <= 
C„,(P). Otherwise not. Halt. 

Proof: First observe from the definition ofC that a string 
( € E* is in C(li) if and only if /»(/.(®,,{x,,})) = (</, x ), 
where 

q = t)(l.q (l ) (3.8) 

and 

X = {-T € X\{3i < |(|) x = £( sur,(/),.r 0 )}. (3.9) 

Indeed, (3.8) holds since for each state (r/, \') C V the 
transition (r/, \'))\ if and only if A(rr, c/)!. To see that also 
(3.9) holds, wc proceed by induction on the length of strings. 
For t = r, (3.9) clearly holds with \ = {.To}. Suppose (3.9) 
holds for an arbitrary string /, so that 

V/ = {x g A'|(3/ < |/|) r - £( Mif,(/). ./■„)} 

and consider the string in for an arbitrary <r £ £, Employing 
(3.6) and substituting the above expression for \ t yields 

X = Xla -{•'■<>} u {•'■'c.V | (3.C G \,)C(fT, ./■)=./'} 

= {.(■()}U{.r'£A I (3/<|(|)((ff,C( suf,(0. 

= {■'<)} u {.r'eA" I (3/<|HK( suf,(/<r).x») = ./'} 

= {■'•'fcA | (3 /<|//t|)((.si//,((/t),./„) = /■'}. 

An immediate consequence of (3.8) and (3.9) is that 

C(B)=r C(P) (3.10) 

and since = (J m x A" 

= C m (P) (3.11) 

Suppose now that there exists a slate r = (q,.r) € V' 
such that q € Q m and 3 n A,„ — </>. Let I C J]* be 
any string satisfying o(L i»o) = "■ By (3.8) it follows that 
q = A(/.f/ 0 ) G Q w , i.e., t G £„,(/’). By (3.9) it follows that 
there is no / < |/| such that £(suf,(/),./■(») E A,,,. Thus, I has 
no suffix that belongs to C m {A). It follows that C,„(P) does 
not converge to C m {A). This proves the correctness of step 2). 

Suppose next that every state v = (q, \) E V for which 
q G Q„, satisfies yOA,,, <j>. Then, in view of (3.8) and (3.9). 
every string / 6 £„>(/’) has a suffix that belongs to £,„ (. 4 ). 
Clearly, if |(| < \V - 7’|, then the above implies that there 
exists / < |V - T\ such that suf,(/) G C m (A). We shall show 
next that when \V-T\ is acyclic, there exists / < |V-7’| such 
that suf, (l) E C, n (A) also if |(| > |V' - f\. If f = <f> this is 
obvious because then V is acyclic and |(| < |U| = |V — 7’| for 
all / e C{B). When f ^ <t>, the acyclicity of \V - T\ implies 
that if |f| > \V - T\ then there exists / < |V - t\ such that 
o(/w-,((), v Q ) = v = ( 7 , x) € t. Then, in view of (3.7) 

suf,(/) G £tn(Pq) c £m(A x ) 

or, alternately 

suf,(() G C m {A x ) (3.12) 


for some x € \. By (3.9) it then follows that there exists 
j < 1 such that 

x = ((suf, (WO),•'■«)■ (3.13) 

Upon combining (3.12) and (3.13), wc obtain that 
£(suf,(/),./•) = 

C(suf,(/),£(sufj(/»/’,(()),x 0 ) - 
C(suf,(pr,( 0 ) ■ suf,(/),xo) — 

£(sutj((),x 0 ) 

which implies that 

suf,(/)e£,„( 2 l). (3.14) 

Thus, the convergence time of C m (P) ioC m {A) is bounded 
by \V -T\ and hcncc £ m (,*l) <= C m (P). 

To see that the acyclicity of (V - T) is necessary for 
convergence of C in (P) to £,„(/!), suppose that there exists 
a cycle C in (V - T). Since IJ is trim, every slate of B 
is accessible from uo. Thus there exists a sequence {/,} of 
strings, /1 < 1*2 < I j ■ ■ • such that for each 7. 0 ) = 0 , = 

((/,. A/) 6 C. Suppose now that £, 71 (j4) <= C m (P) and let 

k be the convergence lime. Choose a string // of the above 
sequence such that 11 == IM > k. Since no state of C belongs 
to f, there exists ,s C C m (P qi ) such that -s ^ C m (A^) for 
every j e \/. Now, l(,s e C m (P) and by the convergence 
assumption of C m (P) to £ m (yi), there exists j < h such that 

suf - suf j{ti) ■ .S e £,r,(/l). 

lxtting j- :- t(suf ; (//), j‘o), it follows that .s e £^(^1^). But 
by (3.9) .r € a contradiction. □ 

The complexity of steps 1), 2), and 4) of Algorithm 3.3 
is 0(|Q|2l A '). The test whether £ m (P (J ) C £, r ,(.4 x ) can be 
performed by constructing a deterministic recognizer A x for 
£ n ,(A x ) (an algorithm of complexity (J(2* A I)) and checking 
whethci C ni (P q ) C £ m (/l v ) which can be done by a compu¬ 
tation of complexity (/)(|<7|2' A *). Thus, the complexity of step 
3) as well as that of the complete algorithm is 0((|<^|2 1A *) 2 ). 

Remark 3.1: An alternate approach to verifying the con¬ 
vergence of C m (P) to £jh ( A ) is based on the reversal of the 
automata P and A so as to obtain the reflections C m (P) R 
and C m {A) li of the languages C m (P) and £ rr? (yl). Then 
£ 7 „(.4) <*= £ m {P) if and only if there exists a bounded 
language L (i.e., a language all of whose strings are of bounded 
length) such that £,„(/T C C m (A) ri -L. (Here C m {Af-L 
denotes the concatenation of the languages C m (A) n and L.) 
The reversed automata I ll< and A 11 are nondeterministic in 
general, and the algorithm for testing the convergence involves 
computing deterministic equivalents to P n and A R . Such an 
algorithm, whose complexity is 0 (( 2 l' v ' + ^l) 2 ), has recently 
been described in |9] and |18J. Algorithm 3.3 above, whose 
complexity is 0((|Q| ■ 2 |A ') 2 ), is superior to the algorithm in 
[9] and [18] since it is only of polynomial complexity in the 
size of Q (although it is also of second-degree exponential 
complexity in the size of A). Thus. Algorithm 3.3 can be 
regarded as relatively efficient or at least tractable when 
the specification can be described by a small automaton as 
opposed to the algorithm of ]9] and [18], which is generally 
impractical. 



622 


IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 40, NO. 4, APRIL 1995 



Fig. 4. 

Nexl we illustrate Algorithm 3.3 by employing it to verify 
the convergence problem of Example 3.2. 

Example 3.3 (Verification of Convergence Problem of Exam¬ 
ple 32 —Algorithm 3.3): 

Step 1: The automaton 13 is given in Fig. 4. 

To illustrate property (3.9) let us examine state 2). Every 
string / € E* that satisfies a(t,n o) = (</ 2 , {j^o-J" i»-r 2 }) of 
the form for some i > 0. If 2 is even, then the suffixes 
e ,/7r*\A/to' satisfy the property £(f,.r 0 ) = so,Z((1a l . s 0 ) = 
x\ % and £( 6 /Ia\.ru) = ^ 2 - If * is odd, then these suffixes 
satisfy £(r,.ro) = x Q ^(/3a l ,x o) = and ((S/M'.jo) = Ji- 

Step 2. Since in the automaton A of Fig. 2 X m = X, we 
have {,r} H = {.r} fl JY = {.r} for each state .r 6 A". 
Hence * H AT m ^ </> for each ( q , *) E K. 

SfppIt is easy to verify that 


We recall from [1] that a language £ m (P) can 

be realized by a nonblocking supervisor S , i.e., C m (S/P) = 
M, if and only if M satisfies the following two conditions: 

• M is £ IT 1 (P)-closed, i.e., 

M = MDC rn (P). (4.2) 

• M is controllable with respect to L(P ), i.e., 

A/E,, n C(P) C M. (4.3) 


We further recall that, for a language A, the class 
CL(C m (P) n N) of controllable and £ m (F)-closed sub¬ 
languages of C m (P) fl N has a unique supremal element N p . 
This implies that a nonblocking supervisor S exists such that 
C rn (S/P) C N if and only if N p ^ (f . In view of the above 
observations we have the following immediate result. 

Proposition 4 / * CCP is solvable if and only if there exists 
a controllable and £,„(r)-closed sublanguage r/> / N C 
C m (P) such that E 4= N. 

A special case of interest occurs when the process P = 
((J'E,A,r/o.Qm) satisfies Q 7ri = Q, in which case C m (P) - 
C(P) and C 1V (S/P) = C(S/P). Condition (4.2) then be¬ 
comes that M is prefix closed and, since the class of closed 
and controllable sublanguages of C(P) is closed under arbi¬ 
trary intersections (see e.g., [ 1 ]), it contains a unique infimal 
element. Thus, in view of (2.12) when Q m = Q, the solvability 
test of CCP simplifies to the following. 

Let P u be the automaton obtained from P upon deleting 
from it all controlled transitions, i.e., P u = (Q, E, *V\ q () , Q m ), 
where 


LU, « /‘H*-'/) if ^ e A%«)! 

' ' 1 undefined otherwise. 


(4.4) 


Clearly C(P") is controllable and closed and for any super¬ 
visor S 


C(P U ) C C{S/P). 


T={ 1 , 2 , 3 }**. 

Step 4: There does not exist a cycle (ofC) in V ■ * T = {()}. 

Step 5: C m {A) <=■ C m (P). □ 

IV. Language Convergence in Controlled DES 

In the previous section we examined the problem of veri¬ 
fication whether a language L (finitely) converges to a given 
language M. In the present section we shall deal with the 
problem of convergence of the language generated by a DES 
P - b<qo,Q m ) to a given specification of logical 

behavior given by a language E C E*. Specifically, wc shall 
examine the problem of existence of a nonblocking supervisor 
S such that E 4= C m {S/P). Thus we formally define the 
following. 

Controlled Convergence Problem (CCP): Synthesize a 
nonblocking supervisor S for P such that 

E <= C m (S/P). (4.1) 


Proposinon 4 .1 can now be replaced (for the case Q 1tl = Q) 
by the following proposition. 

Proposition 4.2: Let P = (Q, E, A, r/o, Q m ) with Q m = Q 
and let E C E* be given. Then CCP is solvable if and only 
if E C(P U ). 

The above proposition yields a simple procedure for solving 
CCP in the special case when Q m = Q. First, one constructs 
P u and checks whether E <£= £(P U ). This verification can be 
done with Algorithm 3.2 (which is of polynomial complexity) 
when E is suffix closed, and with Algorithm 3.3 otherwise. 
When E 4= C(P U ), then the most restrictive supervisor, i.e., 
the supervisor that disables all controlled events, is a solution 
of CCP because this supervisor satisfies 

E*=C(S/P) = C{P U ). 

The above solution is, of course, not a very efficient one and 
in general better solutions might be sought. We shall return to 
this issue later. 
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Remark 4.1: In'[9] and (18J, a problem similar to CCP was 
discussed. A language L was called /-stabilizable w.r.t. E 
if there exists a sublanguage HcL, controllable w.r.t. L, 
such that E 4= H. The authors argued there that for a system 
P = is /-stabilizable w.r.t. a given 

language E if and only if E 4= C rn {P u ) t The justification 
for this conclusion was the claim that C m (P u ) is the infimal 
controllable sublanguage of C tn (P). When Q m = Q, this 
claim is indeed true. It is false, however, in the general case 
when Q m ^ Q , since in that case C, n (P u ) is not necessarily 
a controllable language. To solve the /-stabilizability problem 
as well as CCP in the general case, a different approach must 
be taken as discussed in detail below. 

We turn next to the problem of solving CCP in the general 
case when Q m ± Q and E is a general regular language. To 
this end wc shall make use of two known algorithms that are 
briefly discussed below. 

The first algorithm computes the supremal controllable and 
£ m (^-closed sublanguages of a given language. Specifically, 
let P — (Q,£, b,(]o,Q m ) be a DES and M C C m (P) be 
a given regular language. Let A ~ ,Y ril ) be a 

deterministic recognizer for M (£,„(/4) = A/). The construc¬ 
tion of a deterministic recognizer for the supremal controllable 
(with respect to C(P)) and £,„(P)-closed sublanguage of M 
is accomplished by an algorithm of complexity of 0(|Q||A'|) 
as follows. First construct the synchronous composition C - 
HI A ■= (Q X A\E,Gr,(r/ 0l . I’d), Q m x Y m ) as descried in 
Section II. Then £ trl (f') = CJP)nC m (A) = C m {P)nM = 
M. The supremal £„, (/^-closed sublanguage of A/ is then 
given by C ni (C') (see [14, Appendix B|), where C f is given 
by the restriction 

r' = r \Q m x 

The complexity of constructing ( v is 0(|Q||.Y|). By |2, 
Proposition, 6.1J, the supremal controllable sublanguage of 
£„,((") equals the supremal controllable and £ m (/^-closed 
sublanguage of M. A deterministic recognizer for the supremal 
controllable sublanguage of C in (C 1f ) can be constructed by 
the algorithm of 111J (see also [121), whose complexity is 
()(\Q\ 2 • |'Y| 2 ). An algorithm of complexity 0(|Q||A'|) has 
recently been given in [13]. Thus, the construction of the 
supremal controllable and £,„(/^)-closed sublanguage of M 
can be accomplished with overall complexity 0(|i^||-Y|). 

The second algorithm that we shall employ below appeared 
in |4] (see also [51, [9], [181) and is briefly described next. 
Let P = be a DES with unspecified initial 

state. Fix a subset Q C Q (which we shall regard as the set 
of legal states). The set Q is called a strong attractor for a 
state q E Q if, when initialized at </, the system P always 
reaches a state in Q after a bounded number of transitions. 
Formally, Q is a strong attractor for q € Q if A(P\Q,q) is 
acyclic and no state of A(P\Q,q) is a deadlock state of P 
(i.e., a state at which no transition is defined). A set Q is 
called a weak attractor for q E Q if there exists a supervisor 
S such that Q is a strong attractor for q in {S/P). Such a 
supervisor, when it exists, can be chosen to be static, i.c., for 
M 6 C{P), S(i) = A» whenever b(« % q 0 ) = />(Mo)(= '/)• 
Note also that a static supervisor can be written as a map 


S : X —► 2 s . The set of all states q e Q for which Q is 
a weak attractor is called the region of weak attraction and 
is denoted by S1r(Q). If Q = 0 , then Slp(Q) := <t>. In [41 
and [5] algorithms are given for computation of Qp(Q) with 
complexity 0(|Q| 2 ). An algorithm of complexity 0(|<y|) has 
recently been presented in [9] and [18], 

We return to our main discussion. To test the solvability of 
CCP, we first construct the automaton B = (K, Vm) 

as defined in Step 1 ) of Algorithm 3.3. In view of the fact that 
£(£?) = C(P) and C m (B) = C m (P), [(3.10), (3.11)), it is 
clear that B can be regarded as a new model of our process. 
Thus, CCP is solvable with respect to P if and only if it is 
solvable with respect to B. 

The automaton B offers an obvious advantage over P for 
testing the solvability of CCP in that its structure provides 
a simple way of identifying all strings of C m (B) that do not 
have a suffix in E. Indeed, let t E C in {B) and it = (q, \) G V m 
satisfy (\{1. (</ 0 , {.r 0 })) = n. Then, since V ln = Q m x 2 A , it 
follows that q E Q m and, see (3.9), t has a suffix in E if and 
only if \ n X m ^ <j>. Let F be the set defined by 

E ={(</, \ ) € V M \q E Q m and * n -Y,„ ^ r/>). (4.5) 

Define B' as the automaton 

/?' = (V,E,ry, vq,F) 

which is obtained from B by restricting its set of final states 
to F. Suppose now that S is a solution to CCP with respect 
to B. i.e., E 4= C m (S/B), and let t E £„,(/?) - £ m (B ; ). 
Since t has no suffix in E , it follows that t £ C W [S/B) and 
it follows further that £ m (S/B) C £„,(#'). Since C m (S/B) 
is controllable with respect to C(B) and is C tn (B) -closed, 
wc conclude that £ m (S/13) C supCL(£ m (Z?')), where 
su \)CL{£ m (B')) is the supremal controllable (w.r.t. C(B)) 
and £ m (B)-c loscd sublanguage of £ m (f3'). Define now the 
automaton 

B = (I\E,rMV V m ) (4.6) 

as a recognizer for sup CL ( C m {B f )). Such an automaton can 
be constructed by an algorithm of complexity f >(|Q| * ) as 

was described earlier. From the algorithm it is also clear that 
B is a subautomaton of B. Now, the inclusion £ m (S/B) C 
£ru{B) C C m (B) implies that C m (S/B) = C m (S/B). 
Hence, CCP is solvable with respect to B only if it is solvable 
with respect to B. To see the reverse implication, i.e., that solv¬ 
ability of CCP with respect to B implies the solvability of CCP 
with respect to B , observe that C m {B) is a controllable (w.r.t. 
C(B)) and £ tl 1 (S)-closed sublanguage of C m {B). Thus, there 
exists a supervisor S such that C m {S/B) = £ m {B). If 
there exists a supervisor S such that E <= C m (S/B ), then 
E 4= C m (S/(S/B)) = C m {S x S/B ), where S x S is the 
supervisor that consists of the conjunction of S and S (see 
e.g., |2]). We have just proved the following. 

Lemma 4.1: For a DES P and a regular language E , let B 
be the automaton defined by (4.6). Then CCP is solvable with 
respect to P if and only if CCP is solvable with respect to B. 

The following corollary is immediate. 
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Corollary 4.1: If C m (B) — 4> then CCP is unsolvable. 

In Theorem 3.2 (see Algorithm 3.3) we have shown that the 
convergence C m (A) <= C m (P ) is equivalent to the acyclicity 
ofV-t where f [see (3.7)] consists of all states v = {q, x) 
such that £ m (P g ) C £ m {A x ). Similarly, we construct now 
a subset T of states of B such that for each v = (q, x) € T 
there exists a supervisor S with the property that £ m (S/P q ) c 
£ m {A x ). We will then show that the solvability of CCP is 
equivalent to the possibility of preventing the system B from 
executing cycles in V - T. 

For a state v = (q, x) of B it follows from (4,2) and 
(4.3) that there exists a nonblocking supervisor S such that 
£ m (S/P q ) C £„,(A X ) if and only if the class of controllable 
(with respect to £(P q )) and £ m {P q )- closed sublanguages of 
C m {P q )r\£ m { A x ) does not consist of the empty language. To 
check this condition we proceed as follows. First note that A x 
is not a deterministic automaton since it has in general more 
than one initial state. Hence we begin by constructing a de¬ 
terministic recognizer A x = (IT. £, /), wo, W m ) for £ m (A x ), 
i.c., £* m (A') = £ m (u4 x ). Next, by the algorithm described 
earlier wc construct a deterministic automaton D v such that 

C m (D v ) = sup CL(C m (P q ) n CmiA*)). (4.7) 

The required supervisor S such that C m (S/P q ) C £ m ( 44 x ) 
exists if and only if C m (D v ) ^ </>. The construction of D " 
requires 0 (|Q| * 2 '“ V I) computations since the number of states 
of j4 x is bounded by 2 |A 1 and the number of states of P q is \Q\. 

Now, let T be the set of states defined by 

f = {veV\C m {D")*4>}- (4.8) 

Since the number of states of B is bounded by |Q| ■ 2 ^ 
(the number of states of B) the construction of T requires 
0 ((|Q| • 21 “ v *) 2 ) computations. 

The final step in our computation is the employment of 
the algorithm of [9] and [18] to compute the set ii D (T). The 
complexity of this algorithm is ()(\Q\ ■ 2 |A| ). 

We now have the following necessary and sufficient condi¬ 
tion for the solvability of CCP. 

Theorem 4.1 * Let P be a DES, and let E C £* be a regular 
language. Let B and T be the automaton and the set of states as 
defined in (4.6) and (4.8), respectively. Then CCP is solvable 
if and only if € tt B (T). 

Proof: Only If. By Lemma 4.1 CCP is solvable with 
respect to P if and only if it is solvable with respect to B. 
Suppose that CCP is solvable so that there exists a nonblocking 
supervisor S such that E «= C m (S/B). We shall show that 
the assumption that v 0 £ il B {T) leads to a contradiction. First 
we shall show that, under 5, there are no deadlock states in 
V—f. Indeed, let v = (</, x) £ V -T and t G C(S/B) be such 
that <i(£, t;q) - h. Suppose that v is a deadlock state, i.e., ta G 


C(S/B) =*s = e. If q<zQ- Q m , th en {q, x) & V m , implying 
that t 0 C m (S/B). Thus C(S/B) ^ C m (S/B ), contradicting 
the nonblocking properly of S. If q € Q fn then t € C m (P q ). 
By definition of £?, x D X m ^ so that t 6 C m (A x ) g 
implying that ( G C m (P q ) n £ m (a4 x ). Since the language 
{(} is trivially L m (P q )-closed and controllable with respect 
to C(P q ), it follows that {f} € CL(C m (P q ) n £™(4 X )), so 
that C m (D 1 ’) ^ 0. This implies that v G t , a contradiction. 

Since ( S/B) has no deadlock states in V - 7\ the assump¬ 
tion that vq £ J 2 B {T) implies that (S/B) has a cycle in V-T. 
With the employment of the same arguments as in the proof of 
Theorem 3.2, it is not difficult to show that the existence of a 
cycle in V-T contradicts the assumption that E 4= C m (S/B). 

If. The proof of the “if” part of the theorem consists of 
the following algorithm for construction of the supervisor S 
that satisfies the condition that E <= C m (S/B) whenever 
f’o 6 Sl B (T). 

Algorithm 4.1: 

Step I) Define the supervisor S] : V —► 2 E( as a static 
supervisor such that T is a strong attractor for 
H B (T) with respect to S\/B . The existence of this 
supervisor follows from the definition of il B (T). 

Step 2) For each v G V , let D' be the automaton defined 
earlier (see (4.7)). 

The required supervisor is given by .S' : L(B) —► 
2^ as found in (4.9) at ihe bottom of the page. 

The supervisor S is obviously well defined because for 
each t € C(B) it defines a unique set of controlled events 
to be disabled. Next, note that under the supervision of S, the 
system B reaches a stale v G T within a number of transitions 
bounded by \V - t\ because for states in il B (f) - f S 
acts as S\. When a stale v =■ (ry. \) e T is reached, then 
it is guaranteed by S that all marked strings generated by B 
(starting at u) belong to 

I^(D v )^snpCMC m (P q )nC m (A^ c £ ni (A x ). 

It thus follows that for I G C m (S/B), |/| > |V - 7 1 1, there 
exists i < |V" - T| such that (v(pr,(i), v {) ) = v = (ry, \ ) E f 
and suf ( (/) G L 11X (A X ). By (3.12)—(3.14) it follows that there 
exists j < / such that suf,(f) G E. If |/| < |V - T |, the 
existence of i (< \V - T |) such that suf,(/) G E follows 
from the definition of B. This concludes the proof. □ 

The complexities of computing the automaton B and the 
set T are 0(|(2| ■ 2^1) and 0((\Q\ - 2* A *) 2 )' respectively. The 
computation of il B (T) requires 0(\Q\ ■ 2^ x ^) computations. 
Thus, the complexity of verifying the solvability of CCP (i.e., 
checking the condition f ; 0 € O n (f)) is 0((|Q| • 2' A I) 2 ). In 
case CCP is solvable, the supervisor defined by (4.9) is a 
solution of CCP. 


(5](f)) where v — a(l,v o) 


if {V l }tt(pr,(f),u n) 0 t 


S{t)= [<TGE r |suf. 0 (f)<r ^£(D")}, 
where to is the least integer i 
l that satisfies a(pr,(t),v o) € f 


if there exists i such that 
oi(pr,(t), Do) € f 


(4.9) 
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When E is suffix closed, the solvability of CCP can be 
verified by an algorithm of linear complexity (i.e., of com¬ 
plexity 0 (|Q||X|)) as described below. The correctness of the 
following algorithm follows from Theorems 3.1 and 4.1. 

Theorem 4.2: Let P = be a given DES, 

and let E be a suffix-closed regular language. Let A = 
(X be a deterministic automaton such that 

C m {A) = E . Then the following algorithm verifies the 
solvability of CCP. 

Algorithm 4.2: 

Step 1) For each 7 G Q construct a deterministic automaton 
D q such that C w (D q ) is the supremal controllable 
(w.r.t C(I\)) and closed sublanguage of 

C m (P q ) n C m (A). 

Step 2) Compute the set Y of states defined by 

Y := \q£ Q\C .,„(/>*) 1 01- 

Step 3) By the algorithm of |91, |I 8 | compute the set 

Step 4) CCP is solvable if and only if <ju £ ilp(Y). □ 

The set Y can be constructed with the algorithm of [13] 
using the same method as was described in Section 111 for the 
computation of the set T. The complexity of constructing the 
set V as well as the overall complexity ot Algorithm 4.2 is 

U(WHA'I). 

Algorithms 4.1 and 4.2. which constitute tests for solvability 
ol CCP, also provide a construction of nonblocking supervisors 
that guarantee convergence of the closed-loop language to the 
specification E whenever CCP is solvable. The synthesis is, 
however, neither optimal nor unique since, in general, there 
does not exist a least restrictive supervisor that guarantees 
convergence. Specifically, there does not, in general, exist 
a supervisor S such that E 4= E w ( S/1 ’) and such that 
if S is any supervisor satisfying E <= E m (S/P). then 
C„,(S/P) C £,„{&/P). To see this, consider the following 
simple example. Let E m (P) = (ft ft and let h ~ ft. Assume 
further that Y, = (a} and Y„ = ]/!}• The solvability ol CCP 
is obvious, and for any A >0 there is a supervisor S k such 
that CM/P) -jy^ft (which clearly satisfies ft <= (ft ft). 
But for Aj < k>. eft'ft C (ft-ft and k can be arbitrarily large! 

The issue of optimal supervisors (in contexts other than 
"minimally restrictive”) is discussed in 117]. 

V. Asymptotic Behavior Of DES 

In nonterminaling systems, i.e., systems that operate in¬ 
definitely, one can distinguish between their transient and 
permanent (or asymptotic) behaviors. Intuitively, the asymp¬ 
totic behavior of a system consists of all strings that the system 
can execute after having already perlormcd an arbitrarily large 
number of transitions. 


For a language L C E* we define its asymptotic suffix, 
denoted as 

L^ := :.s£S*|V/>().3L€i:M/,|>iAf l *€/.}- (5.1) 

Clearly, L^ ts an empty language whenever L is a bounded 
language (see Remark 3.1). It is also easy to see thal L •* is 
suffix closed for every language L C Y*. 

Let P = (QXA.qq.Qm) be a DES. The asymptotic 
behavior of P is defined as E„,(P)rx ■ A recognizer for 
E m (P)^ can be constructed as follows. Let cy{P) be the 
set of all slates of P that arc accessible from r/ () and belong 
to a cycle of P, that is 

n/(P):={r/eQ|(3«,/ € E*)f/ = A(l, <Jo).» A A{«, <l) = q). 

(5.2) 

Lei Q^ := A(P.ni(P)) and let Q ,„:= (^n^. Define 
TV :=(Qn..Y.KQ^,Q„, v ). (5.3) 

The following proposition stales that 7 V is a recognizer 

fOr C w {P)nu- 

Proposition 5.1: £,„(/*)\> = £m(/V)- 

Proof: £„,(PK C £,„(/ J x). Let t £ C, n (P)^ be any 
string. By (5.1) there exists a string «>. |ii>| > |Q|, such 
that mi £ £„,(/*). Let I = |w|. be the path 

associated with w. Since / > |Q|, there exists a state q € Q 
that occurs along this path at least twice, i.e., there exist 
0 < m < < I -such that q = (/,, - q Tlius, 

<1 £ ctl(P) and q := b{w,q 0 ) £ A(P .<</(/*)) = Q -... Since 
wt € £,„(/’) and P is deterministic, it follows that 

I e C m (Q^.Y.I),q\Q, ,,* ) C 

Em (Q x , E. 5, Qx. Qmn. ) = £|>l (7 Xj )■ 

Em{P )tv 2 ) £m(/V). Let lit £ Em (7t*j). Then w £ E n1 
. YA. q. Qmo *.) for some q £ Q~^ ■ Thus see the equation 
at ihe bottom of the page. 

By (5.1) this implies that w £ E W (P V concluding the 
proof. O 

The construction of the set ry{P) can be accomplished by 
the well-known algorithm for computing strongly connected 
components of P (see 1 7, p. 64|) whose complexity (under 
our assumption that |E| = 0(1)) is D(|(,)|). The construction 
of A{P,cij(P)) requires O(|0|) computations. Thus, the com¬ 
plexity of the construction of TV is linear in the number ol 
slates of P. 

Our next goal will be to show thal for a given system/*, 
the convergence of E m (P) to a given language E can be 
determined by testing the asymptotic behavior of P. The 
following proposition gives a necessary condition for the 
convergence E <= £,,,(7*) in terms of the asymptotic behavior 
of P. 

Proposition 5.2: Let P be a given DES, and let 7£ C E* be 
a given language. If E <= C, n (P), then E m (P)^ C suf(/3). 


3 u £ E\ 3r/ € <•;/(/*). A(u,r/) = q (definition of Q-J 

=*■ 3 (t.H £Y*)q' = jt ( and q' = b{», q') (follows by (5.2)) 

=>• Is*nut £ E m (P). 
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Proof: Assume that E «= C m {P), let the convergence 
time be k and let » 6 C m (P) aCj be any string. By (5.1), 
for each t > 0 there exists t, € £*, |f,| > i, such that 
hs € C V ,(P). Choose U > k. Since there exists j < k, such 
that sufj(t,s) G E, it follows that 

* = suf|t,|(t.J») = suf| ti |_j (suf,(*;«)) G suf(£) 

concluding the proof. □ 

The following lemma states that every regular language 
L C E* converges to Loo- 

Lemma 5.1: For a DES P = (Q,£,A, qo,Q m ), C m (P)oo 

* Cm(P)- 

Proof: Since the language C m (P )« is suffix 
closed it follows that t e £ m (P)oo- Thus for each 
t € C m (P), suf|,|(/) = f € C m (P)oo implying that if 
\t.\ < (C^| there exists ? < |Q| such that suf,(/) € C m (P) «©• 

Let t € C m {P )- |f | > |Q|. Since the number of states of P is 
|Q|, there exists ij < |Q|, such that <Hpr t (f),r/ 0 ) = 7 where 
q £ c.y(P) C Qnc- Since by Proposition 5.1 £ m {P)oo = 
£m(/oc)» h then follows that &uf,(/) € £ w (Poo) = £m(P)oa- 
Hence the convergence time of £ m (P) to £ m (P)oc is bounded 
by |Q|, concluding the proof. □ 

It should be noted that if L is not a regular language, then 
the convergence <= L does not necessarily hold. Consider, 
for example, the language {a k fl k Y\^h > 0 }. 

Proposition 5.2 and Lemma 5.1 yield the following. 

Corollary 5.J: Let P be a DES. Then £ w (F) nc is the 
infimal (in the sense of language inclusion) suffix-closed 
language that £ m (P) converges 10 , i.e., 

£m(P )00 = nfAf C E*| suf(M) = M and M <= C m (P)}. 

A sufficient condition for the convergence E 4= £ in (P ) is 
given be the following proposition. 

Proposition 53: For a DES P and a language E C 
E*, if £m{P)oc C E , then E <= £ m {P). 

Proof: By Lemma 5.1, £ m (P)^ <= £ m (P). Thus, if 
£ tn (P)oo C £, it follows directly by ( 2 . 12 ) that also E 4= 
C m (P). □ 

By combining Propositions 5.2 and 5.3 for the special case 
of suffix-closed languages we obtain the following necessary 
and sufficient condition for E <= £ m (P). 

Theorem 5.1: Let P be a DES and let E C E* be a 
suffix-closed language. Then E £ m (P) if and only if 

£ m (PK C E. (5.4) 

Next we turn to the problem of supervisory control. In 
the previous section we considered the problem of language 
convergence, that is, the problem of synthesizing a supervisor 
that guarantees convergence of the supervised language to a 
specified legal language. We shall now be interested in the 
less restrictive control problem wherein only the asymptotic 
behavior of the supervised process is required to lie in a 
specified legal language. Thus, we shall consider the following. 

Asymptotic Control Problem (ACP): Let P be a given DES 
and lei E C E* be a given language. Synthesize a nonblocking 
supervisor S such that 

£m(S/P)n o c E. (5.5) 


In the case when E is a suffix-closed language there is an 
obvious equivalence between ACP and CCP. This equivalence 
is stated in the following reinterpretation of Theorem 5.1. 

Proposition 5.4: Lei E be suffix closed. Then ACP is 
solvable if and only if CCP is solvable. 

Let us now examine the solvability of ACP in the case 
when E is not suffix closed. Since is suffix closed for any 
language L % ACP is solvable (i.e., there exists a nonblocking 
supervisor that satisfies (5.5)) if and only if there exists 
a suffix-closed sublanguage M C E and a nonblocking 
supervisor S such that 

£ m (S/P )00 C M. (5.6) 

By Theorem 5.1, a supervisor S satisfies (5.6) if and only if 
M*=£ W (S/P). (5.7) 

Thus, to determine the solvability of ACP, it is sufficient to 
check whether CCP is solvable with respect to the supremal 
suffix-closed sublanguage of E. The existence of the supremal 
suffix-closed sublanguage of E follows from the closeness of 
the set of suffix-closed languages under arbitrary unions. 

Proposition 5.5: For a language /?, ACP is solvable if and 
only if CCP is solvable with respect to the supremal suffix- 
closed sublanguage of E. 

We conclude this section with an algorithm for computing 
the supremal suffix-closed sublanguage of a given regular 
language E. 

Algorithm 5.1: 

Input) A deterministic automaton 

A = (A r , E.(, j’o, A',,,) 

such that (without loss of generality) £(rr. x) j- .r { > 
for all (T G !], j 6 A". 

Output) A deterministic recognizer for the supremal suffix- 
closed sublanguage of £ m (A). 

Construct a deterministic automaton 

r==(i/,£,/wtU 

where U — 2 A \ uq = {^n}, f/ rr< = {u £ U\(u- 
{.rn}) C A' m and fi : E x U —► TMs defined by 

u) = 

{.tq} U {£(< 7 .:/') |.r 6 11 } if £(< 7 ,./’)! for all xE u 
undefined otherwise. 

(5.8) 

Proposition 5.6: £ m [C) is the supremal suffix-closed sub¬ 
language of £ m {A). 

Proof: First observe that for any two states u,w € V 
such that u C w (as elements of 2 A ), f1(< 7 , w) \ only if (i(o< u)!, 
and if then /i(rr,n) C Moreover, if fi(rr,u)\ 

and x € u , then £(<7 ,j;)! 

Let s = * 7 ] • ‘ ■ o n € £ m {C) be any string, and 

let uo,' <a « 1£ n be the associated path in C\ i.e., u % = 

and u n € U m . Then since 
uq = it follows from the above observation that there 
exists a path ^ 0 , • ■ ■ ,x n in A such that x, = £(< 7 n ;r,_i) and 
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Xi G Ui — {x*o} for all i = 1 , ■ ■ ■ ,n. From the definition of 
U-mt x n £ u n ” {# 0 } C X m , so that s 6 £«*( A), implying 
that C m (C) c X m (A). 

To see that C m {C) is suffix dosed, consider sufj(s) = 
&j +1 ■ ■ * a n for some 0 < j < n. We must show that sufj(.s) e 
Cm(C). Note that u 0 C u :i , whence since = fi(<Tj+i,Uj) 
is defined, so is u i := , m 0 ) and u\ C Proceeding 

inductively, we obtain a sequence of states u 0 , u\ , •, u n -j 

with == uo),Uj+i = ^((r T - + j + i, Ui) and u t C iii+j. 

Since u n _j C u n € l/ m , we conclude that G U m , 

whence sufj(.s) € £ m fC) as claimed. 

Finally, to see that C m (C) is the supremal suffix-closed 
sublanguage of £ rn (A), we must show that if ,s = <t\ • ■ • <r„ G 
£n.(A) is a string such that sufj(.s) G £ m (A) for all j = 
0, ■ -,n, then .s G £ m (C). Suppose that .s £ £ m (C7). Let 
/, 1 < / < n, be the smallest integer such that iij =■■ 
u f „i) is defined for all j = 1, ■■■./- ] but 
is undefined. This implies [see (5.8)] that there exists A;, 0 < 
k < l - 1, and a sequence of states ;/:o, X \, ■ • ■, 1 

such that .Tj = ^(fTfc -1 j, ;;■)_]) for all j = 1, ■■-./ - k - J 
and is undefined. But then suffc(.s) £ £(A), a 

contradiction. 

□ 
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Concurrent Vector Discrete-Event Systems 

Yong Li, Member, IEEE , and W. M. Wonham, Fellow, IEEE 


Abstract —The vector discrete-event system (VDES) is a com¬ 
pact serial discrete-event system model, in which the system 
state is represented by a vector with integer components and 
the transitions by integer vector addition [13], [14]. Continuing 
the study of VDES, we introduce in this paper concurrent VDES, 
extending the base VDES model to capture strict concurrency, 
or possible simultaneity of events. We characterize the effect 
of strict concurrency on control, and show how to synthesize 
nondeterministic controllers allowing maximal concurrency or a 
controlled VDES plant. 

1. Introduction 

ONCURRENCY is a critical issue in designing computer 
operating systems, data base management systems, and 
distributed computer and communication networks [4], [7], 
[1J. It is one of the key characteristics of real-time systems 
in general [18]. Researchers in the computer and communi¬ 
cation areas have recognized that, compared to serial systems 
(in which events occur serially), concurrent systems present 
special challenges. The simultaneous event occurrences in a 
concurrent system make it harder to enforce the orderly flow 
of events according to a specification of desired behavior. 

This paper studies concurrency from a control-theoretic per¬ 
spective. We adopt the vector discrete-event system (VDES) 
introduced in [13] and [14] as the base model for controlled 
plants, incorporate simultaneity of events, and study its effect 
on controller synthesis. We call the presence of simultaneous 
events strict concurrency or simply concurrency. 

As background, we first define VDES and review some 
concepts of VDES control. 

VDES is a compact serial discrete-event system model. 
The state of a VDES is coordinatizcd by state variables 
,ri, j' 2 i ■ *, J'n ranging over the integers Z. The state space is 
the set of n-dimensional integer vectors 

•n 

X := i X J . 2 I x,eZ)=Z". 

L^ri J 

The event set £ is partitioned into disjoint subsets £ c and £ u , 
the controllable-evcnt subset and uncontrollable-evcnt subset, 
respectively. The state transition is described by a partial 
function 

b\ £ x X —► X\ (o^ A) X -+■ E (y ( 1 ) 
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where E a E X is the displacement vector for a. For X 6 X 
we write A" > 0 if r, > 0 (/ = i, X) is 

defined, denoted by b(n, X)!, if 

X 4" E,y > 0. 

A VDES is formally defined as a 4-tuple 

G = (£, X 7 6, A" () ) (2) 

where Xo is the initial state. 

A state-feedback controller for C is a function 

/: X rc 2 l (3) 

where F is the set of control patterns defined as 

F := b|E M C 7 C £}. (4) 

An event a is enabledat a state X if o 6 /(.Y); otherwise, 
o is disabled. 

We describe system behavior by predicates. A predicate 
on the state space X is a function P: X —> {(), 1}. The 
predicate operators “-V’ (negation), “A” (conjunction), and “V” 
(disjunction) are defined as follows 

(-iP)( X) = 1 P{X) - 0 
(I\ A I\){X) = I <=> I\{X) = 1 and P,(X)=[ 
Av/^-lKilAl^)). 

A partial order is induced on the set of predicates on X 
according to 

Pi *P*&Pi AP 2 = A- 

In this paper wc identify a predicate P on X with the subset 
of X induced by P 

{X e\’\P(X) = i}. 

We will employ the expressions “A e P' 9 and ki P(X) = 1 ” 
interchangeably. 

For each a 6 £ the predicate transformation A/ ri , the 
weakest liberal precondition [16], is defined for a predicate 
P on X by 

{ 1 if 6 (a, A)! and A(a, X) E P, 
or if not ^(oj, A)! 

0 otherwise. 

For a predicate P on X , wc define a predicate i?(G, P) 
which characterizes the set of states reachable from Ao by 
way of states satisfying P\ 
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1 ) if Xo € P, then X 0 <E ft(G, P); 

2) if A' € R{G, P), ftgS, 6 (rv, A')! and A(cv, A) e P, 
then 4(a, A) € i?(<7, /»); 

3) every state in i?(G, P) is obtained as in 1 ) and 2 ). 

We call a predicate P on X controllable with respect to G if 

P X R(G, P) and (Vo e E „)P < M n {P), (5) 

Note that the empty (or identically “false”) predicate is 
trivially controllable. 

For a VDES plant G under the control of a controller /, we 
define the closed-loop predicate 7?(//C) as follows |13]: 

1) Ao 6 P(//G); 

2 ) il X £ P(//G), n € A r )! and a £ /(-V), then 

*) € P(//G); 

3) every state in U(f/G) is obtained as in 1 ) and 2 ). 

It was shown in [13) that a predicate V can be synthesized 
by a deterministic controller / for a serial VDES model G (i.e., 
P(//G) = P) if and only if P is nonempty and controllable. 

A controller / is called balanced [13] if for any A\ A’' £ 
P(//G) and any a £ i] with A(nr, A')! and />(a,A r ) = A' 
we have that a £ f(X). A balanced controller is the one 
which, among controllers synthesizing the same closed-loop 
predicate, enables the most events at any reachable state. 
It can be checked that any controller may be replaced by 
a balanced controller without changing the reachable set 
|9, A.2|. By concentrating on balanced controllers, wc gain 
technical convenience in our proofs without loss of generality. 

In the serial VDES model, the event-occurrence condition 
is defined for single events (in D) only; it is assumed that at 
any given instant of time, at most one event can occur. Also, 
at each stale, the controller (3) issues a predetermined control 
pattern; such a controller is deterministic. 

In this paper wc first extend the serial VDES (2) to include 
concurrency (that is, to allow simultaneous occurrences of 
events); this extended VDES model is called concurrent VDES 
(CVDES) (Section II). Then wc study the control of CVDES 
by both deterministic (Section III) and nondelermimsiic (Sec¬ 
tions IV and V) controllers. A nondetcrministic controller 
issues, at each state, a control pattern which is “randomly" 
selected Irom a set of control patterns. It is shown that 
nondcterministic controllers are more powerful than their 
deterministic counterparts in the presence of concurrency. Wc 
also discuss how to synthesize nondeterministic controllers 
which allow the controlled CVDES to operate with maximal 
concurrency according to a specified criterion. 

In the control literature, simultaneous events in DES (but not 
VDES, which had not yet been introduced) were treated in a 
preliminary way in the thesis [ 8 J, via the ideas of well-posed 
supervisor, well-posed language, and concurrency product. It 
was shown that the supremal well-posed sublanguage of a 
given language need not exist, hence there need not exist an 
optimal supervisor in the presence of simultaneous events. 
These results were reported in |I 0 ] and [ 11 ], where non¬ 
determinism was introduced to obtain an optimal solution. 
The completion of a set of state-feedback controls (SFBC), 
forming a poset but not a lattice, using nondeterminism (ex¬ 
ponentiation) to construct a supremal element, was proposed 



Fig An ATM model. 


independently in |2]. In [12J the optimal control problem for 
VDES with strict concurrency was formalized in the style of 
|10|, 12], and [11|, and the optimal nondeterministic SFBC 
obtained. The current paper completes [12] with full proofs of 
the authors' earlier results. Wc remark that the nonexistence, in 
the presence of strict concurrency, of a supremal deterministic 
SFBC, as distinct from merely multiple, maximal SFBC, was 
pointed out by Krogh for controlled Petri nets (CPN), when 
the latter were introduced in [5J. The existence of a unique 
maximal nondeterministic feedback policy, in the context of 
the forbidden stale problem for CPN, was later noted in [3] 
and [6|. 

Throughout this paper, we use the example of an asyn¬ 
chronous transfer mode (ATM) switch. A general high-level 
model of an ATM switch 115| is shown in Fig. 1. It consists of 
N input-port controllers (IPO, an interconnection network and 
N output-port controllers (OPC). 1PC and OPC are composed 
of buffers for incoming and outgoing ATM cells, 1 respectively, 
and the control logic for these buffers. The interconnection 
network is responsible for switching incoming cells to their 
destination output ports. The outgoing cells to an OPC arrive 
from K routing networks (RN). The buffer within an OPC has 
capacity C\ while the buffer is full, any arriving cell will be 
discarded. An OPC can disable the transmission of cells from 
a RN by backward signaling. 

In this paper we focus on a single OPC, modelling the 
buffer within the OPC and synthesizing control strategies for 
the buffer, For definiteness, assume that K = 3 and C = 5. 

We first model the buffer as the one-dimensional serial 
VDES G shown by its Petri-net graph in Fig. 2, with E ctl = 
E a , = /; ni = 1, E fU = -1, and A r 0 = 0. In this model, 
o, (i = l. 2, 3) are controllable and m is uncontrollable. 
The state transition graph of the buffer is displayed in Fig. 3. 
The control specification is that the buffer must not overflow, 
i.c.. (A’ < 5). Then the (optimal) controller enforcing this 
specification in G is easy to obtain 



^4} 

w 


if A < 5 
otherwise. 


( 6 ) 


In the serial model G, no two events can occur simultaneously, 
e.g., cells from two RN’s never arrive at the OPC at the 
same lime. This assumption is not realistic, considering the 
independence and high-rate of transmission of cells from 

1 An AI M cell is a segment of a 53-byte data stream. 
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Fig. 2. Tlie VDES model for an OPC buffer. 



<'i '*> f „ 

Fig. 3, The serial stale transition graph of an OPC buffer 



Fig 4. The concurrent stale transition graph of an OPC buffer, 
and extend the state transition function as 


W E ro „ x * -> X: (p, X)~X + E>. (9) 


RN’s in an ATM switch. In this paper wc will show how 
to incorporate simultaneity into the VDES model of the OPC 
buffer and how to synthesize controllers in the presence of 
simultaneous events. 


II. CONCURRHNCY 

We formalize concurrency by extending a VDES G to 
include simultaneous events. Define the extended event set by 

y 

con ■— * - 


An event v — 02 , ■ • ■ ,«*) € E f<m represents the simulta¬ 

neous occurrence of ar\. 02 , • ■ •. ru- We define a simultaneous 
event as an unordered set: all its component events are thought 
of as occurring at the same instant of time, with no causal 
ordering among them. We reserve the symbol “(•■■)” for 
simultaneous events simply to differentiate them from ordinary 
(uninterpreted) subsets of E; apart from this, “(■ ■ ■)” and 
have the sdme mathematical meaning as subsets. In 
particular, 0 e E ftifl is the null simultaneous event, denoting 
that no event in E occurs. Our definition of simultaneous event 
rules out multiple simultaneous occurrences of the same event, 
but our results could easily be extended to accommodate that 
situation. 

For a e E, (a) 6 E f() „ is a singleton event; with slight 
abuse of notation, wc write (a) € E t0 „ as a whenever no 
confusion results, as they really represent the same physical 
event. 

Define a companion vector for E 0 


if £„(/)< 0 
\() otherwise. 


(z = l,2, .n) 

(7) 


We can regard J a (i) as the minimal value of x, required for 
the occurrence of «. Then define the occurrence condition of 
the simultaneous event v as 


b rnn (c.xy.^x>^2j„. (8) 

nrfc( J 


Note that 4 rori is an extension of b in (1) in the sense that 
6 ron : («, X) 1 -+ 6(n, X). 

On this understanding, from now on we drop the subscript 
“con” from 6 <wl , and write simply b. The extended model is 
called concurrent VDES (CVDES) and is denoted by 

G t(m '■= (J^ron, X, ^ A()). (10) 

As an illustration, consider the one-dimensional VDES 
model for the OPC buffer in Fig. 2. The extended event 
set E ton = 2 s with E := {nq, r> 2 , n?i}. For instance, 

p := (<*i, r> 2 , rvj) G E (OI , denotes the event that two cells 
from RN l and RN 2 arrive at the OPC at the same time as 
a cell is transmitted from the OPC. The simultaneous event < 
is defined at every state except A r = 0 since 

5> = 1 - 

ryff 

The state transition graph of G fon is shown in Fig. 4, where 
ci denotes e 2 denotes the events 

«. (/ = 1, 2, 3) 

cj denotes 

(a,, tij, a 4 ) (/, j = 1, 2, 3, 1 ± j) 

c 4 denotes the events 

{(h,Oj) (?, J = 1,2,3; I / j) 


rr t denotes 


(r*i, a 2 , ct-i, r* 4 ) 


This condition ensures that the resources represented by state 
variables are sufficient to support the simultaneous occurrence 
of all component events in e. We define the displacement 
vector for e as 

E, 

r\£t 


and denotes (r*i, « 2 , n^). 

The following simple properties of CVDES will be used 
later. 

Claim 1: Let c 6 E ( . on , and ei C r, c 2 := e - rj. If 
f)(e, X)! then X\ := Jf)! and A'i)!, and then 
6(e,X) = *(P 2 , A,). 
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Proof: Let ej C e and A(e, A')!. Then 
X>5>.> £ J n 

«-*£r fiftf 1 ! 

with J a being the companion vector E n as deiined in (7). So, 
6(t'i, A)!, according to (8). From (7) 

(Vo )E n > -./ 0 . 

Thus 

Xi := A(n, A) = A + > A+ £ (-./„) > £ 

cvfefi (*€<'2 

since 

Q fc* nt£f ] 

So, A(c 2 < -Vi)!, as required. By the definition, fi(r, A r ) — 
^2iA’i). □ 

We call //> E U* an instance event string of the simultaneous 
event r e E ro »i if each element of r occurs exactly once in 
i/ 1 . For example, w — nr jorj rirj is an instance evenl string of 
r — (rkj. (%j). 

Claim 2: Let w be an instance event string of < 6 E fim . 
Then for all A” € A\ A)! only if k(w. A’)!, and then 
b(uK X) = b(<\ A'). 

Proof By the definition of A. □ 

Claim 3: Let S C S and P be a predicate which is 
controllable wrt G. Assume A' E P. Then 

(Vr 6 2^)(A(r. A) E /> or not A(r, A)!) 

(Vr E 2 tSr,1 " )(A(r, A) E P or not A(c. A)!) 

Proof: (=>) Automatic. 

(4^) Let r € 2 s and ^(r. A)!. To show that b(<\ X) E 7 > , 
let (i ( D >],, < i :=c - i [. By Claim 1 

X\ := , A)!, A(f j 2,Ai)! 

and 

A(r,X) = A(r 2 -Y 1 ). 

By hypothesis, A'i € /\ Let w E XI* be an instance event 
string of c 2 . Since P is controllable, by Claim 2 

A(r 2 , -Y 1 ) = A(ii»,.Y 1 )6P 

namely h{<\ A r ) E as claimed. □ 

For CVDES, a controller controls singleton events in £, 
directly; a simultaneous evenl can be controlled if and only 
if at least one of its component events is represented in . 
Thus the uncontrollable events in G f0n are S ro? , (U := 2~“ 
(so, 0 E and the controllable events are T, coux := 

^ro»i ^ron.H- 

Moreover, a simultaneous event is disabled by a controller 
if and only if one or more of its component events is disabled; 
or equivalently, a simultaneous event is enabled if and only 
if all of its component events are enabled. We formalize this 
requirement by defining r rotl , the set of admissible control 
patterns of G co „. A control pattern of G ron will be an element 


on 

V € 2 E,m \ Thus each if is a set of simultaneous events, 
interpreted as enabled in Formally 

r< OM :={r,|>,^2\i: u C 7 Ci:} (11) 

or equivalently 

I Vow := {V I V = 2 1 , 7 E T>. 

Therefore, each control pattern // E F to „ contains among its 
singleton events all uncontrollable events in E u ; and r/ is 
generated by its singleton events. Denote the set of singleton 
events in a control pattern /; E r c , ur , by T). Then i) = 2^; we 
call this property of control patterns control dependency. Note 
that the null simultaneous event 0 E £ fori is included in each 
control pattern //. This can be interpreted to mean that “no 
event occurs” is always allowed. 

For the OPC buffer example, 1l C on % u = {0< 1 * 4 } and a 
control pattern in V con takes one of the forms 

" 4 } 

{0, <* 4 , a,, ( tA ) i - i, 2 , 3 

{ 0 , rv 4 , a,, a,, (,.i, c,,i, c, w , <\ J>4 } i, j ~ L 2. 3 : / / j 
{0, 0 4 , Oi, O 2 , ( [ 4 . f' 2 , 1 , (%i, rj.2^ < l,Ji < 2 Ci ( 2,a^ 

r2,.i,47 *‘1.2,3, 4 } 

where M, 2 ,J 4 denote (o ( , nr,). (o,. <>,, nr*), (o t , 

(^ 3 , o 4 ), respectively. 

Note that a simultaneous event is included in a control 
pattern above if and only if its component events are included: 
In the case of the simultaneous event, say v\ (2 , this means that 
two cells from RN I and RN 2 are allowed to arrive at the 
OPC at the same time instant if (and only if) these two RN\s 
are both enabled for transmission by the OPC at that time. 

Deline a controller for G rou as /': X —♦ r ft> „. Let 
Rif*/Gcon) be the closed-loop predicate (i.e., the set of 
reachable slates in the closed-loop system) synthcsizxd by /' 
in G lHm r 

A question of interest is the following. 

Q 1 . For a given predicate P on A\ how to synthesize a 
controller /' such that /?(/' /G Cim ) = / 7 ? 

When the serial model G is extended to G f(m to include 
concurrency, the control action of a controller /: X —► T 
for G is extended accordingly and is captured by a controller 

Iron for Cl con 

(VA G *)/,„„( A) := 2'< a '>. 

At a state A, / ron enables all singleton events enabled by 
/ at A, as well as all simultaneous events generated by 
these singleton events. We call this controller the concurrent 
extension of /. Conversely, for any /': X -> r r0fl , if we 
define a controller /: X T by 

(VA r E X)f(X) :-/( X) 

2 The formal definition of R{f /G<„ r <) can be given in the same way as 
that of R[f/G) in Section 1. 
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where /( X) denotes the singleton-event set of f f {X ), then 
fcon = /' since 

(vx e X)f\ X) = 2 /(X) . 

This controller / is called the serial reduction of /'. We say 
that /' is balanced if its serial reduction is. 

It should be pointed out that, in implementation, the physical 
control action of a controller f ton is identical to that of 
its serial reduction /; both directly enable the same set of 
singleton events. But, owing to control dependence, f ron 
also implicitly enables all simultaneous events generated by 
enabled singleton events. As far as the closed-loop predicate 
is concerned. / ror , is more permissive for G ron than / is for 
G, simply because the former allows more events to occur 
at any state than the latter. We say that concurrency has no 
effect on the control action of / if R(f/G) = R{f fv ,JG C(tn )\ 
in this case, the synthesis of f ron for G con is reduced to that 
of / for G. 

Another question of interest is then the following. 

Q 2 . Under what condition does concurrency have no effect 
on the control action of /? 

We address Q\ and Q 2 in the next section. 

III. Concurrent Weli -Posedness 

We have seen in Section I that the balanced controller (6) 
synthesizes the controllable specification P ~ (A' < 5) in 
the serial model G for the OPC buffer. We now show that 
there is no balanced controller which synthesizes P in the 
concurrent model G CLm (Fig. 4). Suppose otherwise, and let /' 
be such a controller. Then X t , X 5 £ R{f'/G (OU ). Since /' is 
balanced, we have that 6 f'(Xk) 0 = 1, 2. 3). By control 
dependence, 3 G /'(A'i). But X) = 4 + 3 = 7 £ 

P , a contradiction. 

This shows that controllability alone is no longer a sufficient 
condition for a predicate to be synthesized as a closed- 
loop predicate for a CVDES plant; and that R(f/G) = 
If (fron/Gcon) is not always true. 

A predicate P is concurrently well-posed (CWP) wrt G coll 
if for any X E P and any a. fi £ E with X) £ P , 
6(ft , X) £ P and />({(*, /^), A')!, we have A((ra, /i). X) £ P. 

It can be easily checked that the specification P for the 
ATM example is not CWP. 

The following theorem answers questions Q\ and Q 2 posed 
in the previous section, and asserts that CWP together with 
controllability is a necessary and sufficient condition to guar¬ 
antee that there exists a (balanced) controller synthesizing P 
in G t ., m ; and that when the closed-loop predicate synthesized 
by a controller for G is CWP, concurrency has no effect on 
its control action. 

Theorem 4: 1) There exists a balanced controller /': X —> 
r Mrt such that 

HU 9 /Gran) = P 

if and only if P is nonempty, controllable, and CWP. 

2) Let J \ X — I 1 be a balanced controller for G and f (on 
be its concurrent extension. Then 

R(f/G) = Ii(Un/G ron ) 

if and only if R{f/G) is CWP. 


(V«G£<)/a(X) 


-{i 


Proof: 1) (IF) Let the predicate P be nonempty, control¬ 
lable, and CWP, Define a controller f by 

\{6(a,X)l and 6(a, X) £ P 
otherwise. 

02 ) 

Then as shown in the proof of [13J Theorem 6, R(f /G) = 
P. It is easy to check that / thus defined is balanced. It 
remains to show that R(f/G) = Itdcon/G CO n)- We only 
need to show thal /?(/,„„ /G con ) ^ 1 V(//G), as the reverse 
inequality is automatic. First note that X n E J?(//G), Xu £ 
If ( fron/ G f on)- For induction on the length of event strings in 
E* nn , oppose that A' £ R{f/G ), X £ R(J«m/G"m)- Let 
V E E f07M A(f, X)! and v £ / lWI (X) (i.e., (V« E e)at £ 
/(X)). We wish to show that 


f){e, X) £ /?(//G). 


(13) 


The proof is again by induction, this time on the cardinality 
of r\ 

When |e| = 1, v = n for some at £ E. In this case, (13) is 
true since a £ f ror ,(X) implies at £ f(X). 

When |r| = 2, e = (rv L . r>>) for some at i, rv 2 E E. Since 
(rvi, at 2 ) E f,on(X ), at\, at 2 E /(A) and 

A(«i.-Y), h(at‘2* X) £ R(f /G) = P> 


But P is CWP by assumption, so 

h((« l .<> 2 ),X)£P = R(f/G). 

Now, suppose thal (13) is true for |r| ^ H 1. We show thal it 
is also true for |r| = k ■■+ 2. Let r — f' U (/^i. (i 2 ) with |('] = k. 
Since ^(c, A )! we know from Claim 1 that r', ( - (fi\) and 
r- - (fi 2 ) arc all defined at A'. It is easy to see that, when c is 
enabled by f run at X, so are r' U {(i\) and ( 1 U ((1 2 ). We 
then have by the induction hypothesis 

/>(,', X)E R(f/G) 
ble'U (ft),X)€7?(//G) 

<H''U (A),X)E/?(//G) 

that is 

A«/#j>. X))ell(f/G) 
x))&m.f/G). 

Again, since /’(= tt(f/G)) is CWP it follows that 

A(r' U (ft, !h), X) = H(fh, fh), W, X)) € Ii(f/G) 

if we notice that b{(fi i, /i^). A(fA'))! (Claim 1). 

(ONLY IF) Let 

micn/Gcon) = P 

for a balanced controller By an argument similar to the 
proof of 113] Theorem 6, we can check that P is controllable. It 
remains to show that P is CWP. Let X e P and A((rv, (i), X )! 
with 


Then 


«(«, A') € P, b{p, X) € P. 

X, A(u, A'), Hft, X) € Ii(fr on /G ro n). 
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Fig. 5. A VDES example. 



/ A*=[0,2,Uj 


Pip. 6. The slate transition graph of the VDES example. 

Bul f rutl is balanced, so 

a, (1 € J i mi (A) 

and therefore, by control dependence 

(«, /*> e/,„„(*). 

We then have 

t>({n.fi)<X)eIl(J'„ n /G (on )-*r 

and that P is CWP. 

2) Easily follows from 1). □ 

The above result indicates that when P is both controllable 
and CWP the construction of a balanced controller f (on 
synthesizing P in G rmi is reduced to that of a controller f 
synthesizing P in G; such a controller / can be constructed 
as in (12). 

If there exists a supremal element in the set of all con¬ 
trollable and CWP subpredicates of a given predicate spec¬ 
ifying legal states, an optimal solution can be found to the 
deterministic-controller synthesis problem tor G ron . Unfortu¬ 
nately, such a supremal element may not exist. To see this, 
consider the Petri-net graph G in Fig. 5, with both a and fi 
controllable. The state transition graph for G con is shown in 
Fig. 6. Let 

Pi := {Ao, X x} 

P 2 := {X a , X 3 } 

P := Pi V /V 

It can be easily checked in Fig. 6 that both Pi and P 2 are 
controllable and CWP. Bul P is not CWP (although it is 
controllable), since (^(a, A«) = A’a € P, Xo) = Xy € 
P, bul b({a, 0), Xq) = i P. 
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This shows that optimal deterministic control is not always 
possible/ revealing the irregular nature of concurrency in 
our model. However, as we show in the next section, this 
irregularity is present only for deterministic control; optimality 
is always achievable with nondeterministic control. 

IV. Nondeterministic Control with Concurrency 

In the previous section we showed that there is no balanced 
deterministic controller synthesizing the specification P in 
the concurrent model G con of the OPC buffer. Even though 
wc can find unbalanced deterministic controllers synthesizing 
P in G rtm , these controllers could be too restrictive to be 
of practical use. For instance, the OPC which permanently 
enables rvi and disables (\ 2 and (i.e., only RN 1 is 
allowed to transmit cells to the OPC) will synthesize P in 
G ( , m , but this controller is obviously undesirable. In fact, 
ovring to control dependence, any unbalanced deterministic 
controller synthesizing P will, at the stale A 4 , enable one 
predetermined RN and disable the other two; and at A 3 , enable 
one or two predetermined RN’s and disable the rest. This 
type of predetermined preference shown by the deterministic 
controller is undesirable, if not unacceptable, for the OPC 
design. (Suppose RN 1 is enabled and RN 2, RN 3 are disabled 
at Xy. Then it is possible that the OPC oscillates between the 
slates X 1 and A'r„ denying transmission by RN 2 and RN 3 
indefinitely.) However, this restriction can be removed by use 
of nondeterministic control. For instance, instead of enabling 
a predetermined controllable event wc can randomly 4 enable 
one of the three controllable events whenever the state X.y is 
reached. Similarly, we can randomly enable two controllable 
events at the state A 3 . The actual random selection mechanism 
might depend on, say, relative sizes of queues for the waiting 
cells in the three routing networks; but we omit details of 
implementation here. 

The above discussion indicates that nondeterministic control 
can be more powerful than its deterministic counterpart in 
the presence of concurrency. In the following, we formalize 
nondeterministic control and exploit its power. 

A nondeterministic controller (NDC) for G is a function 
f n : .V — > 2 1 \ A nondeterministic controller (NIX’) for G C0n 
is a function /' : X —>■ 2 r,un . Intuitively, the control pattern 
issued by an NDC /„ (or, f n ) at A is a randomly chosen 
element among the elements of f n (X) C T (or, X) C 

IV or,)- 

Subject to concurrency in the plant (modeled by G ctm ), a 
(physical) NDC modeled by f n will behave like a (physical) 
NDC modeled by /„ if . on defined as 

(V A € *)/„,„„( X) := {2 1 | 7 € / ri (A)} 

because of control dependence. We call / n , cori the concurrent 
extension of Conversely, for any NDC / 7 ', for G con , if we 

'One special case where the optimal controller exists for a specification P 
in G t <,» is when m ij>C{F) turns out to be CWP. In this case, .*upC( P) is also 
the supremal element of the set of controllable and CWP subpredicates of P . 
Then wc can construct an optimal controller /*,„ such that /?(/,*,„ /(?, «„) = 
f*upC{P). 

4 Here “randomly*’ jusl means “according to some hidden or unmodelcd 
mechanism of selection.” 
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define an NDC /„ for G as 1 

(VAG *)/„(*)“ W I •?€£(*)} 

where rj denotes the set of singleton events of then /', = 
We call /,, the serial reduction of /'. It should 
be emphasized that, in implementation, the physical control 
actions of /„ and / n , ro „ are the same; they directly enable 
at any state a pattern 7 of singleton events selected from the 
same subset of F. But, owing to control dependence, /„, ron 
implicitly also enables at that state all simultaneous events 
generated by 7. 

For a VDES plant G under the control of the NDC we 
define the closed-loop predicate R(f n /G) as follows: 

1) A 0 € 7?(/„/G); 

2) if X G RUn/G), a 6 E, A(o, A')! and ( 3 7 6 
f n (X))a g 7 . then 6(«, A) e fl(/„/C); 

3 ) every state in R{f„/G) is obtained as in I) and 2). 
Similarly, we can define the closed-loop predicate 

R(f'n/G cou ). 

To extend the definition of balanced deterministic con¬ 
trollers. we call an NDC /„ for G balanced if the following 
two technical conditions hold. 

1) For any X, X' G R(f„/G) and a € E with A)! 
and X' — b{a. A), there exists a control pattern 7 G /„(A) 
such that « G 7. 

2 ) There is no redundant control pattern; that is, for any 
state A and 71, 72 € /,,(A), if 7! C 72 then 71 = 72. 

We call an NDC balanced if /„ is, and confine 

attention to balanced NDC. 

Theorem 5 ; Let P be a predicate on X with Ao G P. There 
exists a balanced NDC /„ : X —> 2 r such that 

R(fn,rnii/G (u „) = P 

if and only if P is controllable. 

Proof' (IF) Assume that P is controllable with respect to 
G. Let /: X —► F be defined as in ( 12 ). Then R(f / G ) = P. 
We define an NDC f° : A - 2 r as 

fn(X) = {E u U •{(*} | a G /(A) n L r }. ( 14 ) 

Each control pattern 7 in /"(A’ ) has the form U {nr}, rv 
being a controllable event which is enabled by / at A. We 
need to show that 

R(fn,ron/G<on) “ P ■ 

First, we show that 

P A R(K so JG con ). 

By the assumption, A« € P, and by the definition, Ao G 
R(fu' CO „/Gco,i). For a proof by induction, suppose that A G 
P with 

A = f>(a, X') 

for some X' G R(f/G) = P and A' G R(f^ o JG r0u ). By 
the definition, o G /(A'), a G 7 for some 7 G /“(A'), and 
(or) G r1 for some r/ G /" ro „(A'). By the induction hypothesis 

A G R(fl ro JG con ). 


This proves P A R(f" ron /G C on)- Next we prove the reverse 
inequality. Again, by the assumption, Ao G P, and by the 
definition, A 0 G R{f° con IG CO n)- For a proof by induction, 
suppose that 

A G R(f°, co JGeon) 

with 

x = 4(r f Jf # ) f reR(/; ron /G cw ). ref 

and 

G T) for some r/ G A'). 

We show that A" E P. By the definition of /" rfjn 
v C E„ U {tv} 

for some a satisfying 

a € E, 0 f(X f ). 

Case / • a £ e. In this case, r C E l( . Since P is controllable 
with respect to G 

b(w, X') £ P 

with iv any instance event string of r. So from Claim 2 
X = <S( f , A") = (S(ir, X‘) £ P 
Case 2 ■ a C r. Since a 6 E r IT /(A'), we have 
A(«, X') £ R(f/G) = r. 

Let w be an instance event string off - {r*}. Since f — {rv} C 
E u , and P is controllable with respect to G, there follows 

*(«;, b(a<X’))eI1(f/G) = P. 

We then have from Claim 2 that 

A(f, A") = A(t«, A(u, X')) £ P. 

This completes the proof that 

R(I° n , to JG fon ) 1 P. 

It is straightforward to check that is balanced. 

(ONLY IF) Let / TlfroT , be a balanced NDC. The definition of 
admissible control patterns implies that uncontrollable events 
are always enabled by / r ,, cc>M . It then follows easily that 
R(fn,nm/Gt»n) is controllable. □. 

The above result has two implications. One is that (bal¬ 
anced) NDC’s synthesize the same set of closed-loop predi¬ 
cates in G con as deterministic controllers in the serial model 
G\ nondeterminism of controllers "offsets” the effect of con¬ 
currency on control. In other words, in contrast to deterministic 
control, the CWP condition is not required for a predicate to 
be synthesized by a nondeterministic controller; the NDC /,'* 
constructed in (14) always synthesizes a controllable predi¬ 
cate P. Since for any predicate P there exists a supremal 
controllable subpredicate of P [13} the second implication is 
that we can construct a balanced NDC f n whose concurrent 
extension f n , cn n synthesizes the largest possible closed-loop 
subpredicate of P in G con . In fact, there may be many 
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such controllers f n . For the ATM example, the concurrent 
extensions of both /* and / 2 defined below synthesize the 
specification P in G con 

fh(X) ■= f{«i, «2. «3, « 4 }) (A' = 0, 1, 2) 
fn(X) := <*4}. {<>21 «4}, <*' 4 } } (A‘ = 4) 

A'):={{« 4 }} (A >5) 
/n(A'):=/‘(A') (X #3) 

/«(A) := {{ai, a?, « 4 }, {(»!, f»j, rt 4 }, {r» a , «,,, « 4 }} 

(X = 3). 

The NDC /£ differs from f r \ at the state A' = 3 only, where it 
enables two controllable events instead of only one. Therefore, 
the concurrent extension of //; allows more concurrency than 
f 7 \ in the concurrent model G tnn . 

In the next section we investigate how to pick, among all 
NDC’s synthesizing a given predicate, one which optimizes 
another performance measure—namely, concurrency. 

V. Nondeterminlstic Controllers 
Allowing Maximum Concurrency 

For the controlled CVDES G fOUS concurrency is measured 
by the simultaneity allowed by a controller. A deterministic 
controller J lim allows gieater concurrency ot the plant than 
another deterministic contiollcr r / fon if 

(V <Y G ' Y ) fton ( X ) 2 ( fton ( X ) 

i.e., at any state there are more events enabled by / ro „ than 
by tit on • 

We now introduce a measure of concurrency permitted to 
the controlled plant by NDC’s. For a set S and two elements 
A, B € 2 ^ 2 \ we write .1 < B (wrt S) if 

(Vn G A)(5 b G B)a C b. 

For a predicate P define a relation “<’’ on the set of all NDCs 
of G by 

In <g n (r<IP) if (VA r 6 P)f„(X)<g„(X) (wr/£). 

Similarly, define tor P a relation “<” on the set of NDC’s 
of G (on by 

fn ton S .f/ri.icjn (rt’IP) if (VA 6 F)/„ j(0rl (A) 

< 9nson(X) (wrtH ron ). 

The meaning of “<” is easy to see: f„ t r,m < 0 n,ron (rrlP) 
implies that at any state of /\ for any control pattern which 
is chosen by there is a control pattern chosen by g n , ttm 
which specifies more enabled simultaneous events. Therefore, 
the relation “<” on the set of NDC’s for G r(W imposes an 
ordering on the degree of permissiveness and concurrency they 
allow of the plant. It is easy to check that 

fn < fh (rel P) & fn .(’on — f]n,con (rd P). 

The following simple result says that the ordering “<” on 
the set of NDC’s induces a consistent ordering among the 
closed-loop predicates. 


Claim 6 . If /ri ,i'on S t/ri.roti {t"d W(/n,oon/l?con)) then 

F( fn on !Cron ) n ,con /Gran)- 

a 

For a controllable predicate P, let /" be defined as in (14). 
Define 

F*(P) :={/ n |/:</nMP)}. 

F+(P) comprises all controllers which are more permissive 
in their control action than /". It follows from Claim 6 and 
the fact R{fn, LO „/G, on ) = P that 

(V/n € F+(P))P 1 li(fn,on/G roil ). (15) 

For a controllable predicate 1\ we also define 

F (P) := {fn | (V X G P)(V 7 e /„(A'))(Vr € 2^) 

■ h(e, X) G P or not t(r, A r )!}. 

F~(P) comprises all controllers whose concurrent extensions 
enable only legal simultaneous events. It follows that 

(V/„ eF-(P))li{f nttm /G im )lP. (16) 

The next result says that the concurrent exiensions of 
balanced controllers in F~(P) n F+(P) are exactly the 
controllers which synthesize P in G ritn \ F (P) determines a 
lower range, and F 4 (P) an upper range, of such controllers. 

Theorem 7 Assume that P is controllable and ATo G P. 
Then for any balanced controller 

RUn,u>n/G (01) ) = P & f n G F~~{P) D F 4 (P). 

Pi oof; (<=) Directly follows from (15) and (16). 

(=>) Assume for a balanced controller f nxon : X —♦ 2 rt,,n 

P(fnton/G lon ) = P. 

To show that f n G F + (P)> suppose for a contradiction that 
not (fn < fn) (r<IP). 

Then 

(3-YGP) not < f u (X)) 

that is 

(3 A' G P)(37 € /;;(X))(V 7 1 E /„(*)) not ( 7 C 7 i). 
Since all candidate control patterns in /"(A") have the form 

there follows 

(3X6 7»)(3r»e£,)(J: B U{«}€^(A') 

& (V 7 E f„(X))n £ 7 ). 

So 

(3 X E P)(3« G E c )A(a, A)!& 6 («, X) G P 

^(V 7 e / tl (X))« jf 7 . 

But this contradicts the assumption that (or /„) is 

balanced. 
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The conclusion that /„ € F~(P) follows directly from the 
assumption that /?(/„, ,on/G eon ) = P and the definition of 

□ 

For a controllable predicate P, denote the balanced con¬ 
trollers in F + (P) n F~(P) by F(P). 

Lemma 8: F(P) is a poset with respect to the relation “<”. 

Proof: It is easy to see that the relation “<” is both 
reflexive and transitive. 

To show that it is also antisymmetric, let /„, g„ € F(P) 
with 

fn < Ui< , !ln < In {rclP). 


Proof: We first show that f* € F + (P) n F (P), That 
/* e F + (P) follows from the fact that for any A e P 

(V 7 eOT(3 7 '6/;(X))7CV 


f°n(X) < K(X). 

That f* 6 F~ (P) follows directly from the definition and 
Claim 3. It is also easy to see from the definition that fn 
balanced. Thus /* E F(P). 

Let f n £ F(P). Then f n £ F~(P), and 


We Show that f Tl = g„ ( relP ). For any X £ P, Jet 7 £ /„(A). 
Then 7 ^ 7 ! for some 71 £ g n {X) since f n < g n . Also, 
since g v < we have 71 C 72 for some 72 E fn{X). So 
7 C 71 C 72. Since /„ is balanced, there is no redundant 
control pattern in / T 1 (A). Thus 7 = 72, implying that 7 = 7^ 
This proves that 

(V 7 €/„(A))(3 7l e Sn (.V))7 = 7i 


(VA 6 P)(V 7 € f„(X))(Vv 6 2^(c, X) € P 

or not b(v, A)!. 

From Claim 3 

(VA fc P)(V 7 € /„(A))(Vr € 2‘ ,nS ' )A(e, A’) 6 P 

or not b(i\ A")!. 


or, equivalently 


Therefore 


/„( X) C <]n(X). 

Repeating the above argument with / and </ interchanged, we 
get 

fn(X)Dgn[X). 

So 

(VA E P)f n {X) = //„(A) 

i *. 9 f n =g n (rclP). □ 

By the definitions of F(P) and /", we know that f\\ is 
the infimal element of the poset F(P). Wc now construct the 
supremal element of F(P). 

For a controllable predicate P on X and X £ P define 

M(X) ;= {r | r C L\ and (Vr' E 2‘ )b{(\ X) 

£ P or not b((', .V)!} 

and 


(VA E P)(V 7 E f„{X ))(3 7 ' E N( A)) 7 C 7 

and f n < f* (rrl I’). It follows that f* is the supremal element 
in F{P). □ 

Therefore, for a nonempty controllable predicate P, f* ton 
will synthesize P and, among all controllers doing so, it will 
allow the plant G (on to have maximally concurrent closed-loop 
behavior. 

For the ATM example, >.], = {<vj. (*), r*j}, and 

f 10, {<*]}, {fv 2 }, 

{«!, {f»2. Oj}. UH. '*'2- o»}} X < 2 

{0. {in}, {ff-i}, I'l'i}. l'*i. }, 

~ {(»). a,}, (r» 2 , f*j}| A = 3 

10, tora}. 1",}} A = 4 

{0} A = r, 


N( A):= {S u Uf|f € Af(A) 

and (V c 1 e M(X))i C ^ c =r 


Then define a controller /* as 





X € p 
otherwise. 


(17) 


Notice that /*(A) does not contain any redundant control 
patterns. 

Assume that for given state X, predicate P on X and 
e, e! C U r , checking A(c, A) € P, b(r, A)!, r C e', and 
e = c' are basic computational steps. Then the complexities 
of computing both M(X) and N(X) are (7(2'“'' x 2'“ r ), 
(Note that |M(A)| < 2l E< L) The computational complexity 
of /* is thus 0(|P| x 2 2 ^'!). 

Theorem 9: The NDC /* defined in (17) is the supremal 
element in F[P). 


fn(X) := (V(A') = 


f {{«]. 02 , fy.», <*4}} 

H f >l, n-2, 04 }. {oj, f»3, rv 4 

J 1 « 2 , 03 , 04 }} 

’ ll'l’j. 04}, 1<»2, 04}, 

{f*3. o 4 }} 

lu f M}} 


A < 2 

A = ;i 

A = 4 
A - 5 


/n(A) := {{o 4 }}, A > 5 

The control actions of J* con are: for A < 2 , enables all three 
RN’s for transmission: at A = 3, randomly enables two RN’s; 
at A. = 4, randomly enables one RN; and for A > 5, disables 
all RN’s. This is intuitively the best that any controller can 
possibly achieve in the presence of concurrency. 
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VI. Conclusions 

The VDES is a compact model for a useful class of discrete- 
event control systems and allows computationally-efficient 
control synthesis [13], [14]. Continuing the study of VDES, 
this paper explored another feature of VDES: concurrency, 
or possible simultaneous occurrence of events. We formalized 
concurrency by extending the event set from E to t con := 2 s ; 
an extended event e '= (r*i, q 2 , ■•■,«*) in S ron represents 
the simultaneous occurrence of O], (* 2 , The VDES 

model incorporating concurrency is called concurrent VDES 
(CVDES). 

It was shown that, owing to the effect of concurrency on 
control, a nonempty predicate on the state space can be syn¬ 
thesized by a (balanced) deterministic controller in a CVDES 
if and only if it is controllable and satisfies an additional 
condition called concurrent well-posedness. Since concurrently 
well-posed predicates are not closed under disjunction, it is 
not always possible to synthesize an optimal deterministic 
controller for a CVDES. 

This led us to nondeterministic control of CVDES. It 
was shown that controllability is a necessary and sufficient 
condition for a predicate to be synthesized by a balanced 
nondeterministic controller in a CVDES. Therefore, optimal 
nondeterministic-controller synthesis (in terms of closed-loop 
predicate) is always achievable. Next we characterized the 
set of nondeterministic controllers synthesizing a given con¬ 
trollable predicate I\ denoted by F(P). A partial order was 
defined on this set to compare degrees of concurrency of the 
closed-loop system allowed by nondeterministic controllers. 
Finally, we proposed a formula for constructing the supremal 
element of F(P). If P is a given legal specification, then 
the supremal element in F(supC(P)) is a nondeterministic 
controller enforcing P which is optimal not only in the sense of 
reachable states but also in the sense of concurrency permitted 
to the controlled CVDES. 

Wc remark that the results in this paper can be easily adapted 
to any automaton DES for which Claims 1, 2, and 3 can be 
established. 

In future research it may be worthwhile to exploit the 
specific structure of CVDES so that the supremal element of 
F(supC(P)) can be constructed in a computationally-efficient 
way, along the lines of 114]. 

Throughout this paper, wc used a simple ATM example 
to demonstrate concepts and results. Though a small part of a 
complex system, this example does show that concurrency can 
arise naturally from and pose unique problems for the conrol 
of real-world systems such as telecommunication networks. 
For future research, it will be interesting to model larger 
systems using CVDES and address control specifications of 
wider scope. 
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Structure of Model Uncertainty 
for a Weakly Corrupted Plant 

Tong Zhou and Hidenori Kimura, Fellow, IEEE 


Abstract—In this paper, we investigate the structure of the 
transfer function set which includes all eliminate the transfer 
functions deduced from the plant available information. It is 
shown that when an upper bound of the plant transfer function's 
H^-norm has been supplied, and the noise contaminating the 
time domain identification experiment data is not too significant, 
such a transfer function set can be parameterized by a linear 
fractional transformation of two transfer function matrices. One 
of them is a fixed transfer function matrix which is completely 
determined by the plant available information and the noise 
magnitude. The other is a norm bounded, structure fixed, free 
transfer function matrix. Moreover, it is shown that the problem 
of analytically obtaining the fixed complexity nominal model that 
best approximates this transfer function set is as difficult as the 
//-synthesis problem. 


I. Introduction 

R OBUSTNESS is one of the principal properties expected 
for control system, due to the high complexity ot plant, 
plant parameter variation according lo operation conditions, 
etc. In order to carry out such a study quantitatively, an initial 
step is to develop a mechanism to measure the size ol the 
uncertainty of the plant nominal model error. Afterwards, it is 
necessary to investigate the problem of designing controllers 
for transfer function set defined by this mechanism and the 
problem of identifying the transfer function set measured by 
this mechanism from plant physically obtainable information. 

In the last decade, most of the research effort has been 
focused on the problem of developing controller design the¬ 
ory and uncertainty measure mechanism. For example, gap 
metric, graph metric, 7Y^-norm, £ J -norm, etc., have been 
developed to measure the uncertainty of nominal model error, 
and parametric uncertainty, additive unstructured uncertainty, 
multiplicative unstructured uncertainty, coprime factor un¬ 
structured uncertainty, etc., have been exploited to describe 
the model of a control plant with a nominal model, while 
H* optimization theory, structured singular value theory (or 
/i), quadratic stability theory, etc., have been developed for 
robust controller design |4], [5], [8], 116], [18]. However, it 
is until recent years that the problem of identifying transfer 
function set measured by a mechanism suitable for robust 
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controller design from plant available information has begun 
to receive a considerable amount of attention, c.g„ [3], [9], 
[12J, [14], [151, [19], [21), 125), |26), [30], |32], 134], and 
137]—1391- 

Several approaches have been proposed to attack this prob¬ 
lem. Briefly, these approaches can be divided into two kinds. 
One of them is a stochastic approach in which the uncertainty 
bound of the nominal model error is sometimes called “soft 
bound” [12], [24]. The other is a deterministic approach, and in 
this case, the nominal model error uncertainty bound is called 
“hard bound” [14], |15|, 1321, [21], [38], [39]. 

In a deterministic approach, there are two kinds of phi¬ 
losophy in identifying the transfer function set of a plant 
from its available information. One of them is to obtain 
the smallest transfer function set which can include all of 
the transfer functions that are deduced from the available 
information about the plant. This philosophy has a strong 
information theory support, because it means that with the 
increment of the information about the plant, the uncertainty 
about the plant dynamics will decrease 131], [35|. Another 
one is to find the smallest transfer function set that can not be 
falsified by the plant available information. This philosophy 
is based on a scientific principle which says that a model 
is correct if it can explain the observed plant input-output 
characteristics, and among the models that can not be fal¬ 
sified by the knowledge about the plant, the best choice is 
the “simples!” or “most powerful” model [25|, [33J. The 
former is called worst case deterministic approach and the 
concepts are first proposed by Helmicki et al in the field 
of identification for robust control |15|. The latter has been 
attacked by Zhou and Kimura. In that case, the problem 
of identifying the smallest unfalsificd transfer function set 
has been reduced to a convex optimization problem, which 
is computationally tractable, under the condition that the 
denominator of the plant nominal model has been prescribed 
[37H39]. 

While it is generally impossible to obtain the “actuaP' 
model of a plant even in the ideal case, i.e., when a plant 
is linear and time invariant, because perfect information about 
a plant is usually not obtainable from measurement, due to 
the finiteness of identification experiment time length and 
unavoidable noise existent in identification experiment data, 
both the transfer function sets, which are deduced by the 
philosophy of unfalsified modeling and the philosophy of 
worst case deterministic approach, respectively, will play 
important roles in robust controller design. This is based on 
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the following arguments. That is, the worst case deterministic 
approach will give a transfer function set that can guarantee 
the actual plant performances when a controller is designed 
such that the desirable performances are met for all of the 
transfer functions belonging to the transfer function set, pro¬ 
vided that the actual plant dynamics can be appropriately 
approximated by a transfer function. On the other hand, the 
unfalsified modelling will bring out a transfer function set 
that a controller should be designed to make every transfer 
function in that transfer function set satisfy the desirable 
performances in order to let the actual plant achieve these 
performances. 

In unfalsified modelling, the well known Carath6dory-Fej6r 
extrapolation theorem (or Schur-Takagi-AAK Theorem) plays 
an essential role [13J, [22], [2]. The importance of this theorem 
in identification and model validation has been independently 
found by Poolla et al. and Zhou et al. [281, [36J, [37]. 

It is still a challenging problem, however, to obtain the 
smallest transfer function set which can include all eliminate 
the transfer functions that are deduced from the available 
information of a plant and is compatible with robust controller 
design, although a number of robustly convergent algorithms 
have been proposed, e.g., [14], [15]. Basically, such a problem 
is to approximate a function set by a single function belonging 
to a prescribed set, e.g., this function should be in W rx \ 
and/or it should have a complexity as low as possible for 
the convenience of robust controller design, etc. Therefore, 
the structure of the transfer function set will play essential 
roles in finding out the identification algorithm. For instance, 
in information-based complexity theory, it is well known that 
if a function set is symmetric, then it will be best approximated 
by its center, no matter what metric is used in measuring 
the approximation error, provided that there is no restriction 
imposed on the function that is used to approximate the 
function set [31]. 

In this paper, we investigate the problem concerning the 
structure of the transfer function set which includes all of the 
possible transfer functions of a plant, based on some kinds 
of the plant obtainable information. It is shown that when an 
upper bound of the Ti^-norm of the plant transfer function 
has been given and the plant time-domain inpul-output has 
been measured over a finite time interval, if it is only the 
plant output that has been contaminated by a not too significant 
magnitude bounded noise, then this transfer function set can be 
described by a linear fractional transformation of two transfer 
function matrices, one of them is completely determined by 
the plant available information and the noise magnitude and is 
therefore fixed, another one is uncertain but norm bounded and 
structure fixed. Moreover, it is also shown that the problem of 
obtaining the fixed complexity plant nominal model that best 
approximates this transfer function set has an equal level of 
difficulty as that met in /i-synthesis problem. 

An outline of this paper is as follows. In Section II, the 
problem is formulated. The structure of the transfer function 
set is investigated in Section III, while the application of the 
main results to identification for robust control is discussed 
in Section IV, Finally, conclusions of this paper are given in 
Section V, 


Many of the results of Section III of this paper have 
appeared, without proof, in [40]. 

Notations 

1Z The set of real numbers. 

C The set of complex numbers. 

7The set of m x n real matrices. 

Qmxn xh e se t 0 f 1rl x n complex matrices. 

T> The closed unit disk defined as 

V={z\\z\< 1 }. 

Ik Identity matrix of k x k dimension. In cases in 

which confusion will not result, the subscript k 
is omitted. 

Scalar 0 or matrix with suitable dimension and 
all elements being zero. 

diag{ai Diagonal matrix or block diagonal matrix defined 
a 2 ■ • ■ } by scalars or matrices , a 2 , ■ • •, <* n 

X f Transpose of vector or matrix A". 

drl (A) The determinant of a square matrix A. 

7T(A) The maximal singular value of matrix A. 

- H ^ Transfer function set or transfer function matrix 

set formed by transfer function or transfer 
function matrix that is analytic on the closed unit 
disk P = {z | |a| < 1}. 

|| F( The H IX -norm of transfer function F(z) 6 7 
which is defined as 

ll^OOIk = max| : |<i|F(0|. 

BH ^ Transfer function set which consists of all 

transfer functions that are analytic when \z\ < 
and have TY^-norm not greater than I, i.e., 

BW ot = {fl(*)|||flW||x<l}. 

Linear fractional transformation (LFT) 

Pi , | p* |, A') = P n 4 P U K{1 - P n K) 1 P n • 

Homographic transformation (HM) 

nM { pL p'L ’ A ) = (PliK + Pyl)[l> ^ K + 

Structured singular value (n.y(M)): For given positive in¬ 
tegers ci . c 2 , ■ ■ •, C'i ; r A , r 2 , ■ • ■, r.s; mi, v\ 2 . ■ * ■, define 
a matrix structure 

*:={A| A = ■■■ Me, 7i■■■ 

7 s/r s Al A/r}, 

ft, € 7?, 7, e C, Ajt E C mk *™ K , 

1 < i < T, 1 <;j < S< 1 < k < F\. 

Then, the structured singular value of matrix M with 
dimension 

+ X/* + ) x ( + X/. 4 £m, 
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is defined as 

{ min |^(^). -le.v^etci-A/Ajao) ; 

0, if there is no A € X which makes 
/ - M A singular. 

II. Problem Formulation 

Mathematically, the problem of identifying a nominal model 
and its error uncertainty bound in the worst case by a de¬ 
terministic approach is a problem to best approximate a 
function set by a suitable single function under a metric 
compatible with robust controller design. While the transfer 
function that best approximates the prescribed transfer function 
set gives the plant nominal model, the approximation error 
will provide the uncertainty bound of the plant nominal 
model error. Both of them are essential in robust controller 
design. On the other hand, some restrictions are generally 
imposed on the function that is used to approximate the 
function set, which is determined by this function’s intended 
application. For example, in robust controller design, a lower 
complexity nominal model is commonly highly appreciated; 
moreover, it is usually required to let the plant nominal 
model have the same number of unstable poles as that of the 
plant “actual" transfer function; and so on. In these senses, 
such an identification problem differs significantly from the 
traditional identification problem, in which it is desirable to 
find some parameters that minimizes the “prediction error" of 
the nominal model which is defined by a priori chosen model 
structure and the parameters to be found [24], [7]. 

To approximate a function set, it is preferable to investigate 
the structure of the function set first, because the struclure 
of the function set plays an essential role in developing the 
approximation algorithm. For example, a symmetric function 
set will always he best approximated by its center, no matter 
what metric is utilized to measure the approximation error, 
and it is generally hard to find out the optimal approximation 
algorithm for a function set with arbitrary structure |3IJ. In 
worst-case deterministic identification, the function set is a 
transfer function set and every transfer function belonging to 
it is deduced from the plant physically obtainable information. 

Generally, two kinds of information can be obtained from 
a plant. One of them is in frequency domain and another in 
time domain. 

Plant frequency domain information can be obtained by 
applying sinusoidal input with frequency corresponding to the 
frequency point of interest. 

The problem that will be investigated here is formulated as 
follows. 

Assume that the plant is linear time invariant and stable. 
Moreover, assume that the W^-norm of the plant transfer 
function is not greater than 7 . Given a series of time domain 
identification experiment data of the planl 

(wo* yo), ( IJ h 01)1 * *' * ( w »t' u o ^ 0 

which is corrupted by an additive magnitude bounded noise 
series 

(■Wo, V ], l>2, ■ ■ , 0,,), v i € Kl — H ^ 1 ^ 11 


in the plant output y u 0 < i < n. The problem is to find 
out all oi the transfer functions that do not contradict the 
aforementioned information about the plant. 

From the problem formulation, it is obvious that the transfer 
function set that consists of all the plant possible transfer 
functions can be approximated, which means that a single 
transfer function can be found that approximates the transfer 
function set with a finite error. For instance, the zero transfer 
function will approximate this transfer function set with an 
error not greater than 7 , provided that the error is measured 
by 7^-norm. Moreover, when the plant perfect information 
is supplied, that is, when k —► 0 and 11 —► oc, the transfer 
function set will degenerate into one transfer function. In 
fact, this transfer function is the planl actual transfer function. 
Furthermore, compared with the maximal magnitude of the 
plant unit impulse response M and its relative stability margin 
P which are widely used as the a prion information about the 
plant in the literature dealing with worst-case deterministic 
identification problem, c.g., [14] and |15], an upper bound of 
the W^-norm of the plant transfer function is easier to obtain. 
Note that when a planl is linear time invariant and stable, 
according to the well known maximum-modulus theorem [291, 
an upper bound of the plant transfer function’s x -norm can 
be obtained by measuring the plant frequency response, which 
is generally possible. In these senses, the formulated problem 
is we 11 -posed. 


III. Structure oi tiie Transfer Function Set 

Under the condition that the plant is linear and time invariant 
during the identification experiment, the uncertainty about the 
plant transfer function results from two factors. First, the 
measure time length of identification experiment is limited, 
which means that v is generally a finite integer. Therefore, we 
can only obtain finite sampling points of the planl unit impulse 
response. The plant remainder unit impulse response will make 
us uncertain about its transfer function. Second, there always 
exists effect of noise during identification experiment, which 
implies that we can only obtain the intervals in which the first 
n sampling points of the actual plant unit impulse response 
exist, and we are also uncertain about the real value of the 
first n sampling points of the plant unit impulse response. 

As the first step, we shall investigate the uncertainty on 
the plant transfer function caused by the finite length of 
identification experiment time length. 

This problem is closely connected to a classical extrapo¬ 
lation problem, Schur extrapolation problem [2], [61, [17|. 
First, wc introduce the following results which are obtained 
by Adamjan et al. 111 and [22J. 

AAK Theorem: For given real number y { ), y \. • ■ ■ de¬ 
fine 


fin fin ~ 1 

Hi 

00 

Hu — l fin —2 

00 

0 


G = 


91 

<70 

0 

0o 

0 

0 
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If W(G ) < 7 , then 
g{z) = jz n+1 - 


-We«W 0 ) 


WV 7 ' q(zHz)+p(z)' 

exhausts all the transfer functions 


a(z) = flo + 91* + 92Z 2 + ■ ■ ■ + g„z n + z n+ '^2h,z', ( 2 ) 


Moreover, every transfer function which has the form ex¬ 
pressed in ( 1 ) satisfies the conditions represented in ( 2 ) and 
(3). Here 

Ocj no 

p( z ) = '%2 pj z1 ~ 1 - n( z ) - ^2 ( 1 j zJ ~^ 


V = [pi P 2 ■■ ]' = 7to. 9 = [91 92 ■■■]* = TGRr. 


Theorem I: Assume that Ip Vo < 7 2 I/ , / 7 . Then, every 
transfer function which is consistent with the given in¬ 
put-output data (u,, y,°) |” =0 , uo ^ 0 , and has 7Y 00 -norm 
not greater than 7 can be represented by 

e? +1 ^- i +(l” +i ^ +2 - j M*) 

7 Ei +l ^- 1 + (Er + 1 «?*" + 2 - , )'W’ 

c(z) € BH°°. 

Moreover, every transfer function belonging to transfer func¬ 
tion set 

G n ( 7 ) = { 9 ( 2 ) I g( z ) 

E t +1 + ( E’i 1 + 1 p> , i42 - j )>(^) 

7 Er i ^'- i +(sr , 9^" +2 -'M=) ! 

■ f(r)eBr /,v } 

has Tf^-norm not greater than 7 and matches the given 
inpul-output data. Here 

p 1) = \pU U 2 M = iUtfi rt v-YAror l <' 


2 7 ri2\~l 


7 ? = ( 7 2 i - c; 


= (1 0 0 


<? = [ 4 ? $ 




!hi 

Sr. — 1 

■ • .91 

9 o 

0 

»() 

0 

, 9 i>-l 

9»-2 

, 9 n 

0 

0 

"1 

a 0 

; 


; 

; 

; 

u = 

ih 

00 

0 

0 

0 ■■■ 

Litri 

"n- 1 

tin 

0 

0 

0 

0 



0 

0 

0 

0 

0 ■ ■■ 



0 0 

0 

■ 


: 

V(,= 


1 0 

0 ■■■ 




U/n 

?/»!— 1 

1 ) 1 

0 ■ 




0 0 

1 ... 





n 


In retrospect, the aforementioned extrapolation problem is 
an extension of the Carath 6 dory-Fej 6 r extrapolation problem 
and has been well investigated by Schur et al Moreover, 
it is well known as a classical moment problem related 
to the Nevanlinna-Pick interpolation problem [ 1 J, [2|, [13], 
[22|. This classical moment problem has found wide applica¬ 
tions in broad-band matching, cascade synthesis, network and 
semiconductor modeling, etc. 117]. The ideas and concepts 
developed in solving these classical moment problems have 
accepted a wide attention in control community, also. For 
example, the concepts of Hankel operator, Toeplitz operator, 
J-los&less factorization, J-spectral factorization, etc., have been 
widely used in solving the so-called H 00 optimization prob¬ 
lem, robust stabilization problem, model reduction problem, 
etc. [5], [ 10 ], |I 1 ], 116], [20], [23]. The solution of this 
extrapolation problem was first given by Adamjan et al. in 
the explicit form expressed in ( 1 ) [ 1 ], [ 22 ], and a proof based 
on elementary matrix manipulation can be found in [41]. 

Based on the AAK Theorem, we investigate the uncertainty 
on the plant transfer function caused by the unmeasured plant 
output. 


Proof ' For a plant with transfer function G(z) = 
1 <) if (m/, t/[ } ) |"_ () are the input-output pairs of 
the plant, then, it is necessary that 

r .9(V 7/0 0 I/O 

U \ 9i 9o mi 


that is 


L</». .'/»-] 


9oJ Lw.iJ 



’.9o 

0 

■ ■ 0 - 


9i 

.90 

0 


1 


; 


-fin 

9n-\ * 

■ ■ 00- 
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Since no ^ 0 , we have 

G = YoU- 1 = U~'Y n . (6) 

Here 

'flo 0 ■ • • 0 " 

a !h Ho 0 

Ct ~ _ 

-ih\ Un— 1 ,70 - 

On the other hand, it can also be proved that if 

G = r„(/-'= V 'Yo, ( 7 ) 

then, when u, || l _ 0 are applied to the plant, the output of the 
plant will be //' |"_ 0 . 

Therefore, a plant with transfer function g(z) = o Hj z ' 
has input-output pairs (u n j/f) |"_ n , u„ ^ 0 , if and only if 


q~TGRe = T ^ (7 2 Ai+i - G )' 1 0 p 

1 l ' ne i [0 Oj [ 0 7 -2 / 00 J [0 

= p7 2 W-f?V‘pl (] 


Note that 


Define 


7 i 2 = a* G^U-'Y'YoU-'. 


p: = 7(7 2 A.+i-f' 2 ) _1 P = itnrUU'-YjYnr'U'e, (16) 

q: = 6’(7 2 /„ +j - G 2 ) -'c = mtfU'U - Ko'yorWf. 

(17) 

On the other hand 


'wo » 

(/'<■ = 


f,' = Vo/7- 1 = U-'Y q . 


0 1 t 1 1 r r 

0 0 0 

• = M(I . = UoP (18) 


Note that 


Lt/„ U„_ 1 l/(). 

which implies that 


Hn ■■■ <n flu] m -or 

flrt — I flu o 0 ■■ 10 


0 oj LI ■■ o OJ 

[flit 0 • • • 0 

</l .70 ■■■ 0 


0 1 Therefore 

0 

■ (9) 


L</„ //», i ■ ■ ■ /Vo. 


■() • ■ ■ o r 
o ■ ■ ■ 1 o 

7i= ... : : 

[ 0 0. 

then using the same notations as those m the AAK Theorem, 
we have 

G = TiG = y’.loi 7-1 = TiV _ 1 >b. HO) 



P z 

= «0 

(19) 


7 - 

wo^'ifl 0 . 

( 20 ) 


«V 

»+i 


1>(Z 

) = 5>' 

1 

II 

-M 

( 21 ) 


KO - 

i» + i 

l 

( 22 ) 


X 

« fl 


<l(z) 

1 

1 

(23) 



(24) 



we conclude 


From the AAK Theorem, we conclude that every transfer 
function which is consistent with (a r tj°) |JL 0 . u 0 ^ 0 , and 
has ft^-norm not greater than 7 can be represented as 

,, „,, +1 u,u e ; i+1 <ry- ( - +2) + ('h> Ei +1 Pj :l ~ j ) f ( z ) 


g 0 ] p ol_ p ; 2 ol 
0 0 lo oj " [ 0 oj' 


- f'v 2 / C 2 ) -] — (T’AiH G ) 0 (] 2 ) 

“(7 I-G) - o 7-ar ’ 


U-Ui-fi 2 )- 1 0 Ip 

= 7/? " - 0 7- J /oc 0 


_ r7(7 2 ^+i “ g r 1 * 5 
0 


, ,, +1 uo rr +k zrr ty-w*) 

U 0 Ei +1 Pj ‘ zJ ~ 1 + («o ZV 1 <ljZ’ ,+2 ~ J )f (2) 
E + (Zx +1 jp j * n+2 ‘ , )<i=) 

7 e " +1 +ie? + 1 7 ?= ,,+ 2 -')'( 2 )' 

■ ((c) E#?* 0 " 1 (25) 

and every transfer function belonging to transfer function set 
G n (l) has W^-norm not greater than 7 and is consistent with 
the given input-output data. 

This completes the proof. ■ 

Theorem 1 implies that when there is no noise contaminat¬ 
ing the identification data, then all of the transfer functions 
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that are consistent with the identification experiment data and 
have H^-norm not greater than 7 can be expressed by a 
homographic transformation of two transfer function matrices, 
one of them is a fixed transfer function matrix which is 
completely determined by the upper bound of the plant transfer 
function’s ft^-norm and the identification experiment data, 
another one is a free transfer function except that its H 00 -norm 
is bounded by 1 . 

The results in Theorem 1 can be extended to the most 
general case, i.e., when YqVo < ^U'U. According to the 
input-output extrapolation-minimization theorem [28], [36], 
[37] (in [281, it is called the extension theorem), in the case 
that W(YqU~') = 7 , there is only one transfer function 
that is consistent with the prescribed input-output pairs and 
has W^-norm not greater than 7 , represent it as f(z). In 
fact, in this case, the Ti^-norm of this transfer function is 
equal to 7 . This fact implies that when rffV’ot/ -1 ) — 7 , we 
can completely determine the plant transfer function from its 
input-output data, provided that there is no noise existent in the 
identification experiment data (however, this is extremely rare 
in actual identification process). Note that f(z) 6 Q n (l + n) 
for arbitrary o > 0, and Q n (7 + «]) C 6«(7 + o 2 ) whenever 
r*i < (Y 2 (this will be shown in the next corollary), it can be 
proved that {f{z)} = lim (l _<H <?„(7 + <0 = Qnh), which is 
to be shown. A detailed statement of this proof can be found 
in [41J. 

The condition V ( fYo < 7 2 U i U for the transfer function 
set parameterization is natural. From the input-output 
extrapolation-minimization Theorem [28], [36], [37], it can 
be claimed that if V' ( [Yo > f 2 U f V, then no transfer function 
exists that matches the given input-output data and has H K - 
norm not greater than 7 . Therefore, to prevent the transfer 
function set Gnil) from being void, it is necessary to make 
the assumption V'jV'o < f 2 U f U. 

Now, we would like to investigate some of the properties 
of the transfer function set Q n { 7 ) defined in Theorem 1. First, 
we discuss the relation between 6 »( 7 ) and 7 . 

Corollary I: Let matrices IK Vo and vector r be the same 
as those defined in Theorem 1 . Assume that 7 2 U f U > V 7 ( f Yo, 
7 $U t U > Vq Vi,. Define transfer function sets Q n { 7 ^, £,,( 72 ) 
through 7 ], IK Vo; 72 , IK V'u, respectively, as 6 ^( 7 ) in 
Theorem I. If 71 > 12 , then 6 „( 73 ) D 6 , 1 ( 72 )- 
Proof: Let 

■*8 " 

„o 

= ir 1 ' 1 (26) 

U'/r! J 

From Theorem 1 , it is obvious that 


Since 71 > 72 , we have 

Sn( 7 l) 2 Sn( 72 ). (29) 

This completes the proof. ■ 

Corollary 1 tells us that when we apply sinusoidal input to 
a plant in order to obtain an upper bound of the ft 00 -norm of 
the plant transfer function, and we are not sure about it, then, 
it is safer to choose a larger one, in order that the actual plant 
transfer function is included in the transfer function set that 
will be approximated. 

The next corollary suggests that if there is no noise in 
the identification experiment data and the identification ex¬ 
periment data length intends to infinity, then the transfer 
function set Q u ( 7 ) which is deduced from the plant available 
information will degenerate into one transfer function, i.e., the 
plant actual transfer function. 

Corollary 2: For given input-output data (u„ //[*) |J!_l 0f 
(w,, \fi) U'ifo, and a positive number 7 , define matrices V n \, 
V'nio; >7,20; transfer function sets Q n 1 ( 7 ), 6 , 12 ( 7 ) as 
those in Theorem 1 . If / 0 , V,( U) V„hi < 7 < 2 f// ll f^i, 
V r ,^oVn 2 () < 7 2 ^/, 2 ^ t. 2 , «2 < «L then, 67 , 1 ( 7 ) Q 61 . 2 ( 7 )- 
Proof: Let 


0o 

0i 


- VJ 


V] 


(30) 


Ul%2 J 


Since n 1 > ;/ 2 , from the structure of matrices ff ril , it 
is obvious that 


Here 


Vn 1 




Vnl 0 

Ai X) 


u n'l 

2+2 II n 241 " 112 


(31) 


L Mill 


Mill — n2 -I 


A '2 — 


Min 
Ml 


0 

M() 


Gn( 7 i) = {.</(*) I ,'/(~) = , 9 o + U\ - +- 1 - 

+ *"£>.**. ||.7(s)|U<7i[- (27) 

Gn( 12 ) = {g(z) I a(z) =9o + 9lZ + -"+ !h,Z n 

+ 3 "VZ /l « 2 ‘> ll.9(«)IU ^ 7z 1- (28) 

7 ^U J 


Therefore 


Hence 


L^n 1 - « 2 - 1 M r* 1 — 71 2 — 2 M() J 


= 


j /- 1 0 

x;'\- 


= fli. 0 < i < n2. 


(32) 


(33) 


nr ■' 
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From Theorem 1, we have 


Here, A is a norm bounded uncertain matrix defined by 


Onl(l) = l</(z) | ?(*) = Vo + i)x . + + 

||<7(2)|k < 04) 

Qnlil) = {9(2) | 9(2) = <h + (hZ + + r/, l2 ~ ‘ 2 

+ =" 2+I f>k |M-)|U 05) 

From ;/l > //2, we conclude that 

^ 1 ( 7 ) ^ 0ii J »(7) 06) 

This completes the pi oof ■ 

Corollary 2 also implies that as the information about the 

plant increasing, the “size 11 of the transfei function set that 
includes all of the possible plant transfei functions or the 
uncertainly about the plant transfei (unction will generally 
dccieasc This is consistent with physical intuition 

From the above investigation, we see that all of the possible 
plant transfer functions arc paramelcti/ed by a fixed ho 
mographic transformation with a norm bounded fiee transfer 
function and the transfei function set that consists of all the 
plant possible liansfer functions will monotomcally degenciale 
into the actual plant tiansfer function as the identification 
experiment time increasing to infinity in the case that the 
measured plant input output pairs arc not corrupted by noise 
Now, wc investigate the variation of the homographic 
transformation in the face ol noise that is existent in the 
measured piant output 

First we consider the change of vcctois //' ry° defined in 
Theorem 1 

The next well known fact plays an important idle in devel 
oping the relation between the variation of vector r/ 1 and 
the noise m the measured plant output 

Far / F 01 matnces 4 e Tv" \ B <F R" A,n it (/„ H 
AB)~ ] exists, then B{ l, n -F 1 B) 1 = (/„ 4 B \) l B 
For bicvity, we assume without loss ol generality that 
7=1' 

The vaiidtion ol vcctois p { \ r/° accoiding to the noise 
existent in the measured plant output is indicated m Theorem 


A = thug{bi)It( n + 1)2 ^]/2(nHF + 1)*}> 

/», € 72, |M < J 0 < ^ < /# 

and the other matrices aie constant real matrices defined by 

HUi. v» l!U- v\ IT-o - 99 and h as 




(1 

o ■ 


" f yo 

0 

0 ■ 


‘A 

Vo 

0 


Vi 

vn 

0 

y, - 

-’An 

‘I'r, J 

Vo- 

V = 

-</n 

<Al-l 

VO- 


w, 

' 0 

£,1 T 
o j Ti 

1 IV, = 

7, 

'K\ 0 ‘ 

0 £ T 


h- 

[7.i+i 

r> 

l?\ 

— h [//(-t- 1 

Ah i 

A.+i] 


"0 

1 

l) 

O' 


0 

0 

1 

0 


0 

0 

0 

1 

1 

0 

0 

0 

0. 

J 


11+1 


7 ( - [C(n + J) h(nM) ^(n-H) J 

7 , = [% s , <?„]', 

lu = diaq[l, u 0 0 / M11 0 0}, 

S, /„ 41 0 ()/„ H 0 OK 


S„=diaq{0 (>/„+,<> 0/„ H } 

and V 1 has the same definitions as those in Theoiem 1 
Pi oof Let 


2 

Iheoiem 2 Assume that (/', - </, - v,, | v, \ < h, 0 < / < n 
Moreover assume that max|, |< h rr(),f / ') <* 1 Let 




' (I 

-V r 



(57) 


l>' = UfVU! ->/!,) J < 


•/ =Ui , 1 v-i,X) l < 


p=.u(v*u-\ ti ir'< <i = nvw-i 'i)-‘ 

Then, for all possible vectors />' and </' 


v 1 ' 


v 


I"'-' 1 1 

tf 


_q. 


1 


h{v+ 1)* ■+ 

'V* - > ' 

-v // 

I— 1 

3= 

>1 

, 


J 


( 


P 

q. 


’When we tan normalize the plant output and the noise magnitude 

in ordeT 10 make i = 1 In detail we may use !/,/*> n/- instead of v, h 
respectively 


It is obvious that N N T aie invertible if and only UUl - 
) , U*U - are invertible 

Since max| ( \< t *a(Y r V~ ) ) < 1 by assumption, we claim 
that IBU ~ Y r \ is invertible and U*U — Y*Y y is invertible 
for all V, 

On the other hand, from the property of matrices //, 1, F T , 
that is, V f/ y f V ure commutative | M], it can be proved that 


7-1 


AT 1 = 



'u(u'u-\ f y)- 3 

y f ((yf/' -yy*)- 1 ] 

(38) 


Y(U'U- Y*Y) 1 

ir'fUW -Y\'y'y 


uiuw - y^)-' 

Y^Ul)' -YX)” 1 ] 

(39) 


r,(r; # r/ - r r f r r )- 3 

u^uu 1 -y.y,*)- 1 J 
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Define 


Af= y M r ~ y (U'U-YfYr) 


Then, it is clear 


(40) 


M = N- 1 /n Q +1 , M r = AT 1 /n 0 +1 


On the other hand, we have 


Nr 1 = 


U* -Y l 


0 (Y - Y r y 


-Y U ' \Y-Yr 0 


= h(n + l)+N~^l f]] V 1 


in which 


+iv - 1 r° n- 1 

+ E 0 


E = Y-Y r = 


0 

0 

"0 

■■ 0 

"n-l 

• U 0 


Therefore 


M r = n: 


- 1 l in+l 


= Jat - 1 _Ar-if° E> 
1 E 0 


■ , _._i ro e 1 

f 2 (n+i) + ™ E 0 


= M-N~ l \l f ) t |[/ 2( n +1 ) 


-i [ 0 E* 
E 0 


Let b, = £v,, 0 < i < n. Since v l &TI, |n, | < k, we have 
6, € Tl, |M < 1, 0 < / < n. 

From the structure of matrix E, we have 


'VO V 1 ll n 

0 I! o W„-l 

E* = . . . 

.0 0 ■ ■■ Vo . 

= [/„+! T 2 Tl ... 77 ; 

r^-+i 

^a, + i 


= EiKE t . 


= £>*2 


"J^Xri+l -J U/n+l ■ 


Here, 7? is defined as f„+i, and A = diag{b 0 l n +i bil n+l 
■ * ■ ^nfn+l ) 1 


Therefore 

[0 E* 


0 EiKEr 


E 0 ” EtAEt 0 


0 Ei 1 [A 01 \E\ 0 ' 
E\ 0 0 A 0 E r ' 


Note that 

[A O' 

0 A ' 


= diay{boI„+\ M«+i 

bn.1 n+] b{)ln+l ^lTi+l ' ' ^ti^n+l} 

- 6 0 diag{l„ +l 0 • 0 J„+i 0 • 0} 

+ biding{U I„+1 0 ■■ 0 /, 1+ 1 0 - 0} 

•+■•■ + b„ding{i) 0 /„+j 0 /„+i} 

= [^2(n+l)^ l‘2(n + \y m ‘- ^2(n+l) J ] 

• diag{bol2(„+iy bil 2 („+i)i ■■■ 
Mac+iWPo S i ■■■ S n Y 
= Ti AT, (47 


0 £‘1 _ [0 £i|sts \E[ (I 
E 0 J “ E* 0 J ' ' [ 0 E, 

= W r iAW',. 


Therefore 

M,=M-N - l W,AW, [/, ()H „ + N ‘ 1 W'iAH’ I ] -, A/ 

— m - yv _1 iy,At/ 2( „ +1)l + w, N-'WiJy'w.M 

(49) 


we have 


>1 _ \u(uni - Y* ) ) ] t ' 
q\ ~ [y^'r -y'rr 1 * _ 

)?] _ \U(V'U- Y 1 t Y 1 )~ ] t 
q' ~ Y 1 (lJ t U - YjY t )~ l i 


= A/r. (50) 


= M,( 


we conclude 


V' -Y f 

-y v 


^2(71-hl) 1 


U f -1" 

-y v 


W t A W r 


This completes the proof ■ 

Theorem 2 suggests that when there exists magnitude 
bounded noise in the measured plant output, then, if the 
noise is not too significant, the parameters that describe the 
possible plant transfer function can be represented by a linear 
fractional transformation of two matrices. One of them is 
completely determined by the measured plant input-output 
pairs and the noise magnitude and is therefore fixed, another 
one is free, except that its structure is fixed and its maximum 
singular value is bounded by one. 

Let 

* {z)= \rry J *'' +2 - j rry- 1 } 


J-l 
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then, according to Theorem 1 and the discussion followed it, 
a transfer function, say, g(z), which has °°-norni not greater 
than 1 and matches the input-output data (u u y[) |"_ 0 , has 
the form 

g{z) = HM(<S>{z), c(z)), 

provided that Y/Y, < U'U 
Let 

Z, = [*"+■ s» *]', Z 2 = [l z /']' 
we have 

., . _ \f> rt Zi q' • z, 1 r.v * „rt i r 7 . i 


Here, N n = [?)‘<f]. N u = -[pY] W* r , N 21 = W( 

h?. m -rrv 




*1 _ fr <r /?1 
<l"Z 1 V r1 Z 2 J - [ q r* it* Z, 


On the other hand 


\p rt </"]- J, 


pi_ri/' -w 

</ -1 f/ 


> ^ 


W,\ W, 


7/* -v 
-i r 


— [/' ( </] _ r /]H, f / 2 („-t i) + 

v -i n 1 1 _1 

_y ( jt », 


Hur'\V 


[ v -f/'J = v , j 

On the other hand, a simple compulation shows that 

(, A = ^ fl 9{W2(n + l)> + 

= w r 

in which 

A = ^4(mi)*}* (57) 

= [i 4( n+ 1) 1 ^KnHl) 1 ^(n+l)<]< (58) 

7, = [.<?„ .<?! .S’,,]' (59) 

Sa = diaq\J 2 (,ni) 0 0 0 0 }. ( 60 ) 

.Si = /2(n +1)- 0 0 12(li+])- 0 0} (61) 


<?„ = <hnq{() o f J(ll+I) . 0 0 / 2( , 1+1) ,} (62) 


[-) U 1 J 

= [/'' <l'\ ~ \J> r/]11/A[7 i( „ 4l) ,-l Wf 

IT _ \ i I _ 1 _ 

f 1 H f A 

-) W ’ 


vf\t' >'1 ' 


-r 'i 1 


0 In 11 
In fl 0 


w* fn = ii> ,t <n?. 

Therefore 

p" (/’'] pVn l) h(n(\) 

</' p'* 0 Nu\l r * 


0 

^iJL 

>12 

0 

0 

N u . 

>22 

0 

0 

N 22 _ 

iV 2 ! 

0 

0 

TV" 21 J . 


P - (^11 ^ p2(„+.,l n _ pVl2 0 

u ~ [ o 7V„J[ 7, J' [ 0 Nu 

fl _ pV.ll 0 ]p2(.l + l)l n _ [7V22 0 

a ‘ t) n 2) \[ /1 J *"-[o jv 22 


y f 

*=/?n f 7?i2 r /)A7, [/4( ri4l )i - /, ] 1 /?2i 

= /?ji + /?i2^/A[7 4( „ 4 di - 7, H 21 I 1 A] -1 /,/? 2 i (63) 
Therefore 

*(.) v r ;i[^ "i 

[ 7 " /V'J [0 z 2 

- {7?1, +7? u 7«A[7 1( „ fir -T,77 22 7 1 ,A]- 1 7’ i 7?2 1 } 

Zi 0 ' 

0 z 2 

= ^n(-) + Vy u A(/ 4(n+l) , - W' 22 A] _ 1 W7i(z) (64) 


^n(z) = Hn[f; ®], W r i 2 = 7?i 2 7|. 


v^ 2 i(z) = t,h 21 ^ ® , vy 22 = y i ,7J 22 T, 


From the construction, it is obvious that H'u(z) is a transfer 
function matrix belonging to W°° with dimension 2 x 2 , iy 12 
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W n (z) = 


W X 2 = 


is a matrix belonging to 7j 2 x 4 ( 11 + 1 )^ W 2 i(z) is a transfer 
function matrix belonging to with dimension 4(n+l ) 4 x2, 
while matrix W 22 belongs to 7j4(n+i) 4 x4(ri-fi) 4 

From the above discussion, the following theorem is ob¬ 
tained. 

Theorem 3: Let 

, \Wn(z) W l2 ] 

v ' L W 2l (z) W 22 J 

then, *(«) = Ti{W(zY A). 

The above theorem makes it clear that when there exists 
magnitude bounded noise in the measured plant output, and 
the noise is not too significant, the homographic transformation 
that describes all of the plant possible transfer functions, can be 
parameterized as a linear fractional transformation of a fixed 
transfer function matrix and a structure fixed, norm bounded, 
but free matrix. 

The next theorem is the main results of this paper. 

Theorem 4: Partition Wn{z), W\ 2 , W^iiz) as 

_ rw'iY(s) wtf(z)] _ tw}} l 
M Wi ?/(*) Wft(z) J’ 12 [w%\' 

Wn(z) = [W}l{z) Wjfc)] 

in which W\i(z) has a dimension 1x1, Wyj has a dimension 
1 X 4(n + l) 4 , (z) has a dimension 4(// + l ) 4 x 1, 

and the others have compatible dimensions. Moreover, see 
the matrices at the bottom of the page. If maX|, i |<^?f((F + 
E)U~ ] ) < 1 , then, transfer function set 

0 = L(Z) | g(z) = Ti(F{z), A(,)), m*x?f(A(;)) < l) 

l \ z \ <l ) 

exhausts all of the transfer functions that have Tf^-norm not 
greater than 1 and match the possible plant input-output pairs 
that are derived from the measured plant input-output data 
( 141 , iy,) |7_ 0 which are contaminated by a magnitude bounded 
noise series v t , v t E 7?, \v,\ < ()</<./# in if, in additive 

form, and vice versa. Here, the matrices IJ . Y are the same as 
those defined in Theorem 2, and the matrices A are defined 
by (43) and (57), respectively. 

Proof: From Theorem 2 and Theorem 3, it is obvious 
that transfer function set 

Q - {»(*) I »(*) = HM&mz). A). e(z)). 

A. 6 7?., <r(A) < 1 . r(z) 6 (65) 



Fig. I. Identification experiment, 


equals the transfer function set which includes all of the 
transfer functions that do not contradict the plant available 
information. 

Note that every transfer function in Q has a structure like 
that depicted in Fig. 2. 

The internal relations of Fig. 2 are 

y(=) 1 p/i 1 (~) <n 2(2) 9 i-s(*)l C"i(-)’ 

1 /( 2 ) = r/ 2 j (r) q 22 (z) q 2J (z) w, (z) , (66) 

_w 2 (s)_ //ji(z) <l.r 2 (z) ()i s(:)_ 

"l(~) = (•=:)■ (67) 


Note that 


V 2 {-.) - Aw 2 (z). 




which is invertible on the closed unit disk V — {z | |i| < 1}. 
On the other hand, the relation in (66) can he rewritten as 

1 -<lvA~) 0 ] [ y(z) 

0 -qnb) 0 iiij(5) 

0 (-) 1 J 

r i> 'yn(-) Vi i(^)i r u(~) 

= -1 92i(~) <nAz) i'i(i) . (70) 

0 <7ii (*) >’2(z) 


1 -r iu(z) 0 

o -<hA--) 0 

o -<712 (~) 1_ 

is invertible if and only if r/ 2 2 ) is invertible, (70) equals (71) 
(found at the bottom of the next page) which implies that 

G = G. (72) 
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Fig 2 Structure of transfer function belonging to £/ 


u 

S.<‘> 





V 

f 2 .w 

» u (‘) 

w 




A(z) 


Fig ^ Structure of pluit possible tnnshr junction 


This completes the proof ■ 

From Theorem 4 we see that the transfei 1 unction set which 
includes all of the plant possible transfer functions can be 
parameterized by a linear fractional liansformation of a lixed 
translei function matrix and a structure fixed norm bounded, 
free tiansfer function matux 

Hie structure of the possible plant transfer function is 
lllustiated in Fig 3 

The results of Theorem 4 can be extended to the case when 
max|, |<*?F[(1 -| h )V *] < 1 In this situation all the plant 
possible transfer functions can also be paiameten/cd as Q in 
Theorem 4 The idea behind the proof of this conclusion is to 
firstly paiameten/e all the possible transfer functions lor the 
plant whose transfei function's noim is not gieater than 
J-l-n' for arbitrary o "> 0 and whose input-output pairs tan 
be deduced from the measured input-output data (n, y,) |J 0 , 
represent it as (7(1 + o) and then, take the limit of the transfer 
function set GO 4- rv) as a tends to 0 from the positive side 
Based on the relations that GO 4- o) 3 G for any o > l), 
and Iim^ 0 f5(l 4 n) = G wc obtain the transfer function 
set with the desirable properties The proof is elementary but 
tedious, and is therefore omitted here The details of the proof 
can be found in |4I] 


64D 

When the noise magnitude bound w does not satisfy the 
condition mentioned above, i e , maxj, (< K ?f[() 4- E)U~ X ] < 
1, let 

r*« o o 

__ do 0 

lA ^oJ 

For those A 0 , A, A„. such that (±V + V) f (iy + V) > 
M rt U, or (Y + sV)'(F | sV) > U'U, and A, e ft, 
|A,| < J, the plant under investigation cannot have a response 
{ij, 4- ) Ij’-y when the input senes </, |"_ 0 is applied to it, 

since according to the input -output extrapolation-minimization 
theorem 128], [ 16] [37], every transfei function that can 
match these input-output pairs, le , (u t tj t H *<\) 1 ^, will 
have a W^-norm gieater than l Videlicet, these noise senes 
{-t b t ) |;* =0 are impossible according to the plant assumptions 
Theiefore, they should be excluded from the noise set V = 

{ 1 1 IIL«1 1 ! C 7? |i,| < h} From the above discussion, it 
is apparent that in this case the transfer function set that 
contains all of the plant possible transfer functions equals 
transfer function set 

G- {</(*) I ) = ^(F(~) A(-)) 

iji.i\7T(A(»)) < 1 (A 0 Ai A„) G P A 
I \' i 

Here P* = {(A„ A, A„) | + V)‘(U + V) < 

y-V'U e 7? 1^,1 s. 1 }, and the other matrices or transfer 
function matrices have the same definitions as those in 
Theorem 4 

IV APPL1C AT ION TO IDI NT1F1C AHON FOR ROBUST CONTROL 

In Section III we showed that when the magnitude bounded 
additive noise existent in the measure plant output is not too 
significant the transfei function set that consists ot all plant 
possible transfer functions can be parameterized as a linear 
fractional transformation ol a fixed transfer function matrix 
and a structure fixed norm bounded, tree transfer function 
matrix The importance of linear fractional transformation 
has been widely recognized in control community in recent 
years, as well as homogiaphic transformation [5] [ 11 ], [ 20 ], 
127] Note that all of the transfer function set descriptions 
widely used in robust contiol theory, such as nominal model 


y(~) ' 


’1 - 912 ( 2 ) 0 

1 

0 911 ( 2 ) 911 ( 2 )' 


'«(*)' 

«'l( 2 ) 

= 

)) — 922 ( 2 ) 0 


-1 921 ( 2 ) 92 j(’) 


Hl( 2 ) 

1112 ( 2 ). 


0 - 912 ( 2 ) 1 


. ( ) 931 ( 2 ) 9 jj( 3 ) 


. 112 ( 2 ). 


«(*) 

"i(~) 


912(^)922 (“) <hn Z ) - 912 ( 2)922 ( > )92l(-) 9ia(2) - 912(~)922 ( 2 )92d(2) 

9 22 *(“) -‘hi ( 2)921 (') -922 ( 2 ) 923 ( 2 ) 

9)2( 2 ) 922 *( 2 ) 911 ( 2 ) - 9 u(*)9» (s)* 2 j(') 931 ( 2 ) - 912 ( 2 ) 922 *( 2 ) 923 ( 2 ) 


= f(z) 


’ ll{z) 
»} ( 2 ) 
" 2 ( 2 ) 


(71) 
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with parametric uncertainty bound, nominal model with ad¬ 
ditive unstructured uncertainty bound, nominal model with 
multiplicative unstructured uncertainty bound, nominal model 
with coprime factor unstructured uncertainty bound, etc., can 
be easily transformed into a linear fractional transformation 
representation, which conversely implies that linear fractional 
transformation with structure fixed, norm bounded, uncertain 
transfer function matrix is in fact one of the most general 
descriptions of plant model with intended application as robust 
controller design. Moreover, a transfer function set, which is 
parameterized by linear fractional transformation of a fixed 
transfer function matrix and a norm bounded, structure fixed, 
uncertain transfer function matrix, can be directly used in 
robust controller design, because such a parameterization just 
fits the framework of structured singular value theory or 
/(, although this robust controller design theory is still not 
complete and more powerful design algorithms are needed to 
be developed 14], [81, [27]. 

In controller design, however, the direct utilization of the 
transfer function set given in Section 111 suffers from several 
drawbacks. For instance, the plant nominal model generally 
has a high degree and the number of the uncertain parameters 
is large. In fact, they generally equal to the identification 
experiment data length and the identification experiment data 
length +1, respectively. Here, by nominal model, we mean 
the transfer function corresponding to A(z) = 0. Moreover, 
the dimension of the matrices used in the parameterization is 
extremely large, e.g., the dimension of the uncertain transfer 
function matrix is 4(« + l ) 4 + 1 , in which n stands for the 
identification experiment data length. But in controller design, 
it is preferable to use low complexity nominal model and 
a simple description of the nominal model error uncertainty 
bound, because siich a plant model will represent the dominant 
characteristics of the plant dynamics which is more convenient 
in control system analysis and synthesis, and will lead to 
a low complexity controller which is highly appreciated in 
realization. Furthermore, we note that in the plant possible 
transfer function parameterization, there are both dynamic 
uncertainty and real perturbation. The dynamic uncertainty is 
due to the plant unmeasured output, while the real perturbation 
is due to the effect of noise. If this parameterization is directly 
applied to robust controller design, it will lead to a mixed 
/(-synthesis problem, which is well known as NP hard [ 8 ], 
[27], 

In this section, we would like to briefly discuss the problem 
of obtaining the desirable fixed complexity plant nominal 
model and its error uncertainty bound from the aforementioned 
plant available information. 

First, we introduce some notations that will be used in this 
section. 

For a given matrix structure X, let BX represent the unit 
ball defined as BX = {A | A £ X, 3r(A) < 1 }. When X(z) 
is a transfer function matrix structure, we define BX{z) as 
BX(z) = (A(z) | A(z) € ^'(z),maX| 2 |<io r (A(z)) < 1 }. 
Moreover, for a given matrix structure X and a transfer 
function matrix H(z) which belongs to and has suitable 
dimension, with a little abuse of notation, we define nx(H) 
as the maximal structured singular value of H(z) according 


to X over the closed unit disk V ~ [z \ \z\ < 1}, i.e., 
maX| 2 |< 1 /i i v(JT(z)). 

Note that for a transfer function matrix H(z') = D -f 
C(z'I - which is analytic on the complement of the 

open unit disk, i.e., V c optn = { z' | |z'| > l} 2 , 2 let z = l/z\ 

then, H(z') can be rewritten as H(z) = z/). 

It is obvious that H(z) e and from the main loop 
theorem that plays an important role in /(-analysis and //- 
synthesis [27], it can be proved that /(, v -(H) < Li for a 
positive real number fi if and only if //y(M) < 1. Here, 

M = C, f • an d X is a matrix structure defined as 

X — {A | = diag{ A, d], A £ X , f £ C}. Especially, 

when nx{H) = 1, it can be shown that = 1, also. To 

express the results in this section more concisely, we adopt the 
notation fi.x(ti) for a transfer function matrix H(z) £ 
and a compatible matrix structure X. 

Using a similar argument as that in the proof of the main 
loop theorem, we can obtain the following lemma. 

Lemma I: For a (rn + ji) x (m + p) transfer function 


matrix hm = [ 2 ;$ 2 ;;j: 


e , a matrix structure 
X\ with dimension rn x rn , and a mairix structure X 2 with 
dimension p xp, define a new matrix structuic X as A': = 
Ui 




^2 


|Ai £ X ], A 2 £ «V 2 }. If // v_.(// 22 ) < 1. then, 
/J.v(ti) £ 1. if and only if max AlF ('¥/(//. A.)) < l. 

Proof' Since /(c, (7/ 22 ) < 1, we have eliminate at every 
zeP = {s | \z\ <‘i) 


dvi{I p - // 22 ( 3 )A 2 ) * 0 


(73) 


if A 2 £ BX 2 . Therefore, for every z € T>, Ai £ A'i, 
A 2 £ BX,, a simple computation will show 


(lt't | /r/i-f jj 


77 11 (z) Hn(z) 
tin (z) H 22 (z) 


At 


= det(l p - ti 22 (z)A 2 ) ■ d,l(I m - LFi(II(z), A,)A,). ( 74 ) 

Hence, /(,v(77) < 1 if and only if max^,^», // v , (LFi 
(77, A a )) <1. 

If max^ 2 fe B.v 2 M,V|(-? 7 ( 77 , A 2 )) = 1 , then, there exist at 
least one £ V, one A 2 £ B X 2 < and one Ai 6 X u such 
that ir(Ai) = 1, and 


det(l m - Fi(H{z a ), A 2 )A|) = 0 
which implies thal 

d(-l(] - \tin (so) 7/| 2 (2 0 )l [Ai 

' { Jm+ ‘’ [H 2l (zo) Ihz(zo) 


A 2 


(75) 


= 0 . (76) 


Since 


= max{ff(A]), <f(A 2 )} = 1 (77) 


2 The analyticity of a transfer function matrix on equals the 

commonly utilized concept of stability of a transfer function mairix in control 
engtneenng. 
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we have 


Then, Q„ J Q if and only if 


Mff)< i 


(78) 


If t‘ \ (H) = 1 . then, for arbitrary n > l). < 1 + r» 

According to the definition of structured singular value, we 
have that for any 2 € Z>, Aj e -V,, ff(A,) < t-J-, 

*(A 2 ) < iir ' ‘ ‘ 


rff t 


( 


l 


m+[> ~ 


Hn(z) 
M 2 i( ) 


Hu{-) 1 [Ai 

Hu{- 


A 2 


^ 0 (79) 


which implies that 


7 21 - 1 

/’too/ First, we note that since Q C W 50 , it is necessary 
that at every : € V, (/ - F 22 (OA(a)) are invertible, for 
arbitrary A(0 € VV(c;) 

Assume thal there exists a z u G T> and 1 2 1 
(/<«</{« 1 V 1 / fi„J} € BX_, such that 


**(V - F 22 ( 2 „)A) = 0 (82) 


A 2 )) < 1 + a (80) 


Since o is an arbitrary positive number, we claim that 


max /n, (/)(// A 2 )) < i 

A £-6 \ 


(81) 


This completes the proot ■ 

Al lust we investigate the problem that concerns under what 
condition a transler function set described by a nominal model 
and an additive unstructured uncertainty bound can include 
all the possible plant transfer functions when an upper bound 
of the plant transler function s norm and a series of the 
plan* weakly corrupted time domain input-output pairs have 
been provided 

Partition the tiansfer function matrix r(->) defined in The 


orem 4 as i (i - 


where I n( ) has a 


/ it ) /it) 

./ i( ) * ( J 

dimension 1 x 1 and the othei transfer function matrices have 
compatible dimensions 

Define transfci function matrix structure V(c) and matiix 
structures V l as 


Al I — {A( )\A(.) = <l"iq{({ ) b {] I M b, I\ 

<( ) f H* b, t R 0 < / £ n\ 


If |cjj| < 1 from \(\\ < 1 and the Nevanhnna-Pick 
interpolation theorem [ 22 |, we conclude that there exists 
an r(~) G BH ^ that satisfies f(^o) - rv Let A(z) -= 
tli(iq{i{") bo I b } [ b n I} then, it is obvious that A(z) € 
BX(*) and d(t(l - F^i* u)A(zo)) = 0 which is a contra¬ 
diction 

If |zo| = 1 , let z' {) - ft o for arbitrary |/i| < 1 , then, 
using a similar argument, it can be proved that, there exists a 
A(*) E B\'(^) that meets the condition A((teo) - A From 
A(^) € B.\( ), we have 

A(so) - Inn A(?/5o) (83) 

Since li is arbitrary except thal \ft\ < 1, we conclude that 
there exists a A(~) E B\{^) such that 

( hi(I I n (zo)A(z {i )) = 0 (84) 

which is also a contradiction 

Therefoie, for every z \z\ < 1, d<t(I - Mu(") A) ^ 0 
whenevei A C BV which implies that 

< 1 (85) 


A> {A | A = diaq{< A„/ A,/ A,,/} 

< f C 7 b, € 7\ 0 < / £ n} 


H A £ then for arbitraiy tiansfei function 6 0 , 
there exists a A„( ) € BH^ such that 

7(0 = 7u(0 + W«(2)A„(2) (86) 


= |a|A- a eTAev! 

Then, accoiding to Theorem 4, when the plant transfer function 
is derived from an upper bound of its H ^ -norm and a series of 
weakly corrupted time domain input-output data, the transfer 
function set Q which consists of all the plant possible tiansfer 
functions can be parameterized as 

S = { 7(0 17(0 = W (0 *(-)) a( 2 )€RV( 2 )} 

Theotem * Foi a given transfer function 1 / 0 ( 2 ) 6 H x and 
a ummodular tiansfer function of K *-, W, (z), define a transfer 
function set 

& = {V(0 |0(a) = w(*W(*)A«(s) A„(«) eBN*)+ 


Thai is there exists at least one A u ( 2 ) £ BW* which satisfies 

+ hz(-)Mz)[I - h2{z)A(z)]- l r 2 i (z) 

= 7o(0 + W o (z)A a (0 (87) 

lor arbitrary A(~) £ RX(z) 

Hence 

A„( 2 )= K\^n(-)-%(z) + F u ( )A(0 
(I-F 22 (z) A(z)) ^’ 21 ( 2 )] 

'il Ft i (0 7*22(2) J’ 

A(0] 


( 88 ) 
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which implies that 


max 

A(x)€8X(z) 


; ^\(W-Hz)(F n (z) ~ 9o(z)) W~ l {z)F 12 (z) 

IV F 2i ( 


(z) 


A(*) 


< 1 


F 22 {z) ) ’ 

(89) 


because A 0 (s) e B7i°°. 

Therefore, using similar arguments as in the proof of 
^*(^ 22 ) < 1 . we conclude that 




m ax, 

&6BX 

From Lemma 1 and (85) and (90), we have 

M; 


_(W-\F n -g 0 ) W~'F 12 \ 

F 21 F 22 ) S1 - 


(90) 

(91) 


On the other hand, if 


(W a - l (F u -ao) W-'F 12 \ 
F 21 F 22 ) 


< 1 


then, according to Lemma 1, we have 


max max 

££BX\z\<l 


Ti 

A 


’(w; 1 


< i. 


(z)(F n (z) - go(z)) W~ 1 (z)F 12 (z) 
F 2 i(z) F 2 2 (z) 


')■ 


(92) 


Assume that there is a transfer function g(z) E G, but 
g(z) & Ga- Since g(z) E G, there exists a A (z) E BX(z), 
such that 

9 (z) = F u (z) + F 12 (z)A(z)(I - F 22 (z)Mz))-'F 2 i(z). 

(93) 

Lei A a (z) — W^ 1 {z)(g(z) — go(z)) 4 , then, it is obvious 
that A 0 (.z) € W°° and 


l|Aa( 2 )||oo > 1 

(94) 

which implies that there exists at least one zq E V, at which 

|A.(ao)| > 1. 

(95) 

Let A = A(^q), then, it is apparent that 


A € BX. 

(96) 

Note that 


Aa( 2 0 ) 





( w a H*o)(Fn{zo) - 90.(*o)) 

\ F 21 (zo) 


Fn{*o) ) ’ 



we conclude that ||A 0 (^)||oo > 1 contradicts with (92). 

> //llbneforc*- dim-is no transfer function that belongs to Q 
but does not belong to Ga* 


This completes the proof. ■ 

Noting that unimodular transfer functions of H°° are ca¬ 
pable of representing all the kinds of frequency-magnitude 
characteristics, the assumption that W a (z) is a unimodular 
function of W°° does not sacrifice any generality concerned 
with the application of the results in Theorem 5. 

From Theorem 5, we see that for a weakly corrupted plant, 
the problem of checking whether a transfer function set, which 
is defined by a nominal model and an additive nominal model 
error uncertainty bound, includes all of the possible plant 
transfer functions that are deduced from an upper bound of 
the plant transfer function’s W°°-norm and measured time 
domain input-output data, is equal to a //-analysis problem 
[4], [27]. Here, by weakly corrupted plant, we mean that the 
normalized measured plant input-output satisfies the condition 
max|„.|< K ff((y + E)U~ l ) < 1. 

Finally, we discuss the problem of obtaining the smallest 
transfer function set which is parameterized by a fixed com¬ 
plexity nominal model with an additive nominal model error 
uncertainty bound, and includes all the plant possible transfer 
functions that do not contradict the aforementioned available 
plant information. 

Mathematically, this identification problem can be formu¬ 
lated as finding out the fixed complexity nominal model 
go(z) E 'H 00 , such that for a given frequency weight function 
Wai(z) (which can be chosen as a unimodular transfer function 
of Ti 00 ) and the unfalsified transfer function set, i.e., G , the 
cost function 

J(9o(z)) = max }\W wi {z)[g(z) - </u(2)]|k ( 9 «) 

is minimized. For control engineering background of this 
problem formulation, refer to [7J, [9], [24], and [38]. 

Let H™ denote the transfer function set that includes all 
transfer functions which belong to H x and have degree not 
greater than m. Moreover, assume that 

a = min max \\W ai (z)[g(z) - jryo(z)]||oo, (99) 

<fo tj(z)£0 

9o(z) = arg min max \\W (li (z)[g{z) - g 0 (z)]\\^ (100) 

then, the desirable transfer function set Gopt will be given as 

Qopt = {. 9 ( 2 ) I 9{z) = 9 *,(z) + aW~ 1 (z)A a (z), 

I|a 0 (2)|U < i}- doi) 

On the other hand, when an upper bound of the 7i°°- 
norm of the plant transfer function and a series of the plant 
weakly corrupted time domain input-output data have been 
supplied, transfer function set G can be parameterized as a 
linear fractional transformation and the cost function defined 
in (98) can be rewritten as 

J(9o(z)) 

= max ||W^a<( 2 )[ff(z) - ffo(2)]IU 

g(*)€b 

■= ||W ai (z){F n (*) + F J 2 (z)A(z) 

&(z)€BX(z) 

■ [I - F 22 (2 )A(z)]- 1 F 2 i(2 ) - »o(2)}|k 
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= max 




( z )(Fll{z) - 0o(z)) W al (z)Fl 2 (z) ' 


Fv(z) 


A (z) 


Theorem 6: Assume that 


F 22 (z) 


( 102 ) 


inm max 

flo(*)£H,T; A(2)6B.Y(-r) 


rUW^zMdz) -this)) W ai (z)Fu(z)\ 
'Ll F n (z) F 22 (z) )' 


A(~) 


= dct(I - F 22 (z) A 2 ) 

x det^l - H / ' ai (z)[Fii( 2 ) - 0 o(s) 

+ F 12 (z)A 2 (I - F T2 (z)A 2 r'F 21 (z)}(jA^ (107) 

at every z 6 D, for all Ai € C and A 2 G BX_. It is obvious 
that if fli > fo > 0 , then 

J(fh, Qo(z)) < -Kft, ffo(z)). (108) 

Therefore, when fit > ft > 0 , we have 

mill .I(ft,gn(z))< inin J{ft, go{z)). (109) 

so(*)eW£ soUifWS 


and # 0 (z) is the solution of this minimization problem. Then 

-j»S) -faWa ,fia 
7^ F 21 F 22 


r/i-i 


')] 


= 1 . 


Proof' Since rv = 0 is a trivial case, we assume without 
loss of generality that a > 0 . From 


min 

Qn{z)e H? t 


max 




^*21 ( ^ ) 


W„(z)F a (z)\ 
F 22 {z) ) ' 


A(z) 


X 


— (X 


( 101 ) 


and at g*)(z) the cost function achieves its minimal value, we 
have 


max 

l(;) 



iH ’ a ,(a)(/’i 1 (z)-05(s)) 



^(~~)^(z)\ 

f 22 («) / ’ 

(104) 


From the above discussion, it is apparent that the problem 
of obtaining the smallest transfer function set, which is deter¬ 
mined by a fixed complexity nominal model and its additive 
error uncertainty bound, and includes all of the plant possible 
transfer functions deduced from the aforementioned plant 
information, can be transformed into a /4-synthesis problem, 
with combination of one free parameter search, i.e„ to find 
the smallest fi such that at this point the minimal structured 
singular value (with respect to go{z) 6 H™) of the transfer 
function matrix 

■^W ai (z)(F n (z) - </,,(*)) ^-W' a ,(*)/’„(*)• 
-J?F 2i (z) F 22 (z) 

according to the matrix structure X is equal to 1 . From the 
properties of ihc cost function J(fi, #o(-)) defined in (106), 
i.c., J(fi, </oU)) will not increase with the increment of fi, 
the algorithm for the search of the desirable fi can be easily 
developed, A candidate procedure for this search may be as 
following. 

Assume that two initial values of fi, 0 < fi* < ftf have 
been given which satisfy 


Since /u (F 22 ) < 1 according to the proof of Theorem 5, 
from Lemma 1, we have 


PT 


■f±W nl (F n -'l») -feW a .F l2 \ 
LV 7^21 "22 ) 


- 1 . 


(105) 


This completes the proof. ■ 

For the given transfer function matrix F(z) and the 
frequency weight function W n i(z), define a scalar function 
flo(z)) as 


flo(z)) = I'y 


r / ^W«i(F'n ~ </o) -jfW ar Fn' 
21 f 22 


1 Fi 


74 


( 106 ) 


for fi > () and 00 (z) G 
Note that since /f_v(^W) < 1 , wc have 
rl ■” 1 

dp " I ~ ' -±F 21 (z) F„[z 


jjW ai (z)(Fn(z) - f/ 0 (z)) -jj‘W ai (z)F ]2 {z) 
) 


A 2 


min J{fi", 00(2)) > 1 . min J(/ffJ\ 00(2)) < 1 . 

q u [z)£H^ 

In the TVth iteration, let fi - fi^~ l 4- (] - pjv)/^ - " 3 . If 
m iHflQ (a )€W£ !ja(x)) > L let fi? = fi , fi[* = /fl/^ 3 ; 
otherwise, let fij? = ftf = fi. If 0 < p^ < 1 for all 

the iterations, it can be proved that this algorithm will converge 
to the desirable fi. For detail, refer to [39]. 

In summary, we claim that the difficulty in developing the 
optimization algorithm for obtaining <75 (z) and a that describe 
the desirable transfer function set G op t lies on /t-synthesis 
problem and the large dimension of the transfer function matrix 
F(z). 


V. Conclusion 

In this paper, we investigated the structure of a transfer 
function set which includes all the possible plant transfer 
functions, under the situation that an upper bound of the 
H^-norm of the plant transfer function is prescribed and the 
plant time domain input-output data are only weakly corrupted 
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in its output. Results show that such a transfer function 
set has an elegant structure, i.e., all of its element can be 
parameterized by a linear fractional transformation of a fixed 
transfer function matrix and a structure fixed, norm bounded, 
free transfer function matrix. The fixed transfer function matrix 
is completely determined by the plant measured input-output 
data, the noise level and the upper bound of the H 00 -norm of 
die plant transfer function. Moreover, it has been shown that 
the problem of obtaining the smallest transfer function set, 
which is defined by a fixed complexity nominal model and its 
additive error uncertainty bound and includes all of the plant 
possible transfer functions, can be transformed into a series of 
//-synthesis problems. 

While in this situation, the plant possible transfer function 
can be compactly expressed, the results in the last section sug¬ 
gest that it may be futile to pursue the analytic solution for the 
problem of searching the transfer function that belongs to H°°, 
has a fixed complexity, and approximates all the plant possible 
transfer functions in the smallest additive error measured 
by a frequency weighted 00 -norm. Moreover, although this 
problem can be converted to a series of //-synthesis problems, 
it is still computationally difficult, due to the large dimension 
of the parameterization of the plant possible transfer function 
and the difficulties met in //-synthesis problem itself. From 
these points, further research in developing computationally 
tractable approximation methods is still needed. 
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Explicit Formulas for Optimally 
Robust Controllers for Delay Systems 

Harry Dym, Tryphon T. Georgiou, and Malcolm C. Smith, Member, IEEE 


Abstract —This paper considers single-input/single-output sys¬ 
tems whose transfer functions take the form of a strictly proper 
rational function times a delay. A closed-form expression is 
presented for the controller which is optimally robust with respect 
to perturbations measured in the gap metric. The formula allows 
the Hnc loop-shaping procedure of Glover-McFarlane to be 
carried out explicitly for this class of systems without the need 
to first find a rational approximation of the plant. The form of 
the controller involves a certain algebra of “pseudo-derivation” 
operators. These operators, and their matrix generalizations, play 
a central role in the derivation of the controller. A discussion 
of the main properties of these operators will be given. An 
example will be presented of a controller design to achieve 
disturbance attenuation and robust set-point following for a plant 
with two lightly damped poles and a nontrivial time delay. The 
performance is compared, and shown to be superior, to that of 
a Smith predictor. 


Notation 

R and C denote the real and complex fields. 
denote the standard Hardy spaces in the right-half plane 
(RHP). £ p denotes the standard Lebesgue space and || || yj 
its norm. The inner product in £ 2 is denoted by (•,■). For 

x 6 £ 2 (-oc, 9 o) we denote.the Laplace transform of x by 

-— T 

x. If G(s) is a matrix function of s,G*(h) G(-.s) is 

the conjugate of G(m), If G(.s) is real rational, G*(.s) = 
G(-») T . H 7 i , £2 , etc. denote corresponding spaces of vector- 
valued functions. (Wj) 1 - denotes the orthogonal complement 
of H2 in £ 2 (- jog , joo ). We denote by II + (respectively, 
//_) the orthogonal projection from £ 2 (-joo.joc) onto 
(respectively, ('H g) 1 ). Dimensions will be suppressed if 
they can be inferred from the context. The symbol spec(A) 
designates the eigenvalues of A. 

I. INTRODUCTION 

I N this paper we consider the class of single-input/single¬ 
output delay systems of the form P(.s) = p~* T P 0 (a), where 
PoW a strictly proper rational function and r > 0 is a time- 
delay. We present a closed form formula for controllers which 
are optimally robust with respect to gap/normahzed coprime 
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factor uncertainty. This formula is a generalization of one 
given in [191 (see also [181) for the case of a first-order Po(.s). 
The approach also builds on the work of [24] which developed 
state-space formulas for computing optimal II ^-performance 
for such systems. 

The derivation of the controllers is based on a synthesis 
of state-space techniques, associated with the rational part 
Po(«), along with the use of a certain commutative algebra 
of operators. These operators are defined as follows 

0..1 := MzJiH 

M — fV 

(for a £ C and /(s) any merotnorphic function which is 
analytic at a), and they represent an analog of the “back¬ 
ward shift” operator in the continuous-time setup. Some early 
use of these operators ([2| and |3|) was in the context of 
reproducing kernel Hilbert spaces and scattering iheory—see 
[7] and [ 8 ] for more recent accounts, numerous references, 
and applications to interpolation theory. The formula for the 
optimal controllers contains infinite-dimensional “distributed 
delays” resulting from the action ot the r'Vs (or a suitable 
matrix generalization tin) on the delay element <" sr . 

The controller formula derived in this paper allows the 7Y X 
loop shaping design procedure of Glover-McFarlane to be 
carried out explicitly for the delay systems considered without 
the need to first find a rational approximation ot the plant. 
We illustrate this fact by designing a controller to achieve 
disturbance attenuation and robust set-point following for a 
plant with two lightly damped poles and a nontrivial delay. 
This type of system is a suitable nominal model in a number 
of process control applications. We will show that our design 
has significantly better performance than a Smith predictor. 

Research on the subject of 'H^ control for distributed 
parameter systems has been on-going since the mid 1980’s. 
One of the objectives of this research has been to develop 
explicit formulas which provide insight into the limitations on 
performance imposed by infinite-dimensional elements such as 
delays. An additional goal has been to understand the effect 
of such elements on the structure of the controller, thereby 
giving an alternative design approach to finite-dimensional 
approximation of the plant. Early work was concerned with 
computing the optimal performance for problems such as 
weighted sensitivity minimization [10], [13], Subsequent ef¬ 
forts focused mainly on developing computational method¬ 
ologies for increasingly general situations [14], [22), [281, 
[29], [34] and for explicitly computing the controllers. The 
computation of the optimal performance involves finding tlu 
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maximum singular value (norm) of a certain generalized 
Hankel or Sarason operator. The construction of the optimal 
controller requires computing the associated singular vectors. 
Extra care is needed in case the infimum is not achieved 
in the optimization problem [ 11 ] or in case the underlying 
operator is not compact. (We mention incidentally that neither 
of these two cases arises in the problem treated here.) The 
computation of singular values and vectors usually exploits 
special structures in the family ot systems considered, such as 
the presence of general inner factors multiplied by rational 
functions. In the case of delay systems, the computations were 
originally cast in the form of a certain two-point boundary 
value problem in the lime domain [13], [24], | 35 ] which in 
turn led to a solution in the form of a transcendental equation. 
Analogous computations can be carried out in the ^-domain, or 
in the 2 -domain after a bilinear transformation, as in the skew- 
Toeplitz approach in [ 1 | and [22|. The general methodology 
has been shown to apply to quite a general class of distributed 
parameter systems and provides a framework for obtaining a 
numerical solution [12], [231. An alternative viewpoint for Hrx, 
control of distributed parameter systems, based on an infinite 
dimensional state space approach, has been considered in [ 5 | 
and 16] and the references therein. 

The current work is part of a continuing effort to realize the 
original goals of the above research program. A formula for 
an H v controller is derived which takes a particularly simple 
form and is easily programmable. Moreover, the controller 
minimizes a certain cost functional which has proved to be 
especially convenient for design. Wc would like to mention 
that, in some recent work [30], an algorithm has been outlined 
for the computation of suboptima] controllers of delay systems 
lor robustness in the gap. At the moment, it is an interesting 
open problem to seek an explicit formula of the type given in 
this paper for the suboptimal case. 

The paper is organized as follows. In Section II some 
background material is reviewed on the properties and com¬ 
putation of gap/normali/cd coprime factor optimally robust 
controllers. In Section 111 the singular value/vector equations of 
Partington-Glover, and the procedure to find the gap optimal 
controller from them, arc summari/ed. In Section IV the 
(scalar) operator <) n is introduced, and the main properties of 
the operator, which will be used in the controller derivation, 
will be described. In Section V the general procedure for the 
computation of the optimal controller will be carried out tor the 
case where P(m) = t~ HT /{h - rr). The purpose of this section 
is to motivate the computational steps that are necessary in 
the higher order case. In Section VI a generalization of the 
0 Q operator to the matrix case will be presented together with 
its properties. In Section VII the general controller formula 
is derived and, in Section VIII, an illustrative example is 
presented. 

II. Background 

Wc consider the feedback configuration of Fig. 1 , denoted 
by |P, K] t where P and K are the transfer functions of the 
plant and the controller, respectively (and are ratios of H Qc - 
functions). We say that [P, K\ is stable if the closed-loop 



Fig. 1 Standard feedback configuration. 

transfer functions, mapping d t to Vj (/,.y - 1,2), belong to 
. If [P, A'| is stable then the following quantity can be 
defined 

h r .K- (/+ KPr'(LK) 

which is the gap metric robustness radius for the feedback 
system. That is, bp is the radius of the largest ball of uncer¬ 
tainty about P, measured in the gap metric, which is stabilized 
by K. For the definition of the gap metric, connections with 
normalized coprime factorizations, and associated results on 
robust stabilization the reader is referred to 116] and [19], 

It appears that bp^ is a useful quantity to maximize for a 
feedback system, especially if it is weighted appropriately* 
In fact, the maximization of b\yp y a , for some weighting 
function W chosen by the designer, is the basis of the 
Glover-McFarlane ' H ^ loop shaping method [21]. This prob¬ 
lem is also the same as the optimal robustness problem in the 
weighted gap metric (sec [41 and [ 151). From the point of view 
of compulations, it is sufficient lo consider the unweighted case 
only. This will be our point of view until the example in the 
last section. 

Wc now summarize some background results on the opti¬ 
mization of l) P f K . Let P — N/M , where A/, N E is a 
normalized coprimc factorization of P (i.e., NN* + MM* ~ 
1 ) and let V — (~N,AI). We define the optimal robustness 
radius 

(H ■ h r k ■ (1) 

AM 1.1/ 

It can be shown that the following formula holds 

&„„,(/’) Ml-Pf') 172 (2) 

where T is the Hankel operator defined by T = I1-F*\-h 2 
(see [19] and [16]). 

If F is continuous on the compactified right-half plane 
then it follows (from Hartman’s Theorem) that the operator 
F is compact and hence that it achieves its norm ||/ 1 ||, 
which is equal lo the maximal singular value of the operator. 
Moreover if 0 < A = \\r\\ < 1 then there is a unique optimally 
robust controller which can be determined from A and any 
corresponding singular vector. Specifically, let x 6 ^ 2 * 1 / = 
(iii. 2 / 2 ) e satisfy 

r.i=Xy\ ( 3 ) 

r*r = Ax (4) 

and define a = £T+F*Xx. Then the optimal compensator has 
a transfer function K = - u\/v 2 ■ See [18] for a more detailed 
explanation of these facts and further references. 




638 


IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL 40. NO. 4, APRIL 1993 


m. COMPUTATION OF SINGULAR VALUES AND VECTORS 

I 

In this section we consider a single-input/single-output 
system with transfer function of the form P(s) = e~* T Po(s) 
where Pq(s) is a strictly proper rational function and r is a 
time-delay. Since is inner we can find the normalised 
coprime factors of P(s) by finding the normalized coprime 
factors of Pq{s) using the state-space construction of [ 20 ] 
and [31] and then multiplying the numerator by Let 
P 0 (s) — C(al - Ao)~ 1 B be a minimal realization with state 
dimension n . We define R to be the stabilizing solution of the 
Riccati equation 

A 0 R + RAo - RCSCR + BB* = 0 (5) 

and set A = Ao + EC where E = - RC *. Then spec(A) is 
in the open left-half plane, and 


Next* by substituting for x and y in ( 8 ) and (7), we obtain 
the relations 

u(0) =A~ 1 i?w(0), (13) 

w(t) = A ~ 1 Sv{r) (14) 

where S is the unique solution of the Lyapunov equation 

A*S + SA + C*C = Q. (15) 

(Incidentally, observe that 7? satisfies the Lyapunov equation 
AR + RA* - 1 - BB* + EE* = 0 .) This leads to the equation 

0 = L(A)w(0) 

where 

L(A):=|[-5, A/]exp(ffr)[*]}. (16) 


F = (~N,M) = {0J)^C(sI - Ay^Bv^.E). ( 6 ) 


We assume that P(s) ^ 0 which implies that ||r||>0. 
Since F(a) is right invertible over which is equivalent 
to P being stabilizable, ||r|| < 1 (see [16, Sections III and 
VIIJ). Finally we note that, since is continuous on the 
compactified right-half plane, the optimal controller for P(s) 
can be found from the singular values and vectors of f. 

We now summarize the formulas of Partington and Glover 
[24] for the computation of the singular values of r. The idea 
is to give a time-domain realization of (3) and (4). These are 
the equations 


/ no 

e 4 ^ u " r ^C*x(u) du , 

w{f) = -A*w(t) - C*x(i) fort) < / < r, 


Xy\(t 



+ r) 


-B* 

. «a(0 


£* 


and 


e A f u;( 0 ) for f > 0 

[no 

v(0)= / e Au (-Byi{u + r) 4- Ey 2 (u))du, 
Jo 

v(t) = Au(/) — Byx(r - t) for 0 < / < r. 

\ /jl\ _ f Cu(t) for 0 < ( < t 

^ f, -(Cr^“ r )i;(r) for t >r 


(7) 


( 8 ) 


(9) 


These equations can be solved by considering a two-point 
boundary value problem on the interval [0, r]. On this interval 
the differential equations become 


where 


Thus 


d v 
dt w 



A 

X~ l BB 

~ [-A-'C'C 

-A * 

[«(<)] _ e Ht 

«(0) ‘ 

NOj ~ 

w( 0) 


( 10 ) 

(ID 

( 12 ) 


Proposition l [24]: A necessary and sufficient condition 
for A to be a singular value of T = II.F*\n, is that 
det L{ A) = 0 . □ 

It follows that, to find the optimal controller for P{s)> we 
can use the following procedure: 

PI) Find the largest A in [ 0 , 1 ] such that dot L( A) = 0 and 
then find a nonzero vector w 0 such that L{X)u)q = 0 . 
(By the assumptions on P(*),\ / 0,1.) Then find 
oo = A“ A 7fio ( ) and solve for u(/) on [ 0 , r] using ( 12 ) 
with u( 0 ) = vq and u;( 0 ) = ui 0 . 

P2) Calculate A x(t) on [U,oc) from (9). 

P3) Find v = /7+F*A./, whence K(,s) = —u\/fi 2 [18J. 


IV. The Operator 

Let f(s) be analytic in ft C C, and define 


(Oaf)M 


/(»)-;(«>) 
.s — (f 


for a £ ft. Then d fV defines a linear operator and has a number 
of interesting properties (see, e.g., [71 and [ 8 J). Several of 
these properties will be exploited in this paper and we list 
these below. Note that d n resembles a derivative, but without 
a limiting process. With a — 0 the operator is the adjoint of 
the shift operator on Pin of the disc. Thus, can be thought 
of as a generalized backward shift. It is readily verified that 
it satisfies the resolvent identity 


()<> - <Jfi = (<* - [i)0 n 0fi (17) 


for any pair of points a, ft € C which are points of analyticity 
for the class of functions acted upon. A direct consequence is 
that O^Ofi = O f jd n . The following pseudo-derivation identities 
take the place of a product rule 

(V<*fg){s) = (0<>f ){*)!)(*) + f(n)(0*9)(»)* (18) 

(drtfsH*) =/(*)(&«'/)(*) + (Onf)(»M<*)- (19) 


(They differ from the usual product rule of differentiation by 
the fact that one of the two terms involves evaluation of one 
of the functions at the point a.) If a + 7 ^ 0 then 

1 + 7 ) 

;—;-:— 

«+ 7 («+ 7) 


( 20 ) 
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If a + 7 # 0 and g is analytic at a, then (18), (19), and (20) and hence PF'e~* T is entire since it is the Laplace transform 


give the following two identities 


of a function with support on the interval [0,r). Further 


(21) 

d lx PP;e-* r = [ T (d a v-**)r(u-T)dn 

Jo 

* + 7 « + 1 \ + 7 

(22) 

= [ [ r)du 

Jo Jo 

If ftc(7) > 0, and g € ‘H-i then 


= - / / F' (v - U] r{a-T)dudv 

Ji) Jo 

n+ 9 =• 

,s — 7 

(23) 

- - j J — :r)d,rdv. (27) 

Finally wc present the following result which will 

be needed 

On the other hand 


in the controller derivations of Sections V and VII. 

Proposition 2: Let 7?r(7)>() and cv 4 7 ^ 0. Suppose 
F 6 Hoc admits a representation of the form F(s) = 
c+fu*' e~ 9t f(t) dt with v constant and /(/) 6 £i[0. ou). llien 
IJ+ F*d«c ' HT /(.s + 7)*’ is analytic in C\{—^}, and 

+ = (24) 

for k — 1,2, ■ ■ ■. 

Proof ; We consider first the case A* - 1 . Under the given 
assumptions it is readily checked that 


Il + F*0 tl — 7 — =- 7 —/7fF* 1 


.s + 7 nr 4 7 1 .s 4 7 

1 F*(—~f) 

<* + 7 .■>■ + 7 
F*(- 7 ) 


'V 


A + 7 


^iUl^F* 


J 


^ + 7 


: / X -» r ,-,(0 

./ -oe Vo 


f// 


PF;i) a e-' r =P [ (■'"(-r n(x - T) )rfj' F <~ H f*(t)d1 
J 0 t/- nu 

= F j (-c n<J - T) ) j f—"/*(v-x)dvdx 

which turns into expression (27) after interchanging the order 
of integration and then taking the projection. V 

Step 2: The function ll+F'd-^F MT is an entire function, 
and 


n+F'O'td^i ” = d'JI+F*i)^<-* T 

for every complex number n such that n 4 7 / 0. 

Proof of Step 2: It follows using Step I that 
+ " r is entire. Clearly /7+r* r"" 7 ' = r* 

() 7 ^ Mr is also entire. Wc now verify the formula for 

each of the components of F(, s) = r + Fi(.s) separately. 
The first is easy 

77+r*c7 a cL-,< - sr = " T = iUl+c'i). ^e“ ,T . 

Next, for (* / -7 we have 

= n + F: 0, ‘~ - ( - 2 c-* T 

a + 7 

__ 2. f - wr 

1 u + 7 

= — ^ pf:<~* t 

n 4 7 

=cU)_,PF,v-" r 
= 0 l> PF;0.,e-" T 
= y„f7 + F;y_,r- 8T 

using Step 1 . The operator P is needed here because F 1 V" T 
may not belong to /^(jR) ar| d hence IT+ cannot always be 
applied to it directly. V 

With the help of the preceding analysis, we can now prove 
the proposition for the case k = 1 . For tv ^ -7 

[I + F*0 a —— =n + F*dJ l 

•v + 7 l S + 7 J 

= t) Q 7f + F*|t)_v“" r + 

using (26) and Step 2. This establishes the required equality. 


(25) 


(26) 


Equation (25) shows that II+F*i) iy 1 /(.s 4 7 ) is analytic in 
C\{” 7 ). The proof now proceeds in two steps. We first 
introduce the natural projection operator, in the Wiener algebra 
on the line, defined as follows 


with (f e £i(R) and .s G jR. 

Step I: Let Fi{s) = F(m) - c = j { ^ c~ st f(i)(]i. Then 
PFfc~* T is an entire function and 

PFld a v- T = d n PF{v- mT 

for every choice of ft € C. 

Proof of Step I: The proof is an adaptation of [9, Lemma 
4.4J. Observe that 

,0 

PF^ T e"* 1 f*(t)dt.e~ aT 

J DC 

= P f e~* v f*(u-T)du 

J — CXI 

= f T c“ au f* [u — t) (ill 
Jo 
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To complete the proof we note that the result for the case 
of k 3 = 1 can be applied to 


Proof: Setting B — C=\,Aq = (j gives A = - 7 , R = 
a 4 * 7 and 5 = 1 / 27 , whence 


, J 1 \ / fc-l\ 

(.9 + 7)1 s+ 7 + “ 1 ••• ( .s + 7 H-— j 

for n sufficiently large, to give 

n+F*d a e~* T g n (») = d a n + F*<-* r 9n (s). 

By letting n —► oo we can check in turn that 

9n(») -*</(■•») = ^ ii‘ C 2 {jR), 

daUn(») ->d n g(») ill £ 2 (./R). and 
ff + FV-* T . 9 n (.s) ->n + F'f-' T g(«) 

pointwise in C\{ — 7 }■ Thus 

II + F*d n c-“ T g(3) = Uni n + F*0 o c~ iT g„(a) 

n —♦ rx 

= Urn O n II+F*r~“ T g n (a) 

= 0 cv n+F*(-* T g(s). □ 


V. First-Order Case 

In the present section wc consider the simplest example 
of a plant having transfer function of the form “time-delay x 
rational function;” P(s) = e~* T /( s-cj). An expression for the 
optimal controller was derived in [18]. A simplified formula 
involving the operator c) a was given in 119]. Below we present 
a new derivation of the optimal conn-oiler making explicit use 
of certain properties of the operator i) n . The computations 
illustrate several key steps which motivate analogous steps in 
the general case. 

Proposition 3: Let P(s) = F _,,r /(.s -• o) where ct is any 
real constant and r > 0 . 

a) The optimal robustness radius for gap ball uncertainty 
is given by 

fropt(P) = (l-ALx ) 1/2 (28) 


where A m ,, x is the largest solution of the following 
transcendental equation 


p 2„r _ Z±Ji (IZliV 

cr-/i\7 + /'/ 


(29) 


where /i = /i(A) = y/ 7 2 — A -2 and 7 = \/T+~a 2 . 
b) The transfer function of the optimal controller is given 
by 

/f„ pt (») = ~—ri 
\/<t 2 ~ M 2 - 

(30) 


where p = y/^ 2 - A^axs in the case p ^ 0. (A general 
formula covering also the case p = 0 is given in 
Theorem 1 .) 


Then 



with eigenvalues ±p where p = 1 / 7 2 - A" 2 . Note that p 
can be complex but ±p ^ 7 . An eigenvalue-eigenvector 
decomposition of H leads to 


rl Hi 


(A " 1 

\7 + (< 



0 


) 


( 7-/i -A"*W 

\-(7 + /0 A -1 / 2 / t ' 


By substituting into (16) it follows that 
L( A) = ()<* r 2 ' JT 

_ -A ~ 2 + 27(7 - //) (fT + 7)(7 + />) — ! 

-A 2 + 27(7 )- /<) (fT + 7)(7 - /1) - 1 
= !l ~ (') 2 (fT + 7)(fT + /<) 

(7 + I 1 ) 2 (fT +7)(fT - //)’ 


This gives (29). Also (28) follows from (2). Setting w {) = 1 
gives U[) — A 1 /?, which leads to 

»(/)=d o)^( a ; /; ) 

=«( " f 


on [ 0 , r], where 


a — 




-(fT + 7 )(fT- //) 
2(i\ 

(o -f t)(ct + /i) 
2pX 


(The calculation of n and is eased by judicious use of the 
fact that 1 = 7 2 - ct 2 .) Thus 


M 0 =r 


f av** + /*-' If 

| r T(t-r)( mi/l r +/r ^r) 


on [ 0 , r] 
on [r, ou). 


Hence 



= (-«(7 + - nn - 

H + 7 
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where the first step'uses direct integration and the second step 
follows from (22). Next 


=(~«(7 + vy ,T o„ - p(i - fi)c 

n ( f ST /(s — 7 ) \ 

+ \(’ s + fr )/(« - 7 ) ) .1 + 7 
= (“ «(7 + lt)t J,T 0, , - 0(7 - li)i~' IT (i- u ) 
( -l/27(« + 7) 

V(l + (<T + 7 K)«- ,r /(« + 7 ), 


where we have used Proposition 2 in the first step (which 
is applicable because ±/i + 7 ^ 0 ), and ( 20 ) and (23) in the 
second step. Using (20), the first element of T1+ F*\r becomes 


-1 

27(* + 7) 


(m^ T + /ir"' ,T ) = 


~(tr + 7) 

A(l 2 - p 2 )(* 4 7) 


of 71 and can be evaluated explicitly in terms of the Jordan 
decomposition of A and the values of F and its derivatives at 
the points of the spectrum of A , as in (39) below. The present 
contour integral formulation is, however, often easier to work 
with. In particular, it follows from Cauchy’s theorem that (32) 
yields 




F(s) - F(n) 
s — n 


for .*m* e 12, when A = a is a scalar. This is consistent with 
the definition given in Section IV. On the other hand, if F 
is equal to a scalar function / and A is either a matrix or a 
scalar, it is standard to define 

f{A)= 2V, l ( iT ~ A )~ X fi s ) dz 


w here the equality f ollows by substituting for (\Ji and = 
\/(ct -f //)/(rr - //)(7 - n )/(7 4 - fi). The second element of 
n+F*\x is equal to 

+ (-/*(7 - ' ,Jr )(I + ~~Y-i< + + T) 

«(7 + /i) f „r + / j (7 -//) .. /(r \ d , \ 

P - 7 -/*-"> / J*■+ 7 

after using (17). In fact, by substituting for n.ri and ( ,fJT , the 
coefficient of cL t in A is easily seen to be zero. Next, using 
similar substitutions, we get 


where 71 is any simple closed curve inside 1 ) which encloses 
sjh>c(4) (see e.g., 126, Chapter VI.31). Using the identity 

(zl - si) - 1 - (zl - , 1 ) - 1 = (si - .4 )- 1 (si - A)(zl - si)- 1 

we see that 

(Oyf)(*)=(«! -A)“‘{JL [ -L-f( z) ,lzI 

l- y -/(.)*} 

= (*/ - ^) _ 1 (/(.s)f - f(A)). 


A = 


2/iA 

(ft + 7)y/fT a - y 

‘2/i\(s + 7 ) 


+ + (a- //)£>-,,^ 

{-( rr + /')'V + (<t - )<>-,. }<•" ,r 


((T+7)\/^-/'V ff2 2 /< 

2/tA(.s+7) 7 2 - y 


where the second step uses (2J). Taking the quotient of (31) 
and the abpve expression for A gives the required result. □ 


VI. The Matrix ^-Operator 

To facilitate the controller computations in higher order 
cases we will need a matrix generalization of the operator 
0 n and its properties. 

Let A € C r,x " and let F(s) be an 7 / x 7 * matrix whose 
elements are analytic in an open simply connected region 
0 c C for which spec (4) C 12. Then we define 

(0 A F)(») = ~ I -^(zl ~ Ar'Fiz) dz (32) 


Let .1 e C" x " and let F(s) be an n x r matrix whose 
elements are analytic in an open simply connected region 
12 C C. Take n e 12 with rv ^ .spin*(-4). Then we can show 
that the following generalizations of ( 20 )—( 22 ) hold 

cU.s I - A)- 1 - -(si - A)-\nI - A)- 1 

= -(nl - A)-\sI - A)-\ (33) 
O n ((»l - A)- ] F(.s)) = (aI - A)-\i) n F(s) 

- (si - A)' 1 F(s)). (34) 

0 n ((si - A)- l F(s)) (si - A)~ l (OnF(s) 

-(nI-Ar'F(")). (35) 


For the matrix <) operator, a version of the resolvent identity 
still holds. Let A,B 6 C ? ' x " with AB — BA then 


(1a — <1b = {A — B)()a<)b (36) 


for a inside 7 ], where 71 is any simple closed curve (with 
positive orientation) inside 12 which encloses spw(/l). We 
also define {d A F)(») by the same formula if either A or F(s) 
is scalar. The above definition (32) is independent of the choice 


where the operators act on a matrix F(m) whose elements are 
analytic in an open simply connected region f2 C C containing 
Hp<'c(A)U$pec(B). To see this, take two simple closed curves 
71,72 inside f 2 , both containing spor(24)Uspoc(B), and with 
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7 i strictly inside 72 . Then, for a inside 71 


for i > 1. Thus 


/ - 2 — (w/-B)- 1 F(tii)*ocfa 
7-.. - z 



■ (wj - Z?) ^(w) dw 


— / — 

2tt j U) - a 

■(wl - A)~ l (wl - B)~ 1 F(w)dw 


where the last step uses the fact that w is outside of 7 ]. The 
required result now follows from the identity 

(A-B)(wl-A)~ l (wl-B)~ 1 = {wI-A)wI-B )“ I . 


Let / = /+ + /_ e C 2 {-Joc,joo), where /+ e H 2 and 
f- 6 ft^- Then 


- ('♦<«. 270 h*i) ‘ («*>• SoTT?)) 

for 72e(s),>0. Now consider L € C rixn where spec (L) is 
in the open right-half plane and let g € H 2 . Then 

n + {al - L)~'<j(s) 


■£.S5fW 0tf - ir, ' 0, »* 

/ J " T ^ r (zI-L)- 1 9 (z)dz 

./—IOC * 


- zi 

./ — ;oc 

= (%?)(«)• 


(37) 


Let I = P.JP ~ 1 be a Jordan decomposition with A- Jordan 
blocks of size n, and corresponding eigenvalues p,, and 
accordingly let 


(Ut 

P = (V and : 

\u; 

Then 

( rf -ir>-gv,{t 5 -L 7 a £ ;*}ie os, 

where Z m is an rn x rn matrix with ones on the first 
superdiagonal and zeros elsewhere. If 71 is a simple closed 
curve containing * and /a, and 7/7 is a circle of radius /? about 
zero, then 

_I_ [ _i_i_ dz 

J~ n (2 -•)(*- /*)' 

1/1 1 

= liin -—r / 7 -- 7 - 77 (7z = 0 

fl-rx, 2nj J jlt (s - a) (z - n) 1 




= (»'/)(«)- 

Hence, from (38) 


(diF)(a) = ££ VX\ l U;(dlJ')(s). (39) 

1 = 1 / = I 

VII. General Controller Formula 

In this section we will generalize the formula of Proposition 
3 to the case where P(,s) = and Po(«) = C(.s/ - 

Au)~ l B is (a minimal realization) of degree n. We recall the 
notation of Section 111, namely A = Aq 4 EC,E ~ —B( y *. 
with R,S<H defined as in (5), (15), (11). 

To carry out the derivations of this section, a mild gencncity 
assumption will be needed on Po(.s), namely that the reflec¬ 
tions of the zeros of PoM about the imaginary axis are disjoint 
from the poles of P 0 (*)- As shown in the lemma below, this is 
equivalent to the spectrum of If and A being disjoint, which 
is the assumption we will need in the compulations below. A 
discussion on the validity of the formula when the assumption 
fails is given in Remark 2. 

Lemma I: Let A, B, ( \ and I o(.s) be defined as above, and 
let If be defined as in (11). Let {C, : / = 1. ■ ■ ■, k\ be the zeros 
of P 0 {s) and let {tt, :/= 1 ,■■■,/} be the poles of Pu ( m ) 
(since /o(.s) is a scalar function these two sets arc disjoint). 
Then the following statements are equivalent: 

a) {C, : f = 1, •■■,*•} D {-?f7 :/ = 1, ■■■,/}- (9. 

b) (A,B) and (A,E) arc controllable. 

c) spee(Z/) fl spec(A) = 0. 

Proof: Define {£ # + : t — + } and {(, : / = 

1, ■ •, k~] to be the set of zeros of Jo(.s) in 'Rc(s) > 0 and 
7Z(‘(s)<0 respectively. Similarly, { 71 -+ : / - 1. ■ ,/ + } and 
{ 7 r # *~ : 1 = 1, •■■,/*} define the set of poles of Po(s) in 
the same regions. We now introduce the following subsidiary 
statements: 

a,) {C : 1 = 1, fc-}n{-g: 1 = J, 

a 2 ) {Ci + :i = l, -- -,A: + }n{—7r“ : / = 1, • ■ •} = 0. 

bi) ( A,B) is controllable. 

b 2 ) (A, E) is controllable. 

b' 2 ) C(—75/ — A)~ 1 B 0 for all i\ €E spcc(A). 

It is clear that a) aj) and a 2 h since imaginary axis poles 
or zeros of P 0 (*) can only belong to one of the sets in 
a). The lemma will be proved by establishing the following 
equivalences: ai) O bx);a, 2 ) <=> b 2 ) <=> b' 2 );I>i), b' 2 ) <=> c). 

Let Po(fl) = p{s)/q(») where p(.s), q{n) are coprime poly¬ 
nomials and q(s) is monic. Let v{n)/r($),q(a)/r{s) be nor¬ 
malized coprime factors of Po{s) over (so r(.s) is a monic 
polynomial of degree n). 
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ftO <=> hi). Asilime thal (A,B) is not controllable. Then 
for some a (TZe(a) < 0). r(«) = p(a) = 0. Since p* p+q*q = 
r*r then q*{ot)q(at) = 0. Therefore <y*(rv) = q(-Tx) = 0 since 
p(fl) and g(.s) arc coprime. Thus ai) fails. Conversely, if ai) 
fails then for some nt (7 Zr(o) <0), p(n) = q(-n) = 0. This 
means that r*(nr)r(a) = 0 which implies r(n) = 0, since r(s) 
is Hurwitz. Therefore, p(») and r(.s) have a common factor. 
Since (C\Ao) is observable, so is (6\A), which means that 
(A,D) is not controllable. 

a* 2 ) <=> l> 2 )* The proof is the same as for jii) <=> bi) but 
with p and q interchanged and B and E interchanged. 

ri' 2 ) <=> b f 2 ). Suppose 1>2) fails. It follows that there exists 
an <\ (72r(rv) < 0) such that r(<k) = />(—?!) — 0, Since 
p*p + q*q = r*i\ it follows that q*(n)q( rr) - 0 and hence, 
since p(s) and q(: s) are coprime, that f/*(o) ^ 0 and q(<\) - 0. 
Thus a- 2 ) fails. Conversely, if * 12 ) fails, then we can write 
I\i(h) = (,S'-«)/( ( s-|-a)T > ] (,s) where 77 is a zero of p(.s), o is 
a zero of ry(.s), and hence Pi (.s) has degree 7 / - 1. We now find 
a normalized coprimc factorization l\(s) = TVi(s)A/j(.s-) — 1 
over ft^.Then P n (*) = ((js— 7Fj/( A + cO^Vj(^))A/|(«)" 1 a 
normalized coprime factorization. Therefore, since the degree 
of ((.s - 77)/(,s + rv)/Vi (a), A/j (,s)) must be n,~rv is in the 
spectrum of the matrix A. Finally, since normalized coprime 
factorizations arc unique modulo a unitary constant then bjj) 
fails. 

hi), 14) <=> r). Let n C spcH(.l). Then (\I + A* is 
nomingular since .1 is Hurwitz. Thus 


dot ( 0 / - //) - dot 


\X l ("C 


-A l /j/r 
ai + .r 

- del (a/ - .1 + F,r 

■ dH (o/ + /n. 


] rv) 

(40) 


Clearly 

dpt («/ //) / 0 B 4 (,\l + .IT'C’ ^ 0 

<«• f'(— (7/ — .1) ’77/0. 


Thus c) l/ 2 ). Also, if (.1, B) is not controllable, then there 
exists an rv E spec‘(.4) such that r 7 (n/I - A) = 0. B ” 0 
for some r / 0. This implies by (40) that rv 6 spe<(//). Thus 
c) b[). Conversely, let hi). 14) hold. Take n E spoc(/l) 
and note that fi := A 2 B*(al F /1*)“ 1 ^ 0. We now claim 
that 0 / - zl 4 fiBC 1 is nonsingular. To see this, note that 
C(s7 - <4)“'77 - p{«)/q{«) - p(»)/ ( lr1 (*7 -*yl) (since r/(.s) 
is monic) and that p(.s) and d( , L(.s7 - /I) have no common 
factor (since (.4, B) is controllable and (f \ .4) is observable). 
Since 

<lc( (L + flC(xT - .-I) -1 77) 

= del (.s7 - /l) -1 dpi (hI - A + /777C) 

it follows thal dpi (aJ - A + fiBC) = dpi («/ - /l) + /lp(.s), 
and hcncc that <1**1 (n/ - /1 + /7/7C’) = (ip( o) ^ 0. Therefore, 
rt £ sppp(//) using (40). This proves c). □ 

We now follow the steps PD-P3) outlined in Section Ill. 
Let A be the largest solution of del L(A) = 0 in [0, l] and let 
Wo ^ 0 solve L(A)v'o = 0. Then 

7;(/,) = (7,0)p"^ A ;^).o 


on [0,t]. This gives 

(C',0)c m r ^)w 0 on [0, r], 

M0= (\~ 'n\ 

<fV- 4 O-r)(/. 0 ) f .«rM ^ j ulQ on [ TiX) ) 

by (9). Direct calculation shows that 

/ T c- Ki v m ,11 = -c) h (,-' t )c Ht 

J 0 


and 


Thus 


jf t—'rW - r > dt = i~* r (sl - A) ml . 
\.r=C(,-' T (sI-A)- l (JA)) -(/, Q)d u (v-' r )) 

( 4 .) 

We now assume that II is diagonalizable, for simplicity, and 
write II = )£,w f v*p,. (In the proof of Theorem 1, we show 
that the result also holds without the assumption.) Thus, the 
term in parentheses in (41) can be WTitten as 

£((*7 - ,1) - 1 , -(/.()) - 0„, (</-'”•)(/, 0)).JV>;. 


From (34) we see thal 

= (//./- A)- l W H ,(t~' T )l -{*1 - A)-'t~' r ) 

which gives 

Ai = -C\J2Uhl - A)O tl ,((*l ' A) -1 p -,T A',) 
where 

A, := (7.0 l )ii.,..;r J, 'f A ” / l/? ip„. 

We will now calculate // =- II+F* A.r. 'Hien 
iii =-H+c* r B*[Hl + A*)- l C*C 

>|)« >„,((*/ -AY 1 , " T X,) 

i 

- - /n*/+. it i 


■C"C(,«I-A) 1 (ii,1 - A)X,) 

by Proposition 2. (Note that, with the aid of a partial fraction 
expansion, the proposition can be applied elementwise.) Using 
(15) wc have 

(* 7 +j 4 *)~ 1 C*C , (.<>/-. 4) -1 = -S(h!-A)-'+(hI+AT'S. 

Thus 

«i = - w.) 

= ~Y D'sw-Ay'x, 

i 

- -B*S(sI- A)' Mr). 


( 42 ) 
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Now define 

#,(«) := - A)(»l - A)-\~' T X t . 


Then 

-i >2 =n + ( 1 - in*' + ^T 1 <?*)E^,(s.W) 

i 

= 5 >*( 9 .(») -EW+W + A*)~'C* 9 l ( 8 )) 

= ^£> M ,( 9l ( A )-£*a_ A .(7- 9( ( S )) 

t 

= '£^M-e*(h,i + a*)- 1 

.(d»,-d- A ')(C-gM)) 

= £((1 - E*{p,I + >l*)- 1 C*)^ i (. 9l ( S ))) 

1 

+ E*d_ A - f £(#»,/ + A'y'C'g.i^ (43) 
Proposition 4; The following identity holds 

^( /il j+A*)- 1 c'* 9 ,(«) = o. 


In the proof of the proposition we will make use of the 
following fact. 

Lemma 2: Let X, Y e C nxn be such that 
rank (X) + rank (Y) = v 


and X + Y be nonsingular. Then 

Y{x + yy l x = o. 


Proof- We assume that 0 < r - rank (A") < otherwise 
the statement is trivial. We now write A' = A' i A " 2 and 
Y = ny 2 , where A, € C"* r ,A 2 £ C'* n ,yi e C nx <''- T) 
and y 2 € d B_, > x ". Thus 


y(A' + K)~ 1 A 

= ny 2 


= yiy 2 


('“’ft)) 

ft) 


-1 


*iA 2 




= yi( ( *(n-/)xr-An-OX(n->))fn ^ 1 *2 

\ u (ri—r)x» ) 


= 0 . 


□ 


Proof of Proposition 4 It is sufficient to show that 

E v,rv, = 0 (44) 

I 

for an arbitrary constant matrix T, where 

u t =(p t i+A*r\ 

V t =(p,I - A)(I,0)w t o:e« r ^~j R 

In fact it is sufficient to prove (44) for an arbitrary T of rank 
one, since the general fact then follows by linear superposition. 


We first take T = Zjy*, where z 3 is an eigenvector of .4*, i.e., 
A*Zj = 7 jZj, and y* is an arbitrary row vector. Observe that 


Hr 


( A l H ) n - (»(r)) - (a - , 's)” ( t > (45) 


from (12)-(14). Thus 


E U ' TV > = z >y* (e ~ ) 

•(a 40 w(t) 

= zju * ((J,o)E ~ ^ 111,11; - (x,o) 


7*7 / \~ 1 


— z,y*({I A))H(IJ + 7 ,/) 


-(i4,())(ff + V) 


■'W 

= Zjg*{(0,X'' 4 7 ( /) -1 ) 


u(r) 


(a-'*) 


I>(t) 


(46) 


Since, by assumption, —7 is distinct from the eigenvalues of 
H, we have 

-1 


f x + v A-'nw V 

(« + V) -4-+vJ 

■(a- 


-) 


* A 

‘A - 1 C*C(A + 7 ^ 7 ) 1 A- 

where A = (-A* + 7 y l I) + \ ~ 2 C*C(A + i J I)~ 1 BB*. Note 
that A is nonsingular since dot (H 4 7 ( V) ^ 0 Thus 

(O.A- 1 /^*)^+%/)'• 

= \- 2 BB*A~ 1 (C*C 4 - 5(yl 4 - 7 , / ))(j4 + 7,/) _1 

We now claim that -4 7 ,/) = 0 . U suffices to 

show that 


A ~ 2 C*C(A + 7 {-A* + 7 ,J) = 0 (47) 

since C(A + i J I)~ 1 B i- 0 by Lemma 1 and C* is a 
nonzero column vector. To show (47) let Y = X~ 2 C*C(A + 
7 t I)~ x BB* and A" = (-yi*+ 7 ; /). Observe that A = X + Y 
is nonsingular and rank(X) + rank(Y) = rank(A). This 
follows since X is rank deficient and Y has rank at most 
one (B,C are vectors). Thus the conditions of Lemma 2 are 
satisfied and so Y(X + Y) -1 X = 0 . This shows that (47) 
and hence (46) is zero. Since y* is arbitrary in (46) and z } 
is any eigenvector of A* this proves that (44) holds for any 
v x 71 matrix T having range the span of the eigenvectors of 
A*. This completes the proof if A is diagonalizable. 

The same conclusion holds in the nondiagonalizable case 
as well. To see this consider a sequence of matrices Ak (k = 
l, 2 t ■ ■) tending to 4, each of which is diagonalizable, and 
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let Pk(a)(k = 1, i2, - ■ •) denote the corresponding perturbed 
transfer functions. It can be easily seen that Pk(s) 7 5 (.<r) 
in the gap metric (this follows because the graph symbol F 
depends continuously on A\ see [16] for the definition and 
expressions for the gap metric). Further, due to the metric 
property of the gap (see, e.g. [16]) the optimal robustness 
radius b Q pt(P ) depends continuously on P(.h), Since the 

largest root of dot(t(A)) = 0 is A = yj\ -bl pX (P), A 
depends continuously on A as well. Clearly R and S depend 
continuously on A , hence so does L{ A). Note that L( A) 
remains singular as A varies. Now suppose that wo,jl are unit 
vectors satisfying Lk(Xk)wn t k = 0 ,k = 1,2, • (where 
is defined via (16) for the case of each P*.(.s), respectively). 
Without loss of generality wc can take to be a convergent 
sequence (otherwise we select an appropriate subsequence), 
and let the limit point be w { ). Then L(X)wo = 0, so this limit 
point can be taken as the definition of wo. Since // is assumed 
diagonalizable and since it depends continuously on ^4, we can 
assume that, in each case, H*. is also diagonalizable (possibly 
for k sufficiently large). Further, we can select eigenvectors 
for Hk tending to corresponding ones for II. Compiling the 
above, it is seen that £, II,TV X depends continuously on the 
perturbations of A. By the earlier part of the proof this 
expression is identically zero in the case of each Ak- Thus 
in the limit it is also zero lor the given 7 J (.s) and A. This 
completes the proof. □ 

Proposition 4 and (43) show that 

i*a = -£(i" £’(/'./ + -4*)- ] r'*)y, 1 ,( f/l (.s)) (48) 

i 

= MV.H, (»,(*))• (49) 

t 

We now expand i) fii [y ,(,•<)) using (35) so that the ^-operator 
acts only on the infinite-dimensional clement 

- Ar'i-'X,) 

= - A)(nl — /l) -1 {d lh (r~' T )I 

= (\„l-Ar 1 ((i l I-A)X,O lll {c- r ) 
-C(«l- A)- 1 X,e~> , ' r . 

Using the following equalities 

x t 

= (AA))M*(lf)0ff(<-' T y /T ( A j R y» 0, 

i 

= (/,0)M*(ff)( A ~ l7? )'«'o 


and (49) we obtain 

«2 =C(sl - A)" 1 (/,0)M*(tf)( A 

- C\sl - (c~" T ) 

- (50) 

Wc are nov\ ready to slate the form of the optimal com¬ 
pensator. 

Theorem I: Let P(m) = <~ ST P 0 (,s) be a single- 
inpul/single-output plant with I\)(s) = C(til - Ao)~ l B 
a minimal realization and r> 0. Suppose that Pq(h) satisfies 
condition a) of Lemma 1. Then the optimal robustness radius 
for gap ball uncertainty is given by the formula 

b opl (i>) = (i - aL*) 1/2 


where A nlrix is the largest solution of the transcendental 
equation (let L(A) = 0 on [0, IJ (which is defined by (16)). 
The transfer function of the optimal controller is given by 




C(sl - A)~'B 2 - C(sI - A^Bhir.n) 


(51) 


where L(A)«>o = 0 

U’l = I J I Wo. 

B 2 ^(/. 0 )A/*(H)«ii, 

/i(r. .s) = (0. A~ J ir)M 9 (H)0 H {e-* r )e Hr w i 


wilh JfS,H dehned as in (5), (15), (11), and A = A max 
throughout. 

Before proving the theorem we will establish the following 
result. 

Lemma 3: Consider any (nonzero) strictly proper rational 
function /3>(-s) Then /> n1)t (< ~ sr /o( 5 )) is a strictly decreasing, 
continuous function of r. 

Proof: Let P$ = No/Mo be a normalized coprime 
factorization over Then (e~* T N, M) varies continuously 
with r in the W x nonn. Thus \\IP (”) \t{ : \\ vaT ^s 
continuously wilh r, and the same is true for f>o P t(f , “ l,r / > o( fi O) 
using (2). Thus, the statement of the lemma requires proof of 
the following fact 


H- ( 


-v' T >NZ 



n -t 


Mo* 


Hi 


(52) 


for 7*2 > ri (cf. (2)). We will prove a slightly more general 
fact than (52). Let F e 7-i!^ 2 be continuous (and non¬ 
constant) on the compactiiied right-half plane, co-inncr and 
right invertible over Hoj. Let $(») € be unitary 

($*<!> = = /) with del. $(*) not a constant. We will 

show that ||/7_ 4>‘F*M>||ZZ_F*|«J. First note that, for 
any r e 

jr|l3 = \\ii-$*n..F*T + n-$*n+F\c\\ 2 2 
= \\ii-4>*n.F*x\\l + \\n.vn+F*x\\l 
= wn.F'ic ill + \\n.rn + F*x\\l (53) 
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The second equality follows since the inner product 

(n-$*n-F*x,n-$*n+F*x) = o. 

We now take x to be the maximal vector for II-F*\h 2 and 
write v = /7+F*i\ Let us suppose that II-$*v — 0, i.e., 
the second term in (53) vanishes. This means that v = 
for some li e so v has a nontrivial inner factor. But 
this contradicts the fact that r is outer and v/x is left 
invertible over Woo for the assumed conditions on F (19]. 
This establishes the required result. □ 

Proof of Theorem 1: Under the assumption that H = II \ 
is diagonalizable the result follows from Proposition 1, (2), and 
steps Pl)-P3). It follows that A', ipt (.s) = -ui/ii 2 , where 111 
and U 2 are given in (42) and (50). To show that the formula is 
also valid when H is not diagonalizable we use a perturbation 
argument. We break the reasoning into two steps. 

Step 1: We claim that H\ has distinct eigenvalues for 
almost all A if the conditions of Lemma I hold. To see this 
we write C(sl ~A)~' l B = p{s)/r(s) where p(s) and r(.s) are 
coprime and r(,s) has degree n (this is possible since (AJ9) 
is controllable). We note that p*p and 1 *r are coprime (this 
follows as in the proof of «i) <=> iq) in Lemma 1). Thus 
\ 2 r*r-p*p has distinct roots for almost all A (this follows by 
taking derivatives with respecl to eliminating A 2 to give a 
nonzero polynomial in and verifying that these root locations 
can only be achieved by finitely many A). The claim now 
follows by noting that the eigenvalues of H\ coincide with 
the zeros of the expression 

t - A~ 2 C>7 - Ar l BB w [-sI - A')~ ] C* 

which are the samfe as the zeros of A 2 r*i - p*p. 

Step 2: Suppose for some r and A = A max (r) that H\ has 
repeated roots. Take a sequence r % {1 - 1,2,---) with Ti <r 
and t, —+ t. By Lemma 3, A, = A in?lx (T,) < A and A, A. 
Using Step 1 we can assume, without loss of generality, that 
//a, has distinct eigenvalues. Now suppose that w t are unit 
vectors satisfying L(A a )w*. Without loss of generality wc can 
take w, to be a convergent sequence (otherwise we select an 
appropriate subsequence), and let the limit point be u; 0 . Then 
~ 0. Let s be defined as in (41) with H — H\ , and 
let t x be defined by the same equation but with wa replaced 
by w t ,r replaced by r, and II by . We note that —**-/■. 

Define u = // 4 .F* Ai* and let (u), be defined similarly but with 
t replaced by j\ and t replaced by r, in F. Then (v), —> u. 
The proof is now completed by noting that, with the first and 
second components of /) defined through (42) and (50), we 
also have (tx ) 4 -+ v by virtue of the fact that M*(H) and 
are continuous functions of H. Thus (42) and (50) 
are correct expressions even when H is not diagonalizable. □ 
Remark I: The form of K „ pt (*) given in (51) is the ratio 
of two H 2 functions. Consequently, it is not a coprime 
factorization over Ti ^ since there will be a common zero at 
infinity. In fact it was proved in 119] that (.s + 1 )# is a unit 
in Hr* and that u/:i belongs to H and is left invertible. 
Thus (51) becomes a coprime factorization over Hoo on 
multiplication of the numerator and denominator by (a 4 - 1 ). 


Remark 2: (On the failure of conditions of Lemma 1.) If 
the poles and zeros of JMb(a) do not satisfy the genericity 
assumption a) of Lemma 1 then M*(H ) may not be defined. 
Examples where such a situation arises are: Po{a) = {s - 
l)/a(,y 4 -1) and — (a *f !)/*(# - 1). It seems to be the 
case in such situations, however, that (f, , as well 

as the second term in (50), is bounded in the limit as P 0 (- s ) 
is approached. If a well-defined limit can be proved 10 exist 
then the controller formula of Theorem I would also be valid 
in this sense when Lemma 1 fails, by virtue of known results 
about the continuity of v/x (see [25J). 

Remark 3. It should be observed that the infinite dimen¬ 
sionality of K ii[lX (s) is confined to the term /i(r, .s), and 
more specifically to It is possible to approximate 

uniformly in the norm, which then implies 
that the corresponding finite dimensional controller converges 
to A op t(,s) in the gap metric. (This follows since the corre¬ 
sponding coprimc factors then converge in the H ^ norm, cf. 
Remark 1.) To approximate ()h{( ~' r ) uniformly, it suffices to 
find a sequence of functions /*(*), with uniformly bounded 
norm, such that r j-ST is approximated umlormly on any 
finite interval of the imaginary axis. If II has an eigenvalue 
of multiplicity r on the imaginary axis, however, then it is 
also necessary to have the first r derivatives of A(.s) tending 
to those of (" wr at that point. Possible choices satisfying 
these conditions arc: /*(*) =■ 1/(1 4- ^r/k) h , j\ (.s) - (L - 
0.5.st/A-) a /U 4- O.lj.s r/k) k or general Fade approximants. An 
alternative approach to finite dimensional controller design for 
the H^c design problem of this paper has been given in 119] 
which explores approximation of the time-domain vector 1 (f) 

Remark 4 An alternative approach to the design technique 
presented in this paper is to first approximate the plant 
t - r Fu(.s), or simply the delay element r -sr , by a rational 
transfer function, and then design an optimal controller using 
finite-dimensional techniques, it is well known and an easy 
consequence of the metric property of the gap that the ro¬ 
bustness radius b CtVt (-) is a continuous function of the data 
(i.e., the system transfer function). Therefore, any controller 
designed using a finite-dimensional approximation of the plant 
model, which is close to the plant in the gap meiric, will 
qualify as a suboptimal controller for the nominal model. The 
perfomancc degradation will be reflected in a reduced value for 
bj \/ v and, accordingly, in an increased value for the 'W^-norm 
of (/,)(/ 4- KP)~ 1 (LK). In general, however, controllers 
obtained in this way may not be close in the gap to the optimal 
one. One such case where this occurs is when the maximal 
singular value of the operator r has multiplicity greater than 
one. Such an example is P(, s) = ,s/(,s 2 4“ 1) (given in [ 171 
and discussed in (32]). At the moment, sufficient conditions 
which guarantee the convergence of suboptimal controllers to 
the optimal one are not known. 

VIII. Design Example 

In this section we will use the formula in Theorem 1 to 
design a controller to achieve disturbance attenuation and 
robust set-point following for a delay system. We will take 
a plant with two lightly damped poles and a nontrivial delay. 
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Fig. 2. Scheme for optimal robustness controller. 




Fig. 5. Nyquist diagrams ol return ratio transfer functions P{*)K{s) and 

P(s)h\(,). 


Fig. 3. Smith predictor control system. 



Fig. 4. Bode plots of optimal robustness and Smith predictor compensators: 
I\ and A\ 


This rype of system is a suitable nominal model in a number 
of process control applications. We will show that our design 
has significantly better performance than a Smith predictor. 
The plant wc will consider is 




-1 .5b 


J 

.s 2 + 0.2.S + 1' 


We will demand increased damping of the oscillations in 
response to disturbances at the plant input, together with robust 
asymptotic tracking of constant reference inputs. To achieve 
these goals, we follow the 11 ^ loop shaping (optimal robust¬ 
ness) design procedure of Cdover-McFarlanc [21 ]. We select 
an appropriate weighting function W(s) and compute the 
optimal controller AVW for W'(.s)/ T (.s). The final controller 
for P(f i) is then given by A'(s) = Kw(s)W()i). 

Since an integral term is required in the controller for 
robust tracking, a proportional-plus-inlegral weighting func¬ 
tion W(«) = kp + ki/if was selected. The values k. p = 1,4 
and k, = 0.4 were found to give an acceptable loop shape 
and a sufficiently large robustness margin of 6 op t — 0.3296. 
(Increasing k v and k L gives larger loop gain but causes fc opt 



Fig. 6. Response ol y to a unii step at r/ with optimal robustness and Smith 
predictor compensators 



Fig. 7. Response of V to a unit step at r with optimal robustness and Smith 
predictor compensators. 


to decrease.) The final K(s) obtained is stable (except for the 
pole at & — 0) and is nonminimum phase as evidenced by the 
lagging phase characteristic between 0.3 and l .0 rad/sec (see 
Fig. 4), A suitable implementation of the controller is shown 
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in Fig. 2, where fc 0 = JKtv(U). (Note that the specifications 
are also met with K(*) in the forward path and unity negative 
feedback. The arrangement of Fig. 2, however, ensures that 
the response from r to y is not adversely affected by the 
nonminimum phaseness of Kw (*). Moreover, any admissible 
command response—a stable system with at least second order 
roll-off in series with a time delay of r seconds—can be 
achieved in this setup using an additional stable filter outside 
the loop. In particular, the command response could be made 
the same as the Smith predictor below.) 

The idea of the Smith predictor [21] is that a precompensator 
Ko is designed for Pq to give a desired command response 
from r to y in the delay-free situation. The precompensator 

K K ° 

* 1 + (1 - e-")P 0 K 0 

stabilizes the plant ( j ~* t Pq, if Pq is stable and gives the 
same command response but with a delay of r seconds (see 
Fig. 3). For this example, A'o was designed using the 
loop shaping (optimal robustness) procedure using the same 
weighting function as above: JV(a) = ].4 + 0.4/a. This gives 

(2.302a 2 + 0.470a + 0.612)(# + 0.286) 

~ .s(., 2 + 2.093 a + 0.719) 

The Bode plots for both compensators K and A\ are shown 
for comparison in Fig. 4. The Nyquist plots of the return 
ratio transfer functions PK and PK S are shown in Fig. 5. 
Although there are some broad similarities in the forms of 
these plots, it should be pointed out that a basic property of the 
Smith predictor is that open loop poles of Pq are cancelled by 
the zeros of the compensator K 8 - Thus, the open-loop lightly 
damped poles in P are not shifted by feedback in Fig. 3, in 
contrast to the situation in Fig. 2. The adverse effects of this 
cancellation become evident if the response at the plant output 
y is compared lo step disturbances d at the plant input (see Fig. 
6). We remark that the bad properties of the Smith predictor in 
case the plant has poles near the imaginary axis, have already 
been noted in the literature, and this has led to modifications 
of the scheme being proposed [33]. 

Finally, a comparison of the command response to step 
inputs at r is shown in Fig. 7. Note that in Figs. 4-7 the 
solid line denotes the optimal robustness compensator and the 
broken line the Smith predictor. 
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Stochastic System Identification with Noisy 
Input-Output Measurements Using Polyspectra 

Jitendra K. Tugnait, Fellow , IEEE , and Yisong Ye, Member , IEEE 


Abstract — Two new classes of parametric, frequency domain 
approaches are proposed for estimation of the parameters of 
scalar, linear “errors-in-variables” models, i.e., linear systems 
where measurements of both input and output of the system are 
noise contaminated. One of the proposed classes of approaches 
consists of linear estimators where using the bispectrum or the 
integrated polyspectrum (bispectrum or trispectrum) of the input 
and the cross-bispectrum or the integrated cross-polyspectrum 
(respectively, of the input-output), the system transfer function 
is first estimated at a number of frequencies exceeding one- 
half the number of unknown parameters* The estimated transfer 
function is then used to estimate the unknown parameters using 
an overdetermined linear system of equations. In the second class 
of approaches, quadratic transfer function matching criteria are 
optimized by using the results of the linear estimators as initial 
guesses. Both classes of the parameter estimators are shown 
to be consistent in any measurement noise that has symmet¬ 
ric probability density function when the bispectral approaches 
are used. The proposed parameter estimators are shown to 
he consistent in Gaussian measurement noise when trispectral 
approaches are used. The input to the system need not be a linear 
process but must have nonvanishing bispectrum or trispectrum. 
Computer simulation results are presented in support of the 
proposed approaches. Performance comparisons with several 
existing approaches based upon computer simulations are also 
provided. 

I. Introduction 

PARAMETER estimation and system identification for sto¬ 
chastic linear systems have been a topic of active research 
for over three decades now 17], [19], [20], 137], [45]. It is often 
assumed that the measurements of the system output are noisy 
but the measurements of the input to the system are perfect. 
The problem considered in this paper is that of identification of 
stochastic linear systems when the input as well as the output 
measurements are noisy. An interesting example of system 
identification with noisy input may be found in [381 where 
the problem of (off-line) estimation of certain parameters 
associated with the dynamics of a submerged undersea vehicle 
is studied. The various control inputs and the corresponding 
motion variables can only be remotely sensed and, hence, are 
contaminated with sensor noises. In multivariate time series 
problems where one is interested in exploring the relationship 
(transfer function) between two groups of variables, it is more 
logical to “symmetrically" model the system by allowing all 

Manuscript received September 24, 1993, revised June 6, 1994 Recom¬ 
mended by Past Associate Editor, B Pasik-Duncan This work was supported 
in pan by the National Science FoundaUon under Grants MIP-9101457 and 
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measured variables to be noisy [29]. Such models are called 
errors-in-variables models in the econometrics literature [6]. 

In this paper we consider a specific class of systems where 
the input process is non-Gaussian and the measurement noise 
at the input as well as the output is Gaussian if the input 
process has symmetric probability density function (PDF). 
The noise processes are allowed to be non-Gaussian with 
symmetric PDF if the input process has asymmetric PDF. 
Clearly, this model may not be always appropriate but there are 
several situations of practical interest where such assumptions 
arc valid. For instance, a pseudo-random binary sequence is 
often used to probe a control system for identification purposes 
[7], [19], [20J; such sequences are clearly non-Gaussian with 
nonvanishing trispectrum. The problem of differential time 
delay estimation given measurements at two spatially sepa¬ 
rated sensors can also be cast in the framework ol this paper 
[10|-| 12], [26], [281. 

Past approaches to the problem of stochastic linear system 
identification for errors-in-variables models may be divided 
into two classes: those that exploit only the second-order 
statistics and those that use higher (higher than second) order 
cumulant statistics. A good survey of the work done prior 
to about 1980 is given in [2], For later work, see |3]-[9], 
1161—f 181, [29], and [32|. Higher order statistics have been 
exploited in [5], [6], [12]-[14], |31], [33H35J, [39], [40], and 
[44]. Soderstrom [1], [21 allows only white additive noise at 
the input, and furthermore, the input and the output noises are 
assumed to be mutually uncorrelated. Most of the early work 
in this area has been done in econometrics. When only second- 
order statistics are exploited, it is known that, in general, 
there does not exist a unique solution [3]—[6], [9]. Therefore, 
attention has been focused on characterization of the class of 
transfer functions which fit the data. 

The use of higher order cumulant statistics [231, [24] can, 
in principle, yield consistent parameter estimates. Deistler [5J 
(also [14]) has shown how to estimate the transfer function 
of a single-input/single-output (SISO) system in the frequency 
domain by use of the higher order cumulant spectrum of the 
output and the higher order cumulant cross-spectrum of the 
input-output record. Cross-cumulanls between input-output 
have been exploited for FIR (finite impulse response) filter 
identification in [12] and [44]. Instrumental variable type 
approaches have been considered in [31] and [351 where the 
consistency results have been proven only for i.i.d. (inde¬ 
pendent and identically distributed) inputs. In [34J a novel 
cost function involving the third-order cumulants of the in¬ 
put-output data has been proposed, and it has been shown to 
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be proportional to a conventional mean-square error criterion 
based upon noiseless data. Consistency of the approach of 
[34] has been established under several restrictive conditions 
such as the system input is a linear process. Also [34] requires 
that the noise processes, if non-Gaussian with symmetric PDF, 
should be linear processes. In this paper we do not require any 
such constraints. It should be noted that unlike the second- 
order statistics case, one can not, in general, model a stationary 
random process with a given higher order cumulant spectrum 
as having been generated by driving a linear system with an 
i.i.d. sequence [27). In 140] several linear/iterative approaches 
using the auto- and/or cross- third-order cumulants of the 
input-output processes have been presented. Conditions under 
which the proposed approaches will yield consistent parameter 
estimators have not been provided in [40]. 

In 128] and |39| the square root of the magnitude of the 
fourth cumulant of a generalized error signal is proposed 
as a performance criterion for parameter estimation. Both 
single-input single-output and multiple-input multiple-output 
models have been considered in [39). Strong consistency of the 
proposed parameter estimator has been established for linear 
inputs in 139] for Gaussian noise processes. The approach of 
[28] and [39] results in a nonlinear estimator that requires a 
good initial guess for convergence; unfortunately, no method 
for reliable initialization was provided in [39]. 

In this paper two new classes of parametric frequency 
domain approaches are proposed for estimation of the pa¬ 
rameters of scalar, linear errors-in-variables models. One of 
the proposed approaches is a linear estimator where using 
the bispectrum of the input and the cross-bispectrum of the 
input-output, the system transfer function is first estimated at 
a number of frequencies exceeding one-half the number of 
unknown parameters. The estimated transfer function is used 
to estimate the unknown parameters using an overdetermined 
linear system of equations. In the second approach a quadratic 
transfer function matching criterion is optimized by using the 
linear estimator as an initial guess. Both the parameter estima¬ 
tors are shown to be consistent in any measurement noise that 
has symmetric PDF. The input to the system need not be a 
linear process but must have nonvanishing bispectrum. These 
two classes of approaches can be modified to exploit integrated 
polyspectrum, either bispectrum or trispectrum. The integrated 
polyspectrum is defined as a cross-spectrum between the 
process and a nonlinear function of the process; see Section 
II for further details. As discussed in Section 11, integrated 
poly spectrum (bispectrum or trispectrum) is computed as a 
cross-spectrum; hence, it is computationally cheaper than the 
corresponding polyspectrum particularly in the case of the 
fourth-order cumulant spectrum. If the non-Gaussian input to 
the linear system has a symmetric PDF, its bispectrum and 
the integrated bispectrum will vanish whereas its trispectrum 
and the integrated trispectrum will not, provided that the 
fourth cumulant of the input 7 4u is nonzero. Extension of the 
bispectrum-based approaches to trispectrum-based approaches 
is computationally complex, and the resulting estimators are 
likely to have poor statistical performance because of the high 
variance of the trispectrum estimators [24J. Herein lies the 
significance of the integrated polyspectrum-based approaches 


which apply with almost equal computational and program¬ 
ming ease to both cases, those involving integrated bispectrum 
as well as those concerned with integrated trispectrum. 

The paper is organized as follows. In Section II, a more 
precise statement of parameter estimation problem under con¬ 
sideration is provided along with a definition and some analy¬ 
sis of the integrated polyspectrum of interest in this paper. 
The bispectrum-based approaches are described in Section 
III. The integrated polyspectrum (bispectrum and trispectrum) 
based approaches are described in Section IV. Consistency 
of the proposed parameter estimators is established in Sec¬ 
tion V under some mild sufficient conditions. Finally, two 
simulation examples are presented in Section VI to illustrate 
the proposed approaches and to compare them with several 
existing approaches. Certain technical details may be found in 
the Appendix. 

II. Model Assumptions and Cumulant Spectra 

Let u(i) and ,s(/) denote the “true” input and output, re¬ 
spectively, at (discrete) time /, related via a finite-dimensional 
ARMA(// m , Hi,) model 

n„ hi, 

• s (o + ^ 2 ~') = - >)■ (2-0 

i=] i=i 

As in 11] and [7), we have assumed for convenience only that 
no term bou(1) is present, i.e., there is at least one delay in the 
system. The processes {u(/)| and {s(i)} are not available for 
measurement. But we can measure noise-contaminated input 
and output 

.r(/)=u(0 + M0 ( 2 - 2 ) 

Ij(t) ~s(t) -1- v 0 (t). (2-3) 

The model (2-1) is assumed to be exponentially stable and 
minimal, i.e., the following condition is assumed to be true. 

HI) A(,:) = 1 + a i z ~ l / 0 for \ z \ > 1 where z 

is a complex variable. Moreover, A(z) and B{z) = 
M - ' are coprime (they have no common fac¬ 
tors). 

It is also assumed that all of the processes involved (i.e., 
•r(7), ?/(/), o,(t), and v a (t)) are zero-mean and jointly station¬ 
ary. Furthermore, the noise sequences {n,(f)[ and {^(O) are 
independent of {<*(/)}, hence of {s(f)). 

Define the vector of the unknown parameters in the system 
model as 

0(ti u , Ti(,) := (ai, o 2 , • • •, Hn„, &1. h. •••, b nt ). (2-4) 

Let 0 q, n a o and n b o denote the true values of the respective 
veclor/parameters. Also define the following set 

0(n o , n b ) := {0(n o , «fc)|A(r, 0(n a , n b )) # 0for|x| > 1} 

where A(z: 0(n o , n b )) denotes A{z) explicitly parameterized 
by 0(n o , rib). Where there is no risk of confusion, we will 
suppress explicit dependence upon the model orders, e.g., we 
will denote 0(n a , n b ) by 0,0(n„, n b ) by 0, etc. The objective 
is to estimate the transfer function B(z)/A(z) (equivalently, 
0) given a data record y(i), 1 < i < N }. 
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Consider the third-order cumulant function C xxv {i, k) de¬ 
fined as 


k) := E{r{1 + i)x(t + k)y(t)}. (2-5) 

Denote'the cross-bispectrum of input/output by B ra . v (wi, w 2 ), 
given by 

oo oc 

ry(u 1 ^ 2 )= ^’) 

—00 tX) 

• exp{-j(wii + w 2 A;)}. (2-6) 

Similarly, let D xtt (ijJi. w 2 ) and il >ss (wi,w 2 ) denote the 
bispectra of the processes {x(f)} and {.<«(/)}, respectively. 
From the above definitions it is easy to see that 

~ (2tt) 2 /; /; * ***(^1, ^ 2 ) 

• exp{j(a;i/ + dbi ^ 2 - (2-7) 

Define 

w(f) := * 2 (/) - £7{a 2 (f)} and ?/>(/) := * 2 (/). (2-8) 

Then both the cross-spectrum between the process {i/r(f)} and 
)} and the cross-spectrum between the process {//'(/)} and 
{$(/)} are given by 

oc 

$ u ,(w):= ^ E{u>(t + k)s(1)} vxp{-jujk} 

A = - X 

-X 

= ^2 C„ s (k, k) t*xp{-yu»fr} 

= S,M (2-9) 

It then follows that 

1 r 

C„,(A., k) = — / rxp {.ywA-J (lui 

to J-n 

= C„(U, -*■) = r;„(0, -fr) (2-10) 

where * denotes complex conjugation. Compare (2-7) with 
(2-10) to deduce that 


Turning to the trispectnun, it is defined as ((23] and (24]) 

ex' oo oo 

i, w 2 , ^ 3 ) ’ = C' SS * M (<, A:, 1) 

»= (%. frs=—00 Isz—ou 

• oxp{-j(aiii + w 2 A' + wj/)} (2-12) 

where 


k, l) := E{.i(1)a(i ■+ i)»(t + k).s(t + /)} 

- E{s(l)»(t + i)}E{*(t + Ar)a(f+ /)} 

- £;(*(0*(/ + + /)a(f + 0} 

- E{s(t)s(i + I)} 

• E{s(t + k)s(t + 1 )} (2-13) 


is the fourth-order cumulant function of the process {*(()}. It 
then follows that 


1 /‘H’ /*7T 

f A*, /) = J J J ^"^(^1^2, uJi) 

(2-14) 


exp {4- uj' 2 h + u>il)\ 
duj] rfu)2 du)\. 


Define 


and 


f(0.= ^(/)-3s(0*V(')} 

r(l)=f(l)-E{r(1)\ 


(2-15) 


Then both the cross-spectrum between the piocess {?■(/)} and 
{*(/)} and the cross-spectrum between the process {r (/)} and 
{s(/)} are given by 


.?,,(*)■= £ / r i’{r(/ + A )s(f)} oxp {-ywA] 

A — Oj 

= ^ A\ k) oxp {-juiA} 

k — cX 

= S,,(w). (2-16) 


It then follows that 


1 r* 

= ^ w 2 ) du; 2 

/3„*(wi, u)da/i 

= S:,H. (2-11) 

Notice that the cross-spectrum between the signal s(t) and 
its square can be interpreted as an integrated bispectrum of 
a(t). This integrated bispeclrum will form a basis (along with 
the integrated trispectrum, to be defined later) for unknown 
parameter estimation. It is easy to see that since bispectrum 
of a Gaussian process is identically zero, so is its integrated 
bispeclrum. Therefore, the integrated bispectrum of {?/(()} 
equals the integrated bispectrum of (a(/)}. In the sequel it 
will be easier to work with the centered (zero-mean) n 2 (t), 
i.e., w(t). 





2t r 



S r ,(uj) oxp{ya;A'} (L> 


= a...(o,o, -k) 

= C»,„(0. 0, -*■)■ (2-17) 


Compare (2-14) with (2-17) to deduce that 


s;m = 


( to ) 2 

=s;, m. 


/*7T /*7T 

/ J T " 

>/-7T J ~TT 


»(w, U2i UJj)(i(J 2 d^3 

(2-J8) 


Notice that the cross-spectrum between the signal .s(/) and 
a function of its cube can be interpreted as an integrated 
trispectrum of .s(/). 

Given the above definitions we are now ready to state the 
remaining model assumptions. 
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Assumption Set I: Assume that the bispectra of the noise 
processes {v 4 (t)} and {w 0 (t)} are zero and that the noise 
processes are statistically independent of the process {u(t)}> 
hence of {*(£)}. It is also assumed that B uutl (u i, lj 2 ) ^ 0 
and that HI) holds. Assume that all moments of the various 
processes involved, viz., a{t), u(t) etc., exist. Also assume that 
the third and lower order cumulant/cross-cumulant sequences 
of the various processes involved, viz., C sut (ti, t 2 ) etc., 
satisfy the following summability conditions 

oo 

l 1 + l T j|]K 7 -'iZi r fc _i)| < 00 

Tl , , Tfc L = — fX) 

for each j = 2 , ■ ■ ■, k -1 and each k = 2 . 3 , ■ where z l (t) € 

\s(t ,), u(t), x(t ), y(f), v,(t ), v n (t)}. The above summability 
conditions are sufficient for the corresponding bispectrum to 
exist [24], |421, [43]. They are also sufficient for the asymp¬ 
totic results concerning the bispectrum estimator discussed 
in Section I1I-A-1) and concerning the integrated bispectrum 
discussed in Section IV-A to hold. □ 

Assumption Set II: In addition to those stated in Assump¬ 
tion Set I, assume that \u(t)} is such that at least one of the 
following conditions hold true: 

AS1) =: 7 J,, ^ 0. 

AS2) {u(M} i s a finite-dimensional, linear non-Gaussian 
process generated by driving a bounded-input 
bounded-output linear filter by a zero-mean, i.i.d. 
sequence {<■(/)} 

u(1) = £ MW*-') ( 2 - 19 ) 

where < t < oo} is the impulse response 

of the filter and E{('\t)} ^ 0. □ 

Assumption Set III: Assume that the trispectra of the noise 
processes and {MO} are zero and that n oise 

processes are statistically independent of the process {u(/)}, 
hence of {.s(f)}. Also assume that T utlinl {u i, lj 2 , W 3 ) ^ 0 . 
It is assumed that HI) holds and that all moments of the 
various processes involved, viz., s(/), u(t) etc., exist. Also 
assume that the fourth and lower order cumulant/ cross- 
cumulant sequences of the various processes involved, viz., 
C HllA - y {T U t 2 , 7 “;i) etc., satisfy the following summability con¬ 
ditions 

[1+ »,(Ti, •••- T*-l)| < 00 

■,n 

for each j = 1, 2. ■ ■, k - 1 and each k = 2 3, ■ ■ • where 
z,{t) € {.S(0, «(*)■ ‘ c (0, !/(0> «o(t)}. The above 

summability conditions are sufficient for the corresponding 
trispectrum (or cross-trispectrum) to exist [24], [42], [43]. 
They are also sufficient for the asymptotic results concerning 
the integrated trispectrum discussed in Section IV-A to hold. 
Assume also that {u{t)} is such that at least one of the 
following conditions hold true: 

AS3) 7 4u := £{u 4 (*)} - 3[£{u 2 (f )}] 2 # 0 . 

AS4) {«(*)} is a finite-dimensional, linear non-Gaussian 
process as in AS2) except that now 74 , ^ 0. □ 


Suppose that Assumption AS1) is true. Then we can not 
have S u - 2 U (uj) = 0 because then 7 a„ = 0 contrary to the 
assumption. Suppose that Assumption AS2) is true. Then 
using (2-19) it can be shown that [23J and [24J S u j„(w) = 
73 fG 2 (-w)G(w), where 73 , = E{( 3 (t)\, G(u) = 
g(k) (\xp {-jwk} and G 2 (w) = Y£L-<x. Q Z ( k ) e~ Julk - 1* 
then follows that unless G(u>) = 0 , we have S u j„(u>) & 0. 
Thus, under either AS1) or AS2) (or both), the integrated 
bispectrum of u(t) is nonvanishing. Turning to the integrated 
tnspectrum, it follows that under Assumption AS3) (i.e., 
74 u ^ 0 ), we must have S rtill (u>) ^ 0 (where r u (t) :^ 
?y 3 (0 - 3u(t)JS{w 2 (t)}), else we have a contradiction. Under 
AS4), 74 u = 74 f Efc / 0 unless G(u) s 0 or 74 , = Q. 

III. Bispectrum Based Approaches 

In this section two new parametric frequency domain ap¬ 
proaches are proposed for estimation of the parameters of 
linear errors-in-variables models. One of the proposed ap¬ 
proaches is a linear estimator where using the bispectrum 
of the input and the cross-bispectrum of the input-output, 
the system transfer function is first estimated at a number 
of frequencies exceeding one-half the number of unknown 
parameters. The estimated transfer function is used to esti¬ 
mate the unknown parameters using an overdetermined linear 
system of equations. In the second approach a quadratic 
transfer function matching criterion is optimized by using the 
linear estimator as an initial guess. The input to the system 
need not be a linear process bul must have nonvanishing 
bispectrum. The estimators are later analyzed in Section V 
under Assumption Set I. 

A. Linear Estimator 

It follows from (2-l)~(2-3), (2-5), and (2-6) that 

B„ y (w ,, «j) = +"'>)/*,„( Wl , W2 ) 

where H(z) = T /[l + 5Z"=i ri i 2 ”'] and H* i s the 

complex conjugate of H. Therefore, if B rrr (u) i, w 2 ) ^ 0 then 

H*(^' + ^) = B TXU (u>u U2)/B„Aui' « 2 ). ( 3 - 1 ) 

The above relation occurs in [5] and [4 11 Our new linear 
approach is to first rewrite (3-1) as 

II(e JU> ) — B* ru (uj i, w - u>i)/B* rr (u>i. u) - wi). (3-2) 

Thus, for a given lj several estimates of H(c J ' J ) can be 
obtained from (3-2) by using different values of . This, in 
turn, can be used to devise parametric estimators for the model 
coefficients; details are in Sections UI-A-2), IU-B, and 1II-C. 

1) Estimation of Bispectrum and Cross-Bispectrum: We 
now discuss estimation of the bispectra given data 
{#(*). j/(f), 1 < t < N} and some large sample properties 
of the estimators. Of the various approaches available in 
[21 1 , [22], [25], and [30], we will follow the approach of 
[22] and [30]. Suppose that the given sample sequence 
of length N is divided into K nonoverlapping segments 
each of size Lb samples so that N = KLq. Let 
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denote the discrete Fourier transform (D£T) of the ith block 
{*(«# (*- l)Xo), 1 < t < Lb) (i - 1, fc, ■ * ■ i K) given by 

L i$ — 1 

: X (,) j(w*) = ^2 X V -+ 1 + (* “ 1 )Lb) Pxp(-jwkt) (3-3) 

fa=0 

where 

2ir 

Wk * Lb k ’ k = °’ 1 ’ ' ‘ ‘ ’ Lb ~ L (3 ' 4) 

Similarly define (<</*). Then the bispectrum estimate 
-"$***■(»», »)' at bifrequency (u/ m , w„) is given by averaging 
the “biperiodograms” over K blocks and spatially smoothing 
Over a rectangular window of size MxM where M = 2M+ 1 
is an odd integer 


. M M r . K 
B xxx (m, n) = Y_ ft Y 

r~ — M 333 — A/ *- 

■ (2-X w (w m+r )X^(u n ^, 

•[X< i >(w m+r + w n+ »)]*) (3-5) 


and similarly we have 


B xxy (m, n) - K Y 

r——A/ s = — M L i— 1 


(^A'«(w m+r )X«K + *) 

[F (0 K, +r +u; n+ ,)]*j (3-6) 


The principal domain (nonredundani region) of B xxx (rn, n) 
is the triangular grid 

Dx*x = {(m, n)\m = Affc - M, n = Ml ~ M , 

1 (m, n) G 75 XX3: } (3-7) 


var-fRe {B xx *(m, «)•}} = ^^2“ 

•5 x (w m+n ) + o(^-) 

.— var {Im {.'J3 xa ;*{m, n)}} (3-10) 
where S x (uJ m ) denotes the power spectral density of {ar(t)} 
at frequency u; m = 27r m/L®. Note that iVA^ = KM 2 /Lb- 
Therefore, as JV —> oo such that A;v —> 0 and —► oo, 

we obtain an unbiased and mean-square consistent estimator. 
Moreover, under the above asymptotic conditions, the esti¬ 
mate B xxx (m* n) (m, n / 0), is complex Gaussian (with 
asymptotically independent real and imaginary parts), and 
independent of B xxx (m\ n!) (m', r\! ^ 0) if rn' ± rn and/or 
n r ^ n, such that both (m, n) and (m', n') are in the interior 
of D xxx . (See [42, Section 4.2] for a definition of complex 
Gaussian random variables.) Similar results holds for cross- 
bispectrum estimates leading to 

w)} 

= Bj-xy (Wmi w„) + 0( A,v), (3-11) 

var {Re {fl,* v (ro, «)}} = 2 ^ 2 S x (w m )S x (u>„) 

Sy{ui m + n ) + O 

= var {Im { B xxy (rn, n )}}. (3-12) 
The estimate B xxy (m, n) (m, n ^ 0) is complex Gaussian 
(with asymptotically independent real and imaginary parts) and 
independent of B xxu (m\ n r ) (rn*, n' ^ 0) if m! ^ rn and/or 
?#/ ^ n , such that both (m, n) and (in', n') are in the interior 
of Dxr.y- As noted before following (3-10), as TV -+ oo such 
that Aa t —<► 0 and NA 2 N —♦ oc, we obtain an unbiased and 
mean-square consistent estimator. One way to accomplish this 
is to choose A jy = N n ~ iyJj with 0 < c < 0.5. In terms of K< 
M , and one may pick K = 1, L B = JV, and Ai — 7V‘ ,+,, - ,rl 
with 0 < c < 0.5. Alternatively, one may choose M = 1, 
L b = and A" = A fU 5+r with 0 < c < 0.5. The 

smallest bias results with c —^ 0 whereas the smallest variance 
occurs for c —> 0.5. 

2) Overdetermined Linear System of Equations: Returning 
to (3-2) and using the estimates of the bispectrum and 
cross-bispectrum, we have 


where D xxx = {(*, /)|0 < k < L B J 2, 0 < / < k, 2k + 
l < Lb). The principal domain of B xxy (m , n) is a larger 
triangular grid [41] 

D xxy = U%UD% (3-8) 

where D- x l J y = {(m, n)|m = Mk - M, n — Ml - M. 1 < l 
< k < Lb/2M, ( m, n) € D Txy , n > 0}, D-rJy = 
{(m, n)|(m, -n) £ Di%] and T) xxy = {(k, £)|0 < k < 
Lb/ 2, -L b /2 < l < L B /2, k > \l\}. 

Define An - M/Lb - MK/N. The quantity A.v is a 
measure of the “bandwidth” of the bispectrum estimate. Under 
the cumulant summability condition stated in Assumption Set 
I, it follows from [24, Chapter VI], [42, Chapter 4], and [43] 
that 

^ E{B xxx (m, n)} = B xxx {uyn, **>»,) + O(A.v), (3-9) 


jj, \ {m, n m) 

H(w,,;w m ) := —— t -- for (m, n-rn) G 

n - m) 

and 

n = IM ~Jf, 1 < l < -- 1. 

2 M 

Let H B (^ Uti ) denote the least-squares estimate of H(e ju, ' n ) 
given H(uf n ]uJm) for a fixed n with rn ranging over (m, n — 
m) € /? x . ry . Then we have 

Hb{^ ) = BxxAm, n - m)B* xy 

[(m, n-~m)€D XJ: y 

•(to, n - in) y2 I BxxAm, n - ro)| 2 (3-13) 

L(m,n-m)eO„„ 

for n - IM— M. 1 < l < (Lb/(2M)) - 1. 
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From the definition of we have 

71 a Tifc 

a l H(e ,u "y~ JUn> + ]r = H{e JU ") (3-14) 

1 = 1 7 = 1 

for any u n . Noting that ajs and b,'s are real and Ii(c lu) ) is, 
in general, complex-valued, we rewrite (3-14) after replacing 
H(e JU> ") with its least-square estimate as 

n P Mh 

7 = 1 

= Re{// B (c^'')} (3-15) 

and 

** fl f ' u 

- a * lm { 6, Im {r _J " J " ? } 

- i 

= Im {f/jy(r^ n )}. (3-16) 

We solve (3-15) and (3-16) forties and b,'s with n - IM-M , 
1 < / < Lb/2M - 1 using an ordinary linear least-squares 
formulation. 

B. Nonlinear Estimator 

We follow a quadratic transfer function matching approach. 
Let Il{e^\V) denote the transfer function of (2-1) with the 
system parameters specified by the parameter vector 0 as 
defined in (2-4). Define 

, , , 13* Jin. n) 

-(3-i7) 

The following result is from [24| and |43J. A proof of Lemma 
I also follows from the mean-square convergence implied by 
(3-9)—(3* 12). 

Lemma /: Under the Assumption Set I, we have 
lim.v ^ n) - 13jj, (uJiri . uj tt ) i.p. (in probability), 

and liniyv-, * /L rf/ (m, n) = 13 u;„) i.p. • 

Using Lemma 1 and the asymptotic distribution of the 
bispectrum estimators, the following useful result can be 
established. 

Lemma 2: Under the Assumption Set 1 as TV —► -x., 
the following results are true for any fixed u/ fi ) and 
(uw, uv) in the interior of the set L) Jty > 

A) ) |^o)) converges 

in distribution to the complex normal distribution 
K(0. (r'i,,) where 

2 _ g(m, iQSsjuJmjSj (ui„)Sy(u > m +„) 

<T ""’ \L3r,A^„. w„)| 2 


a(, n . n)=l + |rt— 

&yy\ u m+n) 


{ ^vi/(^rn-fF.) J 


and ($ni, n = y W n ). 

B) and H{( ,J ^ rn '^ n ^) are statistically in¬ 
dependent for (m, n) ^ {m\ n’). • 


Sketch of Proof: It follows from the asymptotic results (3- 
7M3-12) discussed in Section III-A-1), Lemma 1, Slutsky’s 
Theorem [36, Section 4.1], and [42, Theorem P5.2], □ 

Lemma 2 motivates the following cost criterion which is 
the negative log-likelihood (up to some constants which do 
not depend upon 0) of the asymptotically complex Gaussian 
vector (m, n) € D% ry } where D® ry is the 

interior of D xjy . Choose 0 to minimize the cost 


E 

(f»«. *.)€!>?„ 


n) 



In the above wc have assumed that rr 7 2 l71 ^ 0. If this is not the 
case then we delete that particular value of (m, n) from the 
summation. In practice, S,(u; r „) and S v (w m ) are unknown; 
therefore, we replace them by their consistent estimates ob¬ 
tained by block averaging and/or frequency-domain smoothing 
as in (3-5M3-5). Thus we have the following practical cost 
criterion 


jV>) = E 

(»n, «)€/•>!', v 


- H(< 


a; xv (m. ip 



(3-19) 


where 

=y( w. n)A\(w)AV(w) 

■ 4- n)/\l\vrr{nL N)| 2 (3-20) 


and (f(m. n) follows similarly by replacing the variables in 
<)(m. n) with their consistent estimates. Mimicking Lemma 
1 and using Slutsky's Theorem, it is easy to show that <t 2 U7 
tends to < r 2 ,,, i.p. as TV —* oc provided 13 Jtrx ((jj 7n , u; n ) 0. 
Minimization of (3-19) w.r.t. 0 is a nonlinear optimization 
problem that is initialized by the linear estimator obtained by 
solving (3-15) and (3-16). 


C> Summary of Algorithms 

Summarizing the above two algorithms, we fit an 
ARMA(/i tt , Ob) model as follows. 

i) Given the record length TV, select the block length Lb, 
number of blocks h\ and the smoothing window size M. 
See the discussion in Section III-A-1) [following (3-12)] 
for possible choices. 

ii) Estimate the required bispectra using (3-5) and (3-6), 
and the spectra S xx and S yy using similar equations. 

iii) Estimate the model coefficients via an ordinary 
least-squares formulation using (3-13), (3-15)—<3- 
16). (Use all possible bifrequencies in (3-13) unless 
\Bxxf{nL n - m)| is too small. Since the bispectrum 
estimates at distinct bifrequencies are approximately 
statistically independent, the estimates at distinct 
bifrequencies carry “new” information.) Stop if desire 
only linear estimators. 
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iv) Using the estimate obtained in iii) as an initial guess, 
refine the model coefficient estimates by minimizing 
(3-19). Again, if possible* use all possible bifnequencies. 

IV. Integrated Polyspectrum Based Approaches 

In this section, the frequency domain approaches presented 
in Section III are discussed using integrated polyspectra. As 
discussed in Section II, integrated polyspectrum (bispectrum 
or trispectrum) is computed as a cross-spectrum; hence, it is 
computationally cheaper than the corresponding polyspectrum 
particularly in the case of the fourth-order cumulant spectrum. 
If the non-Gaussian input to the linear system has a symmetric 
PDF, its bispectrum and the integrated bispectrum will vanish 
whereas its trispectrum and the integrated tnspectrum will 
not, provided that the fourth cumulant of the input 7 4u 
is nonzero. Extension of the approaches of Section 111 to 
trispectrum-based approaches is computationally complex and 
the resulting estimators are likely to have poor statistical 
performance because of the high variance of the trispectrum 
estimators [24]. Herein lies the significance of the integrated 
polyspectrum-based approaches which apply with almost equal 
computational and programming ease to both cases, those 
involving integrated bispectrum as well as those concerned 
with integrated tnspectrum. 

A. Linear Estimator 
Define 

>‘2r(t) =-.r 2 (0 (4-1) 

and 

r 3x (l) = At) - 3j-(f)£V(0} - *V(0) (4-2) 


1) Estimation of Integrated Auto- and Cross-Polyspectrum: 
We now discuss estimation of the integrated polyspectra given 
data {^(O, y(t)> 1 < t < N} and some large sample 
properties of the estimators. Suppose that the given sample 
sequence of length N is divided into K nonoverlapping 
segments each of size Lb samples so that N = KLb■ Let 
denote the DFT of the /th block {.r(/-|-(/-l).Lfl), 1 < 
t < Lb) 0 = L 2, ■■ , K) as in (3-3) and (3-4). Similarly 
let denote the DFT of the /th block of ?-,,(/) given by 

Lb- 1 

n (,) M = Y [At + \ + u - i)l b ) 

f=U 

- .lr(( 4-1 + (/ - \)L d )P 2 j] 

■exp (~ju k t), (4-4) 

i‘ 2 v - N Y (4 ' 5) 

/-] 

when Vj(t) = r is (t) [see (4-2)J, and by 

/ n — 1 

^ j2 ( f + 1 + 0 - i'xp(-jukl) (4-6) 

f =0 

when r,(f) = r 2x (t) [see (4-1 )| To set the mean of {r, (t)} in 
the 7 th block to zero, simply set f^ ,J (0) = 0. Then the estimate 
S 7i r (m) of S fjX (u/ w ) based upon the record yO)* 1 < 

t < N \ is given by averaging the block pcriodogiams over K 
blocks and spatially smoothing over a rectangular window of 
size M where M = 2M + 1 is an odd integer 

JI r k 

s r , IjjE 

l:-M S » 1 

y- i? (0 K, +i ) [tf'W+i) (4-7) 
L>i 


Replacing s(t) with u(t) in (4-2) yields r$ u (t) which is 
identical to r u (t) in Section II, In the following we will use the 
notation r T (t) to refer to both r 2T (f) and r$ a (t) f meaning (4- 
1) when discussing integrated bispectrum and implying (4-2) 
when discussing integrated trispectrum. 

Under the Assumption Set II (or III), we have 

Sr lV (u) = = Y + h’)y(i)) cxp(-juk). 

k——ou 

It follows from (2-1 >-(2-3) that under the Assumption Set II 
(or III) 


S rr y(»)=n m (' JU )S,.s(u) 

where H(z) = B{z)/A(z ), S r ,, r (w) - S Iuti (u) and S, rV 
(w) = SV„»(w). Therefore, if S rj , r (ui) ^ 0 then 


where m = Ml - M, 1 < I < l-n/2M - 1. The estimate 
6', iV (m) is given similarly by 


S, lU (in)=M 1 



Z 




I 


(4-8) 


where rn — Ml - M , 1 < l < Lr/2M - 1. 

Under the cumulant summability conditions stated in As¬ 
sumption Set II (or III), it follows from [24, Chapter VI], [42, 
Problem 7.10.8], and [43] that for large N, the real and the 
imaginary parts of the estimate S r±y (m) (rn 0) are bivariate 
Gaussian, and S 1xy (m) isjndependent of S 7jy (n) (n ^ 0) for 
tn ^ n { m, v = 1M - M, l < l < Ly/2M - 1) such that 

E{5,, v (m)} = 5 r< „(w m ) + ()(A N ), (4-9) 

var {Ro { S r J j/(m)}} 2)VZkiv 1 Tr ) 

+ no{Sl u (u,rn))} + 0(N-'), 


H*(en = S rtV (w){S,,Av)]- 1 . 


(4-3) 
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cov {Re {S riI/ (m)}, Im {S’,. iW (rn)}} 

— 2NAjy ^ 0(N *)• (4-10) 

In the above, as in Section I1I-A-1), & N = M/L u is the 
bandwidth of the cross-spectrum estimates. Moreover, as in 
Section III-A-1), we must have A at -* 0 and NA n -> oo 
as N —> <x to obtain an unbiased and mean-square consistent 
estimator. Similar results hold for S rrJ .(m). 

2) Overdeiermined Linear System of Equations: Returning 
to (4-3) we set 

H ir (r JUn ) = S* (U (n)[S* jl {'n)}~ 1 . (4-11) 

Mimicking Section III-A-2), we_solve (3-15) and (3-16) for 
a,’s and b,\ with n = IM - M, l < I < Laj'lM - 1 and 
using the above estimate of 


B. Nonlinear Estimator 


We follow a quadratic transfer function matching approach. 
Let IJ(r lu> \0) be as in Section II1-B. Lemmas 3 and 4 are 
counterparts of Lemmas 1 and 2, respectively. 

Lemma 3: Under the Assumption Set 11 (or III), we have 
liniA —*x, Sr,A"<) = i p and liuijv — ^ £,,„(»») = 

■5/, yi^m ) i-P- • 

Lemma 4: Under the Assumption Set II (or 111) as N -> nc, 
the following results arc true for any fixed 0 < u m , < tt. 
A) Lei W m - ) - H{c^ |6I„)]. Then 

[l\v ll' m lm//'J r converges in distribution to a real 
bivanate normal distribution ( 0, E) where de¬ 
notes the /^-element ot matrix E) En = l\ + ReTa, 

e 22 - - Ror 2 , >: 12 = imr 2 


1 1 — 0.5f/(m)rS, t , i (cu m )sSy V (u; rM )|»b rj t (cj n i )| 


-2 


d(m) = 1 + |'*m| 

r 2 =0 


2 1 [uJ m ) 

*S/y (^m) 


- 2Re 


V Vm S>JJ' 


and tX 1n — & t j i/(^ru )/‘%j 1 (^m ) 

B) Hjp(e iuJ,n ) and are statistically indepen¬ 
dent for m ^ w'. • 

The negative log-likelihood of the asymptotically complex 
Gaussian vector {Hjp{e lu,,n ), rn E l) r ,j \ where D 1r , 
{m\m = IM - A/, 1 < / < (Ld/(2M)) - 1}, is similar 
to that in Section JII-B. As in Section II1-B, we choose 9 to 
minimize the cost (cf. (3-19)) 


i(iP) 




E 

iii€D, 3 r 





K, (4-12) 


where 

o? n = d(rv)S 1 rlj (rr>)SyyM/\S, r ,(m)| 2 (4-13) 

and d(ni) follows similarly by replacing the variables in d(rn) 
with their consistent estimates. As in Section 1II-B, it is easy 
to show that & 2 m tends to rr 2 } i.p. as /V oo provided 
S rtcJP (u) m ) ^ 0. Minimization of (4-12) w.r.t. 9 is a nonlinear 
optimization problem that is initialized by the linear estimator 
of Section IV-A. 


C. Summary of Algorithms 

Summarizing the above two algorithms, we fit an 
ARMA(n a , rib) model as follows. 

i) This is same as item i) of Section Ill-C. 

ii) Estimate the required integrated poly spectra using (4-7) 
and (4-8), and the spectra S,.^, S, lX and S yv using 
similar equations. 

iii) Estimate the model coefficients via an ordinary least- 
squares formulation using (4-11), (3-15), and (3-16) 
with Hu(.) replaced with Hfp {.). (Use all possible 
frequencies in (3-15) and (3-16) unless |5 r ^ i .(ni)| is 
too small. Since the spectrum estimates at distinct 
frequencies are approximately statistically independent, 
the estimates at distinct frequencies carry “new” infor¬ 
mation.) Stop if desire only linear estimators. 

iv) Using the estimate obtained in iii) as an initial guess, 
refine the model coefficient estimates by minimizing (4- 
12). Again, if possible, use all possible frequencies as 
in iii) above. 


V. Consistency 

In this section we discuss the (weak) consistency of the 
proposed approaches (as N —► x). Let U(r iuJ ; 0 O ) denote 
the transfer function of the true model (2-1) with ii a - n w o, 
n - n l)0 and 9 = 0 O - Consider the system ot equations (3-14) 
rewritten as follows alter replacing H(e JUJ ) with 9q) 

ITl i=1 

= H{(^\ fl 0 ), 1 < / < L. (5-1) 

Note that in (5-1) lji represents an arbitrary frequency, not 
necessarily equal to 27 rl/Lp. 

The following result is central to the consistency results. 
Lemma 5: Given the transfer function H(e ,u> \ 0 O ) of model 
(2-1) at frequencies 0 < u>\ < u 2 < • • ■ < < n such that 

7?ri T- rib < 2 L and min (n a “ 7? a0 . no - riho) > ()■ Then the set 
of solutions to (5-1) is such that H{e } ^\ 9) = • 

Pmof: If (5-1) is satisfied for some 9 (defined in (2-4)), 
then i/(p^; 9) = fl u ) for 1 < / < L. Define 

A(z- 0, So) = A(z ; S 0 )B(z- 0) 

= E h >w z " 

. T =-0 1 = 1 

n a y)-t-T ifr 

= ^ )*-' (5-2) 

1 = 1 

where au(^o) 1 so that Q\(9, 9q) = 6](^). Also define 
B{z',9, 9 q) = A{z\ 9)B{z‘9 0 ) 

" n i “w 

= E! a ’^"’ |E ,, '^"‘ 

j =0 =1 

= £ E,(Mo)*-’ (5-3) 

» = t 
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where ao(0) := 1 so that bi(8, 8$) =,, bi(8o). Clearly if 
H{e JUl \ 6) = 8a) for some l, then 

I(c J “';Mo) = fl(e w ;Mo) (5-4) 

since H{v lw \ 0) = B(e Jul \ 8)A~ i (t JU : 8) for any u and 8. 
Using (5-2) and (5-3) this in tum implies that 

n 

£MMo)-MMo)K J u, '' = 0 (5-5) 

1=1 

where 77 = max(n a o + n a 4- wi>o) and we define 
3*(0, 9q) — 0 for n a o + no 4-1 < i < n and b t ( 0, So) = 0 for 
n a + Hbo + 1 < t < 77. Since the coefficients a,(9 % 0 o ) s and 
BqYs are real-valued, it also follows from (5-5) that 

53 6» 0 ) - = 0. (5-6) 

1=1 

By the hypothesis of the lemma, (5-5) and (5-6) are true for 
1 < / < L. This implies that the (77 - l)-degree polynomial 
equation 

n—1 

5]|fl,+i(tf.«o)-K.+i(^Ms , =0 (5-7) 

»=0 

has 2 L roots. Under the hypothesis of the lemma, 2L > 
n a + rib > 77 because (min v a - r/„o> *h> ~ Wfco) i 0. Now 
(5-7) can have only 77 - 1 < 2L roots; hence, we must have 


Proof: Equations (3-15) and (3-16) can be expressed as 
a linear system of equations 

Fjv0 = fAr (5-9) 

where 9 is the (H a +flb)-column vector of unknown parameters, 
the matrix F v is 2 Lx (n a +n*,) consisting of appropriate parts 
of the left side of (3-15) and (3-16), and is an 2L-column 
vector consisting of the right side of (3-15) and (3-16). By 
(3-13), Lemma 1, and Slutsky's Theorem 

lirn ^a(H Ww ) = ff(f JWn ;»n)i-p.- (5-10) 

V—»oo 

Let F^ and denote Fjv and fjy, respectively, when the 
transfer function estimates are replaced with their true values. 
Then by Slutsky's Theorem we have 

lim F/y = Fol i.p. and lim fw = f x i.p.. (5-11) 

JV—►>. \ — * X 

Since min (n a - n a o, fii , - tiwj) = 0, by Lemma 5, (3-15) and 
(3-16) have a unique solution given by 9 - 9$ provided that 
//„(<^) = ; fl 0 ) in (3-15) and (3-16). Hence, Fo. 

has full rank so that [F^F*]" 1 exists; therefore, by Slutsky’s 
Theorem and (5-11), liin\„ x [FjvF/vj = [F^F^]' 1 i.p. 
Similar arguments lead to 

(i N :=\ Fj&FvJ-^fv 

^ 0 :=[F 7 ,F^]- ] F^ as N - x. (5-12) 

This proves the desired result. □ 

Define 

0 N :-=a.K( iuf f N B) (8)} (5-13) 


o*+j(0, 0o) 555 b t +i((f,0o) for 0<i<w-l (5-8) 


leading to H(e J 9) s 0o). □ 

A. Bispectrum Based Approaches 

Lemma 5 combined with the results of Theorems J and 2 
(discussed later) motivate the following definition of persis¬ 
tence of excitation of inputs to linear systems for the purpose 
of consistent parameter estimation using the bispectrum. This 
is completely analogous to the concept of persistent excitation 
when using second-order statistics (see [7, Section 5.41). 
Similar to the second-order case, if the input is persistently 
exciting as defined below, the matrix F x defined in the 
proof of Theorem 1 will have full rank, leading to parameter 
identifiability. 

Definition 1: A stationary input {u(t)} is persistently excit¬ 
ing of order L w,r.t. the bispectral statistics if its bispectrum 
Buuu{umi Un) is nonzero at L distinct bifrequencies in the 
interior of its principal domain. • 

Theorem /: The linear parameter estimator discussed in 
Section IH-A is (weakly) consistent under the Assumption 
Set I if (3-15) and (3-16) are solved at L frequencies 0 < 
< wj < * ■ ■ < Ui < 7 t such that n a + 74 < 2 L, 
min (n a - n„o, n b ~ n*o) = 0, and, B liUU {w m , u> n - m ) £ 0 
for all bifrequencies used in (3-13), • 


where J^\0) is given by (3-19) and (->c C B is a compact 
set. Convergence of this nonlinear estimator is discussed next. 

Theorem 2: Suppose that Assumption Set 1 holds and 
Jx*\9) utilizes at least L distinct bifrequencics (u m , u„) 
in the intenor of the principal domain of B auu (uj m) u>„) 
such that n a + m < 2 L, and B llun {u ) m . u;„) ^ 0 for all 
bifrequencies used in (3-19). If 0 () 6 Be, then 9^ defined in 
(5-13) converges i.p. to a set A) as /V —> oc where 


A) 


{ # 


B(z: 9) 
A(z ; 9) 


B(z ; g 0 ) ] 
A(z\9 { ))/' 


Proof: See the Appendix. □ 

The following corollary follows from Theorem 2 and some 
standard results (see, e.g., [7, Section 6.3]). The additional 
condition imposed in Corollary 1 ensures that the set Do 
consists of a single element Oo- 
Corollary 1: Under the hypotheses of Theorem 2, if 
min(n„ - n<,q, ni - 74 , 0 ) - 0 , then 9 N converges i.p. to 
9o as N —> oc. • 


B. Integrated Polyspectrum Based Approaches 

The following results mimic the corresponding results of 
Section V-A with obvious modifications. Therefore, we will 
simply state the main results without any proofs. 
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Definition 2: A stationary input {«(£)} is persistently ex¬ 
citing of order L w.r.t. the integrated polyspectral statistics if 
its integrated polyspectrum .? r „ u (w) is nonzero at L distinct 
frequencies in the interval (0, n). • 

Theorem 3: The integrated polyspectrum-based linear pa¬ 
rameter estimator discussed in Section 1V-A is (weakly) con¬ 
sistent under the Assumption Set II (or III) if (3-15) and 
(3-16) are solved (after replacing H B {e JUn ) with Hip(c }u ")) 
at L frequencies 0 < u>i < w 2 < • ■ • < uji < ir such 
that n a + n b < 2 L, min(n B - n a0 , n b - n b0 ) = 0, and 
5 r<1 ti(wm) / 0 for all frequencies used in (3-15) and (3-16). • 
Theorem 4: Consider the nonlinear parameter estimator de¬ 
fined by 

0 N := ar K { fl in£ 

where Be C B is a compact set. Suppose that Assumption 
Set II (or III) holds and J^ P) (0) utilizes at least L distinct 
frequencies uj m in the interval (0. tt) such that n u + n b < 
2 L, min (n a - 7i tt0 , v b - n b0 ) - 0, and S 1nV {uj m ) ^ 0 for 
all frequencies used in (4-12). If 0 O e B r , then 9^ converges 
i.p. to 0 q as N —► cxj. • 

VI. Simulation Examples 

We now present two simulation examples to illustrate the 
proposed approaches. Example 1 is from 139], and Example 
2 is from [34] and [401. In both the examples the proposed 
approaches were applied using a block length Lq = 64 and 
a smoothing window M = 1 (Af = 0, i.e., no smoothing). 
The software package NL2SOL [15] was used for nonlinear 
optimization using the results of the linear approaches as initial 
guesses. Also the nonlinear cost functions were simplified 
somewhat by setting </(m, 7i) = 1 in (3-20) and d(m) = 1 
in (4-13); this has no effect on the consistency results but will 
influence the accuracy of the estimates. 

A. Example I [39] 

The system to be identified is given by 

,y(/) = 1,5s(/ - 1) — 0.7 *(f — 2) 

+ u(i — 1) — 0,5 a(i — 2). (6-1) 

Thus, we have bi = 1.0, b 2 = -0.5, = -1.5, and a 2 = 0.7. 

The input {u{f)} is generated as 

u(0 = 0.3 u{t - l) + e(0 (6-2) 

where {?(/)} is an i.i.d. binary sequence with values one and 
— 1, each with probability 0.5. Thus, we have £{e 2 (f)} = 1, 
") 3 f. = 0, and 74 < = -2. The noisy input-output observations are 
given by (2-2) and (2-3). The mutually uncorrelated colored 
Gaussian noises {/;,(£)} and {u 0 (£)} wc generated as 

v l (t)=0.8w 1 (l-l) + /i(*) > (6-3) 

Uo (0 = -0.9ti o (t-l) + /o(f) (6-4) 

where {f^t)} and are mutually independent i.i.d., 

zero-mean Gaussian sequences. The variances of / t (f) and 


m 


TABLE 1 

PARAMETER ESTIMATES: EXAMPLE I, 30 MONTE CARLO RUNS, SNR » 
20 (IB, N = 4000 = Input-Output Data Record Length in Each 
Run, a = One Standard Deviation. [IT: Integrated Trispectrum 
Based Approach, MSE: A Mlan-Squarl Error Criterion 
Minimizing the P rediction Errors, See Criterion (76) in [39|] 


Approach 

Parameter! 



fri 

fra 

True Values 

-1.500 

0.700 

1.000 

.0.500 

estimate statistics: N 

= 4000 


IT: Linear 

mean 

-1.4257 

0.6364 

0.9832 

-0.4391 

(Sec. 4.1) 

u 

±0.0242 

±0.0192 

±0.0136 

±0.0331 


RMS 

(0.07B2) 

(0.0664) 

(0.0173) 

(0.0888) 

IT: Nonlinear 

mean 

-1.4754 

0.6774 

0.9839 

-0.4845 

(Sec. 4.2) 

a 

±0.0075 

±0.0053 

±0.0083 

±0.0149 


RMS 

(0.0257) 

(0.0232) 

(0.0182) 

(0.0215) 

1*9] 

mean 

-1.3363 

0.6189 

0.9562 

-0.4537 


V 

±0.2366 

±0.1195 

±0.2093 

±0.2302 


RMS 

(0.2877) 

(0.2153) 

(0.2138) 

(0.2349) 

[35] 

mean 

-1.5631 

0.7494 

1.0142 

-0.5761 


tr 

±0.2585 

±0.2138 

±0.0113 

±0.2647 


RMS 

(0.2661) 

(0.2194) 

(0.0182) 

(0.2754) 

MSE 

mean 

-0.9490 

0.2587 

0.9902 

0.0782 

(from [39]) 

a 

±0.0076 

±0.0060 

±0.0108 

±0.0220 

! 

RMS 

(0.5511) 

(0.4413) 

(0.0145) 

(0.5732) 


f a (t) are selected to yield a signal to noise power ratio of 20 
dB at the input sensor and the output sensor, respectively. 

Thirty independent realizations of 4000 input-output data 
pairs were generated. The methods proposed in Section IV- 
A were applied using the integrated trispcctrum, along with 
several existing approaches. For the linear estimator, we used 
1 </<Lu/ 2-1= 30, i.e., 30 frequency points in the 
interval (0, 7r). The nonlinear estimator minimizing J^ P \o) 
given by (4-12) was initialized by the linear estimator. The 
frequency points used in (4-12) were as that for the linear 
estimator. 

Table 1 displays the arithmetic means, standard deviations 
and root mean-square (RMS) errors of the results ot the various 
parameter estimators. The approach of [391 is based upon the 
fourth cumulant of a generalized error signal at zero lag, and 
the approach of [35J uses the fourth-order cross- and auto- 
cumulants of the input-output data at various lags. The results 
pertaining to the approach of [351 have been taken from 
[35]. For comparison the results of a least-squares criterion 
(labeled MSE in Table I) are also shown. It is seen that the 
MSE criterion relying only on the second-order statistics of 
the input-output record, produces a substantial bias (and low 
variance) for this example whereas the proposed frequency- 
domain approaches proposed in this paper yield very good 
results: the variance is comparable to the MSE criterion and 
the bias is much reduced. The proposed approaches perform 
much better than the approaches of [39] and [35], 

B. Example 2 [34], [40] 

The system to be identified is given by 

«(t)=l-5a(t-l)-0.7s(t-2) 

+ «(*) +0.5 v(t - 1). (6-5) 

Thus, we have b 0 = 1.0, i>i = 0.5, ai = -1.5, and a 2 = 0.7. 
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The input {u(/)} is obtained as 

a e(t) - 0.2 e{t - 1) + 0.3 c(t - 2) (6-6) 

where, {e(/.)} is an i.i.d., zero-mean, unit variance (one-sided) 
exponential sequence. Thus, we have E{e 2 (t)} = 1. and 7 3fl 
= 2. The noisy input-output observations are given by (2-2) 
and (2-3). The mutually correlated colored Gaussian noises 
{ty(<)} and are generated as 

v>(t) = /.(*) - 2.33/, (f - 1) + 0.75/,(t - 2) 

+ 0.5/ l (t-3) + 0.3/ l (t-4) 

- L.4/,(f-5), (6-7) 

v 0 (t) = w,(t) + 0.2i;,(f- 1) 

- 0.3U|(/ - 2) + 0.4u,(t — 3) (6-8) 

where {/*(/)} is an i.i.d., zero-mean Gaussian sequence. The 
variances of v,-(t) and ?;„(/,) are selected to yield a signal to 
noise variance ratio of 5 dB at the input sensor and the output 
sensor, respectively. 

One hundred independent realizations of 4000 input-output 
data pairs were generated. The methods proposed in Sections 
m-B and 1V-A (integrated bispectrum based) were applied 
along with several existing approaches. The nonlinear esti¬ 
mator minimizing jjJ P) (0) given by (4-12) was initialized 
by the linear estimator of Section IV-A for which we used 
1 < / < Lb /2 - 1 =30, i.e., 30 frequency points in the 
interval (0,7r). The frequency points used in (4-12) were 
identical to that for the linear estimator. A similar procedure 
was fallowed for minimization of J^\0) where we selected 
all the frequency points in the interior of D TXy . 

Table II displays the arithmetic means, standard deviations 
and root mean-square (RMS) errors of the results ot the various 
parameter estimators. The approach of [31J uses the third- 
order cross- and auto-cumulants of the input-output data at 
various lags; so do the approaches of [341 and [40]. The results 
pertaining to the approaches of [34) and [40] have been quoted 
from [40]. It is seen from Table II that the frequency-domain 
approaches proposed in this paper yield the best results and 
the approaches of [311 and [34] the worst. 

Note that 0^ minimizing J^\0) is, asymptotically, a 
maximum likelihood parameter estimator under the restriction 
that only the third-order statistics are used. The other ap¬ 
proaches ([31], [34], and [401) also restrict themselves to third- 
order statistics of input-output record, but are not maximum 
likelihood. Since maximum likelihood parameter estimators 
are, in general, efficient, the proposed nonlinear estimator is 
expected to outperform other “nonoptimal" approaches. This 
Seems to be the case for the simulation example considered 
here. Similar remarks are apply to integrated polyspectrum- 
based approaches. 

VII. Conclusions 

Two new classes of parametric frequency domain ap¬ 
proaches were presented for estimation of the parameters of 
scalar, linear errors-in-variables models. One of the proposed 
classes of approaches consists of linear estimators where 


TABLE H 

Parameter Estimates: Example 1, 30 Monte Carlo Runs, SNR = 
20 dB, N = 4000 = Input-Output Data Record Length in Each 
Run, n = One Standard Deviation, [IT* Integrated Trispfctrum 
Based Approach, MSE: A Mean-Square Error Criterion 
Minimizing the Prediction Errors, See Criterion (76) in [39)1 



Parameters 

Oi 

da 

*o 

h 


True Value* 

-1.500 

0.700 

1.000 

0.500 

1 Approach 

estimate statistics: N 

= 4000 


1 IB: Nonlinear 

mean 

-1.4015 

0.6618 

1.0333 

0.4410 

(Sec. 4.2) 

a 

RMS 

±0.0047 

(0.0101) 

±0.0031 

(0.0185) 

±0.0102 

(0.0384) 

±0.0368 

(0.0686) 

| Biep.. Nonlinear 

mean 

-1.455S 

0.6556 

1.0647 

0.4173 

(Sec. 3.2) 

<r 

RMS 

±0.0245 

(0.0508) 

±0.0230 

(0.0504) 

±0.0435 

(0.0779) 

±0.0457 

(0.0950) 

| [31] 

mean 

a 

RMS 

-1.0394 

±0.4005 

(0.5310) 

1.0351 

±0.3834 

(0.5092) 

1.00B4 1 
±0.0330 
JU0341) 

0.1561 

±0.4232 

(0.5453) 

1 [34] 

mean 

a 

RMS 

-1.6200 

±1.1680 

(1.1740) 

0.8430 

±1.4300 

(1.4371) 

0.0700 

±0.2450 

(0.2627) 

0.3600 

±0.9710 

(0.0708) 

jj f40] 

mean 

-1.4444 

0.6496 

1.0506 

0.5377 

(Crose-cum.) 

<7 

±0.0472 

±0.0406 

±0.1278 

±0.1485 

1 _ 

RMS 

(0.0720) 

(0.0647) 

(0.1410) 

(0.1532) 


the bi spectrum or the integrated poly spectrum (bi spectrum 
or trispectrum) of the input and the cross-bispcctrum 
or the integrated cross-poly spectrum, respectively, of the 
input-output are exploited. Based on these polyspcctra the 
system transfer function is first estimated at a number of 
frequencies exceeding one-half the number of unknown 
paramelers. The estimated transfer function is then used to 
estimate the unknown parameters using an overdetermined 
linear system of equations. In the second class of approaches, 
quadratic transfer function matching criteria are optimized by 
using the results of the linear estimators us initial guesses. Both 
classes of the parameter estimators are shown to be consistent 
in any measurement noise that has symmetric probability 
density function when the bispectral approaches are used. The 
proposed parameter estimators are shown to be consistent in 
Gaussian measurement noise when Irispectral approaches are 
used. The input to the system need not be a linear process 
but must have nonvanishing bispectrum or trispectrum. We 
emphasize that unlike the second-order statistics case, one can 
not, in general, model a stationary random process with a given 
higher-order cumulant spectrum as having been generated by 
driving a linear system with an i.i.d. sequence [27]. 

Computer simulation results were presented in support 
of the proposed approaches. Performance comparisons with 
several existing approaches show that the proposed approaches 
outperform them. On the theoretical side, the proposed es¬ 
timators arc consistent under sufficient conditions that are 
more general than that for existing approaches. Moreover the 
integrated polyspectrum-based approaches apply with almost 
equal computational and programming ease to both cases, 
those involving third-order statistics as well as those concerned 
with the fourth-order statistics. It should be noted, however, 
that higher-order statistics based methods typically yield high- 
variance estimates requiring “large" record sizes to reduce the 
variance. 

We considered only SISO models. There are no essential 
difficulties in extending them to multiple-input multiple-output 
models although details remain to be worked out. 
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Appendix 

In this Appendix we provide a proof ot Theorem 2 First 
we need some auxiliary results 
Lemma 6. Under the Assumption Set 1, it follows that 

hrn^ ./^V) - -4 fl) (0) 

(m n 


- H(( 


l(“> 4^ )um2l/r^2 


1*)l ]/[*L(M (Ai) 


i p uniformly in 0 for 0 E B^ where 0< c B is an arbitrary 
compact set, Sj is a fixed collection of L biJrcquenues and 
m = ?("». h)S’, (w,„) S'.K) .S\(w m ^)/|7J. U (u)„, 
u>„)| 2 • 

Phw/* By (VI7), (V20), Lemma 1, and Slutsky’s The¬ 
orem, we have 

(A2) 

V ► x 

lun H(<>^ ^ 

A» -s-K 

"J.OMip (AI) 

Consider 

7>.(fl) -|77(c' ( “ u >) + “ 

- |//(, ,u + “ //((»<* +- ’l 

(A4) 

|W( ' ( * ^ M|2 ' H “ ‘iMVn 

+ 2R( {77(i jU ^ , |0)[//(< J< “' H< " H) 

W(r )1*0 (A5) 


By compactness ol (-)< and continuity of //(( ' l ' i ' '|ff) in 

0 as well as in u m and uwe have 


sup sup | H( 
*> ^ r I—7T n] u< e< 


lU 4 —' ) 


|0)| < M < ^ (A6) 


By (A2), (AI), and Slutsky s Theorem, given an> r and b > 
0 there exists an integer N(( b) such that 

P{|[|//( ^ +u >)| 2 - l u(<‘ { - ’K)l 2 ]0<'} 

> 1 (A7) 


r{|[7/(f H " 4 " } ) - + " ) K)K7,f,l <4 

> 1 - b (A8) 

lor every N > N{t b) Hence it follows that given any < i (= 
» + 2Me) and b .> 0 there exists an integer N{( i, b) such that 

P{|D m „(0)| < fi) > 1-AW 6 He, V/V > AT(»i, b) (A9) 

That is, convergence of D„„,(9) is uniform in 0 The desued 
result now follows by using the uniform convergence of 
Slutsky's Theoiem, and noting that 

$ E \ D m U m<i^ 

(m n)6 Si 

(A10) 

>f |A,,„(f)l < ft D 


Lemma 7' Under the Assumption Set I, we have 
<4 B) (0) = o foi any 6 € 0 «• d e D 0 

Proof From (Al) 

for(m, n) £ S L C D° ry 

=> H(( JUr \0o) -H(( Jui \0) Vt j E (—7r 7 r], byLemnuS 
=> 0 E A) 

Conversely 

l9e £>«=>#('' (t +u >|0u) = 

loi (rn n) £ S L C y 

=► A B) (») = i> 

This completes the proof 

Proof of Theorem 2 Define a set 

»(p) = »( - 5(A), P) 


S(A) />) = Wee mf \\0 ~0\\ <p\ 

f>£Do 

Since S (A) is open, B(p) is a closed subset of a compact 
set B f c B, hence compact By continuity ot J^\0) in 0 and 
compactness ot 0(/j), there exists some 0* e B(p) such that 

mi 4 fl, w=4 0, (n= Mp)>o 

(9hH(,0 

= (All) 

where we have used Lemma 7 in deducing that ii(p) > 0 By 
Lemma 6, given any ( < /v(p)/d and t) > 0 there exists an 
integci A](i h) such that e B(p) and VA^ > Ai(e, b) 

P{ l{ n) {0) S r”\6) - . > ,,(p) - <} > 1 - b (A12) 

The above equation may be rewritten as 

P{ ml 

V/V^Njie, b) (AJ3) 

Similarly, by Lemma 6, given any t £ /i(^)/d and b > 0 
theie exists an integer 7V 2 (t, b) such that V0 E B c and 
VN > N 2 {( b) we have 

P{-^ B) (f) <4 B) W + 4 > 1-* (A14) 

In particular, we have 

P{^ B) (»o)<.4 £,) (('o) + r = i}>l-6 

V7V > Ni(e, b) (A15) 


,g t o) 
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however, it follows from (A14) and (A 1,5) that 

P{ inf < r < n(p)/ 3} >1-6 

VN > N 2 (r , 6). (A 16) 

Consider On as defined in (5-13). Then we have 


P(*« € S(D 0 , p)) 

- p U& y/'m < ^ tf'm 

> P {inf < f < ;‘(p)/3, 

p(p)-c< inf A fl V)} 

Ptn(p) 

>1-P{inf Jif’(9)>^)/3> f } 

> 1 - 26 VN > N(t. 6) = max{IV](e. 6), N 2 {t, 6)} 

(A17 ) 


where we used (A13) and (A16) to obtain (A17). This proves 
the desired result. □ 
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Minimum Bias Priors for Estimating Parameters 
of Additive Terms in State-Space Models 

Bertrand Hochwald, Student Member, IEEE , and Arye Nehorai, Fellow, IEEE 


Abstract — We treat the problem of estimating parameters 
of additive terms, sometimes called bias terms, in state-space 
models. We consider models that depend linearly on the state 
but possibly nonlinearly on the parameters, where both the state 
and observation are corrupted by additive noise. A prior density 
for the parameters is introduced that, when combined with the 
likelihood function to form a posterior density, minimizes the 
bias of the posterior mean. The result is a useful prior based on 
Ignorance. Two examples and simulations illustrate the use of the 
prior. 


I. Introduction 

T HE problem of determining unknown additive terms in 
state-space models has been addressed by many authors; 
for some early examples, see [41 and [8]. These models 
appear frequently when, for example, noise of unknown mean 
is encountered, sensors with systematic errors are used, or 
moving objects are being tracked. We consider the problem 
of obtaining the posterior mean estimate of parameters that 
appear nonlinearly in the additive terms. The posterior mean 
is important because of its wide use in estimation theory 
and its close connection to the Kalman filter estimate of the 
additive terms [4]. For example, when the parameters appear 
linearly in the additive terms and have a jointly Gaussian 
prior distribution, the Kalman and posterior mean estimates 
coincide. 

We then suggest a method to choose a prior density lor 
the parameters that conveys ignorance. When information 
about parameters is unavailable, there are various criteria 
for choosing nomnformative priors. One is convenience: If 
we assume that the parameters have a distribution amenable 
to simple recursive updating equations, then estimates are 
often easy to compute. Another is lack of favoritism: If we 
assume the parameters have a uniform distribution over some 
interval, then it is widely accepted that we arc not giving 
preference to any particular parameter value. There are also 
information theoretic criteria that, for example, maximize 
the Kullback-Liebler-Lindley distance between the poste¬ 
rior density and the prior |6|, [9|. These criteria, however, 
while broad in scope, often cannot predict precisely what 
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effect the priors they recommend will have on a particular 
estimate. 

By focusing on the posterior mean, we are able to show 
the effect of any prior density, and we choose a prior to 
minimize bias (to be defined shortly). If the true prior is used 
to construct a posterior density, then the posterior mean has 
the well-known minimum mean-square error property. This 
optimality is measured as an average over possible realizations 
of the parameters drawn in accordance with their distribution. 
If, however, a prior other than the true one is used to construct 
a posterior density, then this ensemble average property of the 
posterior mean is generally lost. 

In this paper we obtain a prior density function that satisfies 
an optimality condition within a realization instead: When used 
to make a posterior density function of the parameters, the 
prior, asymptotically in the number of observations, minimizes 
the posterior mean’s bias over all possible priors. Let 0 be the 
parameter we wish to estimate and 0/ be the posterior mean 
computed at time / using the prior f H . Supposing denotes 
the true value of 0 and E^ denotes expectation conditioned 
on 0 O 9 show that E-1- 0(/ _l ), as / —► x, 
for most fa. But the proposed f 0 (which can depend on /) 
satisfies E* o 0/ = 0 () -I- Hence the name “minimum 

bias prior.'’ 

As more observations become available, the density wc 
propose varies. Contrast this with the traditional prior densily 
that reflects what is known about the parameters before ob¬ 
servations are made. We draw on the work of Hartigan [5|, in 
which minimum bias priors are first developed. He determines 
that (asymptotically) the first-order effect of the prior appears 
in the posterior mean. Effects on the higher order moments 
of the posterior density are negligible by comparison. The 
minimum bias prior makes the first-order effect as small as 
possible. 

In Section II the state-space model with parameterized 
additive terms is described and the estimation problem formu¬ 
lated. Wc show how to recursively calculate the unnormalized 
posterior density function of the parameters in a simple and 
efficient manner. In Section III we introduce the minimum 
bias prior. The prior is developed in [5| under the assumption 
that identically distributed observations of the parameters are 
available. Our model will not satisfy this assumption, requiring 
us to extend the prior’s theory. A method is also given to 
approximate the prior with an adaptive scheme, since an 
exact closed-form solution cannot always be obtained. The 
prior used in the adaptive scheme is, however, particularly 
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simple. TWo examples and simulations demonstrate the results 
in Section IV, and we conclude in Section V. 

II. Model and Problem Formulation 

Consider the state-space model 

x t =Ax t -i+Bg(0) + wt- U ( 2 . 1 a) 

Vt -Gx t + Dh(B) + v t (2.1b) 

where t = 1, 2, is the time index. The vector y t £ 
R mxl is the observation, x t £ R nxl is the state, and 
0 = [Si, ■ ■ ■, 0 P ] T £ ]R /)Xl is the unknown parameter vector. 
The noise vectors w t ~ Af( 0, Q). v t ~ Af( 0, /?) (have zero- 
mean Gaussian distributions with respective variances) and 
are assumed to be independent of ?/, for all s and i. 
It is supposed that Q £ is positive semidefinite and 

B 0 R mXm is positive definite for all /. Tlie parameter 
vector 0 and initial state .to are random and independent. 
Furthermore, we suppose that g(0) £ Ii qX] and h(0) £ JR TXl 
are known functions of 0 representing the additive terms. All 
matrices arc assumed to have the appropriate dimensions and 
to be known. 

A common approach to simultaneous estimation of the staLe 
and parameter vectors is to extend the state-vector to include 
the additive terms by letting x f ^ [xj , /; T ( 0 ), h T (0 )] 7 , 
whence 

[A B 0 " 

0 I 0 | .f/. i 4 'V (2.2a) 

[0 0 / 

Ut - [C 0 D]x, + n,. ( 2 . 2 b) 

If we assume that the initial vector Jr {) has some covariance 
matrix, then the model is in a form suitable for applying the 
Kalman filtering equations to estimate xt [4]. This provides 
recursive estimates of g and h. If 0 ►—* (y(0)~ h(0)) is one-to- 
one we may invert the estimates of g and h to get estimates 
of 0. 

While this method is attractive because of its computational 
simplicity, we may not want to be forced into assigning g and h 
a covariance that can. at best, be a guess. This is true especially 
because the parameter of interest is 0 and not (g(0)< h(0)). 
Moreover, the criterion we consider in this paper is bias, and 
the extended-state Kalman filter, which essentially obtains the 
posterior mean of g and h under the assumption that they 
have a joint-Gaussian prior, may have a bias when estimating 
0 that is higher than that obtainable with the minimum bias 
prior. (Due to the non linearities in g and h> this holds even 
when g and h can be estimated with no bias.) Example 2 in 
Section IV demonstrates this. 

We use the term realization to denote a series of measure¬ 
ments obtained via (2.1) in which 0 is fixed at 0 o. When we 
speak of an average within a realization, the expectation is over 
all possible values of the observations (assuming 0 is fixed) 
and the relevant density is f Y t \ 0ir where Y* is shorthand for 
H\ , ■ ■ ■ , y t , This expectation is denoted 

Suppose a collection of / observations from one realization 
is available and 0 f = f 0fo\r< dO. Our goal is to minimize 


the asymptotic bias, E g o 0 t - flo* by modifying fe\y* t through 
selecting an appropriate prior, fg. We will use fa to denote a 
prior density for the parameters, whether it is the true prior 
or not. The true prior plays no role in the bias criterion 
so there should be no source of confusion, and it will not 
matter what law is used to choose 0. Similarly, we will not 
distinguish between posterior densities and means computed 
using different fa. 

Assumptions: 

Al) f T „\» = jV(mo. Pa )■ 

A2) The noise is “white,” implying that f Xl |. P( ] 7 y< - i f * — 
fr t \x t -],0 and f yi \ Xt 'Y‘ i ,0 — 

We begin by computing the joint posterior density function 
of j't and 0. Define 1(0) — [ g T (0 ), h T (0 )] 7 . 

Lemma 1: 

fx t ,9\Y l = ’ /tf|V* (2-3) 

where 

(2.4) 

/•IV oc/» ■ pxp{tt?V(0) + / r (0)A ( /(0)} (2.5) 

and z t , fv f . M*, and At may be computed recursively. 

Proof: See Appendix A. 

Given 1 observations, the posterior mean of ,r f and 0 is 
/ [■'•?’■ 0 r ] T L t ,e |i ' dstM’. (2.3H2.5) yield 

h=^M t I 1(0)1, ■ m >{«fl(0) + l T (0)A t l(0)} 

* c]0 -f- Zi ■ ( 2 . 6 a) 

■ exp + P (0)A,l(0)} tW (2.6b) 

where r t = f f e ■ rxp {(t{ 1(0) + l T (0)A t l(0)\d0 is a nor¬ 
malizing constant. Observe the notation: the estimate of x t , 
and 0i the estimate of the constant $o. 

In what follows we concern ourselves exclusively with 0 f 
and examining its behavior as a functional of fg. Generally, 0 t 
converges in probability to as t ~► cx . We will, in fact, give 
conditions that imply convergence of 0 t to 0 () in expectation, 
and choose fg to optimize the speed with which this happens. 

III. Minimum Bias Priors 

The minimum bias prior minimizes the first-order effect of 
the prior on the posterior density [5]. When 0 is a scalar we 
show that there is a closed-form solution for the prior. When 
0 is a vector, we can show only that it satisfies a certain set 
of differential equations that do not necessarily yield a closed- 
form solution. To handle this, we propose a simple adaptive 
approximation of the prior. 

The minimum bias prior depends on the likelihood function. 
Jeffreys [7] first suggested using the likelihood function to 
design priors, with the Jeffreys prior being a well-known 
example. It turns out that in one dimension the minimum 
bias prior is the square of the Jeffreys prior (see [2] for other 
uses of the Jeffreys prior). In more than one dimension, the 
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minimum bias prior and the square of (he Jeffreys prior no 
longer necessarily coincide. 

We would like to apply the results of Hartigan in [5] 
but some justification is needed. He considers a model for 
Which independent identically distributed observations are 
available. That is, in his case, log fyt\ B = log f Vr ^ 
and {log f Vt \e}t=\ t 2, is a collection of independent iden¬ 
tically distributed random variables; many of his results are 
based on the asymptotic behavior of this sum. In our case* 
log fy<\0 — S»=i and l h e summands are not 

identically distributed. As we shall momentarily see, however, 
the summands are independent and have the same distribution 
asymptotically, allowing us to use the results of [5] anyway. 

The first step is to observe that from (A.2), (A. 6 ), and 
(A.7), f Vi |i i- 1 ,# = Af(N t l(0) 4 CAz t - 1 , S t ), where Nt = 
CAMt-i + [CB, D] and S t = C(AP t ^ A 7 4 -Q)C 7 + /f. The 
asymptotic behavior of log/ v ,|yt-i # is principally governed 
by Nf and 5/ which are shown to converge under the following 
assumption. 

Assumption ■ 

A3) The pair ((7, A) is detectable and (A, Q 1 ^ 2 ) is stabi- 
lizable (1 ]. 

Lemma 2: The matrices N t and S t tend to limits exponen¬ 
tially fast as t —» (X. 

Proof We have S t — AP t \ t _ X A T 4- P where P t \ t „ i = 
APt-\ A T + Q. That P t \i~\ tends to a unique positive definite 
matrix exponentially fast under Assumption A3) can be found 
in [1]. Therefore, the lemma is proved for S t . To prove the 
result for N t , we observe that it is sufficient to prove the 
result for M t . From (A.4), Al t = (A - K t CA)M t -i 4- [(I - 
K t C)B, -K t D], where K t = P t \t_iC T Sf l . It ,s well known 
that K t converges' exponentially quickly to some K^ such 
that the eigenvalues of A — AK^C have modulus less than 
one, whence A - K t CA has eigenvalues with modulus less 
than one for t sufficiently large. Therefore M t converges at an 
exponential rate as well. □ 

Define 

rit(O) = log f yi ^ < 10 . t = I, 2 . • • •, 

= — 5 [log, (lot, 27T.S', + (y t - N t l(0) - CAz, ,) J 
■St-'Uji-Nm-CAz^)} (3.1) 

Nr» ^ Ymu^Ni, S f , and tkW = 

[log dot 2irS rK +e? K e»(0)] where ~ Af (N^ 

[l((?o) - /((?)], .S' rac ). We use Lemma 2 to show that r/,(0) 
converges in distribution to r/ rx (0) exponentially quickly. 

Lemma 3 

i) The random variables ry s and r/* are independent for 
ft j* /. 

ii) The random variables q f converge in distribution, as 
1 —t oc, to r/oo. Furthermore, the expectation of any 
power of any derivative of with respect to 0 tends 
exponentially quickly to the expectation of the same 
power and derivative of ?/<*>. 

Proof: See Appendix B. 

Lemma 3 shows us that log f Vt \y< -i,*, / = 1, 2, ■ • •, are 


almost identically distributed. The fact that they are not exactly 
identically distributed turns out to be of no consequence. 
We show this in the lemma that follows, extending Lemma 
1 of Hartigan’s paper to our model. First, some regularity 
conditions on f# and q t (0) similar to those appearing in [5] 
are stated. 

Assumptions 

Bl) #o is an interior point of 0 where 0 C R p is closed. 

B2) 1(0) is continuous for 0 E 0 and continuously four 
times differentiable in some neighborhood of 0 o- 

B3) The parameters are identifiable; For 0 ^ [/(0) - 

1(0 o)]| is bounded from below by a positive constant, 
uniformly in t. 

B4) log Jo, (O/d0) log fo and (d 2 /O0c)0 T ) log /# exist 
and are continuous in a neighborhood of 0o. 

B5) The prior density fe may be improper [£ 7^(0)], but 
fa fyt - 1 exists and is a bounded function of 

,</t, uniformly in Y ^" 1 and /. 

B 6 ) Either 0 is bounded, or for some k > 0 , 

sup|(j| > i e [ r lt(Q) “ 7 /t( 0 u)] h as fi rst moment bounded 
from above by a negative constant and bounded fourth 
moment, both uniformly in /. 


Lemma 4 For any c > 0 there exists a b > 0 such 
that h t\e u is 0 (r' f *) with probability 1 - G(t ~ 2 ), 

uniformly in 0 for \0 - f? () | > <. 

Pi oof Sec Appendix C. 

To extend his results completely we need to also show that 

d H lh( 0) | 

2 ^ m* 


i-i 


— tv m 4- O r ( t l 




i — 1 , ,/>. s = 1 , 4 

for some r fH E JR. But that this is true with r iS = 
E®c, (0)/<)0i\t>- #u } follows directly from Chcbyshcv’s 

inequality and Lemma 3. Wc are now in a position to use the 
results on minimum bias priors. 

Let V(<1 , 0) be a smooth nonncgative loss function with 
unique minimum at 0 = d and 

0 t — aig mill I V(f], 0)f#\} > (10. 

For a scalar 0 , under assumptions B1-B6 we have the asymp¬ 
totic expansion 

rug mill L#„r(0(, 0) - lh'[q\ - (l >\2 + /m )//>2 

- 5(112 ■+ V2i)/V2n]|e=,(=fl u 
+ o{r l ) (3.2) 

as / —+ x), where 


^ Q Ion h 

<n = 


m 


V 12 


= E«{ 


d 1 °R h »)g o 2 log fy -|« 


00 


A P & loR h 

Pi = 


09 2 


}• 


0(9' 


See 15, p. 1147] for the derivation. 
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For the quadratic loss function, V(d, ft) = (d ~ 0) 2 , 
it is easily verified that ft t is the posterior mean and 
arg iiiiii 0 £B E«i o V(0t, ft) = Thus, in this special case, 
(3.2) gives an asymptotic expansion of the expected value of 
the posterior mean. The influence of the prior on the expected 
value is seen in the r/i term. 

From Lemma 3 we see that j# 2 1 = 0(/" 1 ) and the terms 
within the brackets of (3.2) are 0(1). Therefore, in general, 
+ 0(t~ l ). Because 1 12 = l 21 = 0, we can 
conclude that E njt - ^0 + o(r ] ) if and only if 


[ ( /i - (pn + Pa'I/pv]#^,- (3.3) 


The prior that satisfies (3.3) is therefore the minimum bias 
prior. 

Observe that 


+,„)M = ~ H (- jp* - 1 ' }) 

upon interchanging the operations of differentiation and ex¬ 
pectation (which is easily justified). Hence, a prior satisfying 
(3.3) is given by 

f l ( >g j\ 1 \() 1 

(34) 

Equation (3.4) is, for a scalar ft. a closed-form solution for 
the minimum bias prior. It is, in fact, the square of the 
Jeffreys prior. For the multivariate case, the situation is more 
complicated. Using the results |5, p. 1149| for a vector ft , 
where the loss function is given by \'( d . ft) = ( d , -ft,) 2 * 

K 6i] ft t — ft {) + o(t J ) if and only if 

( ]t “ (*1 ) 1 a (p> i k 4 pi /A) 

1 A==1 . fi-Oo 

/ =r 1, p (3.5) 

where 

.a 0 log }j_ 

Oft, ' 

A ,, f < )2 log J\ *\e d /) *\»\ 

=■ ht \"oeM, J' 

A F fi > Iff 

P " k " tjH '() 0 , i ) 0 l <) 0 k 

and J = E#{y 2 log. i^/OOOB 1 }. Equation (3.5) consists of 
t> differential equations that the multivariate minimum bias 
prior must satisfy. We apply (3.5) to the model (2.1) and 
simplify the results in the following theorem. 

Theorem I: Efl o 0 f = 0o + if and only if 

l UOi J9=9 n 

/ = 1, p (3.6) 
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where 


(V.) 


J = 


/,* 


m 2Af am 


OB 
i)l 7 {0) 


2 A t 


m T ’ 

d 2 l{B) 


dO, "" dBjOBi’ 
it ^ ft • V 


(3.7) 


(3.8) 


and A, is defined recursively in (A.8b). 

Proof ■ Sec Appendix D. 

As it must. Theorem 1 yields (3.4) in the special case p 1. 
To verify this, observe that 


,/ = 


<m 

Oft 2At Oft 




Of 1 (ft) 

Oft 


2A t 


d 2 l(S) 

Oft* 


and ihercfore =■ ~O.J/Oft. It follows that J~ l U i — 
} 2 ')~ ] 0J/0ft = ^<9 log (-,/)/cW. Hence, from Theorem l, 
the minimum bias prior satisfies qi = (Hog(~ J)/0fl. The 
solution to this is (3.4). 

The set of differential equations given by (3.6), with a 
closed-form solution for scalar ft, may not have a closed- 
lorm solution in the multivariate case for all ft. The problem, 
however, can be made tractable by observing that Theorem 
1 requires the differential equations to hold only at 0o is 
unknown, but with an estimate we may construct a simple 
prior that satisfies the conditions of Theorem 1 approximately. 
This prior can then be used to obtain the next estimate of 0o, 
which in turn feeds ihe next prior, and so on, reducing the bias 
through an iterative procedure. This bootstrapping method is 
now staled and proved to have a beneficial effect on the bias. 

Theorem 2. Define o{ft) =- [i'i(0), ■ • ■. t'p(ft)] 1 , where 


!',(«)= ^[logdol(-7)] + lr{.7- l (/,}, 
P- 


(3.9) 


Lei (-) be a closed bounded set such that i'(ft) is continuous 
for all ft € 0 and sufficiently large /. For every f, suppose 
some mechanism is used to “project” ft t _i onto 0. Let ftt„\ 
be this projection and 

= pxp (b 1 • i)) (3.10) 

be the prior used at time /. Then, = 0o 4- o^" 1 ) as 

I —> <x. 

Proof: See Appendix E. 

This provides a practical method for achieving the minimum 
bias effect without having to explicitly solve for the minimum 
bias prior. The prior defined in (3.10) has a simple form and is 
readily calculated at any time t It is adaptive since it depends 
on ft t -\ and therefore also V'*"’ 1 . 

Remarks: The projection mechanism should modify only 
estimates that fall outside 0 by, say, assigning them boundary 
values of 0. The actual mechanism will not be important 
because fto is an interior point of 0, and given enough 
observations, |0 f -0 o | will be large with only small probability. 
Hence the projection mechanism will generally modify the 
estimates only when 1 is small. The mechanism’s main use is 
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to provide a deterministic bound for the random variable 0*, 
and to ensure that the prior given by (3.10) is always well 
defined. 

The choice of initial prior (before any estimate of 0o is 
available) is arbitrary and will not affect the convergence 
behavior of E* o 0t for large enough t. A convenient choice 
that generally provides good initial parameter estimates is the 
uniform density, that is f# = c for some arbitrary constant 
c > 0. In fact, a good rule of thumb is to employ the uniform 
density until the terms neglected in the asymptotic expansions 
can safely be ignored. At this point, the adaptive prior may 
be employed. Otherwise, if (3.10) is used too early, erratic 
estimate behavior could result. That is, a poor estimate of 
0 O used to construct 1 ) may lead to a possibly worse 

estimate, and so on. Nevertheless, even this process will 
eventually correct itself as t becomes larger. The smallest t 
for which the minimum bias prior should be applied can be 
determined by experimentation. 


IV. Examples and Simulations 

We give two examples, one that has a closed-form solution 
for the minimum bias prior and one that does not. 

Example /—Linear in the Parameters: Let g(-) and h(-) be 
linear so 1(0) — 0. The posterior density of the parameters is 
given by Lemma I 

/•II . x/ flC xp {aj0 + 9 r A t 9}. 


Calculating the minimum bias prior using Theorem 1 yields 
J = 2A f , U, = 0, / = 1, • • •, p. Therefore, the prior satisfies 
q t = 0, / = 1, t p m A closed-form solution is given by 
the uniform density /# - r for some arbitrary r > 0. The 
posterior mean, when = c, is 0* = — (1/2)A^ 1 since 
fa |> t a exp {af0 + 0 ] A f 0} is a Gaussian distribution. It 
follows from (A.8) that E^,0/ = 0o, and it is straightforward 
to verify that using the prior f e = r is equivalent to employing 
(2.2) and initializing the inverse covariance of 0 to zero. 

This example shows that when g and h are linear, closed- 
form solutions for 0 t and the minimum bias prior arc available 
at every time step. The minimum bias prior, in this case, yields 
a completely unbiased estimate for all /. We next give an 
example with nonlinear g in which the adaptive prior must be 
employed to minimize bias. 

Example 2—Determining the Thrust of an Accelerating Tar¬ 
get: We look at the problem of estimating the thrust of a 
moving target in two dimensions, where we are allowed vector 
measurements of the x and /y-axis coordinate positions and 
velocities. The target is assumed to have a fixed unknown 
thrust a along an unknown direction ip during the observation 
interval. The direction is taken with respect to the /y-axis and 
the observation intervals have length T. The velocity obeys 


M0 1 M* -1) 

M0 0 Uy(l -1) 

. To sin ip , 

T[a cos if - <j) J ,V * 


(4.1) 


where 

'2 0 
0 1 

and g is gravitational acceleration. The unequal noise variances 
are due to the greater atmospheric disturbance parallel to the 
ir-axis. The equations of position r(/) are assumed to obey 


E w' t v> t T = 


r(i) = r(i - 1) + j [n(/ - 1) + «(/)]. (4.2) 


^ 7 

Letting .r, = [M0. ii y ((), MO. MO] 7 and combining (4.1) 
and (4.2) we see that 


and 


■r t 


ri o o m 
0 1 0 0 
T 0 1 0 

.0 T 0 U 


Ta sin f 
T[a cosp - g] 

(T 2 / 2 )u sin p 
L(7 l2 /2)[n cos p - (/] J 




(4.3) 


K w/wf = 


I r 

j) 


o 

0 

T /2 


T 

n 

T 2 /2 

0 


0 

T /2 


0 

7-7 ij 


We assume the observations follow the model i/ t — Cj t + 
where E v f vj - 1 and C 1 describes the sensors’ outputs as a 
function of target velocity and position. 

We wish to estimate 0 ~ [«, p ] 1 using the minimum bias 
prior. For simplicity we suppose that C T C “ 7, niu = 0 and 
Pi) = I [see Assumption A I)]. We have 


T 


0 


D = 


0 T 

\T>/2 0 

0 T l f 2j 

g(6) — [fi sin f, a cos ip - g ] r , 

h(ff) =0 


From (3.6) 

-— = — 7~ [1 - 2^f cos 2 f + (£, - 1) cos' f\, (4.4a) 

On cic^t 

^ fto 0 \ (t i\ 2 1- 

—;r- = ——[1 + 2G + (Ci - 1) cos p] sin p cos p 

Op G 

(4.4b) 

where = (A<) 2 , 2 /^) 1 , i. We do not know of a closed-form 
expression for fa satisfying these equations. 

Instead, we use the prior density of Theorem 2. Then, at 
time t 

/•(MO = exp + vMMi)) 

where MM 1 ) and 1 / 2 ( 6 t_j) are the right-hand sides of, 
respectively, (4.4a) and (4.4b) evaluated at a = ii t -\ and 
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Fig I Posterior mean estimates loiii puled with adaptive minimum bias 
and uniform priors, aveiaged over KKJ Inals The minimum bias prioi was 
applied when / > 2"» Also shown are the extended-slate Kalman hltcr and 
maximum likelihood estimates. Solid lines correspond to minimum bias and 
uniform priors, dashed to Kalman Idler, dash-dotted to maximum likelihood 
(a) Estimates of «o = -10 lb) Estimates ol r -o = 0 r > 


ip = £ f-1 . Observe that from (4.4), v{6) is continuous except 
when a = 0. To fulfill the conditions of Theorem 2 it therefore 
suffices to let 9 = G flV 2 b < M £ 50, ip E [ tt, 7r)} 
where fo is any small positive number. This imposes a weak 
and practical restriction on the choice of parameters since the 
choice a = 0 is not identifiable in \p and therefore does not 
satisfy Assumption B3) anyway. 

We simulated the system with 7 = 0.25, cio = —10, and 
Vo = 0.5. The posterior mean estimates with the minimum 
bias and the uniform priors and, for the sake ot comparison, 
the extended-state Kalman filter and the maximum likelihood 
estimates are all shown in Fig. I. Plotted are the estimates’ 
responses averaged over 100 trials; the averages approximate 
the expected curve values. We show the first 50 out of the 
total of 250 time steps to demonstrate the effect of application 


table i 

BIAS PERFORMANCE OF VARIOUS ESTIMATES 


Estimatt 

Deviatio 

n around 

¥>0 

min. bias 

5.23 

1.03 

uniform 

6.80 

1.41 

Kalman 

7.45 

1.16 

max. like. 

7.92 

1.15 


of the minimum bias prior at 1 — 25. The prior was uniform 
for / < 25 (sec remarks after Theorem 2). The extended-state 
Kalman filter used the fact that the slate-space model is linear 
in (j(0) to get estimates of a sin p and a cos <£-(;, which were 
convened inlo estimates of a and tp. The g{0) portion of the 
initial covariance matrix was set to 10*/ to approximately 
convey ignorance, and the initial mean was [0, - <j ] 3 . The 
maximum likelihood estimate was evaluated in the usual way. 

Table 1 is a measure of the bias of each method. We show the 
magnitude of the difference from the true parameter of each 
curve summed from time steps 25-250. From the table it is 
clear that the posterior mean using the minimum bias prior has 
the least deviation of all the estimates. That the minimum bias 
prior causes less deviation than the uniform prior is predicted 
by the theory. That it happens to cause less deviation than 
the Kalman filter and maximum likelihood estimates may be 
just fortuitous. Nevertheless, this example demonstrates the 
viability of the minimum bias prior. 

V. Conclusion 

The minimum bias prior was proposed as an alternative 
to other prior densities based on ignorance because of its 
beneficial effect on the posterior mean. Within a realization, for 
t sufficiently large, the dominant effect of any prior appeared 
as bias. The minimum bias prior ensured that - ft) — 

o(1~ ] ), as / —* x, as opposed to 0(1 J ) for other priors. 

In Example 2, the proposed estimate was shown to have 
bias performance superior to the maximum likelihood and 
Kalman filter estimates. Some additional tests were performed 
indicating that this improvement held for other parameter 
values as well. It would be of interest to determine analytically 
if this is a general phenomenon. 

When the bias contribution to the realization mean-square 
error, E^ 0 (ft - ft)) 2 , is significant, we can expect the minimum 
bias prior to reduce this error. We performed some numerical 
experiments to see if E($/ 0 (ft - ft)) 2 was lowest for the 
minimum bias estimate in Example 2, but the realization mean- 
square errors of all the estimates in Table I were essentially 
equal. We discovered that this was due to the overwhelming 
contribution of the variance over the bias to the realization 
mean-square error in this example. This, of course, raises 
the question: Is there a systematic way to choose priors to 
minimize this error? 

Bayes estimates other than the posterior mean can be ana¬ 
lyzed by choosing nonquadratic V in the asymptotic expansion 
(3.2). The minimum bias prior and its adaptive version vary 
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with V, and so do their interpretations. This is a potential 
subject for further investigation. 

Appendix a 

‘ Proof of Lemma 1 

The notion of recursively calculating density functions in 
state-space models can be found, for example, in |3] and 
the proof of Lemma 1 follows the same, now standard, 
ideas. Due to the structure of the state-space model (2.1), the 
recursive scheme for computing the posterior density function 
is particularly simple. 

Given 9, (2.1) obeys a standard linear state-space model 
with known additive terms, implying that 

hi |.r ( .,,® =Af(A.r t -1 + Dg(9), Q )i 
fy,is„e=Af(CT t + Dh(9), It)- 

Using Assumptions AI )-A2), given 0 and Y* wc observe thal 
the density of r f is Gaussian. That is 

/jr,|l '.9 ~N{rn t . It) (A.I) 

where m t and P t may be computed iteratively. The equations 
for the recursion are given by the Kalman filter: 

i) mt = Amt -i + D<l{9) + - C(Anii _ [ + Bq(9)) - 

Dh{9)\, 

ii) K t = (AP i - J A T + Q)C T S{\ 

Hi) P t = {1 - KtOiAP-iA 1 +Q), 
where 

St = C(AP,-iA r + QY' 1 + R (A.2) 

for t = 1. 2. • ■ ■. By inspecting i), we conclude that 

THf = M t H0) + z t (A.3) 

where 

Zf =(/ — ht(7)Azt-] ■+■ A/t/f, (A.4a) 

M t =(/ - K t C)AM t -i + [(/ - K t C)B. ~K t D] (A.4b) 

with Mo equal to zero and cu = m () . This gives (2.4). 

Plainly, f Xl , 0 \Y< = L,\Y',fi ■ f$\Y'* which is (2.3). We have 
dispensed with f Tt |> so we need only lo evaluate Jb\y'- 
By a standard argument 

fe \* fv * /v* \v — fa 1 yo (A.5) 

r=l 

and it follows from (2.1) and (A.I) that 

/*|v-* = WNtW) + CAzt- 1 . S,) (A.6) 

where 

N t = CAM,, i + [CD, D}. (A.7) 

We may now prove (2.5) by induction. At t = 0, since Y° 
is the empty set. we set no and Ao to zero and (2.5) holds. 


Now assume (2.5) holds at time t — 1 >0. We need to show 
thal it also holds at time t. Equations (A.5) and (A.6) yield 

few * f»\Y‘ -> • fy,\\ * 1,9 

oc fe ■ <*xp {«£.,/(*)+ / T (0)A,_,/(0)} 

•Af(yt\ Ntl(9) + CAzt-i, St)- 

It is straightforward to find, upon substituting (A.3) for , 
that the posterior density retains its form. The iterative equa¬ 
tions are 

tv, =n t -1 + N?s;\y t - CAzt-i), (A8a) 
A, = A,_, - %N?Sr l Nt (A.8b) 

for / = 1, 2, ■ ■ •. 

Appendix B 
Proof of Lemma 3 

The proof of pail i) uses a conditioning argument to show 
that certain random variables, of which and rj t are functions, 
arc independent when ,s / /. This will imply the independence 
of //„ and rif. The proof of part ii) is a simple application of 
Lemma 2. 

i) Define Y f ) ~ y t - N t l(ti) - CAz t i, and assume 
that 6 is given. Clearly £r # (0. V') is a linear combination of 
Gaussian random variables and therefore also has a Gaussian 
distribution. Letting A/(0o, 0) = Ni\l(0 () ) - /(#)], we obtain 

E eM<>- V , )=E w „{K« ll c,(6'. V')|)"-'} 

= E#J.V, [/(»„) - 1 ( 9 )}) 

= 9) (B.l) 

because / V( ^ - Af(N,l(0 > () ) + CAz t - j. St)- Further- 

more, for s < / 

V f ) =F* o {K ( ,„c,(0. V’)^. r')|)"-’} 
= E# o {f„( 0, V')Eo (l e/(d. r , )|V ,_l } 
= [E»„M0, ((?„.«) 

= A ,( 9 0 , 9 ) Aj ( 9 tl . 9 ) (B.2) 

so e, and c< are uncorrelalcd. We show that they are jointly 
Gaussian and therefore also independent: it follows thal 7 /, = 
— ^[log dot 2 rr.S\ + S~ *f„] and ij< = - 7 [log, dot 2nSt + 
t] Sj ~ 1 c,] are independent as well. 

Wc have |«„ = fe,\e„e u ■ fe |»„. where / f ,| S() is 
Gaussian. Moreover, /, ()t = f f.,w l .c,» u A 1 

dr'- 1 = J / E( |y, 1 A ,| C> 1#0 f/y f_1 . From (B.l) and 
the distribution of //,, i,# () = Af{A t {9 0 , 9), S t ), and 

because is a linear combination of Y", J\ < i is also 
Gaussian. Therefore, /«,|-,, 9 0 is Gaussian, and so is /. tif j# (l . 

ii) The random variable t (9, Y 1 ) is Gaussian with 

mean 5 t -1 ^ 2 A { (0o, 9) and covariance I. According to Lemma 
2, A t (9 u , 9) -* AocM,, 9) = N ^[1.(90)-1(9)} andIS, - -V, 
as t —*• 00 . Consequently, Sf l ^ 2 et{9, Y f ) —* Sno^ 2 ^ in dis¬ 
tribution. Since r/ ( depends continuously on . < ) t -1 ^ 2 f ( (d, F'). 
we have ry —> »/«., in distribution as well. 

To show convergence of the moments of the derivatives of 

we give two examples: ry itself and the square of its first 
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derivative. All of the remaining cases are shown in exactly 
the same way. For r) t we have 


E*o Vt{9) = -5 [log tiot 2nS t + Af 5 f -1 A, 

+ lrS t - 1 E 9 tJ ( £( -A ( )( £t -A,) T 
+2AJS, 1 E« 0 (e, — A ( )] 

= -f [log dot 27rS, + Aj 67'A, + m], (B.3) 


Similarly, Er/*, = - 5 [lop, dot 2 t tS v + A^S^A^ + m]. 
The result now follows from Lemma 2. For the square of the 
first derivative 


E*„ 


09, 00 , 


_ EV 0f±_ o 1, ,Te-l Off 


00 , 


00 , 


Ol T (0) \rT q-lr? T 


am 


N,' srts, + A, AO 


for any 0 obeying f < |0 - 0 o| < & 
P 


T h ( 0 ) ~ r/r(0(>) > |///1 + 0(1) | 

< P |E ~ »/r( 0 O ) > + 0 ( 1 ) J 

<pJEW)-^)]-N#) + o(i) 


> -5^iW| 

< 3/(/- l)^(fl) + f/i 4 (g) + Q(») 
(l/lti)*^*) 

^ 3t(f- 1 )/x| + //7 4 + 0(1) / Wi -2\ 

(l/16)f 4 ,4 } 


as t -* 00 . Equivalently, /v'|#//i''|e„ is 0(f ,-t *) with 
probability l - 0(/~ 2 ). The constants in the O(-) terms are 
independent of 9 so the approximation is uniform. From 
Assumption B 6 ), the result holds for \9\ > k when k is 
sufficiently large. We therefore have uniformity in |0-0 O | > r 
for any c > 0 . 


Appendix D 
Proof of Theorem I 


Similarly 


Olh. (0) ti/K {0) 

,J 00 , 00 , 


N ' 1 q - 

00 , 


[S^ + A^A^)Sz}N n 


tm 

d9j 


and the result again follows from Lemma 2. 


We prove Theorem 1 by evaluating the expressions of (3.5) 
and simplifying. From (A. 6 ) we obtain 

t 

h< lf , = Yl^(0,:N,l(0) + CAz,^,S r ). 


<=\ 


So 


0 log J\ 


00 , 




r =1 


09, 


Appendix c 
Proof of Lemma 4 

This lemma is proved heie using Chcbyshcv’s inequality in 
much the same way as Lemma 1 of 15]. Our proof, however, 
accommodates the fact that log J\ t\ e is a sum of independent 
random variables that do not have identical distributions. 

We start with log(.A q o/f\ *|* u ) = EUi {^(0 )-ib (0o)}« 
where q r {9) is defined in (3.1). From (B.3) 

E #o {r, f (0)- 7 / t ( 6 » 0 )) = -\*f(0o, 9)S^ A,(0 O , 0) 

+ \*J(9o, 0 o )S t - l A t ( 0 o , 0 O ) 

=-±A7(9o, 0)Sr l *t(Oo,9)- 


Therefore, Assumption B3) and Lemma 3 imply that 
{ 71 ,( 0 ) - ii,(Oo)} tends exponentially quickly, as t -* oo, 
to some m(0) < 0 when 0 ^ 0 O . In general, n,(9) - t/,(0 o ) 
has xth central moment converging exponentially quickly to 
some /i„(0). The moments are continuous functions of 0, so if 
we let /i, = max f <|#-# 0 |<fr M 6 ) for any 0 < r < k, then 


where 6,(9, V 1 ) = y, ~ N r J(9) - CAz,- i. Also 


O 2 log h '| 

00,00j 


-E 


M T (9) arlB-l 


fr{ [00,00, 


N* S~ e r 


*m N T S -i N «wi 

00, ' r ’ 00, 


(P log />•< \0 

00,00j 00 k 


-E 

r= 1 


dH T (9) 


00,00 } O0k 

O 2 l T (0) 
00,00, 
Ol T (0) 
00, 

o 2 i T (0) 

O0,O0 k . 


N?S~ l e r 




00k 
0 2 l(9) 
d0jde k 

N*8? N r d JW 


Nj S~ 1 N r 


OOj J' 


Therefore 


r—1 3 


O0 k 


01 ( 0 ) 
OOj ‘ O0 k 


(D.l) 
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where the last equality is a consequence of (A. 8 b). Further¬ 
more 


Pi) k 


\ r tt Law, 


n?s; ] 


ai T (0) 

00 , 


NTsr'N, 


a\{9) 


±» jm n 


9 = 1 


oh 


Oh 


(D.2) 


From (B.l) and (B.2), E«e> = &t(&- 9) = 0 ; whenever 
a ft U E,e.e? = A„(6, 8)A J(0, 6) = 0 ; E e e f e 7 = S, + 
A f (0, 0)&l(8, 9) = S t . It follows that 


J'.j n = - 


O8 l O0 i 2Xl 00 k 


(D.3) 


Similarly 


(D.4) 


_ & i T { 9 ) nk <m , <m 

Ptjk 08,00, * 08 k O0,O8 k ' Wj 

<m8) om 
08,08) * 08, ' 

Combining (D.3) and (D.4), we obtain 

Fui+P,,*- 2A, ^ 2i , d6)< 

and using (3.5) and (D.l), we sec that the minimum bias prior 
must satisfy 

- t 

IJL7/71 1 , oP\ 0 ) . crt(g) 

00, Xl 00,00 k 

= [I 7^" [Ior a °' (_ ' /)] + ir } 

L » J W - 

7 — 1, ■ • , p. 

This concludes the proof. 


v{0) is continuous on 0 uniformly in f, we are assured that 
<71 = {0/08) log ) is still ( 2 ( 1 ). Hence, the 

expansion (3.2) becomes 

Egjt = 00 ~ Pi l [Efl?i - (P 12 +P 3 )/P 2 ]| 9 _ So 

+ o[r i ) (E. 1 ) 

as / —+ 00 . It remains to show that [E« qi - (pu + p&) 

/P2]fl=#„ = o(l) • 

From [5, Theorem 2], Of converges m probability to #o, and 
since 0 O is an interior point of B, 0 t converges in probability to 
(the projection mechanism has no effect in a small enough 
neighborhood of 0 O ; see remarks after statement of theorem). 
For any b > 0 we have 

|E*k/i - (pu + P.\)/P2\o-0u = |E^ u />(^-i) - i'(0 {] )\ 

\i'(0t-i)-i'(8u)\ 

Efl n * {|9,-i -«o| >t<) 

H#»-|)-K«ti)| (E.2) 

Let ( > 0 be chosen. Because v(0) is continuous at fln, 
the first tenn on the right-hand side ol (E.2) is bounded 
by c for I) small enough. Wc know 0/_i 6 (-) is bounded 
and i/(0) is continuous tor 0 € (-), so the second term is 
bounded by a constant multiple oi P{|0<_] - 0 u| > ^}. 
For t sulhcienily large this tenn is also bounded by f. Tlius, 
[E 9 < 7 , - (pi 2 +Pj)/P2]fl=8u = <»(I) as I —* x, and E,,„0, = 
Oi , — ]>/o(l) + n(t 1 ) — 0 () + <»(/~ 1 ), which was to be shown. 
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of j/t. uniformly in )' f ~’ and 1. Because A, ji converges as 
t -* oo and © is bounded, u(0) is continuous in 0 uniformly 
in (sufficiently large) t. Thus, fe(0t-\) is bounded and B5) 
is satisfied. 

The fact that the prior is now a function of the observations 
modifies the asymptotic expansion (3.2). Since 8 t -i e 0 and 
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Nonlinear Li Optimal Controllers for Linear Systems 

A. A. Stoorvogcl 


Abstract^-in this paper we study the L i optimal control problem Tor 
linear systems. We will show that by allowing the class of controllers 
to Include nonlinear controllers, we can make the closed-loop L\ norm 
strictly smaller than we could do using only linear controllers. 

I. Introduction 

The L\ optimal control problem has been studied extensively in 
the literature. The L\ problem was originally formulated in |1J and 
[8|. A solution to the problem was presented in [3] (for discrete-time 
systems) and [2| (for continuous-time systems). In these papers it 
became obvious that, when searching for optimal linear controllers 
for these problems, we need infinite-dimensional (continuous-time) or 
very high order (discrete-time) compensators. Especially for discrete- 
time systems, there is now a good theory available for the design 
of linear compensators (see e.g., [5]). The approach taken in these 
papers, however, is a method based on linear programming, and the 
method is therefore essentially constrained to linear compensators. 

The objective of the current paper is to study whether we can 
improve by extending the class of compensators to include nonlinear 
compensators. For H it is known, for instance, that this is not 
possible: the minimum over all stabilizing linear compensators of the 
closed-loop H ^ norm is equal to the minimum over all stabilizing 
(possibly) nonlinear compensators (see [6|). 

In 17] it has been shown that, although for the L\ optimal control 
problem optimal and near optimal linear slate feedback compensators 
are in general dynamic, there always exists static nonlinear compen¬ 
sators which achieve the same or better performance. Moreover, in 
[4] it was shown that nonlinear controllers which are differentiable 
in the origin cannot do better than linear controllers. Via an example, 
it was shown in that paper that nonlinear controller can do better for 
individual disturbances ir. But, for this example, the worst-case L\ 
norm could not be improved via nonlinear controllers. The objective 
of this paper is to show that we can achieve smaller L\ norm if we 
allow for nonlinear, continuous controllers. This will be shown by 
means of a very simple static example. Hence the example applies 
equally well to continuous and discrete lime. For ease of exposition 
we will concentrate on continuous-time systems only. 

The paper has the following structure. In the next section wc will 
give a problem formulation. Then we will present our example and 
we conclude with some final remarks. 

11. Problem Formulation 

We will consider systems of the form 

r.r = Aj‘ + Bu + Eu\ 

if = f'i.r 4- D\[U + Divu\ (2.1) 

Th\ " + D'i >n\ 
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We will assume that .i\ m, h\ y , and ~ take values in finite¬ 
dimensional vector spaces .r(f) € R”, u(t) £ R u \ ir(t) £ R\ 
y(t) £ R* P and -(f) £ R'\ A special case are static systems where 
.i- is absent (// = 0) and we just have 

Vi Uj = D M u + Djair. 

(4 = D'l\U + D'£/U\ 

For a vector in R" we define the L -norm by 

Ill'll » ="Wl>ll'.| 

where i> = (i>i . /'_•. i>„)'. We define ihe function space L * as 

the class o! time-signals / for which the norm 

ii/iu = s "i> \\ m\u 

is finite. This norm will be referred to as the I. K norm. Wc define 
the L x -induced operator norm of an operator {} mapping tr to : by 


( 2 , 2 ) 


(2.3) 


IICIIi = -«P 

n ( / 


This norm is also referred to as the L\ norm of (7. This is due to the 
fact that for operators C, described by a linear time invariant system 
of the form 



F.r -I- Gu\ 
Ha + J u 


(2.4) 


we find that the -induced operator norm is equal to the L i norm 
of the impulse response which is defined by 


phi - r p<')ii'« 

./o 

where lor a matrix Al = {M,,} we have 


p/||-.uax£|,)/„|. 

I 

Clearly, this interpretation only holds for linear time-invariant sys¬ 
tems and in particular does not hold for the closed-loop system we 
obtain by applying a nonlinear controller to (2.1). Nevertheless, since 
the L x -induced operator norm yields a natural extension of the L i 
norm to nonlinear systems, we will often refer to the -induced 
operator norm of a nonlinear system as the L\ norm. 

Note that for a static, time-invariant system like (2.2), first of all, it 
is obviously of no use to consider dynamic compensators. Hence wc 
only consider static, time invariant but possibly nonlinear controllers. 
Then the output al time f of the closed-loop system is only affected 
by the disturbance at time f and the input at time /, and we obtain <> 
finite dimensional optimization problem. Find a function K from R' 
to R m such that I - Du K is invertible and the closed-loop opcratoi 
6\ i from R / to R'' defined by 


0\i D n + D>uK([ - DuKY 'Dvi 
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has minimal L x -induced operator norm where the L^ norm of the 
input and output vectors arc defined by (2 3) 

111 I XAMPLT 

In this section wc will study the following very simple example 

f V ~ u 


n\ 

i 


0 


0 

i 

n 4 


-1 j 

17 - i 

i 


-3 

0 

0 

Id 



l) 

0 



We claim that foi this static example the minimal achievable / i 
noun is 3 The latter can only be achieved however via a nonlinear 
controller Via a linear controller the minimal achievable 1 1 norm 
is equal to 3 75 

A L imai ( ontwller \ 

We will siudy the optimal inpui u loi lour specific tlisluibances 



Lach ol these distuibances has norm oik Hence to aclutvc an 1 \ 
norm equal lo i we have to ha\e ilial lor n and Iht corresponding 
contml input u we obtain an output with norm less than or equal 
lo n For a i n and u « wc hnd that || || v =- 1 4 ||n || For i 
wc have || i|| v — niri\((j- ii \ a \ - 1} Since we onl\ allow linear 
controlleis and u \ - a \ - it - u t we must have //, = */i - u - u i 
We hnd that to make || i|| s In then cither //| u oi u < must 

be larger then 0 75 and hence i oi \ must have norm largci 
than 3 75 I his implies that we can never make ihe 1 \ norm less 
than 1 75 b\ hncai controlleis Moieover the linear coniioller 


yields the closed loop system 


0 7 j 

2 27 

(1 7 r » 

-0 7) 

0 7) 

-2 27 

-2 27 

-0 7 j 

0 77 

0 77 

-0 77 

- 2 2> 

and it clearly achieves a dosed 

loop 1 | 

noun of 3 75 


B North neat Conti ollas 

To find an optimal nonlinear controllei for the system (3 I) we 
again look al the four disturbances from (12) Since we have 
|| ,|| v ^ *+ II it i s obvious that wc Ldnnot make the 
induced operator norm less than three We will now construct an 
optimal nonlinear controller which achieves an /1 norm equal lo 
three Consider the eight corners of the cube ||n IK =1 Besides the 
tour corners given by (3 2) we have 




We find that the optimal input lor each comer rs u t = 0, u> — 0, 
«i=0 «i = i i/ — 0, in ~ 0 u ~ 0, iih — -i This yields an 
output with norm three tor each comer We next extend oui function 
to each face of the cube For any u with ||n || x - 1 we hnd 

it =- \| it j + \^n 4- \i« , -f Ai ii j 

lor some A > 0 with \j H- r \i 4 Ai -= 1 The choice of A is 
not unique bui it is straightforward to derive a continuous selection 
iulc We choose ihe corresponding input as 

ii — \ i u L f \ a 4 \ i ij j 4 A i // j 
and the resulting output is 

= \i , 4 \ 4 \ i , \ i , 

Finally we note that 

II II <Ai|| ,11 +V II Ik +\.|| ,1k +A.II ,1k 
- ^ -'ll" Ik 

In this wa\ we ha\t defined a continuous 1 unction J on the cube 
||m || x. = J Wc can extend ihts function to the whole U i by 

and the closed loop system resulting fiom u = j(u ) can be easily 
checked to have l induced operator norm equal to thiee Wt have 
already seen that this feedback is therefoie optimal 

IV Concusion 

In previous papers the hi si siep was to use the Youla paiametcn^a- 
hon to bring the closed loop opeiator in Ihe form 7 i 4 7 QF \ where 
(} is iht design variable which should be a stable system and which 
determines umqucl) the untesponding controller In [4| it has been 
shown that for 7 —l nonlinear contiollers can not achieve a smaller 
1 i norm This paper gives an example where /7 - / and we can 
do bttlei by nonlinear contiollers T — I is connected lo estimation 
problems while T t /is connected to control problems I his shows 
a dear lack of duality and the best we can hope for is a kind of 
sepaiation structure for I \ controllers which will be of the form of 
a linear cslimatoi interconnected to a nonlinear static state feedback 
Nonlinear static slate feedbacks were already studied in 17] 

This papei basically only tells us that we should study nonlinear 
compensators if we want lo obtain optimal contiollers 
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Simultaneous Observation of Linear Systems 

Y. X. Yao, M. Darouach, and J. Schaefers 


Abstract —This note considers the problem of designing an observer 
which can observe the states for each of a given set of plants. The 
concept of the simultaneous observation is introduced. The simultaneous 
observation is achieved using coprime factorization techniques. Necessary 
and sufficient conditions for the existence of simultaneous observation are 
derived. The observer design is discussed, and an example is given. 


I. Introduction 

In this note we are interested in the problem of designing an 
observer which can observe the states for each of a given set of 
plants. Specifically, suppose that Gu ( a ). G\ (#),■■■. G„ (.s) are a 
finite number of given plants, we would like to know whether or 
not there exists a common observer for this given set of plants. This 
is referred to as the simultaneous observation problem in this note. 

There are two reasons at least to study the simultaneous observation 
problem like the simultaneous stabilization problem [1] in which a 
compensator stabilizes each of a given set of plants. First, because the 
structure of the plant is often subjected to unexpected change, such 
as components fault or variation in operating condition, we wish 
to design an observer which can continue to observe the states of 
the plant in the presence of the plant change. Specifically, Gn(.s) 
corresponds to the nominal plant description to be observed, while 
Cti( s),Cvi(*).■ • ■. (i u (s) correspond to the plant description after 
some sort of structural change. The problem lo design an observer 
which can continue to observe the states of the plant after this sort 
of structural change is equivalent to find an observer which can 
observe the states for each of the plants Gn(*).G'j (a). • - ■. G rf (*). 
Secondly, it may arise in the observer design of a nonlinear system at 
various operating conditions. The nonlinear system is often linearized 
at the various operating points, and some of linearized models can 
be obtained. If the states of each of these linearized models can be 
observed using a common observer, then a fixed observer can be 
used for the nonlinear system. 
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The proposed problem is motivated by the simultaneous control 
system design problem, particularly the simultaneous stabilization 
problem which was fiTst posed in |2] and [3|. For a given set of 
plants, one would like to know whether or not there exists a common 
controller such that all plants are simultaneously stabilized in the 
simultaneous stabilization problem. It is also natural that one would 
like to know whether or not there exists a common observer such 
that the states for each of given plants are simultaneously observed. 

This note focuses on the existence of simultaneous observation. 
The observer design is also discussed. 

II. Problem Description and Preliminaries 
Consider a set of the linear time-invariant systems described by 

f(t) = n,u(t) (I) 

fi(t) = C,.i(/)+ D,u(t). i ^ 0.1, •■■,#! (2) 

where j*(/) f 7?" is the state vector, </(/) € IV‘ is the input vector 
and 1 / E 7? ,n is the observation vector, and A,. 7?,, (\ and D, 
are constant matrices of the appropriate dimensions, faking Laplace 
transformation of (1) and (2), gives 


with 

G, (.s) = ( ,(.s7 4,) 1 B, f /),. 

It is desired to design an observer which will produce an estimation 
for 

--(0 = /->(/) (4) 

with L\ E /?**". An observer for (lie systems (I>—(3) is expressed 
as a dynamic system 

/■(*)-- F{s)tt(s) + (5) 

with the property that the estimation error satisfies 


Inn (77,.r(/) - / (/)) = 0 (6) 


for all where F'{s) and 77 (/s) are TUI x matrices, i.c., they are 
stable transfer function malt ices. If such an observer exists, wc say 
that Gu{b). (m (.s). • ■ ■, G„ (m) in (3) arc simultaneously observable. 

Our main problem lo be solved is: given n + 1 plants as (1), 
(2), or (3), we would like lo know whether or not there exists a 
common observer for this given sel of plants. This is referred to as 
the simultaneous observation problem. 

The right-coprime and left-coprime factorization ol the /th plant 
G\ arc 

GAs) = .Y,(*MC '(*) = .V/,' '(*(AM.*) (7) 


respectively, and ,Y,(»). .V,(s) are right-coprime /7/M matrices, 
M,(m) and V,(j») are lefl-coprime /7/M matrices. For the double¬ 
coprime factorization of TV, (s) there exist /7/M transfer functions 
V,(*). A',(s). V’,(.h) and A\(s) such that 


Y,(s) Y,(s) 

-Y,(s) AMs) 


’MM -A',(.s) 

A'.(s) V,(s) 


' MM 
-A\U) 
I O' 

0 1 ' 


-Y,(s) 

r,(s) 


>',(*) 

A'M) 


-Y,(« 

M,[* 


(8) 
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Each matrix in Bezout identity (8) can be calculated by using the 
existing algorithms given in [I]. The state-space construction of the 
right-coprime factors NAs) and MM is given in the following 

]'!tM = Cii(*I-.A ki r'Bi + D 1 (9) 

MA*) = KM-Akir'Di+J (10) 

where K, is matrix such that A kl - A, -f B t K, is stable and 
C k l = C, 4- Dih’i. 

Define the purtiaLstate f(s) as follows 

Mi{s)((s) - ll(s). 

Then from (7), (3) can be written as 

3/<(a)£(a) = u.(s) (11) 

Hi*) - \\(>)£U) (12) 

and the variable E, .r( s ) according to (4) can be expressed by 

M = PMU«) (13) 

where P, ( s ) can be calculated as follows. 

From (7), the righl-coprime factorization of the /th plant G, can 
be written as 

or - A,)-'D, + D, = ,Y.(«M/,-'(*). 

Using (9) and (10) we have 

(si - A,r'n, = («/ - A k ,r'n,.ur\») <i4> 

which leads to 

.r(.s) = {si - A ,)“’ B,u(s) = {si - Aki)~ l B,Z(x). 

Thus, the variable :( s) in (13) is obtained with BU ^ matrices 

p,(») = E,(si-A h r'n,. (i5) 

Then the problem of designing a common observer for the systems 
(I), (2), or (3) can be stated as finding /?// v -matrices F(s) and 
//(a) such that for all u [ s ) 

~{») - r(s) = 0. (16) 

Condition (16) is satisfied, i.e., E,.r{t) can be observed if and only 
if the following condition holds |5) 

F(s)M,(s) + H(s).X t (s)= PM- (17) 

When only one plant is considered in (1), (2). or (3), the param¬ 
eterization of the observer (5) has been given in [4], |6] using the 
factorization approach 111. The result is given in the following lemma. 
Lemma I ]6]: The set of BH^ matrices F(.s). H{s) that satisfy 

F(s)M(H) + m*)M*)= P(») (18) 

is given by 

F( S ) = P(s)Y(s)-Q{»)X(s). (19) 

H(s) = F(s)\(s) + Q(s)M(s). Q(s) e IIH IX (20) 

and, for every PIL^ matrix Q(s) of appropriate dimensions, F{s) 

and H(s) satisfying (19) and (20) also fulfill the existence condition 
(18) of the observer. 

Based on this parameterization, we now mainly consider the 
problem for the existence of the simultaneous observation. We would 
like to know whether or not there exists a common observer with 
the form (5) satisfying the observer existence condition (17) for the 
given set of plants (I), (2), or (3), and if this observer exists, how 
to design it? 


BL Main Results 

First, we study the problem of simultaneously observing the states 
of two plants which can be slated as follows: Given two plants (7o(») 
and Cm ( s), when does there exist a common observer such that they 
are both observable? 

Theorem I: Given two plants Ga{s) and G i(*)„ define 

Ms) = Yo(s)MAs) 4- XMXM- (21) 

B(s)=-N 0 (s)Mi(s) + Mo(s)Si (s). (22) 

Then C/o(.s') and G\(s) can be simultaneously observed if and only 
if there exists an 7?(.s) E B 77 x such that 

/()(« )A{S) + J?(S) D ( .V ) = P x ( s) (23) 

holds. 

Proof: From Lemma 1, the sets of observers that can observe 
Gn(s) and Gy(s) are given by 

!■((»()) — {Fu ( s ) = P 0 {s)\o(s) — Qn( s)Ao(s ),//o( s ) 

= P> (*)-Vii ( s) -f- Qu (.s )\lo (.s); Q 0 ( s ) 6 RH ^}, (24a) 
HO I ) = { F , (.s) = f\{s)Y ] (s) - QMSM.Hi(s) 

= P\ ( * ) Vi (s) 4- Q i (« )3/i (a): Q j ( s ) E BH x }. (24b) 

Hence G<t(s) and G'i(.v) can be simultaneously observed if and 
only if there exist Qof.s). Q\(s) e 7? 77* such that 

PiMYtM - QMSuis) = Pi(s)YM - QMSM- (25) 
KMXM + QMm*) - lMs)X\(«) + Qi(*)M\(s)- (26) 

Now (25), (26) can be rewritten as 


= [/’,(.'■) 


s) -Yj( 
.s) M ] i 

s) 

s) 

(27) 

which is equivalent to 






[rots) cm*)} _ 

) o ( s ) A o ( 

-Vo (■*') 3 7,) (a 

) 

J/,(*) 
•V) (s) 

-A, (A) 

V, (a) 


= [/',(•• QAs 

')] 




(28) 

from (8). or 






[Pn(s) (Vo(s)] 

’Ms) S(s)' 
IK*) T(s) 

= 

= [Pl(s) 

QM] 

(29) 

where 






4(s) S(.s) 

) o( a) Ai )(s 


-l/|(s) 

•V|(s) 

-Xy(s 

(30) 

B{s) T(s) 

Vo ( a ) Alois 

) 

Y[{s 


Thus Go ( a ) and G'i( a) can be simultaneously observed if and only 
if there exists Qo(s). Qi(s) E BH-* such that (29) holds. Therefore 
Theorem 1 is proved if we can establish that (29) holds if and only 
if ?o( a) A (a) 4 - B(s)B(s) = P\(s) for some B(s) E RH X . 

Suppose (29) holds for suitable Qo(a). Q } (s) E /?// x , then 
Po(s)A(s)+Q 0 {s)B(s) = P\ (s). Conversely, suppose Po(.*)A(a)4- 
B(s)D(s) = P\ ( a ) for some B\s) E B , then (29) holds with 
CM*) = B(s ), Qi(s) = (S(s) + Qu(s)T(s)). This completes the 
proof. □ 

r rheorem 1 shows that the problem of simultaneously observing 
two plants can be reduced to that of observing a single plant using 
a common observer. 

It should be pointed out that the proof of Theorem 1 closely follows 
the corresponding proof of the simultaneous stabilization problem for 
two plants (e.g., see ( 1 , Theorem 5.4.2]). In addition, it is seen that 
4 (.h) and B(s) in Theorem 1 are the same as those of Theorem 
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14.2 in [1], It is shown in [1] that A{«) and £(#) are right coprime. 
Theorem 5,4.2 in [1] states that two systems Go(*) and Gi(#) can 
be simultaneously stabilized if and only if the associated system 
G(#) — iJ(#)A(«)“' is strongly stabilizable, i.e., it can be stabilized 
using, a stable compensator. 

Corollary /; If condition (23) is satisfied for two given plants 
Gq(#) and Gi(*) in Theorem 1, then 

Fo(s)Mi(s) + H 0 (s)N\(fi) = Pi(n). (31) 

Proof: This is a direct result from Theorem 1; its proof is 
omitted here. □ 

This corollary further indicates that the observer of the first plant 
Go(*) can be used to observe the second plant Gj (*). Therefore two 
plants can be observed by a common observer. It is a more intuitive 
result in the following. 

Corollary 2. Suppose Go (s) is stable and Gj ( s) is arbitrary. Then 
Go(-s) and G i(.s) can be simultaneously observed if and only if 
G](w) - Go(s) can be observed. 

Proof: Suppose M 0 " 1 ( s ) Ao( n ) is a left-coprime factorization of 
Go(«) and N\ {»)M { 1 (a) is a right-coprime factorization of G\(,s). 
If Go(») is stable, then Ao(a) =■ Go(*)< Mo(±) — /, In (a) = 1 
and A’o(tf) — 0. From 

G(.) = D(*U(»)'' 

= (A',<*)-Co(s)A/i(s))A/.(*r 
= Gj (s) — Go( x ) (32) 

it can be seen that (A'j {h)-Gq{s)Mi{x))Mi (.s)" 1 is a right-coprime 
factorization of G i(s-) - Go(.s). From Theorem 1 it follows that 

Pn{*)My(s) + 7?(-*>)(iVi(.s) - G 0 (*)Jl/, (*)) = Fj (s) (33) 

which can be written as 

(Po(M-/?(.s)G 0 (^)M/ 1 (.s)+ fl(«)A T i(*) = fi(*) (34) 

define 

F(s) = (Po(s) - /?(.s)Go(«)) (35) 

Hi*) = m*) (36) 

then (34) shows lhat r(*) = F(,s)w(.s) + J/(a)j/(.s) can observe the 
plant Gj (*). It also can be proved that 

(Po(.s) - H(h)Go(s))M 0 (s) 4- R(*)No(s) = Po(s). (37) 

That is r(s) = F(s)v(*) 4- H(s)y(s) can also observe the plant 
G'n(>0. Hence Go (a) and Gj(h) can be simultaneously observed. □ 
It can be seen that from (19) and (20), in fact, r(«) = F(s)u(s) + 
the observer of the first plant Go (*) with Qo ( a ) = *) * 

Next we turn to the simultaneous observation problem with more 
than two plants. We would like to know whether or not there exists 
an observer that can observe the states of each one of a given set of 
plants Go(«).Gi(#),-",G„(,s). In the following theorem, we give 
necessary and sufficient condition to observe simultaneously n + 1 
plants using a common observer. 

Theorem 2: Suppose Go, Gj. ■ • ■, G r , are given plants, define 

A(*) = Yo(s)M,(») + X q (m)NAs)< (38) 

B,(s) = -JVoG)AJ,(*) 4- AIo <*)JV,(a), is 1,2, ■ ■ ■, n. (39) 

Then Go (*),Gi (*),-■• ,G„(js) can be simultaneously observed if 
and only if there exists a R(s) £ RH<^> such that 

A(«)A,(*) + i?(#)J3 4 (s) = P,(«) (40) 

holds for ? * 1«2, 


Proof: The proof closely parallels that of Theorem 1; it is 
omitted here, □ 

It is shown in Theorem 2 that the problem of simultaneously 
observing n + 1 plants is equivalent to that of simultaneously 
observing n plants with a common observer. 

Now suppose /?(*)€ Pi?™. exists which can observe the auxiliary 
plant G(s), then from (5), (19), and (20), the common observer for 
the plants Go(«) and Gj(#) is just 

r(») = [Pu(*)VoU) - J?(.h)Ao(a)]ii(M 

+ [Po(«)A' 0 (.s) + R(s)\Jo(s)]y(s). (41) 

In the assumption of Corollary 2, the plant Gn{s) is stable, then from 
(35) and (36), the observer has the following form 

r(s) = (Po(s) - R(s)Go(s))u(m) 4 R(s)y(x). R(s) £ RH^ 

(42) 

where R(s) can be obtained from (23). 


IV. Design Examplf 
Consider the system 

0 1 

(l 2 1 d'jj 

,V = [2 7 ].c(#) 

- - (1 - 1 }-i(t). 


•!■(/) + 


»(0 


(43) 

(44) 

(45) 


The parameter values (riji - «jj) for the two operating conditions arc. 

1) (flai,fl 2 i) = (-1.-2): 

2) = (-2. -3). 

For these conditions, we get 


cu.i-t-T’ 


- + 2 * + r f " 1 ' 1 ' +.i„ +"-i' 


(46) 


Because the plant 6'n(») is stable, the related elements ol double 
coprime lactonzation lor the first plant are 

A'o(*) - , ~ I t = CA,(.s). 

.s- + 2« + 1 

Mo ( s ) = 1. A'o(s) = 0. !./(•/)- 1. 

= T 

and the related elements of double-coprime factorization for the 


,, . , + 3.s -|-2 v 2.s - S 

M,(x) = - A,(s) = 

«- + 2 s + J 

r> , , » + 5 

I>(H)= ^T27ry 


r + 2-s + .V 


From (21) and (22) in Theorem 1, we have 

A(s) = Yo(s)M\{s) + A’o( s) <V](.s) = Vt' V 

s 1 4- 2.s -1- 3 

B(s) = — Ao( f* ) Mi ( a) 4 * 3/o( s )Ai ( s) 

= -(7n 2 + 9^ + 2) 

(k 2 4* 2« -|- 3)( js 2 4-2.s 4- 1)‘ 

From (23), we get 

R(") = ^^5- e 

IK + 2 

Then from (42), the observer is given by 


r* = 


9 -.s +1 

-=**(*)+ -=—~rir(«). 


(47) 


(48) 


(49) 


(50) 


"« + 2 ” ' 7* + 2 ‘ 

The observer (50) can observe simultaneously the two plants in 
(46). 
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V. Conclusion 

In this note, a class of new observation problem, called simultane¬ 
ous observation problem, is introduced and studied. This is achieved 
by using the coprimc factorization approach. Necessary and sufficient 
conditions for the existence of simultaneous observation are derived. 
It was shown that a general solution for the simultaneous observation 
problem reduces the problem of observing n plants using a common 
observer to one of observing n — 1 plants using a common observer. 
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An Algorithm for Computing the Mask Value of the 
Supremal Normal Sublanguage of a Legal Language 

Michel Barheau, Guy Custeau, and Richard Si-Dents 


Abstract —We consider the problem of finding the musk value of the 
supremal normal sublanguage L i> of some given language L. We describe 
a straightforward algorithmic solution that can be applied to existing off¬ 
line procedures for determining the supremal controllable and normal 
sublanguage of L and that does nut require an explicit calculation of 
Lit. This problem is fundamental because it is related to the supervisory 
control problem under partial observation. Our algorithm applies only 
to closed languages. 

I. Introduction 

One of the basic problems in supervisory control is to design a 
controller whose task is to enable and disable the controllable events 
of a discrete-event system (DES) such that the control requirements 
expressed as a legal language L arc satisfied. A unifying theory 
has been developed by Ramadge and Wonham [111 to define and 
solve this problem. Lin and Wonham (10] and Cicsluk et al 16] have 
studied, in the framework of Ramadge and Wonham, the supervisory 
control problem under partial observation. Following the theory of 
Ramadge and Wonham, the uncontrolled DES is represented by a 
generator, which is a deterministic automaton G = (Q. E. b. qo.Q „ t ), 
where Q is a set of states, E a finite set of events, A: E x Q — > Q 
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a transition function, r / 0 6 Q an initial state, and Q m C Q a 
set of marked states. Let T » {0.1}^ be the set of all binary 
assignments to the elements of E, and let -j 6 F and rr e E, If 
=* ] then rr is enabled; otherwise, er is disabled. Let E t and 
£ t# be fixed disjoint subsets of E denoting the sets of controllable 
and uncontrollable events respectively, such that E = >3, U E, f . A 
controller it> a pair C - where S - (V. A,c. ytuY r „) is 

a deterministic automaton and t;>: V —♦ V is a feedback function 
satisfying: i) <>(//)(rr) = 1 , if rr £ E„; and li) 0 {y){tr) 6 { 0 . 1 }, 
otherwise. 

To control a DES. Ramadge and Wonham introduced a transition 
function b t : T x E x Q -*■ Q defined by 

<\ (■). q) 

b((r,q). if b(tr.q) is defined and ->(or) = 1 
undefined, if b{fr.q) is undefined or ^{(t) = 0 . 

The controlled discrete-event system (CDES) is then the generator 
G, = {Q< F 7 o, Q m ) obtained from C7 by specifying the sets 

E, and E„. 

Following the extensions proposed by Cieslak et al. [ 6 ], we 
consider the case in which C observes all the events of G, through 
a mask or observation function M that maps each event in E into 
an observed event in A U {-). The events in M '“ 1 (-) are those that 
cannot be seen by C. If C cannot distinguish between tr { and 0 * 3 , 
then .U(<T| ) — If \1 is the identity function, then A = E 

and all the original events are observed by C’. The special case in 
which M simply erases some of the events in E occurs frequently 
and is called a natural projection (12] or natural musk [14]. 

Finally, the CDES and controller are embodied in a closed- 
loop system to constitute a supervised discrete-event system (SDES) 
('/G ,, which is defined to be the generator (V x Q, E. (c, o M) x 
, f/(i).V VIJ x (},„ ). where the function 

(C 0 M) :'Jxl'x(^Vxy 

is defined as (U o U) x b, )(<r, y.q) = (c,(A/(rr). 1 /), b t (0{y). tr 7 q)), 
if b(cr.q) and t,( M (rr), y) are defined, and o(tj){rr) ], and is 
undefined, otherwise. 

Let A* C L C L{G) C_ E* and E' C E. We recall that a language 
K is closed il K - 7\\ the prefix closure of K. In this paper, 
we assume that all languages arc closed and denote by s the prefix 
closure of a string s £ E\ Language K is (E\ L[G ))-controllable 
if (V.s € A')(Vrr 6 E')[si 7 £ L(G) sa £ J\]. Language K is 
(UA(<7))-nonnal if (V* € A')(V.s' £ L(G))[M{s) = M(s) =* 
■s' 6 A’]. Finally, A is (E\ A( (7)) -observable if (V.s,*' £ A')(Vrr £ 
E')[( Mis ) — M(s) A .s rr Cl A A .s'rr £ TAG)) => s'rr £ A’]. 

The supervisory control problem under partial observation is 
formally stated as follows. Given a CDES G ( , its legal behavior L C 
L{G), and a mask function M, find a controller C = (5. 0 ) such that 
L[C/G t ) is the largest sublanguage of L that is closed, (E„, L[G))~ 
controllable and (3A L(G)) -normal. This problem is slightly different 
from the supervisory control and observation problem (SCOP) first 
formulated by Lin and Wonham [10], in which M is a natural 
mask und L{C/G .) is constrained to contain a minimal acceptable 
language. 

Let us consider a simple example. Fig. 1 depicts generator G of 
the language of a plant and automaton /? of its legal language L. The 
observation function is shown in Table I. Note that the illegal event 
(‘ of the plant is observed as the legal event b. Note also that the 
words odf f and h<h f of the plant are observed as the legal words 
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adr and hdc, respectively, but that adef and hdt f are not admitted. 
The largest normal sublanguage of L clearly is: Lh = To 

make R an automaton for Lh, the transition from ,ru to . 1*1 on event 
b must be removed as well as state j*:t with the incoming transition 
from j *2 • In this way, indistinguishable behaviors of the plant are all 
permitted by the controller if they are all legal, or all rejected if at 
least one of them is illegal. 

Cieslak etal. [ 6 ] proved, under the assumption that M~ [ ( M (E„) 
- {c[) C E (m that Kit* the supremal controllable and nonnal 
language contained in Z, can be computed as lollows: 

Step I) Compute Lh , the largest ( M* L (6 )(-normal sublanguage 
of L. 

Step 2) Compute M(Lh ). M(L(G)), and A,, = ) — { = }. 

Step 3) Compute A', the largest (A„, ,U( Z(G')))-controllable 
sublanguage of A7(Z/,-), by using the algorithm developed 
by Ramadge and Wonham [15]. 

Step 4) Compute Kr = \I~ } (K)C\ L(G). 

This paper presents an alternate procedure for computing the 
supremal controllable and normal sublanguage Kr, in which Step 
1) above is omitted. That is to say, our procedure does not require 
an intermediate representation of Lh. It includes a new algorithm for 
computing directly the mask value M(Lh) of the supremal normal 
sublanguage Lh . Indeed, Z(G), M(Lh ), and M(L(G)) are the only 
languages required for completing the last two steps of the procedure. 
Compared with the approach of Cieslak et al. [ 6 ], our procedure 
provides a simpler and more straightforward algorithmic solution. 

This paper is organized as follows. Section II presents a review 
of known related algorithms. Section 111 gives a description of 
our algorithm and a proof of its correctness. Section IV provides 
a comparison with existing algorithms and contains concluding 
remarks. 

11. Related Work 

The supervisory control problem under partial observation has 
received much attention, and several algorithms for computing the 
supremal controllable and normal sublanguage of a given language 
have been proposed in the literature [4|-[6], [14]. They can all 
be used for the off-line derivation of a controller, but the worst- 
case computational complexity of all these algorithms is theoretically 
exponential because there is no polynomial-time algorithm for the 
SCOP unless P = NP |13|. The exponential-time complexity 


TABLE l 

The Observation Function M 


a 

a 

b 

c 

d 

e 

f 

M(o) 

a 

b 

~ 

£ 

e 

€ 


results from the construction of one or more deterministic automata 
from nondeterministic automata. But, it should be noted that in 
practice the worst-case occurs rarely [1]. An on-line approach was, 
however, suggested by Heymann and Lin [7] for improving this 
bound. They consider the required control action to be calculated 
at each step of the actual execution of the dosed-loop system. The 
basic idea behind their algorithm is that a controller that has been 
designed for operation under full observation can be modified to 
operate under partial observation. The computational complexity at 
each step is polynomial in the product of the number of states in 
automaton R for the legal language L and the number of states in G . 
Furthermore, the closed-loop behavior thus achieved is a controllable 
and observable sublanguage larger than Lhe supremal controllable and 
normal sublanguage. It should be noted, however, that the normality 
and observability properties are equivalent under the assumptions that 
the language is controllable and all controllable events are observable 
1 % 

Nevertheless, the off-line approach is useful when there is a need 
to design a full controller because the plant is highly time-critical 
or the closed-loop system must be verilied prior to its execution. In 
addition to the off-line procedure of Cieslak et al. | 0 |, Brandt et al. 
[4], [ 14J propose lhe algebraic formula L - M “ 1 (,U( L ( G) - L)) for 
the off-line computation of Lh- These operations can be interpreted as 
follows. Take the set of illegal words, that is. L(G)-L. Compute how 
the illegal words are observed, that is, \I( TAG) - L). Compute the 
set of words, legal or illegal, that are observed as illegal words, that 
is, M 1 (^J(L(G) - L)). Reject from Z the legal words for which 
there arc illegal words observed the same way. This formula can be 
effectively computed with the TCT tool [141 under the hypothesis 
that M is a natural mask. The computation of Lu includes, however, 
the construction of six intermediate automata. 

Cho and Marcus J5] give two algorithms for computing Lr. The 
first one is based on a graphical characterization of lhe notion of 
normal language. Indeed, they prove, under the assumption that R is 
a strict-subautomaton of G, that a language L is nonnal if and only 
if 7* is a subautomaton of 7, where T and T\ are the deterministic- 
automata for M(L(G)) and ,U(Z), respectively. The main step of 
this algorithm consists of the elimination, from 7\, of slates and edges 
so that the resulting automaton 7, is the largest subautomaton of T. 
Language Lh equals LDM~ [ {L{ 7\)). In comparison, our algorithm 
constructs a single intermediate nondeterministic finite automaton, 
that is, the one for P. Needless to say, elimination of states and 
edges is performed according to u different procedure. 

The second algorithm provided by Cho and Marcus [5) constructs 
a nondeterministic automaton for the language Lit directly from 6\ 
i?, and T, under the stronger assumption that G has a property called 
M -recognizable. If this assumption is satisfied, the cardinality of the 
state set of T is less than or equal to the cardinality of the state 
set of G. Unfortunately, G does not always exhibit this structural 
property, and the computational effort required for obtaining an M- 
recognizable nondeterministic automaton from G is equivalent to that 
of their first algorithm. 

III. The Algorithm eor Computing M(Lh) 

In this section, an algorithm is presented for going from A, an 
automaton for the legal language Z, to C*, an automaton for the 
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projection M(Ln ), with respect to a plant G and an observation 
function M. The algorithm comprises two steps. 

In the first step, a deterministic automaton C\ is constructed by 
applying the observation function 1/ to every transition label of R 
and translating the result into a deterministic automaton. The states of 
C'i are subsets of the states of /J, and L[C \) = J \f(L ). The algorithm 
of Lewis and Papadimitriou [ 8 ] can be used for this purpose. 

Fig. 2 shows the automaton lor language \l{ L) and automaton C'i . 
for the automaton R introduced in Section 1 and illustrated in Fig. I. 

Before describing the second step which computes M{Lr) from 
(V. R and A/, let us state some easy-to-satisly properties that G and 
R must have. 

Let T i = ( Y i. L. f \, .rin, A i ) and 7_> = (A D. fj. ./'uj, A y ) 
be two deterministic automata of regular languages f j and I.>, 
respectively, with L\ C L,>, Wc say that T\ refines T> if 

(Vs. / 6 L i )[j] (s. .i m ) = /i (/. .cm ) 

=* /*(■'’.■»■ uj) = /.»(L ■*■«.»)]. 

If /'i refines 7 V then it can be shown [ 6 | that there exists a unique 
correspondence function h :A i —► X> such that 

(Vs £ / i )[/> o /[ (s. oi ) = /_»(*. j'oj )]■ 

The correspondence function h can be intuitively interpreted as 
follows. Given a state .r of T\ reached after the occurrence of string 
s. /#(.!■) yields the state in which 7j would be after encountering the 
same string. 

For example, in Fig. I, R refines G, and the correspondence 
function from the states of R to the states of G is 

/i(Ji) - </,(/ =0.1.2.3). 

In the sequel , 1 let G = [(f. r/u. Q). R — 
and A/: £ —► A U {"}■ We assume, without loss of generality, that 
R refines G. Moreover, let h: X —► (} denote the correspondence 
function, and C\ - OV A. u, j/o. Vi) with Vi C 2 X . 

A. The Concept of Normal Transition 

A transition of C i, from a stale // to a stale //, labeled with an 
event A £ (A U {;}), is normal if there is no state .r, £ y and event 
r £ H such that: 

• in state />(.<*,) of the plant, event t is active, that is, r, /i(.r, ))! 2 ; 

• event t is observed as A, that is, A/(t) =■ A; and 

• event r is not admitted from .r #i that is, t(r,.r,) is undefined. 

1 In this paper, every state is marked and every language is closed. 

2 *(r, h(.r ,))! means that r, /#( **»)) * defined. 


Intuitively, C i is an intermediate structure that mirrors the legal 
language as observed through the mask function. A transition of C\ 
labeled A, from state y to state y\ is on the path leading to the 
acceptance of a word of the form u Ac, where </. r £ A*. Such a 
word corresponds to a word in the legal language of the form 
with £ D\r 6 £, A/(/i) = i/. U(r) = A, and A/(f) sr r. 
If there exists another word a'tV in 746 ’) but not in L. with 
A/(s') = i/, A7(r') = A, and A/(f f ) = i\ this word is illegal and 
observed the same way as the legal word ttrf. A normal transition is 
one for which such a word s'rV does not exist. A normal transition 
labeled A is called a normal A-transition. 

In the second step of the algorithm, generator C'i is obtained from 
f'i by pruning states and transitions. Pruned states are those for which 
nonnormal . -transitions arc active, and pruned transitions are those 
that do not have the normal property. The language of C 2 is M{Ln). 

The concept of normal transition is slated in an operational fashion 
as lollows (let Ea denote the scl of events in H observed as A, and 
H € V. C 2 s ) 

function normal (i/. A) 

{precondition: t,, (A. i/)!} 

forall .i, £ y do 

forall r £ V \ do 

if ^(r, //(.?■,))! and not £(r, .r, )!thon 
return false 

return true 

B. The Compulation oj Cy 

In this section, wc present an algorithm for deriving a deterministic 
automaton (\ from C\ such that a word t in the language of i) 
is ihe projection of some word » in L\ and ii) any word «' in L(G) 
for which the projection is also /, is in L as well. More formally, the 
following two results will be proved (Lemmata 1 , 2. 3, and Theorem 
1 ) 

I) L(C.) = |A/(.s) : > 6 L A (V.s' 6 ,i)(W' £ /.((',')) 
[\Iis")=U(,s' )=>*"€/.]} 

and 


2 )UC.>= \riL,t). 

The algorithm consists of two loops. The first one inspects every 
slate y £ Vj of (\ . By convention, the transition Cj (r. y ) is always 
defined. If this -transition is normal, then state y is copied into 
sel Vj. Otherwise, the -transition is disabled by rejecting state y , 
because it is impossible to act on nonobservable transitions. 

The second loop inspects every event A £ A that is active from 
a state y £ V 2 . If the transition on that event is defined in C\, is 
normal, and leads to a stale in IV then it is also defined in Cj. 

If the initial state go of C\ is in Vj. then the algorithm returns 
an automaton C> = OVA. Cj . yu .!■»)■ Otherwise, it returns <£, the 
empty automaton for the empty language, which has as its set of 
slates the empty set. The computation of Cj is performed by the 
following procedure 

function stcp 2 ( A/, G\ R.hX\ ) 

{Selection of states with normal -transition} 

lv-0 

forall y £ V] do 
if normal (y. ') then 

{Selection of normal A-trunsitions, with A £ A, from 
states in } 
if go £ V 2 then 
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TABLE II 

Comparison op the Aiooriihms 


Alg 

Authors 

Restrictions 

Number of 

intermediate 

automata foi 
producing Lr 

Number of 

intermediate 

automata for 
producing M(Ln) 

Ease Of 
understanding 

1 

Cieslak et al 


4 

4 

Low 

2 

Brandt et al 

M is a natural projection 

6 

7 

High 

3 

Gho and Marcus (1) 


4 

3 

Intermediate 

4 

Cho and Marcus (2) 


3 

4 

Internif dialt 

5 

Our algorithm 


J 

2 

Intermediate 



hig 3 Automaton C 2 


forall (A 0 ) £ ( \ x do 
if (,1 (A, ?/)’ and normal^/ A) and c ,1 (\, 0 ) £ >j then 
(,i(A v) is equal to c,i(A y) 

else 

<,,>( A, </) is undefined 
return O 2 V o Vo ) J 
else 

return # 
end 

We hrst discuss an application of our algorithm, using the example 
presented in Figs 1 and 2 then formally demonstrate its correctness 
Example Referring to Figs 1 and 2 it is easy to see that 
normal! v„ ) returns true tor / = 0 1 This is not the case for t — 2 
however, since we have M{f) = , h{i ,)) defined (that is, 

Hf* </i) f ) and £(/ 1 1 ) undefined Hence state y 2 must be eliminated 
as well as the incoming transition on event r 
Similarly, normal! 1/0 n) is true The result of normal! j/n h) is false 
however because we have M{i ) - M(h) (S(r h{m)) defined (thal 
is, f>(f qoY) and £(r 1 ») undefined Thus the transition from i/ ( 
to y\ on event h must be removed The resulting automaton ( is 
shown in Fig 3 Another application of our algorithm can be found 
in Barbeau el al |3] 

C Demonstration oj Cot redness 

We now formally demonstrate the correctness of our algorithm We 
hrst introduce the following two facts 
fait I) By construction of C 1 given 1 / € 3 1 1 € \ and 

Ci( \/(s) 1 / 0 ) = 1 / A r £ 1 / 

=> Of €£•)[£(/ m)- 1 A A(f rin) 

= //(«,) A W(f)= U(s)] 

ratt2) Given s 6 / (that is £! s jo) is defined) such that 
(W 6 / (<7))[ ’V/!) = 3/m) =* s' £ l ) 
if C(s 7 (jo) is defined and V(s') = U(s) then £(s\ j„) 
is defined 

Lemma 1 Given s e /„ 

(Vi' 6 I(G))[A7(i') = M(s) =W 6 7] 

=> noimal((j(Ai(s), 0 o) c ) 

Ptoof Since O is a deterministic automaton for \J(L) then, 
for an * in L, Ci (A/(i). 00 ) is defined (it ts denoted by y in the 
sequel) 


Let 1 6 1 / and r £ 1 such that />(r /i(i )) is defined and 

\[(t) - From Fact 1) Of £ ^*) [£(/ i 0 ) = i,A «S(/ r/o) 
- h(i ) AU(f) = U!s)] Thus, if we let s' = tr ^(s' </ 0 ) =r 
^(f r »/)) = (7- «*>(/ r/o)) - f»(r //(1 )) is defined furthermore 
W(s') - U(/r) = W(f)U(r) - \/(f) - 17(*) 

It follows from the hypothesis of Lemma 1 and had 2) that 
£(s' < 0 ) is defined Therefore £(r 1 ) = £(r £(f 1 ( )) - £(fr 
1 u) = t(s' id) is defined and by definition normaKO ( I7(s) <A ) 

) is true CJ 

Lemma 2 Given s £ L 

(Vs £ sj(Vs" £ 7(G))[Y7(s") - 3/(s') => s" £ 7] 

<£> l/(s) £ 7(C ) 

The pioof is by induction on the length |s| ol s 

Ptoof of -=t> (Ba m \fr/>)Let s be such that |s| — () Then s - 
By Lemma 1 normal!cj ( W( s) i/ f) ) ) is true Since M( )= and 

Ol Vo) “ Vo then normal! 1/0 ) is tiue Bv construction ol C 

1/0 £ )_ Therelore, U( ) £ I {( ) 

(India non \Up) Let s be such that |s| = //-!-] with n 0 We 
may write s us a <1 foi some u £ L* and some n £ N with |</| — n 
By hypothesis s is such that (Vs' £ s)(Vs' 6 7(G)) [ \7( s' ) 
U(s') => s" £ /] In particular it is such that (Vn' £ //)(V<i ' £ 
7(G))[W(r/") = U(i/') ==> 1 / £ 7] By the induction hypoth 

csis W(</) £ I {(- ) Thereloic (,J)7(n) //o) is defined Foi 
(, 2 ! \7(//rr) 1 / 0 ) to be defined we have to show 

1 ) that normal!c, 1 ( \J(tnr) r/n) ) is true and 

2 ) that Ihe transition from state (,1 ( 3/( 1 /) 1 / 0 ) to state 
C,i 07 (on) i/n) labeled W(n) is normal 

The first assertion follows from 1 emma I 1 he second assertion 
is verified by using an argument similar to the one developed 
in Lemma I Let 1 £ c, 1 ( 37( 1 /) tjo) and r £ L such thal 

f*>( rh(i )) is defined and U(r) =- M(a) From Fact I) 
(3/ £ Z* )[£(f 10 ) = • r A^(f r/n) - //(1 ) A \7(f) = 17( 1 /)] 1 hen 
if WC let s — It b{ s' r/o) = />(/r r/o) = <S( 7 f)( f r/ 0 )) — A ( t /? ( 1 )) 
is defined Furthermore, U(s') = U(fr) = U(f)Ufr) = 
U(/i)U(rr) = \7(i/rr) = 1/( s) It follows from the hypothesis 
and Fact 2) thal £(s' ; 0 ) is defined Therefore £(r 1 ) = 

£(r £(f /(>)) = £(/r 10 ) - £(s' i 0 ) is defined and, by definition 
normal(t, 1 ( W(u), 1 / 0 ) \l{rr)) is true By construction of C\ 
C,i(V(imt) 1 / 0 ) £ h and ^(M(rr) t/o)) is defined 

Therefore U(s) £ 7 (( 2 ) 

Proof of <= (fle/ws vre//) Let 1 be such that |s| = 0 Then 
w = «=:(»<) and M{») = Let us suppose that there 

exists an 1 " £ /(G) such that A7(s") = W(s) Wc may write 
= r, 72 r„ and W(ri) = U(r 2 ) = = l/(r„ ) = 

Sirtce ^/(«) £ /(C 2 ), then nonnal(//o ) is true, by constiuction 
of C 2 Furthermore, since A(r t . go) is defined and V/(ri) = , then 
t(r u to) is defined and £(r t r 0 ) = h 1 ((n r/o)) £ 00 In the 
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same way 

^(r 2 ^(ri,:^o)) 6 yo,...,-C(T fn ,£(...^(r l ,ic 0 )*'-)) £ yo< 

Therefore, £(//\.t*o) is defined and *" £ X. 

(Induction step) Let s £ X, with |«| = n + 1 (n > 0), and 
A/($) € X(CV). Since X and X(Oi) are closed, for all proper prefixes 
s of s. s' £ X, M(a') £ HC 2 ), and the conclusion is immediate by 
the induction hypothesis. It remains to be shown that the conclusion 
holds for fi f = *. 

We may write u as ta for some t £ £* and some a £ S. with 
|/| = n. Assume that there is an s” £ L(G) t with Mis”) = A/(>*-). 
There are two cases: A/(<r) = 5 and V/(<t) ^ 

If M{a ) = f, then hr is observed as t which is a proper prefix of 
s and the conclusion is immediate. 

Consider the case where A/(<r) ^ We may write s” = t n rii ", 
where t",u" £ r € S, A/(/") = A/(f), M(t) = XI(<r), 
and M{u") — r. Since L(G) is closed, /" € X(6 7 ) and by 
the induction hypothesis t" £ L. Let y = G(A/(f).,(/o). By 
construction of C 2 , £U'\.r 0 ) 6 y. Since < 2 (A/(to), t/ 0 ) is defined, 
then normal( 1 /, M(o)) is true, which implies that £(f"r, ,r 0 ) is 
defined, and normaI(C 2 ( Min), y), ■:) is true, which implies that 
.to)) is defined. Consequently, «" = t”ru " £ L. CD 

Lemma 3: 

(Vf € 5T )[f £ L(C 2 ) =► (3.s e X)[A/(.s) = /]]. 

Proof: Trivially true since C> is derived from C\, a determinis¬ 
tic automaton for M{ L), solely by pruning states and transitions. □ 

Theorem 1: If Lu is the largest (XI, L(G)) -normal sublanguage 
of X, then M(Ln) = X(C 2 ). 

Proof: We must show that: 

1) if h £ Lu, then Mis) £ 7,(0?). that is. M{Lu) C X(C 2 ); 
and conversely. 

2) if t £ X(C 2 ). then there exists an s £ Lu such that M(s) = /. 
that is, L (C>) C M[Lu), 

Proof of I: Let s £ Lu. Since Lu is closed and (XI. L{G))~ 
nornuil, we have that s £ L and (V.s' £ *)(Vs" € L(G))[M(s”) = 
A/(.s') => .s" € X], From Lemma 2, we may conclude that 
A/f.s) 6 X(C 2 ). 

Proof of 2: Let / E L (C 2 ). From Lemma 3, there exists an 
s £ L such that X!(s) ~ t, and, from Lemma 2. (V.s' £ s)(Vs" £ 
L(G))[M(s”) A/ (.s') .s" € I] (that is, M~ ] M{*’) 0 L{G) C 

X). From Proposition 3.3 of Cieslak *7 a/. [6], we may conclude that 
s£Lu. □ 

IV. Conclusion 

Table II gives the results of an analysis, conducted in [2], compar¬ 
ing the characteristics of our algorithm with those of the four other 
off-line algorithms mentioned in Section II. 

As mentioned in the Introduction, the computation of (M.L(G)b 
normal languages is significant because it is related to the problem 
of supervisory control under partial observation. The compulation of 
Lu is an intermediate step in the solution of this problem. 

Any of the algorithms listed in Table II for computing Lu can 
be used in combination with either of the two main algorithms for 
computing a controller C. namely, that of Won ham and Ramadge 
|15] and Cho and Marcus [5]. 

The algorithm in [15] is used in [6] for computing the largest 
(X, M(L(G)) [-controllable sublanguage of XI{Lu) and works 
under the assumption that Ai~ A (A/(£ u ) ~ {^}) £ ■£„. That is, 
no controllable event can be observed as an uncontrollable event. 
Furthermore, it takes as input the projection of Lu, that is, A/(Zw) 
rather than Lu. Therefore, the most suitable algorithm in that 
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context is the one which produces M(Lu) with less effort. This is 
Algorithm 5 because it computes an automaton for M(Lft) through 
the construction of only two automata. 

The algorithm in Cho and Marcus [5] is more general than 
that in Wonham and Ramadge [15], Indeed, the assumption that 
A/"" 1 (A/(E„) - {-}) C is not necessary. The former is based 
on the application in alternation of the algorithm of Wonham and 
Ramadge and an algorithm for computing an automaton that generates 
Lit. This process is therefore iterative, and generality is obtained 
at the price of a longer run-time. Any of the algorithms listed in 
Table II can be used for computing Lu. The most suitable one is 
Algorithm 4, however, because it does not require reconstruction of 
any intermediate automaton after the first iteration [5]. 
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Decentralized Robust Control of Uncertain 
Interconnected Systems with Prescribed 
Degree of Exponential Convergence 

Zhiming Gong 

Abstract —This note considers the problem of decentralised control of 
an uncertain Interconnected time-varying system which does not satisfy 
the so-called matching conditions. The uncertainties, which may be 
decomposed into two portions, a matched portion and a mismatched 
portion, may appear in the subsystems and also in the interconnections 
between the subsystems. A robust decentralized control scheme fs pro¬ 
posed which guarantees the controlled system to converge exponentially, 
with a prescribed degree, to a residue set with a prescribed bound when 
the mismatched portknt of the uncertainties satisfies a certain bound 
condition. 

I. Introduction 

In recent years, robusl control of uncertain systems has attracted 
great attention. Since uncertainties reside inevitably in real inter¬ 
connected systems, decentralized control of uncertain interconnected 
systems is of great practical and theoretical interest 11J—[10]. 

In robust control of uncertain systems, the so-called matching 
conditions [11] play an important role, it is well known that an 
uncertain system is stabilizable when the matching conditions are 
satisfied (see, e.g., [II ]—[ 14]). Many efforts have been made to 
relax the matching conditions on the uncertainties, and a number 
of sufficient conditions for existence of stabilizing controllers for 
uncertain systems have been obtained. Some of them arc stated in 
terms of bounds of mismatched uncertainties [7], [15], while others 
are stated in terms of existence of positive definite solutions ot certain 
Riccati equations (16]. 

It is interesting that the matching conditions also play an important 
role in decentralized control. It has been shown that when the 
interconnections between the subsystems of an interconnected system 
are through the input matrices of each subsystem, i.e., the matching 
condition is satisfied, the interconnected system is decentrally stabi¬ 
lizable [4], [5]. This remains true when the subsystems (including 
the input matrices) and the interconnections are assumed to have 
uncertainties [10] and even when the interconnections have higher- 
order uncertainties [9]. For the case where the matching conditions are 
not satisfied, Siljak [1] considered a class of interconnected systems 
with uncertainties in the interconnections and, by using the concept 
of vector Lyapunov functions and the properties of Metzler malrices, 
obtained a sufficient condition for stability in terms of bounds of 
uncertainties. Chen [8] took uncertainties in the subsystems into 
account and obtained a sufficient condition for stability in terms ot 
existence of positive definite solutions of certain Riccati equations. 

In this note, we consider lime-varying interconnected systems 
with uncertainties which do not satisfy the matching conditions. The 
uncertainties are possibly nonlinear and fast time-varying and may 
appear in the subsystems (including the input matrices) and also 
in the interconnections among the subsystems. By decomposing the 
uncertainties into two portions, a matched portion and a mismatched 
portion, and based on the bounds of the uncertainties, we propose a 
robust decentralized control scheme which guarantees the controlled 
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system to converge exponentially, with a prescribed degree, to a 
residue set with a prescribed bound when the mismatched portion 
of the uncertainties satisfies a certain bound condition. 

II. Problem Statement 

Consider an uncertain large-scale system which is composed of N 
interconnected subsystems described by 

•r.(t) = A,(f).r,(f) + B'(t)uAt) + w,(t,.rAt), u,(f)) 

N 

+ ^ ( 2 . 1 ) 

where /' = 1,2, ■ ■ ■, N\ j\(f) E R>"' and </,(/) E R'"‘ are, 
respectively, the state and control input of the /th subsystem, and 
A ( (f) and B t (t), which represent the nominal part of the sub¬ 
system, are matrices of appropriate dimensions and with possibly 
time-varying elements. The unknown functions «*,(/,.!•,(/). u,(t)) 
represent internal uncertainties in the subsystems stemming from 
parameter uncertainties and input disturbances, and the unknown 
functions #,,(/, .r ; (f)) represent uncertain interconnections between 
the subsystems. It is assumed that .4,(/), 7?,(f), " ( (L■' AO* "AO) 
and fulfill the conditions for existence of the solutions 

of (2.1). 

Equation (2.1) can be rewritten routinely in the following compact 
form 

Jr(t) = A(t)jrlt) + D(t)u(t)+ (t).u(t)) + */(/,.i (0) (2.2) 

where .r(t) = [./• / (f),.t \ (/).■ ■ •. a\ (f)]' E R M and //(f) = [uf 
(f),i/j (/),-•-, i/\ (f)] / E R"‘ are the state and control input of 
the overall system; .4(f) and Bit) are block-diagonal matrices with 
elements .4,(f) and /J,(f), respectively; and ir[t..r(t). u(t)) and 
r/(f, .#■(/■)) are the corresponding functions determined by (2.1). 

Note that in system (2.1) ihe uncertainties represented by 
ir,), //,(f)) and //jj(/..f a j(f)) do not satisfy the so-called 
matching condition, i.e., they may not be in the range of the input 
matrices ZMf). Without loss of generality, it is assumed that the 
uncertainties can be decomposed into two portions, a matched portion 
and a mismatched portion, so that 

ir,(f, .i\(f), i/,(f)) = B,(t)d* (f,.i\(f), a At)) 

+ (1At"i‘At).uAt)) (2 3a) 

r/ ( ,(/..r,(f))=: BAt)fi*,)(t< 't'j (i)) 4- // / y (f. .i ; (f)) (2.3b) 

where rC(-,-,-), rf,(-,-,-), //*,(■,■), and //,,(>.•) are unknown func¬ 
tions. Clearly, this decomposition is nonumque. Generally speaking, 
it is desirable to keep the mismatched uncertainties r/, (■. ■, -) and 
/),,(-,-) as small as possible by an appropriate decomposition, since 
the matched uncertainties do not impose big difficulty in the stabi¬ 
lization of the system, as shown later in this note. 

The following further assumptions are made on system (2.1). 

Assumption J; The pair (-4,(/), BAt )) of each subsystem is uni¬ 
formly completely controllable. 

Assumption 2: For each t<j E {1,2, ■ ■ ■, A’}, there exists non¬ 
negative constants f,,, £* . CmC- 9 ,,, and r/,V such that 

l|fc.jU.^(*))||< UlMOll + »/„ (2.4a) 

\K^Am<c\UAm+v:j (2.4b) 

llrf.d. j,( 0 ,«.( 7 ))|| < iuWr.it )|| + cikmil + (2.40 

l|r/;(f. J -,(f), u ,(r))|| < CiIku)ll + Clkmil +»»:, qm * 
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With 

C < 1 (2 4e) 

where || || denotes the standaid Euclidean norm 
It should he noted that Assumption 1 above specifies a condition on 
the nominal part of each subsystem while Assumption 2 specifies the 
bounds of the internal uncertainties of each subsystem and the bounds 
of uncertain interconnections Condition (2 4e) can be mierprcied as 
that the uncertainties associated with u ( t) does noi overt ide totally 
the nominal control effect of </ (/) to each subsystem 
To state the problem studied in this note the following definitions 
are required 

Definition 1 A closed set S C R is called a (global uniform) 
final attractor of a system with the slate i (f) C R ill tor each initial 
condition t (t u ) = i 0 there exists a finite constant l(t ( S) such that 

fit) £ S V/ > /( +T(fo S) (2 5) 

Furthermore it tht final attractor of the system is a ball D = {i t- 
R II r || S 1 1 *l le radius / ol D is called a final bound ol the 
system 

Definition 2 Suppose that a system with the stale i (/) £ R 
has a hnal attractor B The system is said to possess a dcgiec of 
exponential conveigence n toward the hnal amactor tlf lor all initial 
conditions /(to) = / C R \B 

||/(/)||< /(k)* Vi(f)€R \B Vf >/, (26) 


Theorem / Consider the case where the decentralized controller 
0 T) is applied to system (2 1), which satisfies the conditions given in 
Assumptions I and 2 Choose y lor the decentralized control (4 4) 
so that 

' > > (* 4 • + £ c ‘ ) (3 4) 
and 

C +9 < 1 (4 5) 

whcie (f and aie positive numbers Then if there exist positive 
numbers h and u, ,(/ y - 1 2 A ) such that 

the resultanl dosed loop system has a hnal altiactor B, and possesses 
a degiee of exponential conveigence n given by 

<\ - niiujn 1 n n \ } (4 7 ) 

toward B The final bound / of the closed loop system is given by 

1 — - (<t y ^ “F ^ OK) 

where h a and 1 aie positive numbers given by 

i^ — 111111 {A | <s fr \ } a — ^2 1 1 


where n is a positive constant and f(i 1 ) is a positive constant 
possibly depending on / 

The pioblcni studied in this notL can now be stated as follows 
Tind a decentralized state feedback contiol law foi ihe system (2 I) 
so that the resultant dosed loop system 

1 ) has a final attractor B with a prescribed final bound / and 
n) possesses a presuibed degree of exponential convergence 1 
toward the hnal attiactor B 

111 Dk tnirai 1/1 n Rom si C oNiRoi Dimc.n 
F or system (2 1) consider first the following mairx Riccati equa 
lions 

—P (f) — { 1 UH 'I / )'/’(» + P(M(4 (f) + « I) 

- P V(t)BM)Ti' (M/MM+ 1 j C'D 

wheie J denote the n xn dimensional identity matrices and n /> 
and i arc positive numbers Under the assumption that the pairs 
(4 (f) B [t)) are uniformly completely contiollable the Riccati 
equations (4 I) have positive symmetrical solutions P {t t f L) on 
[0 t f ] corresponding to a terminal condition P{t ( ) - I ^ 0 117| 
Let P (t) = lini/^-x PU U 0) ll is known that this limit exists 
and F,(t) are positive symmetrical solutions of (4 I) 117J. | IK) Also 
there exist positive constants // and o such that [18, Lemmas 5 and 
6 ] 

// r < P(t) < //1 02) 

ft is dear that the minimum and the maximum eigenvalues ol P,(t) 
over all time t can scive as the positive constants /v, and v, 

Now consider the following decentralized contioller 

«,(#)■= (t) =12 (14) 

for system (2 1), where i, are positive numbers The main result of 
this note can then be stated m the following theorem 


• -£'(£"*, 

1 \ 1 > 

Pi oof of Tht oh m l Consider a Lyapunov function candidate 

\ 

1 (/ ' > - £ > y p(0» (19) 

1 

foi the ovciall closed loop system ll is deal from (4 2) and (* 9) lhat 

i'\\>\\- £i if *)<HMr (no) 

where /< 111111 {/r 1 // /1 \ } and v = max{/>i rj v\ } 

The total time derivative of \ (t /) along the closed loop system 
is given by 

\ 

it//) = £(('/’(/>» + i r P{t)i + 1 ' P(t)i ) 

I 

\ 

- -£{.*« 1 / r 1 +U' -t>) 

l 

X t 1 I'D D 1 Pi + J 1 ' I J 

+ £21 J ,(/ 1 , 11 >+ ^ >j)| 

I l J-\ Hti J 

oil) 

By (2 3) and Assumption 2 

£2i'p|h (t 1 11 ) + ^ 7 ,(/ r ,)1 

i ( i-i 1 * J 

r £- > ll^'.ll|c ih.c'r. 11 + + »/,)J 

+ £ 2 \\D' P ,,,II jo Ih, d! p,, 11 + jjc ,IMI + ) J 

(3 12) 
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By using the inequality 


2 ab < - a 2 4 ch 2 
“ r 


Enp = *IMI 


for any real numbers a , h and < and the fact that 

as P, are symmetrical matrices, it follows that 

a ( \ ^ 

«,(f. r,.«,)+ £ 

I y— I J 

^ 12^n'C 2 ^ll , .|| 2 + 26 * 0 . 11 ^ 7 i’-r.ll 2 

isrl l ' 

+ ]C (— + II 1 > ll J + 2, m 


2 + ^'ill , /l| 2 + 2l) t/ V ||j, 


+ E P, r,|| 2 + £ (c* P.' P, r,||' 

.=1 l J- J 


it follows that 


1 (/,j)<-2 nl (#, I )-/>||-'I| 2 +2<t||i|| + <, 

= -2M (t. i) - (|| 11 | - i )(A|| r|| 

+ \JaT+Ti - u) ( 120 ) 

where a,rt. n\ ^ and > are positive numbers as given in (3 7) and 
(3 8 ) From (3 20), it is clear that when || 1 1| > t , we have 


and therefore 


!(/.;)< -2r*\ (t 0 


1 It r) < l (f 0 


Hence, by (3 10), we have 


CIKIl' + ^p/i^ll) 


Noticing the fact that 

EX>.,+c) ii r, r=E E (*, + o ii *. ii' 

t~ I /-I » = 1 y -1 

substituting (3 15) into (3 11) gives 

\ 

1 (f, r) < — ^ 2o, i ! P, i 

i-i 

-E -Ec} 

x||P ; P, II'-£{/.-£(*, + 0 

<=i 1 ; i 

+ EE( 2 , ^ , ll''ll + 2| /-*P/r,, 1 ||) (i i6) 

By the choices of and 0, as given in (3 4) and (3 5) and by the 
assumption (3 6), we have 

1U1)<-E j 2a <!P>'+ ,||iJ/P,j,|| J + M|i.|| 2 

-E(2»/w^ll'.ll + 2^||P'poll'll (117) 

Since 

E(- .Wtir.'.r + phaB!r,',\\^ 

=| (3,8) 

and 

EE , ^' /, ii ,, ii - E 


'-ii- oil. 


for all /(f) E R"\B Equaiion (3 23) implies thal for each initial 
condition i(/«) = iu» there exists a finite constant T{ in B ), given 
by 

Tit, B ) = - In 1 / " 11 (124) 
/' 1 

when / o E R' \B and I ( t 0 B ) = 0 when / o E B such that 

/(/) E B Vf ^ tu 4 7(1(1 B ) (3 25) 

According to Definitions I and 2 (3 21) and (3 25) imply that ihc 
system has a hnal attractor B with the final hound t and possess the 
degree of exponential convergence n toward the hnal attractor B 
This completes the proof of Theorem 1 

Remark I If the uncertainties of the subsystems and the 
uncertain interconnections satisfy the matching conditions, i e 
<1,(t i ,{t) u [t)) and h ,(1 in (2 3) arc zero and therefore 

and //, arc zero, then (3 6) can always be satisfied In 
this case, the proposed decentralized control scheme can always be 
successful and guarantees the contioiled system having a prescribed 
degree of exponential convergence and a prescribed final bound For 
the general case, however, theic is no guarantee that the condition 
(3 6) is satisfied, as the mcieasing of the numbers 1, may also cause 
the increasing of v which are associated with the solutions of 
the Riccati equations (3 I) This reflects the fact that Assumptions 
1 and 2 along can not guarantee the existence of a decentralized 
controller which stabilizes the system This is true even in the case 
of linear time-invariant systems without uncertainties [J9]-[21] 
Remark 2 When the free portion of the uncertainties bounded by 
ihj and ij* f in Assumption 2, are zero, as assumed by many authors 
[ 11-[9|, a zero hnal hound is achieved and the system is exponentially 
stabilized (assuming the proposed control scheme is successful) If 
some numbers j} j and //*, are not zeros, however, there exists no 
control scheme which guarantees the controlled system to have a zero 
final bound, since such uncertainties can always drive the system off 
the nominal equilibrium at any time In this case, large numbers -), 
and i f may be chosen so that the final bound is satisfactorily small 
Remark ? From (3 20), it can be seen that there exists a number 
ri < r such that the derivative of 1 (f /) is still negative when 
n < IMI < r Therefore, the closed-loop system has a final 
contractor B, j The system may not possess the degree of exponential 
convergence o toward the final attractor B,, 
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Based on Theorem J, a design procedure of the decentralized robust 
control scheme is given as follows 

Step I) Verit) Assumptions 1 and 2 

Step 2) For a given degree of exponential convergence n choose 
numbers n p and i , and solve the matrix Riccati 
equations (3 1) 

Step According to (1 4) and (1 5) choose the numbers - and 
9 and check the condition (1 6) with different sets of the 
positive numbers ^ l; 

Step 4) If in Steps 2 and 3 a set of positive numbers 
o ,/> i, * and -t , is found such that (16) is 
satisfied, go to next step Otherwise, repeat Steps 2 and 
3 with different set of numbers n />, i 9 and a , 
It no set of the numbers <\ p I - 9 and ^ , can 
be found such that 0 6) is satisfied this decentralized 
control scheme is not applicable to the system and this 
system is probably not decentralized stabilizable 

Step Use (3 8) to estimate the final bound / of the closed loop 
system and construct the decentralized feedback controllci 
(3 3) 

It should be noted that m the computing of i the largest num 
bers and b which satisfy 0 4) and 0 6) should be used lo 
estimate the smallest final bound of the closed loop system Also 
the design pioetduic given above may be repeated with ihe numbus 
t\ p i 9 and ^ adjusted to obtain a beltei control law and 
a better estimation of the hnal bound 

IV CONfllSION 

In this note a robust decentralized control scheme tot uncertain 
nmt varying interconnected systems is pioposcd Genual uncertain 
tics arc considered which do not satisfy the matching conditions and 
may appeal in the subsystems and m the interconnections between 
the subsystems as well The proposed contiol scheme is simple in 
the structuie and yet it guarantees the controlled systems to have a 
presuibed degree of exponential convugtncL and a predictable final 
bound when the uncertainties satisfy a certain bound condition 
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Hedging-Point Production Control 
with Multiple Failure Modes 

Paul Glasseiman 


Abstract —We consider Ihe control of a production facility subject to 
multiple failure modes Motivated by work ot Akdla and Kumar |1] and 
Bielccki and Kumar |5] on single-failure-mode models, we study hedging- 
point policies, in which production is controlled to its maximum rate 
whenever inventory is below a critical level and set lo zero whenever 
inventory is above that level 1 he maximum production rate varies with 
the state of the machine Assuming that the machine state is governed by a 
semi-Markov process, we evaluate average and discounted inventory costs 
for any hedging point, thus providing a simple mechanism foi identifying 
optimal hedging points Our most explicit results require that intervals 
in which demand exceeds production are exponentially distributed. We 
drop the exponential assumption at Ihe expense ol obtaining asymptotics 
rather than exact results 


I In iRODiic riON 

We consider the control of a production Utility subject to various 
types ol failures The facility oi machine can be in any of n states, 
labeled 1 n In state / the machine has maximum production 
rate i the production rale can be controlled to any level not 
exceeding ihe maximum late in the current state The durations of 
visits to each state are stochastic and mutually independent those 
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to state i have distribution F,, Upon the completion of a holding 
time in state r, the machine state becomes j with probability R tJ 
independent of everything else. Demands arrive at a constant rate 

Production in excess of demand accumulates as inventory; when 
the demand rate exceeds the production rate, the inventory Level 
decreases. Negative inventories reflect unfilled demands. Broadly, the 
objective is to minimize costs associated with (positive) inventory 
and unmet demands. 

Motivated by work of Akella and Kumar [IJ, Bielecki and Kumar 
[5], and Kimemia and Gershwin [10], we analyze the performance 
of a simple class of control rules based on a critical number 
sometimes called a hedging point: when the inventory level is below 

produce at the maximum possible rate; when the inventory level 
is at c, produce at the maximum possible rate or at the demand 
rate, whichever is smaller; when the inventory level is above 
do not produce. Our main result (Theorem 1, below) characterizes 
average and discounted costs associated with this class of policies 
under relatively mild assumptions. 

To state the result, we need to introduce some further notation. 
Let r, = d - r, be the net rate of decrease of inventory in state 
/, i.e., the demand rate minus the maximum production rate in that 
state. To rule out trivial cases, we always assume that there is at least 
one strictly positive r t . We assume throughout that the matrix 77 of 
machine-state transitions is irreducible; this implies the existence of 
limiting probabilities (zri ,• ■ ■. ) for the machine states. For each 

> 0, define an n x n matrix-valued function ‘l' 1 (■) with entries 

= [ VX[ >[(/-,<9--,).!•] JF,(.!■). (1) 

Jo 

Each has positive entries; when these are finite, the Per- 

ron-Frobenius Theorem (see, e.g., [151) asserts that $*M0) has a 
real eigenvalue /M0) equal to its spectral radius. Denote by A, the 
strictly positive solution (whenever it exists) to 

/M A-,) = l. V,(A,)<x. (2) 

Suppose that cost>accrues at rate /(.r) whenever the inventory 
level is .i\ with / mapping (-tl.x) to [0. *x,). Denote by A', 
the inventory level at time t under policy parameter *, and suppose 
A‘ n * = Let 

VU) = Inn f 1 / f{Xi)dt 
1 - x J\ j 

be the average cost under this policy, and let 

Jo 

be the corresponding expected i -discounted cost starling in machine- 
state i. Part of the content of our main result is that V(~) is well 
defined. 

Call state / a deficit state if r, > 0. We characterize \ ’ and 1 under 
the assumption that holding times in deficit slates are exponentially 
distributed. Later, wc drop the exponential requirement at the expense 
of obtaining asymptotics and bounds, rather than exact results. 

Theorem l ■ Suppose that F, is an exponential distribution for all 
i with r, > 0. 

i) If *tr, < 0 and if An > 0 solves (2) at ^ = 0, then there 
is a Co € (0.1) such that 

r(i)*(l-Co)/(;) + Cn /' til. (3) 

Jo 

ii) For any -) > 0, if A^ > 0 solves (2), then there are constants 
CY,, i — 1 , - • ■, n, such that 

Ji) 


In proving this result, we give expressions for the constants Co and 
C v ,; these are most easily evaluated when there is a unique deficit 
state. We also argue that the existence of A^ >0 in (2) should be 
considered typical. 

A consequence of Theorem l is that optimal hedging points can be 
identified by minimizing the expressions in (3) and (4). In a case of 
particular interest, costs for inventory and unfilled demand are linear; 
i.e., there are constants r+,c_ > 0 for which 

/(.* ) =• r+ max].r,0| + r_ niax{— j\ 0}. (5) 


Write log+ j for niax{0,log.r}. Using (5) in (3) and (4) and 
minimizing over : gives the following corollary. 

Corollary 1: Suppose the assumptions of Theorem J hold and 
that / has the form in (5), then 
i) under the average-cost criterion, the optimal hedging point is 
♦ 1 ( Co(r> + < ) \ ^ 

■ - {— -j ® 


ii) under the ^-discounted criterion, the optimal hedging point 
from initial machine state / is 


'i 1 


. . 


(7) 


Akella and Kumar 11] and Bielecki and Kumai |5| considei a two- 
state version of this model with exponentially distributed holding 
times and the linear cosi function in (5) In that setting, they prove that 
the class of policies considered here includes the optimal policy under 
discounted and average cost criteria, respectively. They also present 
counterparts of (6) and (7). We do not prove (or even suggest) that 
these policies remain optimal in our more general setting; howevei, 
their simplicity and relative tractability makes them appealing. 

The key to our analysis is a link between the mvenior) process 
and a time-reversed, continuous-time random walk in a semi-Markov 
environment. Through this connection, we are able to draw on results 
for random walks, in particular the work of Asmussen |4|. For two- 
state models, there is an equivalence with queues, as pointed out by 
Hu and Xiang |8|: see Chen and Yao [6] and Kella and Whitt [9] 
for related observations. None of these related references considers 
discounted quantities. The matrix <I>o(0) and its leading eigenvalue 
/>n(0) play an important, related role in the large deviations theory for 
Markov additive processes, as developed by Ney and Nummelin [ 14). 

We prove Theorem 1 in Sections 11 and III. Section JV presents 
asymptotically optimal hedging points when holding times in deficit 
states have general distributions. 


II. Avlrac.f-Cosi Analysis 

Let us take the hedging point : to be fixed and write X, = X, for 
the inventory level at time /. with Ao = ;. Let ,7 f denote the state ot 
the machine at time t. Our analysis of the inventory level is simplified 
if we work with the process 11 = i - AY t > 0, recording the deficit 
in the inventory level. The process V has the advantage that its law 
doevS not depend on :: during intervals in which J ( — /, 11 increases 
at rate r<, unless 11 = 0 and r, < 0, in which case 11 remains at 
zero. This description reveals that 1’ is the image under reflection at 
the origin of a free process evolving according to the same rule but 
without the constraint at zero. More precisely, let So = 0 and let St 
increase at rate r t whenever .7/ = i. Then 

11 = sup (S t 5 a ). (8) 

Uv t 

(ThisType of representation is standard in queueing theory.) From this 
formulation, we will evaluate V(*) by evaluating the expectation of 
/(- - 11) with respect to the stationary distribution of 11. 
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Now let (Sf.J#) have the law of the time-reversal of (S/,J f ). 
defined as follows, The process J is a semi-Markov process on 
' N }» with the same holding-time distributions Fj, • ■ ■«F„ 
as J. and with embedded transition probabihtes U* f = 
where (i'i, ■ ■ *. ) arc the stationary probabilities associated with the 

matrix 7?; i.e., ///?=: /✓. (See, c.g., [3, Section II.51 for background 
on time-reversal.) In particular, then J has the same stationary 
distribution as .7, given by 7r* = iv” t )* ' = L 

with m # the mean of F, [7, p. 3421. The process S t starts at zero 
and increases at rale r, throughout intervals in which J, = /. Wc 
take .7 to be right continuous. 

If we give J 0 the stationary distribution tt, then, for each t > 0, 
we obtain a stationary version of < u < ^} by .setting 

•7„ — 0 < i/ < t. Similarly, we can couple S to 5 by setting 

S„ ■— 5? — St 0 < i/ < /. With this construction, from (8) we get 

Y t ■■ sup (St - S u ) - sup (5i - St „) = sup s 

(Kii' t O' II : / ()• / 

Therefore, for any .r > 0 and any /,.y 6 {(), 1.* the events 

{V/ < .r.Jn = E «7 ( = ,/| and {sup ll# t S„ < = j.,7, ~ 

/} coincide, and so have the same probability. Defining Mi — 
s1l P<j. 11 't S tl and using the stationarily of.7 and ,7, we conclude that 

7rj P, {1) < ,r. 7, — ./) = 7r ; r ; ( U, < a. .Jt =• /) (9) 

where ihc subscripts on P indicate the initial machine slate (.7 ( > on 
the left, ./n on the right). By summing over /, we gel 

r **7i - y) = x,p,(Mi < .»*: dO) 

Next, we consider the limit as / increases. Because Mi is almost 
surely increasing in t , the limit M = Inn/— x ]// exists with 
probability one; moreover, under the drift condition ]T ( 7r,r, < 0 
in Theorem 1-i), M is almost surd) finite (2, p. 309]. Also, since 
[Yf.Jt) is evidently regenerative, it has a limiting distribution not 
depending on the initial state; let (V v .,7 x ) have (hat limiting 
distribution. Then from (10) we get 

PO\ < .»../* = j) = lim V < ./. ,7/ - ./) 

t — Z—r 

lim n f Pf(Mt < .i) 

= it j Pf( 3/ < .r). (II 

Summing over ,y, we have proved the following (compare [3, Propo¬ 
sition 2.21). 

Lemma P Under ihe assumptions m Theorem 1-i), PO ^ < 
.#*) = Yl, TT/F y (37 < r). Consequently, J'U) = E[/(: - T x )] *= 
Ej ' r ,K / (.f(i - -W)]. 

In light of Lemma 1, to prove Theorem 1-i) it suffices to evaluate 
the distributions P,{M < ■), j - l.-.u and to show lhat the 
expression in the lemma coincides with (3). The properties we 
need can in principle be obtained from the analytical results of 
12], Instead, we use a change-of-measure argument based on similar 
techniques in [4|, 113], 117]. Wc work with the discrete-time process 
{(St u . Jr„)*n — 0. L- *1, where r„ is the epoch of the nth jump 
of J. 

For fixed *, > 0. let A n be us in (2). We need two preliminary 
results. They are easily verified by induction and similar to results 
in [4] so we omit their proofs. 

Lemma 2: The quantity E ; [*'xj>( A-S T|| - i r„);,7 rn = /] is the 
.//-entry of (4^(A-))". 

By the Perron-Frobenius theorem, <I r ' (A-,) has a strictly positive 
right-eigenvector h (depending on o) associated with the maximal 
eigenvalue p-,(AJ = 1; i.e., ^(A-> )h = h. We have the following 
lemma. 


Lemma 3: E y [rxp(A n S T(< - ~t u )h(J Tn )//*(>)) = 1, for all j and 
n. 

A consequence of Lemma 3 is that, with ./ ( , = j . rxplA^St,, - 
' t„ )h(Jr n )/h{ j ) defines a change of measure. More explicitly, with 

tM) ‘7!0T " FM "' (,2) 

and Ph(j) = Y,, H'th(j)' K Y a probability distribution function. 
Let P and E denote probability and expectation, respectively, when 
the holding times in state i have distribution F t . i = l, * ■ -. /?, These 
are related to the original process through a version of Wald’s identity. 

Lemma 4: For any : > U, suppose there is a A n > 0 solving 
P-, (A-) = 1. If A T is a slopping time for the process {( S T „. J Ttl ). n > 
0} and if the event A is measurable with respect to {(Sr M . .7 r „ ),0 < 
n < A ], then P,{ A: X < x ) = E,[oxi>{~( \-,S TN - it\ )\h(t)/h 

[Jry): V < -x]. 

Similar results are proved in |4,| 113], and [17]. The proof is 
standard, so wc mint it. 

Under the new measure, S has positive drift. 

Lemma 5: When the holding times have distributions F,, i = 
1.*.//, lim*— ^ 1 S, >0 (and is independent of the initial state). 

Proof: By Lemma 5.3 of 114], Inm-rx t 1 S, = (A- ( ). From 

115, Theorem 3.7). we know that p, (*) is convex. We now separate 
the cases - - l) and - > 0. For the former, we find that $°(0) = /?, 
a stochastic matrix, so p u (0) — L Moreover, pJj(O) is the drift of S 
under the original measure and is therefore negative, by hypothesis. 
Thus, the convexity ol p (J implies that if A f , > 0 exists, then po is 
incieasing at A () . In the case ' > t), wc see that ^{()) is strictly 
substochastic and therefore p- (0) <r 1. Again, convexity implies that 
P- must be increasing at A. □ 

Remark. The convexity of the I unci ions p-,(), A > (), indicates 
that the existence ot A- solving (2) is typical, at least if the deficit- 
stale holding-time distributions have exponential tails. Indeed, the 
only alternative then to p- (A,) - 1 is the existence of a 0 > 0 for 
which p- (0- ) 1 and />-, (0+) = x . Such cases must be considered 

exceptional; see [2| for a treatment of this case. The second part of 
(2) makes the new drift finite. 

We can now evaluate the distribution of M from any initial slate. 
Lei V, = inf {n > 0 : S, M > .r} be the index of the first jump epoch 
at which S exceeds .r. (In a simplifying abuse of notation, we write 
S\, and 7\ f lor .S r% and .7 r% , and we write J\ t - for the state 
of the machine when S first exceeds j .) Under the original (negative- 
drift) measure, X, may be infinite, but under the new (positive-drift) 
measure. X, is almost surely finite for all .r > 0. By Lemma 4 

P,(M > .r) = P,( Y. < x) 

- E;[oxi>(-Au S\ r )h(j)/h(,J\ t )\ 

= « 1>{J )Ej<‘vp(-A 0 [5\ J - .»■])//» (.7 v r )]. 


To evaluate the expectation on the right, we condition on J\ T ^, 
which is necessarily a deficit stale. Given J\ T ~ = k\ the overshoot 
S\, - a and the next state are independent. Moreover, F* 
exponential implies F* exponential, so the overshoot is exponentially 
distributed with mean rum*, where 


an ~ 


my r 

HI') L 


rr V "' Fj ( d.r ) 


an 

1 - AdTa an 


(13) 


is the mean of Fa. With .7o = k\ S Tl also has the exponential 
distribution Fa . Thus, we have shown that 


F,(3/ >,r) 

= A, - *fr)g*[«c|»(-AnS ri )]£ t [l/A(.A, t )]«-*<" 



= *) 


_ J. _ 

T+AofniTi* 


mr'(k),- Xn '. 


(14) 
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From (13) we find that 1/(1 + Aounu) = 1t Aorfcuu. So, setting 
Gi = = *')(! “ X Q rknn)R^ 1 (k) 

J A 


Lemma 6: Ej[<? lT *] = 6 7 r'“ A ’ y,, i where 
t>j =h(j)Y,PAJN'- = m + h-n\,)nn)Rh-\k). 

k 


we find from ( 11 ) and (14) that P(Y „ c > .r) = Cac~ x ° J t for all 
<r > 0 . Since V(s) = E[/(- - V«u)], this completes the proof of 
part (i) of Theorem 1 . 

The only unknown terms in our expression for Co are the proba¬ 
bilities Pj( Jn,_ = k). If there is just one deficit state—call it state 
1 —then 

Co = ^2^jh(j)(l - \ 0 rimi)Rfr } (l). (15) 

j 

Each of the terms appearing in this expression is easily computed 
through matrix calculations if the number of machine states is not too 
large—not more than 30, say. Choosing h so that nh = 1 eliminates 
the first factor on the right in (15). 

111. Discounted-Cost Analysis 
We now turn to the evaluation of Vp (c), 7 > 0. As in the previous 
section, we take : to be fixed and suppress it as an argument. Let 
£ be an exponentially distributed random variable with mean 1 / 7 , 
independent of everything else. Then 

-)V: = ^e, f •~' l f(x,)dt = z,[f(.x,)}. 

J 0 

Thus, to evaluate V, 1 , it suffices to find the distribution of the 
inventory level at the random time L. As in the previous section, 
we work with the process V, = r - X t rather than the inventory 
level itself, and now seek to evaluate 

lV? = E,[/(;-»,)]. (16) 

From (9) we get 

PAY, >-r) = ^Yl <?/=') 

J 

= — y ' < L,J, = 1 ) (17) 

JT, 

I 

where = inf{f >0:5/ > .r} is the first time S reaches level 
,i\ Letting V = £ - T,, we get 


Proof: Because TV < tn t , a.s., and because the evolution of 
St is deterministic between jumps, 7V is measurable with respect 
to 0 <n< 7 v a }: so, we may evaluate the ex¬ 

pectation of rxp(— 7 Tj) by applying a measure transformation to 
{{S Tn .Jr n ),0 < n < Nj] as follows 

E A<-' U ] = h(j)/h(J Nl )] 

= r" h(j )E J [P ( ' , "' V ~ ><r "“ )] (19) 

where we have written r \ J ,_ tor r i N (the prevailing net rate when 
5 first crosses ,r) and used the fact that (5% a -.r) = r\ x -( r\ T -7,), 
a.s. Now we condition on .7 a x -, which is necessarily a deficit state. 
The corresponding holding-time distribution is therefore exponential; 
and given J 1 \ lt , the next state J\ t is independent of r \ r - T,. Thus 

■El[l /h(J T , )] 

-A) 

k 

mr l (h) 

1 - (7 - n A- )nu ' 

= r ' '/.(j)^/’ < (./v l = *) 

A 

x (1 + (7 — r*A- )/?// l (A ), 

= 6, ’ 


where we have used 


Rh{h) [' ( A Jj r / , 

MA’) ./o 


nn 

1 + (1 - A -,ik)"n 


□ 


If we set 


P,(Tr <L.Ji = 1) — P /(Tj < L)Pj(Ji = /| T <£) 

= E,[< j "' )7j ]Pj(.7r,+/' = ' I r, < fc). 





»*r|T. <L)PUJ, =-) 


( 20 ) 


Given {T, < L}, £' is exponentially distributed with mean 1 / 7 . 
Thus, we get 

PAT , < L.Jf = 1 ) = = A-1 37. < L) 

k 

x = >)■ 

We conclude from (17) that 

PAY, >-r)=^5>, E >- 17 '] 

J 

x = * I t j < L)P k (J, = i). (18) 

k 

We evaluate E, [r “ 713 ] by adapting a change-of-measure argument 
due to Glynn (personal communication) and Kollman (11 ]. Let -5 >0 
be fixed and let h denote a strictly positive Perron-Frobenius right- 
eigenvector for $ 7 ( Xf ), so that $ 7 (A-, )/* = h. Via Lemma 4, define 
new holding-time distributions as specified by (12). Throughout this 
section P and E refer to probability and expectation based on these 
distributions. Let N, = inf{« > 0 ; S Til > t], as before. 


then from (18) and Lemma 6, we find lhat P,(Yi > j*) = C\ ,t ’ ^*. 
Using this in (16) concludes the proof of part 11 ) ot Theorem 1. 

As in the average-cost setting, a more explicit expression for C 1 , 
is available if we assume that only state 1 is a deficit state. With this 
assumption, Ji\ x - = 1, a.s., and the sums over k in (18) and (20) 
collapse, resulting in 

< 0 ,. = + (7 - ) 

x mr l (\)PAJ, =/). (2D 

To conclude the evaluation of this constant, we need to find g, {j) = 
P}(Jl = 0 at j = 1. For 7 > 0, let be the diagonal matrix 
with entries 

IM'.') = l-5>?,(0). 

Let e, be the ?i-dimensional vector (0,■ • •,0,1,0, • ■ ■,0)' with ilh 
component equal to one. 
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Lemma 7 <j, (7 - 4> (())) l £V, 

Paw/ Notice that 

= =/)= )E, "'lj,, ,>r/f 

The result now follows by combining Proposition 2 20 and (5 14) o( 
Chapter 10 oi Cinlar |71, noting that the Q, defined in his (2 10) 
coincides with our (0) □ 

Remat h 

l) As * l 0, Pj( 7/ ~ /) —♦ t lor all / ) and we expeci that 
X-, —*► \o Thus C —* Co lor all /, and (T) and (4) are 
consistent as - decreases to zero 
n) Under the discounted cost criterion the optimal hedging point 
depends on the initial state, denote by the optimal hedging 
point stalling in machine stale / An alternative policy controls 
the system to hedging point , whenever the machine state is 
i Such a policy is guaranteed to result in lower cost than fol 
lowing a single ^ The optimal policy of Akella and Kumar 
(IJ in a two state model is precisely ol this form, however 
since m then model the hedging point toi the breakdown slate 
is always greater than that lot the functional stale the foimer 
plays no role, and the policy is indistinguishable Irom ont 
with a single hedging point See Kimcmia and Geishwm 110| 
Malhamd 112| and Shanlma [ 16) lor moie on state dependent 
hedging points 

IV AsYMProncs Kir Cif nfrm Dmnr Inurwls 
Wl now drop the requuement that Ihe holding time distributions loi 
deficit stales be exponential at the expense of obtaining asymptotics 
rather than exact results I or the asvrnpiolics we restrict attention to 
the lineai cost structure in (5) and consider optimal hedging points 
as ( —* x lhe case ol large penalties toi unfilled demands is ol 

practical as well as theoretical intercsl 

For the lollowing recall lhal a distnhution is nonlatlicc if it is noi 
concenlialcd on any set ol the form { - 2A -r s 0 < s 2 s } 

1 ht oh m 2 Suppose that T is nonlaltiu whenever / >0 

i) It ]jT T 1 < H and if V) > 0 solves (2/ at — 0 then there 

is a constant C o t (0 1) such lhal as < —* \ 

('“! r ,_> ) + " (11 (22) 

ii) Suppose theic is a unique deficit slate For any ^ ^ U 

\ > 0 solves (2) then th^rc die constants C / - l 

such that as / —» x 

1 <M —- 41 >(i > (2?) 

' 4 - 

Proof The argument used for Ihcorem 1 still applies, except 
that S \“ i and r\ - l no longer have exponential distributions 
In the average cost setting the argument leading to (IS) now shows 
that 

/»(K -> 0 = 52^*'/)r,[ ’/M/n )]« (24) 

Set /ij = S\ J - i and T, = f\ and consider the process {(/3 
r ( ) i > 0}, with i playing the role of time parameter This 
process is regenerative, the icgeneiation points are those i tor which 
6\, = i and /\ 3 = / for some fixed / It follows (undei our 
nonlatticc condition) that this process has a limiting distribution not 
depending on the initial state, let (/K ) have that distubution 

Then, by the very definition ol convergence in distribution, the limit 

lim F,[r ‘ ] /hUs,)] = E[* ' ,,H '//.(r,)] 


exists We have thus shown that with 

r„ = £>,/.(, je[, VoH v/»(r^)] 


we have 

/*(K>i)~C h . v °‘ (25i 

the symbol ~ indicating lhal the ratio of the two expressions 
conveiges to one as i x see also |2) So long as the distribution 
of 1 ^ is continuous on (0, x ), straightforward minimization of 
M -) = m f [( l[(K- ) H ] shows that the opumal 

hedging point satisfies 

(26) 

The lequired continuity follows Irom the nonlatticc assumption But 
then (25) implies that 



which implies (22) The pi oof of (2^) is similar but starls from the 
icpiesentation 

POi > ')= 1 Y,r,h{Oiy ' ' ,<r ' 7 IHJ\ )] 

l 

x P,(h - / | 7 W ), ' 

When state I is the only deficit stale, this reduces to 

/mi, ^ <)= -T-,i,(i)nh '(in,!, 11 x 

7T ' • • 

X /*,</,- ,), ' (27) 

much as in Section 111 We claim that - A i, £ 0 Otherwise, 
all iow sums ol <I> ( ) would be strictly less than one and since 

the maximum row sum is an upper bound on the spectral radius 
wc would arrive at the contradiction p ( \ ) < l By essentially the 
same regenerative argument used above foi ( o wc may define 

( -= '(1) 

7 T 

I 

x r,( i, -,) i,m e,[i' v ,Mr '' 11 

and (.(include that /’,(),> i) ~ C , < v The resl of the proof 
is the same as part i) □ 

II is possible that part n) of this result extends to multiple deficit 
states Such an extension would require conveigence ol P,( 7\ — * | 
r I) as / x and a guarantee that - - V-, i <0 whenever 
; "> 0 


V CONtHIDING Rl MARKS 


i) Foi simplicity, we have assumed throughout that the lime spent 
in a machine state is independent of the next state visited One 
can easily imagine settings in which it would be desirable to 
relax this requirement For example, a quick repair might restore 
the machine to operation soonei than a thorough repair, but 
the lesulling machine states might differ in then susceptibility 
to further failure To incorporate such phenomena, one could 
define holding-time distnbutions r n corresponding to current- 
state i and next-state j The matrix (^) is defined just as in 
(1) hut with F, replaced by F n The analysis goes through as 
befoie, except that in expressions like (14) 


1 

1 4- Aourn* 







becomes 

^ RLik~ l {i)/i 1. + AtiffclWir,.) 

; with miu themean offY». v 

ii) (t also seems possible to carry out an extension in which 
the hedging point changes with the machine state, with one 
important modification. Let Z/ be the hedging point at time 
a function of J t . it is possible that upon a change in 
the machine state the inventory level A'/ exceeds the new 
hedging point Zt. Under the usual operation of the system, 
A\ would then decrease linearly at the rate of demand until 
it reaches Z t . Suppose we modify the system so that X t is 
instantaneously reduced to Z t whenever a change in machine 
state results in A", > Z t \ physically, this corresponds to 
discarding excess inventory rather than waiting to sell it. With 
this modification, the process I# = Z ( -X f becomes the image 
under the reflection mapping of the free process Z ( - St, and 
its distribution can be analyzed through time reversal. 
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A Globally Optimal Minimax Solution for 
Spectral Overbounding and Factorization 

Robert E. Scheid and David S. Bayard 

Abstract —In this paper, an algorithm is introduced to And a minimum 
phase transfer function of specified order whose magnitude “tightly” 
overhounds a specified real-valued nonparametric function of frequency. 
This method has direct application to transforming nonparametric uncer¬ 
tainty bounds (available from system identification experiments and/or 
plant modeling) into parametric representations required for modern 
robust control design software (i.e., a minimum-phase transfer function 
multiplied by a norm-bounded perturbation). 

I. Introduction 

Assume that a discrete-time plant P(~~ l ) is estimated as P ( 1 ), 
and let L denote the uncertainty in the estimate. For example, 
three common characterizations of plant uncertainty are L -additive 
uncertainty, Lr -input multiplicative uncertainty, and Lo -output mul¬ 
tiplicative uncertainty, where [17, p. 224] 

La = P - P 

L, = P~ I [P-P) (I) 

Lo = (P - P)P" 1 . 

Note that multiplicative representations require a square plant. Let L 
denote any one of the above three quantities. Suppose, a nonpara¬ 
metric overbound ((*•) on I, is known such that 

({j > <r{L(e~ J '*‘‘ 1 )) for all ^ 6 [0. n/T] (2) 

where T is the sampling period and rr(L) is the maximum singular 
value of L. Various methods are available to find fU ) from raw data 
(cf., |3J, [11], 113], 116]). / (-j), however, is a nonparametric function 
of frequency and cannot be used directly in modern robust control 
software packages such as the Matlab Robust Control Toolbox |7] and 
p synthesis software |2|. Instead, the uncertainty must be represented 
as a minimum phase transfer function matrix WU“ 1 ) of a specified 
order such that 

Lit ) = AW(i (3) 

where A is norm-bounded, i.e., 

IIAIU < 1- 

The choice of W in (3) can be structured or unstructured. For present 
purposes, the simplest choice is to use a scalar matrix representation 

W = W ■ / (4) 

where W is a single-input single-output rational function. 

Manuscript received December 21, 1993.. This work was supported in pari 
by the Jet Propulsion Laboratory, California Institute of Technology, under 
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To incorporate the uncertainty bound (3) mio a robuM control 
design, a systematic method for determining the weighting U in 
(4) is needed Ideally, any approach to find U should satisfy the 
following prpperties 

PI) II must overbound the uncertainly / F ie 

|H )| > (U) tor all u. £ [0 k/T] 

to ensure the existence of some ||A|| v 1 satisfying (3) 
P2) II should be as tight an overbound as possible to avoid 
conservatism in the final robust control design 
P3) W should be ol specified ordei (in fait as low order as 
possible) since it will be mcorpoiated as a weighting and 
increase the final controller order 
P4) II should be stable and minimum phase 
These requuements on ll rule out using several powerful methods 
from the complex analysis hteiatuic f or example interpolation meih 
ods such as Nevanlmna-Pick theoiy for finding rational mtcrpolanls 
to complex valued data [201 do nol addicss the real valued data 
case and do not satisfy the above piopcrtiex For ihe same reasons 
the extension of interpolator methods to the noisy data case (cf 
[13] [141 M2| (6| 1181 etc ) aic nol diiectlv applicable lo the 
present problem L ven if mtcrpolatory methods could be appropriately 
modified to overbound teal valued data sets (i e PI)) arid do so in 
a minimax optimal sense (i c P2)) the final rational fits tend to be 
ol very high ordei (eg on the oidci of the numbei of data points) 
and would not satisfy properties P3) and P4) in geneial 
In this papei a mathematical piogrammmg approach is used to find 
a II which satisfies all ol the piopcrties PI) P4) Ihe main idea is 
lo leloimulate the problem so as to fmd a spectrally lacton/ablc 
rational lunction II Ml whose magnitude tightly ovcibounds the 
squared data t U) This is done in Section II by posing a minima* 
nonlincui optimization pioblem lo ensure tightness of fit of a lational 
(unction having specified ordei wiih side constraints to cnsuie that 
the data is oveibounded and that ihe solution admits a spectral 
lacton/jtion A key iesult m Section III is that the nonlinear 
optimization problem can be solved by a sequence ol icweighted 
constrained lincai problems In particular a globally convergent linear 
programming spectral ovci hounding and tactouzation (l PSOf) algo 
lithm is piesenkd based on solving a sequence of lineai programming 
pioblems [P>] Hie I PSOl algorithm piovidts a globally optimal 
solution lo the nonlinear pioblem 

11 PRUBItM FoHMIIAIION 

In this section a nonlinear constrained optimization is posed to 
compute a minimum phase transier function 11 ol order m such 
that |U | is a tight ovci bound on /U) for all ^ With this iesult 
the uncertainty can be wnlten in standard foim I =■ All where 
||A|| x < 1 Forming the quantity 11 ( )II ( 1 ) and evaluating on 

the umf circle gives an expression of the form 

11 Ml = l{ " (S) 

where 

1(+) = lo + It fos(^T) + + 1, < 0 S(nial) (6a) 

o(vt) = 1-F oi <os(vu7”) 4* +iii ros(maT) (6b) 

It is noted that n(*) is defined as monic without loss of generality 

A Const)amt\ for Os e) bounding 
The requirement that |IT | be an overbound on fU») is equivalent 
to the requirement that |H I"* is an overbound on ( J and tan be 


exptessed as 

>(\l) for all u. € |0 zr/7 J (7) 

B C unstnunts foi light 0\ a bound) m* 

The requirement that |H | 1 be a tight’ overbound can be expressed 
as 

linn b (8) 

where 

*■ tUT! ‘ ru) ) q1 u ’ j toral1 " € 10 k/t ^ (9) 

Here the criterion minimizes a worst case ciror b which is frequency 
weighted by the quantity #/ ! U) 

C C oust)amts fo) Spu tral / ai toiizabihn 

The requirement lhai the overhound 1/n admits a spectral tactor- 
i/ation can be satisfied by ensunng that (Astrom [![) 

f(t)/n(a)>0 for all - € [0 t/ 7 ] (10a) 

nU) 0 for all a F [0 n/l] (10b) 

Nolt that condition (H)a) is implied bv (7) and condition (10b) can 
be cnfoiccd explicitly by the constiaint 

^ U tot all * e [0 t/ 7] (Ha) 

tor some small n I or technical leasons it will be convenient to 
enforce a similar constraint on i as 

i( ) * f n o Ioi all ^ € [0 ~/7] (I lb) 

for sonic small 1_ 

In summary it is desued to solve the optimization problem (8), 
(9) loi o f subject to constraints (7) and (10a) (l()b) 

III Tnr LPSOF Alc.oriihm 

In this section the IP spectral oveibounding and lactorization 
(LPSOF) algorithm is introduced which sohes the constrained non¬ 
linear optimization problem ol Section 11 on grid ol points V = 

{a i x [ Modifications to extend these results to all u- € 
[0 ~/T\ will also be discussed 

The constiaincd optimization problem restricted to points of the 
sel \ can be wntlen as 

mm b (12) 

subjecl to 

)- rt- )n(w ) >0 (Ha) 

iU f - ( U )"U ) < N/U )t*(w ) (13b) 

IU ) >J_ nU.) > n (13c) 

foi all i = 1, , n 

where n( „) and fU) are defined by (6a), (6b) A key observation 
from (12), (13) is that for fixed b the optimization over a, I is 
simply a linear piogrammmg problem to find a feasible solution 
for the coefficients n 1, Hence, the joint optimization problem 
can be solved by a nested search procedure where an outei loop 
systematically decreases <*>, while an inner loop finds feasible solutions 
in the variables n and 1 for fixed b The procedure terminates 
when the smallest ^ is found, which admits a feasible solution This 
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approach is denoted as the LP spectral overbounding and factorization 
(LPSOF) algorithm. 

To solve problems (12) and (13), one must begin with upper and 
lower bounds for the optimal value />. For example, one can choose 
the low,er bound b- = 0 and let the upper bound />+ be derive^ 
from some starting feasible suboptimal solution (an obvious choice 
is <i = 1, ji = max*,, f 2 (*•,)). Then h = (/»+ + £-)/2 becomes 
an updated value for 6+ or depending on whether or not the 
inequalities (13) can be satisfied for b = h (i.e., the bisection method 
(8|). In this way, the LPSOF algorithm converges to the optimal value 
of A geometrically (i.e., as a power of 1/2). 

This process can be further accelerated by effectively linearizing 
about the candidate value b. Thus, given K one can solve the 
linearized problem 

max u (14) 

"•'V'Lf 

subject to 

JU',)-f 2 U’ f )o(u',)>0 (15a) 

f a (^, )nU’ ( ) - < -</ (15b) 

d(^,)> i a (15c) 

for all -j,,/ = 1. * • •. n. 

Then b provides an update for b or A+ according to whether the 
solution if is negative or nonnegative. In the latter case a sharper 
a-posteriori estimate for i*+ is derived via 

^ + = 1 ^{(^"' 2u ' , ) / ' u) } (,6) 

where oU 1 ) and fU') are the solutions derived from (14). (15). Il is 
also worth noting that setting b — 0 and solving (14M16) provides 
an excellent starting value for b H to initialize the algorithm. 

Remark 1. It is noted that the weighting q may be chosen as 
certain functions of the unknown polynomials n and i without 

violating the linear fdrm of the constraints. Generally q U’) may be 
taken in the form 

f/U 1 ) = + (17) 

where qu (-j) and rjiU>) ure positive for jjfA and specified before 
hand. 

Remark 2: By the fundamental properties of linear programming 
[9], [10], the LPSOF algorithm is globally convergent and achieves 
a globally optimal solution to the discrete problem (12), (13). An 
excellent algorithm for spectrally factorizing polynomials 1 and o 
without solving for roots can be found in Kuccra [21]. 

Remark 3: For each fixed value of b the semi* infinite linear 
programming problem (7), (9), and (11) (i.e., for continuous-valued 
u> £ [0. tt/T]) can be solved as a sequence of discretized linear pro¬ 
gramming problems of the form (13) in the limit as the mesh becomes 
sufficiently line (cf. [15, Section 7.21). Thus, one can recover the 
solution to the semi-infinite spectral overbounding and factorization 
problem (7), (8), (9), and (11) using the LPSOF approach, by solving 
a sequence of linear programs with decreasing mesh spacing for each 
fixed value of b. Decreasing the mesh spacing, however, may not be 
de&irable in practice, and alternative methods for approximating the 
solution to the semi-infinite problem will be presented in Section IV. 

IV, Modifications of the LPSOF Algorithm 

Strictly speaking, the LPSOF algorithm only enforces inequalities 
(7), (10a), (10b) at the points in the grid A. Hence, the inequalities 
may be violated in between grid points, and the solution may not 


be a true overbound, and/or may nor admit a spectral factor. If this 
happens in practice, the simplest solution, generally, is to choose a 
denser grid (see Remark 3) and/or increase lower bounds o, ij_ in 
(13c). There may be certain cases, however, where these approaches 
are not desirable. Hence, in this section, systematic modifications of 
the LPSOF algorithm are presented to overcome this problem. 

For convenience to subsequent discussion, we make the following 
assumptions. 

Assumption /: Let ( 2 U ) be a linear spline inierpolant to the 
points I ~(*>,) defined on the grid A = , ■ ■ ■, } with piecewise 

linear segments having maximum slope n, and maximum grid size 
h = max, 1 — a.’,}. 

Assumption 2: The set of vectors {[cos(A-^i 7’), - ■ •, 

< os{ku,'„T)] 6 ft", A* = (),■■•, m } are linearly independent. 

Modifications of the LPSOF algorithm to ensure proper behavior 
between grid points, fall into two categories, a-priori and a-posteriori. 
These methods will be discussed separately below. 

A. A-Priori Modifications 

The basic idea behind the a-priori modifications is to enforce 
additional linear constraints in the LPSOF algorithm so that the 
derivatives n'(-j) =• ^(n(u,’)), ( *U’)) and ( i/n)' = 

~ ( V'O are suitably hounded for all € [0. tt/ 7’]. This clearly 
restricts the excursions of o. i, and f/n in between grid points, so 
that under Assumption 1, and specification of lower bounds n. t in 
(13c), the desired inequalities (7). (10a), (10b) can be satisfied. 

Some useful definitions are in order: A function .r(o,') defined on 
the interval € il is said to be uniformly bounded from above il 
|.rU')| < C < v lor all a. 1 € IL Here, the quantity ( ' is denoted as 
the uniform upper bound. Similarly, the function r(^) is said to be 
uniformly bounded from below if |.i U’)| > c > 0 for all 6 il. The 
quantity e is denoted as the uniform lower bound. 

A method to uniformly bound the aforementioned derivatives 
is now- introduced. For some positive m.A.A',,, and A’i, let the 
following linear constraints be imposed on the grid A 

n(a' f )<n (18a) 

iU,) < /w>U’,) (1 Hh) 

VU',)| < /i„|n(a',)| (18c) 

J'U)| < K,\ 1U',)|. (18d) 

If linear constraints (18a)- (18d) arc used to augment linear constraints 
(l3aMI3c) of the LPSOF algorithm, it can be shown that 

n < ) < n (19a) 

2 < *U) < An (19b) 

|n / (^', )| < A,,a (19c) 

\.fU)\ < A\A'n (19d) 

,l <K{K,, + K„) ( 1 %) 

where (19e) follows from 



Inequalities (!9aMI9e) imply that n. L J/r* and their derivatives 
are bounded on the grid A. Under Assumption 2, these bounds 
at the grid points impose bounds on the coefficients (this 

is because the matrix of trigonometric functions which determines 
these - coefficients has a bounded inverse, (cf. [8|)), This implies 
that M(ai)|'|o'(uj)| and |J'U')| are uniformly bounded from 

above since they are bounded functions of the bounded coefficients 
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If the uniform upper bounds on |o'(,j)| and |/j'U')| are 
sufficiently small, constraints (13c) on the grid points imply the 
existence of uniform (nonzero) lower bounds on |n(yt»)|, | J(a/)|, 
Hence, by systematically decreasing the values of A j and A,, 
(with the other constraints fixed), one can make |n'(-L>)| and |/J'U')| 
arbitrarily small, and there will always be some point at which 
inequalities (10a), (10b) are satisfied uniformly in jj. Given that (10a), 
(10b) is satisfied in this manner, it follows from (20) that |(<i/ f| 1 , | 
is bounded uniformly from above. Using (13a) and Assumption J, 
it follows that (7) is satisfied when this uniform upper bound on 
|M/rt) f | fttIH below the value of a* defined in Assumption 1. 

In summary, by augmenting the linear constraints (13a)—(13c) of 
the LPSOF algorithm by linear constraints (18a)-(1Kd), and solving 
a sequence of problems where A’ y and A\, are systematically 
decreased, there will be a point at which (7) (l()a), (10b) are satisfied, 
ensuring that ,ift\ is an overbound on f 2 for all ^ E [0, n/T] and 
spectrally factorizable. 

B. A-Posteriori Modi fit ations 

The basic idea behind a-posterion modifications is to slightly 
perturb the unmodified LPSOF solution so that inequalities (7), (Ida), 
(l()b) are satisfied uniformly in -c. For grid A sufficiently fine, a small 
perturbation can always be found which does the job. To see this, note 
that the solution to problems (12), (13) will always satisfy bounds ol 
the form (10), if the quantities n.A. A,,, and A y are compulcd 
thpostciioii . As noted in the previous discussion, Assumption 2 
guarantees corresponding bounds for the coefficients which 

in turn impose uniform upper bounds on |nU)|.| fU*)|. |n'(-j)| and 
| f'( j‘)|. Then, for a sufficiently small grid si/e /». uniform constraints 
(7), (10a), (l()b) can be satisfied by means of an (){h) perturbation 
of oU>). f(a’), and <\ 

The construction above implicitly assumes that o.A, A... and 
h, are reasonably sized. If not, these quantities can he explicitly 
constrained a-pnon. as done earlier. 

V. NUMLRICAL liXAMPIF 

In this section, the LPSOF algorithm is used lo determine spectrally 
factorizable overhounds on additive uncertainty estimates obtained 
from large space structure identification experiments (5|. Raw ad¬ 
ditive uncertainty data f"U\) adapted from |5| is shown in Fig. 1, 
depicted by the symbol on the grid A — {u:, *=. m/{ 12ST). i ~ 
1. ■ • ■. 12b}, where T ~ .03 seconds. In addition to the raw dala set, 
the envelope ( 2 is depicted in Fig. 1. The envelope f 2 is a smoothed 
nonparametric overbound on the raw data ( 2 . 

In all subsequent examples, the LPSOF algorithm of Section 111 
is used in sequential linearized form (14), (15), with underbounds 
u = J = 0 and frequency weighting q U») - <y fl 4- <yi 
where q\ = 1 and <jo remains to be specified (see (17)). The dual 
rather than primal form of the underlying LP problems [9|, [101 
is implemented lo considerably reduce the number of constraint 
equations. In all runs, 10 to 15 iterations were sufficient to ensure 
convergence of h to six significant digits. 

A . Overhounding Raw Data 

In this section, the raw dala i 2 of Fig. 1 is overbounded using the 
LPSOF algorithm of Section 111. 

The first set of runs is generated by fixing m = 4 and q\ = 1 
and varying weighting factor qo as qa =- 0, .01, . 1. 1. The results 
are summarized in Fig. 2, where the raw data ( 2 is depicted by the 
symbol \ It is seen that the overbound associated with </n = 0 does 
a reasonably good job of overbounding the data, but tends to sucrifiec 
some accuracy in the peaks for accuracy in the troughs (e.g., there 



-40 

101 10° I0‘ 

Frequency -1 U 

Fig. I Raw additive uncertainty data t d (denoted by '*') and envelope data 
(- (denoted by ‘ ’) defined on 128 point grid. 



Fig 2 Spectrally lactori/able ovcibounds l/n on law data ( J (denoted by 
**’) using unmodified LPSOF algorithm in = i = l). ryj s l, nt — 4) 
obtained by vaiying weighting tactor r/n 0, .01, 1,1 

is about an 8 db overshoot of the main peak). Methods to improve 
the results become apparent. 

Method I Decmphasize the weighting of trough data by increas¬ 
ing qu. 

Method 2 Reduce oscillatory behavior of the data by overbound- 
tng the envelope ( 2 rather than raw data 

Method 1 motivates the remaining runs of Fig. 2 which are 
generated by increasing ryo. (Method 2 motivates using envelope data 
rather than raw data, which will be discussed later.) It is seen from 
Fig. 2. that increasing qo from 0 lo .01 improves the accuracy of the 
overbound by several db in the vicinity of the peaks at the cost of 
accuracy in the troughs. Increasing <yu further to .1 and I continues 
this trend Hence, the user can choose from the family of curves in 
Fig. 2, to trade-off accuracy in the peaks for accuracy in the troughs. 

The second set of runs is generated by fixing q\ =1, qu = 1 and 
varying the order w as w =2, 4, 6. The results are summarized in 
Fig. 3. It is seen that the successive overbounds improve uniformly 
as the order is increased. In particular, the peaks are tilted reasonably 
well by bounds of alt orders (this is a consequence of using qo = 1 
for all runs) while most of the improvement from increasing order 
comes from fitting the troughs. 
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Fig 3 Spectrally factonzable overbounds i/n on raw data I 2 (denoted by 
'*’) using unmodified LPSOF algorithm (« = = 0, qu = 1, q\ = 1) 

obtained by varying bound older m = 2, 4, 6 _ 



Fig 4 Spectrally factonzable overbounds i/a on envelope data P (denoted 
by '*') using unmodified LPSOF algonthm (a = _f = 0 <jq ~ 0, q\ — 1) 
obtained by varying bound order in = 2, 4. 6 


B Over hounding En\ elope Data 

In this study (in accordance with Method 2 outlined above), 
oscillatory data is avoided by overbounding the envelope data P 
rather than raw data (see Fig 1). 

The runs of Fig 4 are generated by fixing r/o — 0 , q i = 1 and 
varying the order m as w = 2 , -LG. The envelope data P is depicted 
by the symbol ’ It is seen that the overbounds for w = 4 and 
n/ = 6 are excellent, and in fact the latter nearly interpolates the 
data. 


VL Conclusion 

A systematic method, denoted as the LPSOF algonthm, has been 
developed for finding a minimum-phase transfer function of specified 
order whose magnitude “tightly” overbounds a specified nonpara- 
metric real-valued function of frequency. The main idea is to find a 
spectrally factonzable rational function which tightly overbounds the 
data “squared/’ This leads to a nonlinear constrained optimization 
problem which can be solved by a sequence of linear programming 
problems. 


The original motivation behind the development of the LPSOF 
algonthm was to systematically replace the graphical overbounding 
method used in [4] for determining robust control weightings. The 
algonthm is, however, generally useful for determining spectral 
factors from raw PSD data and can be useful in such applications as 
deconvolution, disturbance identification, blind channel equalization, 
and estimation of noise colonng filters. 
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A Lower Bound for Limiting Time Delay for 
Closed-Loop Stability of an Arbitrary SISO Plant 

R. Devanathan 

Abstract —This correspondence is concerned with the time delay margin 
Tor closed-loop stability of a single input, single output plant with time de¬ 
lay uncertainty. Since no restriction is placed on the stabilizing controller, 
the limiting value of time delay depends on the plant transfer function 
only. The approach, using a modified form of classical Ncvanlinna-Pick 
interpolation theory, provides a lower hound for the limiting time delay 
given the plant open-loop poles and zeros only. 

Notation 

C = {Complex planef 
A — {* E C.Rc(.s) > oy 
A = {.s e C.Ro(s) > 0} 

.4 = iu{\| 

I. Introduction 

The closed-loop stability of single-input single-output (SISO) sys¬ 
tems with time delay uncertainly is considered in this paper. Walton 
and Marshall [IJ have given a direct method for stability analysis of 
time delay systems. Consider the closed-loop system of Fig. 1. Let 
the process transfer function be given by 

7'(*) - pn(.s)exp(- 7,/.s), T ({ > (I (1.1) 

where ^o(s) corresponds to the nominal transfer junction and I,i 
corresponds to the lime delay uncertainty. Walton and Marshall 
[IJ consider the following problem. Given ;>o(.s) and th>* controller 
transfer function r (*), what is the limiting (minimum) value of lime 
delay l,i below which the closed-loop system of Fig. 1 is stable? 
F.l-Sakkary [2J has given a partial solution to this problem. 

Suppose we now ask: Consider the closed-loop system of l ; ig. 1 
where the plant j>{ s ) can be any transfer function of the form of (1.1). 
What is the largest value of the limiting (minimum) time delay, call u 
T,i , llrlv , below which there exists a controller stabilising the closed- 
loop system ol Fig. I? This is known as the optimum time delay 
margin problem. In this paper, we artempt to characterize a lower 
bound for T,/ 

II. Background 

Let the nominal real rational transfer function j> n(.s) correspond 
to m finite zeros, — 1,2, -ni, multiplicities included, and // 
poles, j — 1.2. • »/, multiplicities included, in the right half 
of the complex plane including the imaginary axis., i.e., in ,4. The 
problem is well posed 131 when pu (*) is strictly proper. 

Let the zero at infinity of 7 >u(.s) be denoted by cj^ , of multiplicity 
(n - m). Let E{s) be such that 

E(s) = ]H)(.s) r-( Sf )/{l +/>o(-s) '■( *)}• (2.1) 

Given E( .s), (2.1) can be used to lind r( .s). The closed-loop system of 
Fig. 1 corresponding to the nominal system is stable if and only if 14] 

i) E{s) is analytic in A 

Manuscript received June 2, 1993; revised June 14. 1994. 

Th t . author is wiih the School of Electrical and Electronic Engineering, 
Nanyung Technological University, Nanyang Avenue. Singapore 2263. 
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Fig. I, Closed-loop system. 


ii) ZT(fy,) = 0. / = L 2. ■ ■ ■ m, multiplicities included 

iii) E((j ^ ) ■= 0, multiplicities included 

iv) E{<i t ) =.- ], j =r 1.2. ■ ■ ■ n, multiplicities included. 

(2.2) 

Consider the conformal mapping |5| as shown in Fig. 2. f f is the 
closed unit disk while V corresponds to the open unit disk. G can 
be any simply connected region in fU | containing the points 
0 and I. 

o is a mapping such that 

0\ G - V (2.3) 

and 

c’i(O) = 0. (2.4) 

The conditions ii), iii), and ivl of (2.2) correspond to 

i) E{ W(/r ) ~ 0. / — 1.2. ■ in, multiplicities included 

ii) E{ ) - 0, multiplicities included 

iii) E( . ln ) = d>( 1), j = 1,2. ■ ■ ■ n, multiplicities included 

(2.5) 

where 

E: C —+ E (2.6) 

i •: A -4 T (2.7) 

= C(^). / - 1,2, - m (2.8) 

-i/% = ) (2.9) 

=, r(p,). j = L2.---U. (2.10) 

Following 151, consider the mapping E of (2.6) such that 

fc(u) = o ft/, ( = J, 2. - ■ ■, 2n, n > 0 (2.11) 

where 

h — , (/f . f = 1.2,*■■//!. and / = 1, 2, * ■ ■ w , respectively, 

(2.12) 

w = : ol ( = in + l,"Mi (2.13) 

U — = n + 1, • ■ ■, 2r# and j = 1,2. ■ ■ ■. n, respectively, 

(2.14) 

h = (U = 1,2, -n (2.15) 

b ( = 1, ( =- it + 1, - ■ -. 2n (2.16) 

E exists if and only if 

O < o 1IU ,v (2.17) 

where n lllrtH is an invariant defined on the interpolation data } 

specified as in (2.I2H2.16) above. Further, the mapping E{n) given 
in Fig. 2 (where G is any arbitrary simply connected region) satisfying 
the condition given by (2.2) exists, if and only if 

|0(1)| < <w- (2.18) 


0018-9286/95504,00 © 1995 IEEE 
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Fig. 2 . Conformal mapping relationships. 



Fig. 3. 



//-plane 


III. Time Delay Uncertainty 

Consider the characteristic equation for the system of Fig. 1, viz. 

1 + r(a) rxp( —T',/.s) = 0. s E A. (3.1) 

Using (2.J), (3.1) can be rearranged as 

tf{s) = //{£(*)} = {i - E(*)}/E(s) - -rxi>(-7».* e A. 

(3.2) 

The function // maps the fT-plane into the //-plane. For s = jjj. 
u.' > 0, the right-hand side (RHS) of (3.2) maps the ju>axis in the 
//•plane onto the unit circle centre origin, in the //-plane as shown 

in Fig. 3. 

Notice that 


1 /( 1 ) = 0 

(3.3) 

»/(<)) = 1C. 

(3.4) 


Consider a simply connected region G (containing the points 0 
and 1) in the E'-planc such that its image G' (containing points xj 
and 0, respectively) under the mapping // corresponds to the //-plane 
minus the hatched arc as shown in Fig. 3. Using (3.2), if we define 
the mapping ^ such that 

*{'/(■«)! = 1/{1 4*//(*)} =•£*(*) (3.5) 

then G can be obtained by mapping G* under If G' can now be 
conformally mapped 16] in to the open unit disk l r by the mapping 
0 f such that 

e'(*) = 0 (3.6) 

then the mapping <p\G V of Fig. 2 will have been implemented 
(compare (2,4) with (3.4) and (3.6)) such that 

Q = <!>’ • '/• (3.7) 

Let the mapping 0' be given by the function 

= { >/{<)/ - (?)/<* - </'H «*Xl>(!n) - <’XI>(j->/2)} 

- V)/(v - </')} exp(j >)+ oxp (-j 7 /2)} 



x [«-xi>{j( 7 T - )/2}] (3.8) 

where 


V = '/{£(") t 


7 and <{ in the //-plane are conjugates of each other and 

<1 = -3)1 (3.9) 

1 is the angle as shown in Fig. 3. Also, from (3.3) and (3.7) 

o(l) -= </(()). (3.10) 

The image of G ' in Fig. 3 under the mapping u is as shown in Fig. 
4. The hatched portion in the //-plane is mapped onto the unit circle 
in the o'-plane. The arc 7 <i"q in the //-plane is transformed into the 
arc u'bu in the r/-plane. Also, the origin in the //-plane is mapped 
on to the point v in the (/-plane. Simple calculation shows that 

0(1) = t'/(U) - or - sm(^/2). 0 < ^ < tt (3.11) 

and that 


Lei 


oh = tan(* /l). 


(3.12) 


E 


E ■ / 1 . 


(3.13) 


The function E(s) can be computed from Fig. 2, (3.7) and (3.13) as 
E|.) = £(.>..)- = „, 4| 

[sin('/ 2 )fj£’(A)} J - 1 ] 

and the corresponding controller <•() can be determined using (2 I) as 

]>v(*)UE(*)) - siii(-/ 2 )] 

The following result can now be slated. 

Theorem 1 ■ Consider the conformal mapping of Fig. 2. Let the 
simply connected region G be such that i / maps G on to G f where 
G' corresponds to the //-plane minus the hatched arc on the unit 
circle defined by angle “(0 < ^ < tt) as shown in Fig. 3 such that 
the mapping (t> is defined in terms of ■> through (3.8) and (3.9). The 
mapping E satisfying the condition (2.2), then exists if and only if 


7 < 2 sill '(X.nax). 


(3.16) 


The function E(s) is given by (3.14). 

Proof: From the result of [5], (2.18) follows. Since sin b/2) is 
positive for 0 < 7 < 7r, it then follows from (3.11) and (2.18) that 

sin( 7 / 2 ) <x IMMX , 

Hence (3.16) follows. This ensures that &(*} exists as per |5] based 
on the modified Nevanlinna- Pick theory. The procedure for finding 
E(h) is well known [7|, [ 8 | through the formation of the Fenyves 
array corresponding to the mapping U —► V, E(s) is found from 
(3.13), Hence £(*) is given by (3.14). Hence the result. 
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Substituting for f\ (yj) from (4.11), (4.J4) can be expressed as 


^ > wJ o = 


i 

U -F) 



-(1+A’*) 2 . 


(4.15) 


Wa can now state the following result. 

Theorem 2. Assume that the condition of Theorem 1 is met and 
that initially the nominal system is stable. Then the closed-loop 
system is stable under the time delay uncertainty provided T,i satisfies 
the inequality 


Fig. 5. Detailed conformal mappings. 


T,i < ,/*■ o 


(4.16) 


IV. Time Delay Margin 
Consider the transformation 

c’i(») = r = {*■(* - *)}/(*+ t). 0 < k < 1. <1 > () (4.1) 

such that 

»V..4 - V. (4.2) 

Further consider the transformation 


n 2 (r) = £ - (f - k)/( 1 - vk) 

(4.3) 

such that 


- r. 

(4.4) 

Let 


r = 1 s ■ mi 

(4.5) 

such that 


r:A - 

(4.6) 

The ./^'- axis of the complex plane Q together with the point at 
infinity is mapped by 17 on to the circle (call it cl) centre origin and 
radius /. which in turn is mapped by i/j onto the circle (call it < 2 ) 
on a diameter whose extreme points correspond to { — 2A-/( 1 -f k 1 )) 
and the origin. Notice that 

<’(>-) = = 0 . 

(4.7) 

Referring to Fig. 2, it the zero at infinity of E is of order 
ni' {=11 - m) then so is the zero of F at the origin. Since F is 
analytic in L\ it follows by Schwarz's lemma I^| that 

l|i’(:)|}< l-n'. !-l < 1 - 

(4.8) 

Fig. 5 shows the detailed mapping as discused so far. 
Representing an arbitrary point on the circle cl as 


r = <>j - k ex)i(jft)* 0 < H < 

(4.M) 

Consideration of the mapping 17 will show that 


tan( 0 / 2 ) =• 1 /*l\ 

(4.10) 

Further, considering the mapping < it can be shown that 


i-i -1 - i 1 = /1 (-*■■> 

2k i _ ^ 

/d- j-w + n + frdMC’ 1 ' - ’ 

(4.11) 

where 


-1 = ^’2 (/ ' 1 )■ 

(4.12) 

Define 


i>h)» 

(4.13) 

Let uf be such that 


/iM < f>{ 7)- 

(4.14) 


where ^-o is given by the RHS of (4.15). 

Proof: Following generally the approach due to Walton and Mar¬ 
shall [3], since the control system of Fig. 1 is nominally stable, under 
time delay uncertainty, the closed-loop poles first need to cross the 
imaginary axis in the complex plane before the system becomes 
unstable. The zeros of the charecterislic (3.1) are also necessarily 
solutions of 

Pn(j^) )UdjjL') = 1 , jj > () (4.17) 

since (3.1) is real. The number of -a's (they are all finite) which 
correspond to the solution of (4.17) are finite in number. Let the 
largest of them be denoted by u.\„. 

A necessary condition for the roots of the characteristic (3.1) 
to cross the imaginary axis is when the gain condition of ( 3 . 2 ) 
(equivalent to (3.1)) is satisfied lor s = j *■. The RHS of (3.2), 
when s = jjl' corresponds to points on the unit circle in the //- 
plane. Since the points on the hatched portion of the unit circle are 
left out by design from the //-mapping, solutions to ( 3 . 2 ) can only 
exist corresponding to points on the arc r/r/\/ in the //-plane (see 
Fig. 3). The mapping u maps the arc i/r/V on to the arc u'hu (sec 
Fig. 4) as explained in Section III. The smallest distance from the 
origin to points on the arc u'hu corresponds to the length of the line 
segment ob (see Fig. 4) which is given by (3.12). 

Considering the mapping of the ,/.*'-axis via the transformation 
{F ■ r) (see Fig. 2) onto the open unit disk V and using (4.8) 
through (4.12), it follows that 

- (/.Mr. (4.i8) 

Equation (4.14) then ensures that the mapping of the j^-axis via the 
transformation {E ■ 1 /) on to the unit open disk V (see Fig. 4) falls 
short ol the arc u'hu, from magnitude considerations alone for all 
u. > jju where u ,- 0 is given by the RHS of (4.15). Since the arc u'hu 
is the image under the mapping o of the arc rp/V corresponding 
to feasible ^.'-solutions of (3.2), it follows that (3.2) cannot have a 
solution for ^ > «l'o. It then follows that 

u,'u > uJ ln . 

Since the angle subtended by the circular arc between any point on the 
arc rj (/y and the (- 1 ) point in Fig. 3 at the origin, can be considered 
(see RHS of (3.2) as corresponding to the product {jjTj ), the smallest 
such angle corresponds to the point ry, the angle being 7 as shown 
in Fig. 3. Since no solution to (3.2) exists for > *? 0 , the same 
is true for 

T<t < 7 /ut'n 

and hence the closed-loop stability is assured. Hence the result. 

Remark I: The analysis leading to Theorem 2 is based on the 
conformal mapping 1.7 (s) (see (4.1)) for 0 < k < 1. The strict 
inequality, viz,, k < 1 is necessary to include the case of zeros of 
Pol^), lying on the ,/a>-axis and at infinity, being mapped to zero 
in our analysis. This is due to the well-known result [10] that the 
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M&bm transformation [represented by (4.3) in oar easel can only 
map points on the circumference on the unit circle on to itself. A 
provision to the effect of A’ < 1 is also assumed while considering 
the boundary interpolation case in theorem 1.5 of [51. Moreover, m 
our case, as will be shown in Theorem 3, a lower bound For T t imn\ 
is optimized in a certain sense over A', 0 < A < 1. 

Remark 2: The ratio 7 /u/o represents a lower bound for T f / ro%H 

"/•*''0 ^ Tinu^> 

For a given ,1 and A\ the ratio ( 7 /-J 0 ) is maximized by choosing 
7 as large as possible subject to the constraint of inequality (3.16). 
Now it remains to optimize (maximize) the lower bound, in a certain 
sen&e, by choosing k appropriately. This is taken up next. 

In the sequel. A* is treated as a variable, 0 < A < 1 , and all the 
parameters, such as, o mnx , 7 . «l\) x p and others are all dependent on A. 

Theorem 3 Consider the lower bound, viz., the ratio ( 7 /-J 0 ) tor 
the time delay margin 7j JllJIH as given in Theorem 2. For l) < A’ < 1 . 
a maximum exists for this ratio if 

p(k) < (l/u){2A-/(l + A-’)}, t) < A* < 1,ri > 1 (4.19) 

and this maximum corresponds to the value of A- such that the 
following conditions are satisfied 

(rf- fdk ),.{ I * 2 - (1 + A - 2 )V I + (dp/dl. ) i -A ’ 

-■i-)kp\() +k 2 )/(\- k 2 )}{l- p 2 ). (><* <1 

(4.20) 

(l/^o )(d 2 ^fdk 2 ) > (\/*,){,1^,/dk 2 ) (4.21) 

where 

d^'o/dk 1 - H[h){B(h) + Uk)(di>/dk) 

+ D(k)(dp/dk) 2 + F(k )(d J /i/dk 2 )} (4.22) 

H{b) = [4 If {[I - A- 2 )’/ }][(4A"V) - <1 + A 2 ) 2 ] 1/2 

(4.23) 

U{k) = - 1 2k '(1 - A 2 )V + h * 1 (3 + A 2 )p 2 
+ 3A(1-A- 1 )V 

- (1 + (>A 2 4- 3GA- 4 4- ISA*' 4- 3k' t )p > 


+ (1 + A 2 )‘(1 + 3A- 2 )(>' (4.24) 

L(k) = (1 - A J )[12A'(1 - A 2 ) + ,.(1A'*)( 1 - 3A 2 ) 

- ). 2 (1 + A- 2 ) 2 (l — A- J )(3A' 2 ) 

4- ,.'(! +*•-')*( 1 +7A-')} (4.25) 

D(k)= -(1 — A- 2 ) 2 ( 1A’’) (4.26) 

F(A) = -{4A- 2 -/) 2 (1 +A 2 ) 2 }(1 - k 2 ) 2 k 2 p (4.27) 

and for clarity, the argument k is dropped from p( A) and written as 
/) only. 

Proof Using (4.19), the RHS of (4.15) can be written as 

= [{A( 1 + A' 2 )}/(1 — A -2 )]{\/« 2 — 1}. (4.28) 


Due to (3.16) and (4.30) 

fituns U and 0 0, as k -+ 0 . (4.31) 

Hence using (4.28), (4.29) and (4.31) 

h/±'o) < h A'i) -+ 0, as k - 0. (4.32) 

Ajlso, as A* —> 1, u,'\ -*■ "v. With 0 < 7 < n 

h/^o) < (* /*>i) -*• o, us A* —*■ 1. (4.33) 

Notice that u/n given by RHS of (4.15) is such that > 0, 
0 < A cl since p < 1. That < 1 can be seen from the inequalities 
(4.34) and (4.35) below 

tan(-,/l) < mu(-,/2) < n, mx < {2A/(1 + A- 2 


m > 1.0 < ^ < 7 r 

(4.34) 

= {t,ui(74)|'/'"' < 2A/( 1 + A 2 ) < 1. 


0 < A < 1. 

(4.35) 


Wilh 7 > 0, then the ratio b/^’n) > 0. (4.32) and (4.33) thus reveal 
thai al least a maximum value lor the ratio (* /-mi) exists in the range 

0 < A- < 1. 

Differentiating the ratio ( m /^) with respect to A (with uj 0 given 
by RHS of (4.15)) and equating it to zero yields, alter simplification, 
(4.20). To ensure that (4.20) corresponds to a maximum condition, 
one needs to check whether 

d 2 ('l^ 0 )/dk 2 < 0. (4 36) 

Direct differentiation of the RHS of inequality (4 16) with respect 
to A yields (4 21) Equation (4 22M4 27) are obtained by some 
simplification of the direct differentiation of -to given by the RHS of 
inequality (4.15) with respeci to A Hence ihc resull ol Theoiem 3 
Remark 3 That the condition given in the inequality (4 19) is only 
a mild one can be seen by compaung the constiaint on f>{k ) due to 
the inequalities (4.19) and (4.35). 

V Laampli 
Let 

Poh) ” 1/f * - 2) 
i = 1 , m f = 1 . 

Assuming k as a vauable, using (4.1) and (4.3) 
n(2) = i, = k/:\ 
t i ( a ) - 1 / = A 
and 

■-I = < ->(».) = -2A7(3-A 2 ) 

-i = rjrj) = (>. 

For the interpolation data ol (:i.l) and (.<*.()) for the mapping 

r - i 


Since >i > 0 and a > 1 are constants, it follows that 

•m — tW-i > 0. A- - 0. (4,29) 

Also, for k in the range 0 < A- < 1, since the interpolation data 
= n + L■ * ■ 2n, n ^ 0 (specified in (2.14) and (2.16)) is 
confined to the region enclosed by the circle (2 in V (the image of 
A under the mapping t), it follows by Schwarz’s Lemma (applied 
to the analytical map E given by ( 2 . 11 )) that necessarily 


Since sin( 7 / 2 ) can be arbitrarily close to put 

sin( 7 / 2 ) = x mnK . 

=2sii,-'{2A/(3-A- 2 )} 

P = tail(-)/ 0 - (1 - \/T- OtiL, }/ 'TC.n..^ 
= {(3-A- 2 ) - v/(9-^)(1-PT}/(2A-) 
dp/dk = (3 + k 2 )p/{ky/i9-k*)(\^W)) 


ft*., < { 2 fr/(l + fc J )} m , 


»r > 1 . 


(4.30) 
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Equation (4.20) ib satisfied in the range 0 < k < 1, for k = 0.781 
only, Also, using (4.22M4.27), (4.21) has been verified to be true at 
k = 0.781, The ratio 'y/wo is maximized for k s 0.781 and a lower 
bound for Tj is obtained as 

r, h „x > *>M> = 1.424519519/9.942603337 
= 0.143273434. 

VI. Conclusion 

Given a plant transfer function with time delay uncertainty, the 
question arises as to the maximum limiting time delay below which 
a controller exists such that the closed-loop system remains stable. 
Since no restriction is placed on the controller, it is expected that 
the maximum limiting time delay should be a function of the plant 
transfer function only. This paper has provided a solution to this 
problem in terms of a lower bound for the limiting time delay. The 
result will be helpful in determining the time delay margin below 
which the closed-loop stability of a given plant is guaranteed with 
the existence of a stabilising controller which can be determined in 
a straight forward way. 
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A New Reduction Technique for a Class of 
Singularly Perturbed Optimal Control Problems 

Vladimir Gaitsgory 


Abstract —We consider a class of singularly perturbed optimal control 
problems which may not be approximated by the reduced problems 
constructed via the formal replacement of the fast variables by the states 
of equilibrium of the “fast” subsystem considered with frozen” slow 
variables and controls. We construct a reduced optimal control problem 
which provides a true approximation for the problems under consider¬ 
ation and write down necessary and sufficient optimality conditions for 
this reduced problem. 
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I. Introduction 

Singularly perturbed optimal control problems have been inten¬ 
sively studied in the literature (see the overviews in LI), [7]~[9] and 
also more recent works [5], |6], [10H121). As follows from the 
quoted overviews, a common approach to singularly perturbed opti¬ 
mal control problems is based on their replacement by some reduced 
order problems obtained via equating of the singular perturbation 
parameter to zero. 

Recently in [5], [6], [111, and [12], it was shown that there are 
classes of problems which do not allow such an approach. That is, 
equating the parameter to zero leads, of course, to a simpler problem 
but the solution of this problem does not approximate the solution 
of the original singularly perturbed one with an arbitrarily small but 
nonzero value of the parameter. 

In [ 11J and [12], it was noticed that such a situation may appear in 
if “■-optimal control problems. In [5) and [6|, it was established that 
it may appear in the standard optimal control setting if the system is 
nonlinear and/or the cost function is nonconvex. 

Our consideration in this paper is connected with results in [5] 
about an approximation of singularly perturbed optimal control prob¬ 
lems by problems of optimization of some differential inclusions. In 
contrast to [5], where the main goal was to construct an approximating 
problem, in this paper we are interested in how this problem can 
be solved. We study here an interesting special case where the 
approximating optimization problem allows a simple representation 
as a “classic” optimal control problem which we call an extended 
reduced problem. We construct necessary and sufficient optimality 
conditions for this problem and describe how it is related to the 
reduced problem obtained in a traditional way (that is, via equating 
the small parameter to zero). 

II. Extended Reduced Problem 

We shall restrict ourselves to consideration of a class of singularly 
perturbed (SP) optimal control problems written in the form 


minimize G(z( 1)) (1) 

subject to 

i(t) = Q(z(t)) + A(^t))y(y(1).n(t)), (2) 

- /0/UL m( f)), 0(0) = y (3) 

iVf € [0,1], (4) 


Here r is a small parameter. Q(-), <;(■), /(*) are vector functions 
taking their values in R k , R l and R v , respectively, and A( ■) is a k x l 
matrix function. The functions G(:), QU). and .4(~) are supposed 
to satisfy local Lipshitz conditions in ~ and the functions g(y,u), 
f(y.u) are supposed to be continuous and satisfy local Lipshitz 
conditions in y with a constant which does not depend on u € V. The 
minimization is executed on the set of admissible controls which is 
defined as the set of measurable functions u (/) satisfying the inclusion 
(4), where V is a compact subset of R UI . 

We shall suppose that subsystem (3) being written in the 
“stretched” time scale r = fp* 1 

?/(t) = /(i/(r),u(r)) (5) 

satisfies the following two assumptions. 
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Assumption 1: There exists a compact set P C J? T ‘ and a function 

ft(r) 

liiii ri(r) s= 0, / ti(t)(1t < oc 

Jo 

such that far any y* £ P t * = 1.2 and for any admissible control 
«(r): »(*),«/') - jf(r, i/(-L/)|l < »/(t)||» i - /||, where 

p(T*it(‘)iy) stands for the solution of (5) obtained with the control 
u(r) and with the initial values 2 /( 0 ) = y. 

Assumption 2: There exists a compact set il C P such that for 
any admissible control u(t) and for any initial values y £ U, the 
solution of (5) does not leave P. 

Along with these two we also need an assumption about the 
trajectories of the SP system (2H4). 

Assumption 3: There exist compact sets D and W„ D c U r C 
JT, such that for any admissible control u(t) and for any initial 
values y) £ D x il the trajectories of (2M4) do not leave \V xP. 

Under Assumptions 1-3 from Theorems 3.2 and 4.1 in (5) it 
follows that the - -components of the trajectories of (2M4) are 
approximated by the solutions of the differential inclusion with the 
right-hand side defined for all 2 € I V as the closure clfy(;:) + 
A(-)!>}, where the set \ ], C R 1 is the union 

v P = N (jl r 1 / 0 <y( r )' tf < r )W T } w 

7 so „( ) l Jo J 

taken over all admissible controls h[t) defined on the interval [0. T], 
and also over all T > 0, with jy(r) being the solution of (5) obtained 
with the control u(r) and satisfying the periodicity condition 

</(()) = y(T) £ P- (7) 

Since ol{(<)U) -f .4(~)l],} — Q(~) j- .4(^)rl{V # ,} and since the 
Closure cl{r i} }, which we denote as T\ is compact and convex |5. 
Theorems 3.1 and 3.2], the set of the solutions of the above mentioned 
differential inclusion coincides with the set of trajectories of the 
system 

:(>) = yU('))+.4(:(/))r(f) :((>)=' (8) 

with the admissible controls r(t) being defined as the measurable 
functions satisfying the inclusion 

i>(t)£Vs Vf £ [0,1). (9) 

The problem of minimization of the cost function (1) on the trajecto¬ 
ries of system (8) with the admissible controls defined as above will 
be called the extended reduced (ER) problem. 

An application of Theorem 4.1 of [5] leads now to the following 
statements about the connections between the SP and ER problems: 
There exists a function p(t) tending to zero as r tends to zero such 
that: 

1) The optimal value function \\\ (a, y) of the SP problem and 

the optimal value function of the ER problem (that 

is, the optimal values of the cost achieved in the SP and ER 
problems subject to that the motions begin from the moment 
s £ [0,1] with the initial values (-(*),*/(*)) = (-.g) and 
l{s) = respectively) satisfy the inequality 

\WA«.z t y) - < //(r), 

V(k,;, y) 6 [CJ, 1] x D x S2. 

2) Let b > 0 be an arbitrary number and /(f) be an admissible 
control in the ER problem such that 

C?(r*(l))<W(0,;) + A 


where /(f) is the trajectory of (8) obtained with the control 
/(f) and with initial values /(0) s= £ D t that is, /(f) is 
A-suboptimal in the ER problem. Then using this control one 
may find a control /(f) which is asymptotically f>-suboptimal 
in the SP problem. That is, it satisfies (4) and when using in (2), 
(3) with the initial values (/(()),./(())) = U.j/), y £ U it 
generates the trajectory (/(f)./(f)) satisfying the inequality 

(7(/m)<H;<0, :,//) + <■> + OUi(t)). 

Notice that in a general case the ER problem is not equivalent to 
the reduced (/?) problem obtained via equating < to zero in (l)-(4). 
The latter, as can be easily shown on the basis of Filipov’s Lemma 
[3], is equivalent to the problem of minimization of cost function (1) 
on the trajectories of system (8) with admissible controls v(t ) being 
measurable functions satisfying the inclusion 

i'(f)€Ui (11) 

where Ki = (J M ^f {tfO'OO- «)} and ^(w) is the root of the equation: 
f(y.u) = 0. Since, by Theorem 3.2 [5], convlL C \ and this 
inclusion can be proper, all the trajectories which are admissible in 
the /? problem are also admissible in the ER problem but not vice 
versa. 

111. Necessary and Sufficient Conditions 
oh Optimality for the ER Problem 

Let us introduce the following periodic optimization PO problem 

sup jr 1 £ \ l 4(-.)f/(,(/(-). i/(r))Jr| -//(;. A). (12) 

Here sup is sought over the length 7’ of the time interval, over 
the admissible controls u(t) £ V defined on [0.7] and over the 
corresponding solutions y{r) of system (5) satisfying periodicity con¬ 
ditions (7). The optimal value of the problem depends on the values 
of the parameters c, A and the function defining this dependence is 
denoted as /»U, A). 

Let (/(f),/(f)) be an optimal control and the corresponding 
trajectory of the ER problem. Since, by definition, /(f) (E V. V/ £ 
[0,1] and V is the closure of the set 1 ], introduced in (6), correspond¬ 
ing to each t £ (0,1] there exist a sequence of positive numbers 7/, 
a sequence of controls u\(r) £ V defined on the intervals [0,7/] 
and a sequence of the solutions /(r) ol system (5) satisfying the 
periodicity conditions: //(()) = y\[T*) £ f\ 1 = 1,2, ■ - ■, such that 

/(f) = lim (7 1 /) ‘ 1 f \){ / (r). u\[t))(Jt. (13) 

Theorem I Let Assumptions 1-3 hold and let G{i), Q{~), 
and /1(;) have continuous partial derivatives with respect to the 
components of :. Let A(f) be the solution of the system 

\(1) = -FU~"(t).c"U))M1)- A(l) = -6'V(l)) (14) 

where 


F(:,i') = Q(--) + A(:)r, 



Fi(c,r), 1 ~ ],■■■.£ are the components of the vector function 
0 ). Then for almost all t £ [0.1] 

11111.(1’/)-' / X 1 (< )j4( i°(f )).■/(!//( t). «J(r))rfr 

Jo 

afc(i°(f),A(#)). 


( 10 ) 


(16) 
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That is, the sequence of triplets {T 1 /, w!(r),gj(r)} defined as that 
satisfying (13) is maximizing in PO problem (12) with - = ~°(t) 
and A =s A (t). 

Proof of the theorem is obtained via the direct application of 
Pontnagin’s maximum principle to the ER problem. 

Let us note that if for all t from some subset ^ of {0.1] there exist 
T 1 > 0, a control i/ f (r) E f\ Vr E [0. T% and the solution y l (r) 
of system (5) satisfying periodicity conditions' y*( 0) = ^{T 1 ) E P 
such that 

f P 

r°(f) = (r') 1 / Sf(j/(T)V(r))r/r (17) 

Jo 

that is, / ,0 (f) E V/ E then for almost all such f the triple 
{T', u f (r). is the solution of PO problem (12) with : = z°(f), 
A = A (t). 

Let us denote by {T x , u x (r). y ' A (r)) a solution of PO prob¬ 
lem (12) with given and A and define the function r(;.A) as 

(T ') 1 / //(« V).» '(T))f/T. (18) 

./() 

Assuming that (17) takes place lor all / E [0,1], one may come to the 
conclusion that the optimal control r l, (/) can be presented in the form 

i"(f) - V{ "{t). A(/)) (19) 

where *, lJ (f), A(f) are determined as a solution ot the following 
boundary-value problem 

- Q( ■-"(!))+ •(()) ■= - 

Mt) - -r'(:"(M.c(i"(M.A(/)))A(f). 

A(1) — -C <_°(1)). 

Notice that the verification of (17) and the construction oi the 
family ol the solutions of (12) depending on , and A seem to be very 
problematic in the general case. The described procedure, however, 
maybe also applied without such a verification and with the use of 
a family of approximate solutions of (12) instead ol the family of 
precise ones. In Section IV below we shall consider a special case 
when an estimation ol “suboptimality” of the control (19) via the 
estimation ol the approximation of the solutions of (12) is possible. 

Sufficient optimality conditions for the ER problem can be con¬ 
structed on the basis of the Hamilton-Jacobi -Bellman (HJB) equation 
written for this problem in the form 

+ miii{(lC(s.:))' {(}(-.) + .4(;).-)} = 0. (20) 

l 

Using again the fact that T is the closure ol 1 ), and also the definition 
or //(-.A) as the optimal value of the PO problem (12), one may 
rewrite (20) in the following form 

n\'(.s. .> + («"(*. .))'()(■.)- M-.-irV -.)) = o. (2D 

HJB equation (21) provides a characterization of the optimal value 
function ol the ER problem (and, thus, an approximate characteri¬ 
zation of the optimal value function of the SP problem). Namely, 
if the former is differentiable at some point (*.:.) E ff* * l , then it 
satisfies (29) at this point [4, Chapter IV, Theorem 4,1]. On the other 
hand, given some admissible trajectory ~ u (f) of the ER problem (thai 
is, a trajectory of (8) obtained with some admissible control r u (f)k 
its optimality can be verified on the basis of the following sufficient 
optimality conditions. 


Theorem 2: If under Assumptions 1-3 there exists a function 
defined and having continuous partial derivatives in some 
(Mube, r<s around ~°(f) 

r* - {(/,;) 1 1 e [o.i],||i -i°(()|| < a}, d>o 

and also satisfying there (21) with the following boundary conditions 

U71. .) = <?(:). V: £ | II?' — -°(1)|| < b}\ 

U'(0,t°(0))> G(:°(l)) 

then (i >0 (/),:"(/)) is a local solution of the ER problem, that is, for 
all admissible trajectories ■(/) such that (f, z(1)) E T* 

G‘U°(1))<GU(1)). 

ProoJ: The theorem is a reformulation of Proposition 3*7.2 in 
[2J and, thus, it is implied by the proof of this proposition. 

In a general case the optimal value function of the ER problem 
is not differentiable at all points, although it can be shown to 
satisfy local Lipschitz conditions and. thus, it is differentiable almost 
everywhere [4, Chapter IV, Theorem 4.2J. Similarly to Theorem 2, 
sufficient (and also necessary) conditions for the admissible trajectory 
of the ER problem to be locally optimal can be constructed on the 
basis of some generalized version of the GJB equation introduced in 
|2). This is 

min {ft + i 1 Q( -) - /#( - (n, 1) E i)\Y[s< : )} 

where d\Y(s .:) is the generalized Clarke gradient. The formulation 
and the proof of the corresponding statement can be adapted from [2, 
Theorem 3.7.6], □ 

IV. asymptotic Lineari/ahon or the SP Problem 

Let us considci a class of SP problems which are linear in the slow 
variables. That is, assume that cost function (I) and slow subsystem 
(2) arc written as 

mf -'i(l) (22) 

and 

Ut) =■ Q -(f) 4- Agiy(t). «(/)), -(0) = : (23) 

respectively, where Q(k x k) and A(k xl) are constant matrices. In 
this case the system (8) becomes linear 

Ut)=.QL(t) + Av(t) ~(0) = c (24) 

and, thus, by Theorem 1, nonlinear SP problem (22), (23), (3), (4) is 
approximated by linear ER problem (22), (24), (9). 

The linearity of the ER problem makes Pontnagin’s maximum 
principle not only a necessary but also a sufficient condition of 
optimality in this problem. More than that, the following statement 
is known to be true. 

Let A(/) be defined as the solution of system (14) which, by (22) 
and (24) takes the form 

= Mi). A(l) = -<• A(f) = —(25) 

and lei an admissible control satisfies the inequality 

A 7 (f).4i-*(f)>ft(A(f)) -/;((). »/(')><> 

for almost all t E [0,1]. Here 

//(A) = maxjA* Ai eT 7 } (26) 

and //(f) is some integrablc function. 

Then the trajectory z*(t) of (24) obtained when using this control 
satisfies (10) with <*> = ^ //(f)rff. That is, r*(1) is (‘Ksuboptimal in 
the ER problem. 
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The validity of this statement follows from the fact that the value 
of the cost function (22) obtained when applying the control v x (t) 
on the interval [*, 1] with the initial values z(») = t is equal to 



whereas the optical cost function value is equal to 

W(s.;) = -A’ h(\(t))dt. (27) 

Notice that 

max^A 7 !' | i» 6 1”) = Mip{A / u | v £ X^} 

and, thus, h( A) coincides with the optimal value of the PO problem 
(12)-, with the dependence on disappearing since the matrix ,4(i) is 
supposed to be constant. This allows us to reformulate the statement 
above an follows. 

Let T x be a positive number, u x (r) £ t r be a control defined on 
[(I,T A ] and y x {r) be the solution of (5) corresponding to this control 
satisfying the periodicity condition: y x ( 0) = y x {T x ) € P such that 

r r x 

( T x l" 1 / X 1 A(j(y x (r),u x (T))dT 

Jo 

> /i(A) — r/(A), T)( A) > 0 

and let 

<'(A ) = (T a / D(i/V),, A (r|)rfr. 

Jo 

Suppose that the family of triplets {T x , u x { r), y x { r )} and, thus, also 
v(X) arc defined for all A £ A, with A containing A6 [0,1]. 
Then the control c x {t) = r(A(f)) is ^-suboptimal in the ER problem 
with h = f ( j t/(X(t))dt. 

V. Connections Between the ER and R Problems 

As we have already noticed, the sets of controls and trajectories 
admissible in the ER problem are in the general case more rich 
than those in the B one and, hence, the optimal value functions of 
these two problems, \Y(s, z) and, respectively, ^(.s, :), satisfy the 
inequality 

F(». •-) > 6 [0.1] x R k . 

In case the SP problem is linear in «, that is (22), (23) are valid, 
the difference between F(s,z) and W(s.z) allows an explicit 
representation. 

Let us define h Ht (\) as the optimal value of the steady slate 
optimization SSO problem corresponding to PO problem (12) 

/u(A) = nmx{\ J Ay(i,'(ii), u) | u € 

= max{A 7 Ay(y, u ) | //(//, u ) = 0, u £ U. y € P\ 

where again, as above, the dependence of the optimal value on ^ 
disappears since .4 is not dependent on 

Theorem 3: Suppose that Assumptions 1-3 and (22), (23) are 
true. Then 

F(*,i) - ir(s,;) = / (/i(A(/))- h«(X(t)))dt (28) 

J H 

where A(f) is defined in (25). 

Proof: In the case under consideration the f? problem takes the 
form (22), (24), (II). That is, it is linear and, hence, its optimal value 
function is given by the expression 

r) = —A 7 (.s)~ — f niaxfA 7 {t )Av \ v £ 1*1 }rff. 


Comparing this with (27) and taking into account that 
max{A 7 .4t» | r £ V B <} = h^( A) 

one obtains (28). □ 

Corollary: Under the assumptions of Theorem 3 the optimal value 
function of the SP problem converges to the optimal value function 
of th& R problem 

lini W, (#,;,jf) — F{s,z) V(.s. 1 , y) £ [0,1] x D x S2 (29) 

if and only if for almost all t £ [0, l] 

h(X(t)) = h H \ (A(/)). (30) 

Notice that in a general case h( A) > /; _< ( A ) and a number of tests 
allowing one to verify whether this inequality is the strict one or it 
takes the form of the equality were developed in periodic optimization 
theory (see references in [5], [61). Notice also that if this inequality 
takes the form of the equality for all A (that is, (30) is true for all 
A and not only for A = A(/)). then as it follows from Theorems 
3.2 and 4.2 in [51, the convergence (29) can be guaranteed for SP 
problems of a much more general structure than that we are dealing 
with in this paper. 

Vi. Conclusion 

Wc have proposed a new method to deul with the class of SP 
problems described. The method consists of an approximation of the 
solution of the SP problem by the solution of the ER problem and 
of characterization of the latter by necessary and sufficient optimality 
conditions. For SP problems linear in the slow variables the method 
allows the complete answer to the question about the efficiency of 
the traditional reduction technique. 
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Pole Assignment with Robust Stability 

Mark E. Haipcm, Robin J, Evans, and Robin D. Hill 

Abstract —This paper uses convex analysis for the pole assignment de¬ 
sign of discrete-time SISO systems incorporating robust stability against 
norm bounded parametric perturbations In the plant transfer function. 
The method involves designing an overparameterized pole assignment 
controller for the nominal plant with the overparameterization chosen to 
reduce the size of changes in the closed-loop characteristic equation which 
result from plant perturbations. Sufficiency bounds on the /,, norm of the 
perturbation guaranteeing stability are obtained. 

I. Introduction 

For most real control problems, the behavior of the plant does not 
match exactly that of an assumed linite order linear time-invariant 
dynamic model. This is because the true plant usually has complicated 
dynamics which may be nonlinear and of high order. 

This motivates the design of robust controllers, that is compen¬ 
sators which achieve acceptable performance in some sense for 
a range of plants. This then can allow for uncertainties in the 
plant model. Uncertainties are usually considered to lit into two 
broad categories, namely structured and unstructured uncertainties 
depending on the specificity of the knowledge of the perturbations 
with respect to the plant model structure. For example, uneertainlies 
m the irequency response of a plant are unstructured because ihcy 
relate to the plant as a whole. On the other hand, uncertainties in a 
parametric description of a plant are structured. The two main aspects 
of robustness are stability robustness and performance robustness. In 
this work we focus on design for stability robustness in the presence 
ol structured parametric uncertainty in the plant. 

Previous related work is described in [l]-|9|. Dahlch 11) used the 
small gain theorem to obtain necessary and sufficient conditions for 
closed-loop stability in the presence of / x induced norm bounded 
coprime factor perturbations to the plant. In [2|, Soh and Evans 
applied Kharitonov results |3| to characterize allowable perturbations 
of an interval plant m closed loop with a pole placement compensator. 
Kharitonov-type extreme point results do not hold, however, for a 
stability region which is a circle at the origin of the complex plane 
so they arc not easily used for testing discrete-time stability so a 
bilinear transform was used in [2J. 

An approach which handles general stability regions and. thus 
allows the treatment of discrete-time systems, involves treating the 
vector of coefficients of an nth order closed-loop characteristic 
equation as a point in n -dimensional parameter space. Fam and 
Meditch |4| have determined the stability boundary for discrete¬ 
time systems in this space of coefficients and have shown it to be 
bounded by three hypersurfaces. Soh ct al . [5] have found the largest 
hypersphere centered at a specified stable point and containing the 
coefficients of only stable polynomials. 

Using this type of framework, Bicmacki ef al. [6], and Keel cr al. 
[7] have proposed continuous-time compensator synthesis approaches 
for uncertain multivariable plants. They regard the compensator as a 
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mapping from plant parameter space to closed-loop coefficient space 
and design the compensator to restrict the closed-loop coefficients to 
the interior of a stable region in this space. The compensator design 
involves a nonconvex numerical optimization. 

Berger [8], 19] has proposed an approach for the synthesis of robust 
control systems using overparametenzed pole assignment controllers. 
In that work, the overparameterization is used to minimize the 
variance of the closed-loop characteristic polynomial due to plant 
parameter uncertainties described by the covariance matrix of the 
plant parameter estimates as obtained by a recursive least squares 
estimator. 

In this paper wc present an approach to the design of overparame¬ 
terized pole assignment compensators for SISO discrete-time systems 
which maintain stability in the presence of norm bounded structured 
parametric uncertainty in the plant model. The work is in a similar 
spirit to that in [71, but our use of norms is different and we design for 
nominal pole assignment. This gives a design which uses a convex 
optimization and allows some closed form results to be obtained. 

II. Problem Formulation 

The : transform, /», of a sequence, {//,},'is defined by h = 
/u- ' With this definition, a stable transfer function has all of 
its poles inside the unit disc of the complex plane. The symbol z~ l 
denotes the unit delay and polynomial, A'(»), of order n, is given 

by A'U) = E"V r -- '• 

In this work wc consider the vector of coefficients of a polynomial 
as an element in an normed space. We denote the vector of 
coefficients of X(z) as ,r, with the I,, norm of a vector .r = {.r, 
denoted as ||.r|| 7 , defined as 

I 1 //' 

■<ii, = |X>i"l 

for 1 < p < x with 

: sup |.r, 

Consider a SISO discrete-time plant given by 


where fj n is the plant output. is its input, and B( -) and A (:) are 
coprime polynomials with ho — 0 and ao = 1. The plant has n„ 
finite poles t \. t 2 . ■ ■ ■. ^ 0. The plant is in a feedback control 

system with </,, given by 

ttU) 

(I -:-')Fhy w 

where ir is the reference input and (7(c) and F(c) are compensator 
polynomials. Note that for convenience a one degree of freedom (1- 
DOF) compensator is considered here, however the robustness results 
obtained apply equally for 2-DOF systems. 

The system closed-loop poles are the roots of I'(c) — 0, where 

M-Hl - ~-')F(-.) + D(z)C!(-.) = 1’U). (I) 

For the usual pole-assignment controller design, the designer specifies 
V(:) with all its roots stable and then solves (1) for the compensator 
polynomials, F(c) and (7(c). The F(c) and G{~) which satisfy (l) 
are not unique but are parameterized as follows. If F°( --) and <7°( -) 
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Rtt a particular solution to (1)>then all F(z) and G(z) which give 
the same \ U) are given by 

F(z) = F°{~) + B(.)\{~) (2) 


This yields a key lesult which is a sufficiency test for stability if 
for some p q € [1, x) satisfying 1/p + I/q = 1 we have 

II AMI I < (6) 


and 

6‘(-) = G°( )-(1 -_-')4(.)\(.) (3) 

where YU) is an arbitrary polynomial 
Usually, F°U) and Cr°(~) aie chosen to be the minimum order 
unique pol>nomiah satisfying (1), and \ U) is incorporated to give 
some sort of performance improvement, ihrough overpaiameteriza- 
tKWi of the compensator Such a procedure gives proper controllers 
In the work presented here, YU) is chosen to reduce the effects 
of parametric plant perturbations on \ (*) We consider the plant as 
a nominal plant modified by perturbation polynomials A2?U) 
with coefficient vector Ah = (0 Abj Ah ) and A 4( ) with 

coefficient vector An = (0 Ar/|, An, A ) so that the perturbed 
plant is 

D( ) _ B'\ ) + AB(.) 

4(-) 4 U (-) + A 4( _) 

The orders of A7?U) and A 4( ) may be larger than those of 
&°( + ) which has order ii/u and ^4° (-) with order n , > The order of 
the perturbed plant polynomials is 

Hi — ina\[n^ n±i] 

and 

ii, — ma\[n, u it a ,] 

If the perturbed planl is used in conjunciion with a compensalor 
designed to give characteristic polynomial 1 °( ) with the nominal 
plant /}°(~)/4 (, ( ), the closed-loop characteristic equation will be 

1 V) + A1 (-) = () 


where from (1) 

Al (.) = 6(-)AD( ) + (1 - V( )A4(.) (4) 

Our approach involves designing the compensator to reduce the effect 
of A7J(,) and A \( ) on the si/e of A\ ( ) thereby allowing larger 
plant perturbations while preserving stability 

III Robustnlss Against a Ciassof Pi ant Pcrfurbations 

We consider the coefficient vectors of AB{ ) and A 4U) as fixed 
but unknown elements in an l, normed space 
Initially, we examine the case with Al(.) = l) In this case (4) 
reduces to 


At ( ) = GU)AflU) 

In terms of coefficients we then have 

~ y^</jAh,- ; , / = 1 2 n v + rt^/ 

so that Hblder’s Inequality gives 

ll-Mk SNUIAML (5) 

tor all p<q E [1 x>) satisfying I/p + \/q - 1 
We define t to be the largest number such that 

||An|| x < 7 => \ °U) -h A\ (-) is stable 

A sufficient condition tor stability of the perturbed system is 
||A?'|| ^ t A sufficient condition for || Ai |k < - is that for some 
p, q € [h r <) satisfying 1/p + \/q = 1 we have ||</||, ||Ab|| , < 


theft the perturbed system remains stable 
If “degree dropping,’ where the perturbations cause certain of the 
h , to become zero, occurs this does not invalidate (6) 

We may interpret ** geometncally as half the length of side of the 
largest hypercube centered at i°, the vector of coefficients of the 
nominal closed loop characteristic polynomial, and containing only 
stable polynomials The value of ^ depends only on / 11 and on n the 
order ot the perturbed closed-loop characteristic polynomial given by 

n — niA\[uj, -f n i u i + n f -f 1] 

Vicino |10| has shown how to calculate - Por the particular case 
where the nominal system is deadbeat with order n we have from 
Soh [11] that - = 1/n 

Thus from (6) we can design a controller with robust stability 
againsl / v perturbations to the plant numerator by minimizing ||fy||, 
This minimization may be performed in several ways One way is 
by calculating C»( ) = G°( ) - (1 - 1 ) t f, ( )\( ) in terms of 

the unknown coefficients of \ ( ) and then minimizing ||r/|| ; with 
respect to these coefficients If p = 1 or p — x the minimization is 
a linear program It p = 2 there is a closed form solution 

Alternatively the minimization ot ||r/||, may be formulated as 
the minimization of a norm sub|ect to a set of linear interpolation 
constraints as follows 


(7) 


subject to 


and to 


72 *'/ 


i_Vi) 

B (I (M 


/ - 1 2 n 


I> - 


mi 

«'(1) 


(K) 


(9) 


where (9) is toi an integrator pole Here the / s arc the plant poles 
which are all known They may be stable or unstable 
Writing the minimization in this form (7) (9) yields the insight 
thai “adding’ more poles to the plant icstncis the feasible set 
tor ihe minimization and thus increases the value ot (IMI/) n It . 
thereby tending to decrease the robustness to uncertainty in the plant 
numerator In (his respect the integrator imposed for performance, 
has ihe same kind ot effect as an extra plant pole 
As the ordei ot the system is increased through overparametenza 
tion ot the compensator the values of both - and ot (IML ) miI 
become smallei It is thus necessary to check that their quotient 
increases with overparametenzation 
Another benefit of expressing the minimization as in (7) <9) is that 
it enables the construction of a dual maximization problem, which wc 
may use in some special cases to obtain an analytical solution to the 
minimization (7)-(9) 

The maximization, dual to the primal minimization ot (7M9), is 
given by 




= uut\ > <\, —-—- 

. f- 'mt,) 


+ <* 


b°(D 


subject to 


11/11/ < 1 
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where 

dt = 5Z a J f 7' + 0’n*+i; i = 

A. Closed Form Solutions 

In this section we derive analytical expressions for (|| for 

a nominal deadbeat closed-loop system where the plant has only one 
pole 1 1 € (-oc,oc) with fi ^ 0 and where there is no integrator 
in the loop. 

For the problem of minimizing the l p norm of a vector .r € t p 
subject to one linear constraint <vr, = jm (which we write as 
p.j’ = w), Holder’s Inequality yields 


B. Selection of Norms 

Performing the design requires choosing the l p norm of the 
compensator coefficient vector to be minimized. This should be done 
based on knowledge of the distribution of perturbation coefficient 
magnitudes to allow larger perturbations without losing a guarantee 
of stability. We do this by incorporating a scaling factor c into the 
perturbation so that 

AD(z) = c*(z). 

This does not uniquely define c but if we keep /)(£) constant, we 
may usefully try to make c large. We then define r*( 7 ) according to 

= (|| A6|| (/ )m ax ^ c “ '•(«,) 


mill ||.r||,, = 


( 10 ) 


S.t. C..T — Ml 


where p, q 6 [1, x,) and 1/p + !/</ = X. 

Applying this result to the minimization problem of (7) gives 

(llvll/ 1 ) min |23« (/, )| 


where 


= U 


Since the closed-loop system is deadbeat, we have 


1 




1 


so that 


n )ib + na 


<»-».).. 


With q =r 1,(11) gives 


(||A/|||| ),rm« = 

With (j = 2, (II) gives 


- |A U (M1 ( 1 -|h| ( "* +l > ) 


(in, + »,/) ( j — Ki r) 


(iiiHiii. = 

Wh + Ht, 


1/2 


(ID 


( 12 ) 


(13) 


With q = (11) gives 

if |fi | > 1, 


(||Aft|l U = 


jgVij j 

lib 4- II7 


and if |fi| < 1, 


(||A/>|M nij ,x = 


lib + 11 q 


(14) 


(15) 


Here with no overparameterization, w 7 = 0. 

Equation (II) indicates that if the plant is stable, that is if (|fi | < 
1), then the nominal deadbeat system can be made closed-loop stable 
against arbitrarily large perturbations of arbitrarily high order in the 
plant numerator by overparameterizing. We can also see this by 
inspecting (8), noting that the coefficients on the left-hand side of the 
constraint (8) may be made arbitrarily large by overparameterizing. 
This clearly holds for a plant with any number of only stable poles. 

These closed form solutions (11H15) for (||Afo||) rim * are each 
proportional to B°(t i). A value of = 0 indicates a plant 

pole-zero cancellation. An unstable root common to B°(z) and A 0 U) 
clearly means that the nominal plant is not stabilizable but a stable 
common root has no such effect. The closed form solutions are 
obtained on the basis of deadbeat pole placement, and this is not 
possible if there is a plant pole-zero cancellation, 


and then attempt to find the value of q which maximizes 
We consider only l \. h, and /** norms because these are the easiest 
to calculate. The results extend for the more general case of l q norms 
with q 6 [1, oo). 

An important result behind the selection is that 

(ll^ll'X’)tnm S (||</||2 )nun S ( Hr/Hl )mm- 

From (6) then we have that 

(||A2j||i ) mrtX > (||A6|U >nw>* > (||AI)||oc )iimv (16) 

We illustrate the norm selection process by considering two ex¬ 
treme distributions of perturbation coefficient magnitudes. The first 
arises from a perturbation with only one nonzero coefficient, that is 
AB{z) = r- where k is an unknown integer in the range [1, 

We wish to choose tbe /,, norm such that minimizing \\g\\ p gives 
the largest value of \c\ with guaranteed stability. For this perturbation 

l|Aft||, = ||A6||a = IIAftlk =M 


so that 

||Af>||] = (||A2>||]) inftx Ou “ (l|A6||i) max * 

\\±b\U = (IIAMI'i)...-, f ( ,) = (||Afc|| a )„ lax 

and 

II AMI x = (||A6|| x)mnx ^ = (||Ah||^)„i«. 


Equation (16) implies that we obtain the largest value of |<*j by 
choosing q = 1 and p = nc t that is we design by minimizing ||<7||^o. 

At the other extreme, if the perturbation is such that |A/>,| = c 
for i = 1,then ||A2j||l = ||Af >||2 — Cy/n^t and 

||A6||r<. ■= c so that we have 


||A/;||, =(||A6||i) mM =»c fJ) = 
||Afe|| 2 = (||A6|| 2 )iiiax => t\2) = 


(llAblhUx 
II Ab 

(II^Mh)max 

y/nAb 


and 


||A6|U = (||A6|U) ul „ =*r (oc) = (||Ab|| rx /max ■ 

For the simple case where the plant has only one pole 1 1 6 
(—00, sc) with 1 1 / 0 and where the nominal closed-loop pole set is 
deadbeat, we can substitute for (||Ab||q), lul x from (7H9) to obtain 

f . iB o (Mi (1 - m~ (, i » +i) ) 1 

U) (»(,+«■;) (l-|fl| -1 ) 7JAI,' 

_ |J?°Ui)| 1 -f~ (2, '» +a) 1 

f(2) (»6 + n B ) 1 - 
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TABLE I 

Effect of Ovbrfarameterization of Compensators 


Outer nfft(z) 

at). 

r 


1 

4 08654 

m 

0 08 J 6 

2 

172476 

0 25 

0 144$ 

1 

1.10385 

02 

0 1812 

4 

07871 

1/6 

02117 


and 

, _ jtfUjl 

{rx ' } (nu + n*)' 

For > 2 ,n g > 1 and |fi| > 1, we have > r (2) > r (l) 
and therefore can guarantee closed-loop stability for a larger value 
of r if we minimize ||. 7 ||i. 

C Example 

In this example we demonstrate how overparameterrzing the com¬ 
pensator can increase the value of (||A 6 ||i )m*x. 

The nominal plant is 

fl°U) _ 0.133:k^-0.GG67^' 2 

.4° (*) 'T-066G7c" 

and so has a delay, a nonminimum phase zero and a stable pole. 
The nominal closed-loop pole set is to be deadbeat, and there 
is an integrator in the loop so (11) does not apply. The order 
n&b of the perturbation is chosen to be n±t, = 2 so that m, — 
max(n f/ o,rtAt») = max( 2 , 2 ) = 2 , Increasing the value ol n^i, 
reduces the value of 7 by increasing the Value of /i/, in the formula 
7 = l/(iih + n y ) for a nominal deadbeat system. Table I shows 
the effect of overparametenzation for this example. This table shows 
how the /j norm of the guaranteed safe perturbation on the plant 
numerator increases as the compensator is overparameterized. 

D. Other Plant Perturbations 

A similar approach can be taken for plant perturbations with 
Afl(-) = 0 and AA(i) ^ 0. 

For both A*4(~), AJR(:) ^ 0 we may prefer on the basis of the 
distribution of perturbation coefficient magnitudes to consider and 
An to be elements of different /,, normed spaces. We take A/> € 1 VU 
and A a € Ip 4 where pa-ph € [1. x). We then have, by a simple 
extension of (5) that 

ll-MU <II/II ( .,JI^II.m + IIsII,.„II^IU„ 

for all P 4 '<i 4 .pn.qb € [ 1 ,oc) satisfying ]//>., + 1 /r;, = 1 and 
t/PH + 1/<zh = 1. 

Here, for convenience of notation, / is the coefficient vector of 
(1 - z~')F(~). 

A sufficient condition for stability of the perturbed system is that 

II/IIp JIAoIIva + Wpr 11*611 (17) 

for some € [l.x) satisfying 1 /p\ + 1/qa = 1 and 

1/vh + 1/7 b = 1 . 

The design approach then is to make \\y\\ and ||/|j small in order 
to reduce the “gain" from each perturbation to Ai». 

We cannot minimize both ||#|| and ||/|| independently with respect 
to -V(;) since the same value of X(z) must be incorporated in 
both G(z) and F(z) as in (2) and (3). To design a compensator 


to reduce the “gain" from each perturbation to Ac, we need to share 
the minimization effort between reducing \]g\\ and ||/||. 

There are a number of ways of doing this. One way is to formulate 
the compensator design as the minimization of a weighted sum of |j/|| 
and ||£|| with respect to .V(;) subject to the set of linear equality 
constraints obtained by equating like powers of z in 

,4°U)(1 -;" 1 )F°(i) + Z) n U)G 0 (;)=: 1(18) 

Another way is to specify a numerical upper bound for either \\(j\\ 
or ||/1| (of course the value specified must be large enough to be 
achievable) and then to formulate the design as the minimization of an 
Ip norm of the other coefficient vector, subject to the constraints (18) 
and to that on the specified value of the norm of the first coefficient 
vector. 

If only /i and / v norms are used in the previously mentioned 
schemes, the optimizations are linear programs. 

By whatever means the minimization effort is apportioned, the 
perturbed system will be stable if (17) holds. 

If either of the perturbations is of high order, it will reduce 
7 thereby reducing the maximum allowable value of the other 
perturbation. 

IV. Rkmark 

In this paper we have calculated allowable bounds on the /,, norms 
of plant parameter perturbations such that ||Ar||^ < 7 where 7 has 
been chosen to guarantee stability. We could have selected a smaller 
value of sufficient to ensure that the perturbed closed-loop poles do 
not cross chosen regions surrounding the nominal closed loop poles. 
The approach could then be considered as design for robusl pole 
assignment. Of course reducing the value of ** 1 educes allowable 
bounds on plant perturbations. 

V. Conclusion 

We have presented a framework which uses convex optimization to 
design overparametenzed pole assignment compensators with guaran¬ 
teed stability against Ip norm bounded plant parameter perturbations. 
The framework is flexible and a numbci ol design problems may 
be formulated within it. depending on the nature of the parametric 
perturbations. 
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The Carathtodory-Fejlr Problem and Tt^/¥\ 
Identification: A Time Domain Approach 

Jic Chen and Carl N. Nett 


Abstract —In this paper we study a worse-case, robust control ori¬ 
ented identification problem. This problem is in the framework of Ti v 
Identification but the formulation here Is more general. The available 
a priori information in our problem consists of a lower bound on the 
relative stability of the plant, an upper bound on a certain gain associated 
with the plant, and an upper bound on the noise level. The plant 
to be identified is assumed to lie in a certain subset in the space of 
Ti x , characterized by the assumed a priori information. The available 
experimental Information consists of a corrupt finite output time series 
obtained in response to a known nonzero but otherwise arbitrary input. 
Our objective is to identify from the given a priori and experimental 
information an uncertain model which consists of a nominal model in 
Ti^ and a bound on the modeling error measured in norm. We 
present both an identification algorithm und several explicit lower and 
upper bounds on the identification error. The proposed algorithm is in 
the class of interpolator algorithms which are known to possess desirable 
optimality properties in reducing the identification error. This algorithm 
is obtained by solving an extended rarathfridorv-KejGr problem via 
standard convex programming methods. Both the algorithm and error 
hounds can be applied to 1 1 identification problems as well. 

I. Introduction 

Recently, there has been a considerable amount of interest in sys¬ 
tem identification from a worst-case robust control oriented approach 
(sec, e.g., 114]. [341 and the references therein). A specific example is 
the H . identification problem initiated in [14]. In this problem, one 
typically needs to apply a sequence of complex sinusoidal inputs 
to estimate a sequence of point frequency samples of the plant 
transfer function, and one then seeks to identify, from the estimated 
frequency samples and certain assumed a priori information, an 
uncertain model. This problem has been studied extensively (cf. [14], 
[I1J, 1121, [26], 1271, [2], [21], [6]), and much of the research has 
been focused on identifying from a given sequence of corrupted point 
frequency samples a model consistent with control design. 

Another problem lhat is closely related to that of the Ti v identifica¬ 
tion is more concerned with the task of identifying an uncertain model 
compatible with the (i design methodology. This problem has been 
generally referred lo as identification in (i and has been studied in, 
c.g., 151,116], 117|, [20], |7], [32], [291. In this problem, one typically 
assumes certain prior knowledge of the plant impulse response, and 
the identification procedure then proceeds with a sequence of lime 
domain data. 
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In the present paper, we consider a variant to the Ti^ and f i 
identification problems alluded to above. This problem is formulated 
in such a way that the assumed plant a priori information is identical 
to that in the framework of {14], while the assumed experimental 
information is more similar to those utilized in [51, 116], [20|. Our 
objective is to identify an uncertain model in v which includes a 
nominal model in and a bound on the modeling error measured 
in Ti x norm. As such, this problem may be considered as a hybrid 
of the Ti^ and identilication problems or merely a more general 
formulation of Ti v identification problems. We shall present both an 
interpolatory identification algorithm |31), |5|, |6] and several explicit 
lower and upper bounds on identification error for this problem. 
Though our results are derived in the Ti^_ framework, they can be 
applied equally well to i\ identification problems. 

The main difference between our approach and that initiated in 
[ 14] lies in that the identilication here is conducted entirely in time 
domain. This has a number of implications. First, unlike in [14J* 
there is no need in the present approach to transform the time 
response data to frequency samples. A more important merit with 
this approach is lhat the proposed interpolatory algorithm has a 
guaranteed optimality property for the overall identification problem. 
In contrast, the previously known interpolatory algorithms [6], [13] in 
the framework of 1141 are suboptimal with respect to only the step of 
identification from point frequency samples; the overall optimality 
properties, however, are lost due to the transformation from the 
time response data to the frequency samples. Moreover, the main 
difference between this work and that of (i identification [5], [J6]. 

120), 132] lies in the assumed plant a priori information. Unlike that 
in (i identification, the plant a priori information in this approach 
is assumed for the plant transfer function. Though it is possible 
to transform this a prion information to one on the plant impulse 
response (e.g., via Cauchy estimates) and hence to apply the (y 
identification approach, the transformation will necessarily lead to 
undesirable conservatism and thus result in a loss of optimality 
propenies. From these perspectives, it appears that the algorithm 
presented in this paper is the first algorithm with a guaranteed 
oplimality property for the problem of identification. Finally, 
though our technique in deriving the algorithm is very similar to that 
of [28] (see also |33J), the two problems are clearly different in that 
the goal in this paper is to identify an uncertain model from certain 
given a priori information and experimental data, while the problem 
in [28J amounts to verifying the consistency of a given uncertain 
model to data. 

A preliminary version of this paper has appeared previously as |4], 
A. Organization 

In Section II, wc formulate the problem and provide a brief review 
on relevant notions in information-based complexity (IBC) [31). In 
Section III, we present our identification algorithm. This algorithm 
relics on the solution of a related problem termed the consistency 
between the duta and a priori information, which is concerned 
with the issue of validating the assumed a priori information and 
is reduced to a convex optimization problem. The construction of 
the algorithm together with the consistency problem amounts to 
solving an extended Carath6odory-Fej<5r [18] interpolation problem. 
Finally, explicit lower and upper bounds on the identification error are 
derived in Section IV, and the convergence properties of the proposed 
identification algorithm are established. The paper is concluded in 
Section V. 
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B. Notation 

We shall use the following notations throughout this paper. Let 
Z denote the set of integers, Z+ := {k € Z : k > 0), and 
Z +.„ {k € Z+ : ft < n[. Let R denote the set of real 
numbers, and R" the space of n dimensional real vectors. Moreover, 
Jet R~f := {,r >E R : t > ()}. The symbol rr(A) denotes the 
largest singular value of the matrix A E R t,x, \ and if ,4 is a 
symmetric matrix, XI A) denotes its largest eigenvalue. The transpose 
of A € R MXM is denoted by A 1 , For any positive p E R+, let 
"D P {r E C : |c| < p) and define the normed space 

:= j/: D,, C | / analytic in D p and 

WfW^n := sup |/U)| < oo >. 

r£D p J 

Furthermore, when p = I, we shall replace Dt by D, and H -v j by 
and so on. Note that the union of Hr* for all p > 1 constitutes 
a linear space. We denote this linear space by ‘H+ := ft ,p C 
H-x.. Recall also the normed spaces and () (see, e.g., | 8 ]). We 
shall use the symbol || ■ |] to denote the norm associated with each of 
the spaces and \ however, each of these uses should be clear 
from context. For each of the normed spaces (A\ ||*||a ) given above, 
we shall define BXlM) := {./■ £ A": ||,t||a < M }. Finally, for each 
m 6 Z+,» > 1 , wc define a projection operator by /V (* —> C * 1 
such that {Pnf)k := fk for all k E Z+ „. Furthermore, we define 
the operator T: C" C' 4 *" by 


l»C„ - t ('Tl —‘l ’ * ’ 

where [r 0 ■■■ r„„i] E R'\ Often, to simplify the notation, 
for a vector denoted by a lowercase letter, we shall use its capital 
version to denote the corresponding lower triangular Toeplitz matrix 
generated by T, i.e., C T(c 0 , ■ ■, c „-)). 

11. Problem Statement and Preliminary Background 

We consider the class of causal, linear, shift-invanani, exponen- 
dally stable, distributed parameter systems. This class of systems is 
identified with the linear space 7Y+. The plant to be identified is 
represented by an h E ft+ which is the standard :-Transform of the 
plant impulse response evaluated at 1/: 

h(z):=Y j h k -. k . (2.1) 

The plant a priori information consists of two positive constants 
M > 0 and p > 1, and the system to be identified is assumed to 
be an element of BH^ ,,(AJ). Note that the plant set DH* n (M) 
is precisely the same as that considered in [14], [11 ], [12], [26), and 
[6], and interpretations on M and p may be found in [ 14). 

The experimental procedure considered herein is similar to those 
in 116], (20J, [51. Specifically, the experimental data is obtained 
by applying an arbitrary nonzero input u E to the plant /i 6 
5ft*.,»(A/), which generates an output h * ii as the convolution 
product of h and u . The output measurement is assumed to be 
corrupted by an additive bounded noise i\ and the corrupted output 
is observed over a finite duration ri. This experiment can be precisely 
described using the experiment operator 

En(h,v) := Pn[(h*v) + v) 

where v G and f > 0 is a prescribed bound, and it 

represents the fiill a prion information on the output noise. Denote 


U := T'(uo, • • ■*w n xj)> Then, each data record y € R", where 
yk := (E n (h*v))i l, can be expressed as y » l T P n h + The set 
of all possible data which may be generated from the experiment is 

y {y G R A : y = VP„ h + P 1t r for some 
4 lh,v)£DH^ n (M)xD? o,(f)}. 

For each y G y, we define the set of indistinguishable systems (cf. 
[31], [22], [6]) by 

P{y) := {/* G BHnc ,.IM ): y = t r P„h + P* v for some 
v G Bl*. (0). 

Given the above a priori and experimental information, our pri¬ 
mary goal in this paper is to develop an uncertain model that is 
consistent with the ft ^ design framework. This goal will be achieved 
by constructing an identification algorithm and by quantifying a 
corresponding identification error. An identification algorithm is a 
mapping 1 A„: R" -+ ft+ which operates on the data record and 
produces an element A n (y) € ft+. The local and global identification 
errors associated with an algorithm A„ are defined, as usual (cf. [31], 
[6]). by 

e{A n ;p.ALt:y) := sup ||/i - .4„(i/)||, 

/'<- H i/) 

and 

c(A„\P' A/, f) := sup f ( A u i p- A/, r: //) 

respectively. Note that the global error provides a universal bound 
on the modeling error regardless of data records. Additionally, as in 
[ 14]. [ 111, [ 12], [26], it will be used to define convergence property of 
identification algorithms. An Algorithm A„ is said to be convergent 
if it satisfies the properly 

lim ((A„:p* A/. *) = (). 

,1 . H.I ’ 

. —.(J 

We shall be particularly interested in a class of algorithms which 
are termed interpolator algorithms (cf. [31J, [23], [5), [6]). An 
algorithm A u is said to be interpolator if lor any y E J. A„(y) G 
Ply). Algorithms of interpolator class are known to possess many 
desirable properties, among which is its optimality properties in 
reducing the identification error. To precisely state these properties, 
we introduce the concepts of the radius and diameter [311 for the set 
P(y), defined by 

r(P(u)) := inf sup ||//-/»| 
and 

dlP(y))’'= \\h )>\ 

ft pCP ( y ) 

respectively. More specifically, the quantities r(P(//)) and <HPly)) 
are called local radius and diameter, and their global counterparts 
are defined by »■* ;= Mip ^ y r(P(y)) and (P := snp v6i y r/( Ply)), 
respectively. It is well known that the radius rlP(y)) coincides with 
the minimal possible, or optimal identification error |31], [22], [51, 
[6], It is also well known that if A n (y) E Ply), then 

rlPly)) < c{A„\p, A/, r, y) < <t(P(y)) < 2 r(P(y)). (2.2) 

1 In a broader sense, an identification algorithm acts on the input-output 
data pair (u.y). To simplify the notation, wc omit the dependence of ull 
the relevant notions (e.g., V{y), -4„(//), etc.) on the input and the a priori 
information. 
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In this sense, an interpolatory algorithm is said to be oplimal to 
within a factor of two. Finally, let 

•■= Mip ||/i(~)lk = Slip ||/»(I)|U. (2.3) 

/it />er(o) 

\\i'r u 

Then, as in | 6 J. [31], it can be shown that the global radius and 
diameter satisfy the inequalities 

**<r'<rf* = 2«*. (2.4) 

Note that the above quantity .s* (defined with respect to H norm) 
constitutes also u lower bound for the global radius even when it is 
measured in f \ norm. 

III. An Identification Algorithm 


A. Consistency of Data and a priori Information 

In the preceding definition of identification error, we have assumed 
implicitly that the set P{y) is nonempty; otherwise, the identification 
error will become unbounded. Whether this assumption holds is 
concerned with the issue whether the given data record and the 
assumed a priori information are consistent. Precisely, ihe notion of 
the consistency between the data and a priori information is defined 
as follows. 

Definition 3 ./: Given the plant a priori information (p. 3/) 6 
[1, x ) x [0. x ) and the noise a prion information r £ [0, x). A 
data record // £ R" is said to be consistent with the plant and noise 
a prion information (p, .V, 0 if P{y) / 0. 

In other words, the consistency between y and (/#, 3/. O is equiv¬ 
alent to the existence of a model-noise pair (//, < ) £ R7Y X ,/.(3/) x 
B( *.(«■) such that lor this pair the data y £ R" may be generated 
from the experiment F„{h.r). As such, the consistency problem 
amounts to establishing the existence ol an analytic function h £ 
,.( U) whose first n Taylor coefficients satisfy the relation 
y — 1' P„h + F 3 „r for some r £ j)( x (#). This characteristic bears 
a close resemblance to that of the Carath£odory-Fej6r problem (ct. 
[9], 1181) and thus suggests the following necessary and sufficient 
condition. 

Lemma 3.1 ■ Let (p. M ) £ [1, v) x [(). \ ) and f £ [0, x). Then, 
y £ R is consistent with (p.M.t) it and only if there exists a 
r £ B(^ (t) such that 

(l'-nir-V)' < M-’rn -u 1 o.d 


where 77 := di;ig( 1 1 > ■ • • ft" 1 ). 

Proof: Lei /(:) := (\/M)li(,n). Then, f € 77X x (1) it unci 
only if h £ BH^ Note that for any : £ D, h ( p:) — 

E^-n Hence. /(-.) can be expanded as f{z) = h 

with /a - [\/M)p k ln for all k £ Z + . It follows that P v f — 
(1 /M)HP v ln and that // = (’/’„/* + P,,/’ if and only if y = 
MI Ft 1 P„J+Pn /’■ Therefore, P(y) ^ 0 if and only if there exists a 
function / £ 7JH * (1) such that y = MP /?' 1 P„ / + P„ c for some 
r £ The result then follows from utilizing an extended 

Caralh6odory~Fcjdr theorem given in [30, p. 26]. □ 

Based upon Lemma 3.1, we give below two equivalent necessary 
and sufficient conditions that can be readily computed. To simplify 
our presentation, we shall assume, with no loss of generality, that 
tin 0. Under this assumption, V is invertible and its inverse is also 
a lower triangular Toeplitz matrix [5], 

Theorem 3.1: Let (p. AF) £ [l,x) x [0, x) and f £ [0. x ). 
Assume also that i/o ^ 0. Then, y £ R” is consistent with (p, 37, f) 
if and only if 


mill ff(J?r'(V-n)<A/. 


Proof: Since V is invertible, condition (3.1) is equivalent to then 
-Y))[Rl T '(V-D ) 7 < M 2 I. (3.3) 


This, however, is equivalent to that H(IU'~ l (Y — V)) < M. Hence, 
the proof is completed. □ 

Similar to the results given in [ 6 ], [281, Theorem 3.1 shows that 
the consistency problem can be solved as a convex program. This 
is clear by noting the fact that the largest singular value <t(.4) is a 
convex function of .4, and that the matrix IU r ~ A (V - V) is an affine 
function of /’a. k £ Z„ +. As in | 6 ], it is easy to derive necessary or 
sufficient conditions from this result. 

Theorem 3.2' Let (p. AI) £ [l,x) x [(Lx) and f £ [0,x). 
Furthermore, denote 


■M' , nr i v' -(Y-v 
(V - Y) 1 


Then, y £ R" is consistent with (p, AT f) if and only if 


min A(-4(i')) < 0 . (3.4) 

( ( |r) 


Proof: By Lemma 3.1 and a well-known fact concerning the 
Schur complement (see, e.g.. 115]), we assert that (3.1) is equivalent 
to the condition 


ru-r/? '-'r' (V-r 

(V-V)' r 


> 0 . 


The latter condition, however, is equivalent to that 4( v) < 0. Hence, 
the proof is completed. □ 

Minimization problem (3.4) can also be solved as a convex 
program. This follows from the fact that the largest eigenvalue A(,4) 
of a symmetric matrix A is a convex function of .4 (sec, e.g., [ISI), 
and that the matrix 4(r) is an affine function of (’*,/»• £ Z N| +. 
Each of problems (3.2) and (3.4) belongs to the class of constrained 
nnndifferentiable convex programs. Numerically, problem (3.4) is 
moic amenable to implementation, as it possesses the standard form 
for this class of problems, for which the techniques developed in 
e.g., (31, [25] can be directly applied. Note also that in this result, 
the assumption i#« ^ 0 is dropped. 

Finally, as in |5], | 6 ], we comment on the problems of computing 
the minimal a priori information levels that are required for the a 
priori information and data record to be consistent. The utility of 
these minimal information levels lies in that ihey help build up one's 
confidence in the assumed a priori information, particularly when the 
computed levels are significantly lower than those assumed. Toward 
this end, one problem is compute, given p and f 

AP min{3/: P(g) ]. (3.5) 


Clearly, as a by-product of the data consistency problem, 3/* is 
solved as 

3/* = miu fr{Ftl T '(V - D). 

(.) 

Another problem is to compute, given 3/ and p 

C - min | r: P(y) j- 0}. (3.6) 

This quantity may be computed by solving a convex program 
analogous to (3.4) 

P = miu{e; <4(r) < 0 , |i’a|<*. A* £ Z 4 . ,,}. 

Alternatively, it can be solved by computing P =- tuin{^: f(t) < 
\f}\ where /( ):R+ —► R+ is defined by 

/(O := min #r ( 1 (V - V)), 


(3.2) 
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As in |6], it can be readily shown that /(•) is continuous and 
oonincreasing* Hence, f* may be computed easily by using, e.g., 
the bisection method, given the fact that /(f) may be solved for each 
* £ 11+< Finally, it is also of interest to compute, given M and e 

/ := max{/»: P(u) £ (3.7) 

which can be solved by computing p* = max{p: g(p) < M\, where 
g(-): [1,oc) —*■ R+ i» defined by 

q{p) max <t{RV~ x (V - V)). 

Clearly, //(■) is continuous. It is also easy to show that r/(-) is nonde- 
creasing. Indeed, let pi < p2 , and denote i?i := <liag( 1 py • • • p” ~ 1 ) 
and JI 2 := diag(lpu ). Then, for any r £ B(^{f) 

»(R\V~\Y - V)) = - D) 

<S[R,Rt l )»{R 9 r-\Y-V)) 

= <r(BiU-'(Y -V)). 

This implies that g[p\) < g(pi). Since <7( ) is continuous and 
nondecreasing, problem (3.7) may be solved by using the bisection 
method as well. 

B. An Imerpolatory Algorithm 

As an immediate consequence, the solution of the data consistency 
problem yields as a by-product an imerpolatory algorithm. Indeed, 
once the consistency problem is solved, we may construct immedi¬ 
ately an “interpolating” function h £ BH^ ,AM) using the Schur 
algorithm [9] or a procedure of [1] (see also [18]), such that its first 
n Taylor coefficients satisfy the relation y — l'P v h + P n r* with 
v* being the solution of the consistency problem. Clearly, such an 
interpolating function lies in the set P(y ), and it can be constructed 
using the following imerpolatory algorithm. 

Algorithm 3.1' 

• Step I: Solve minimization problem (3.4). Let the minimizer be 
denoted by 


and the minimum be denoted by A\ Accordingly, write V* := 
T( /•(*. )• If A* > 0, stop. The data and a prion 

information are not consistent. Otherwise go to Step 2. 

• Step 2: Compute the matrix H* .*= {\/M)RU (Y - T *). 
If A* < 0, go to Step 3a. Otherwise compute the rank of 
I - H*H* 1 and go to Step 3b. 

• Step 3a: Compute the polynomials 

/>(;):= [0 0 1 ]T(J.• '* 1 )(/- H'tt" f‘ 

nn 


.1 

9(-}~{0 ■ 0 l • • • .pn {)) 

roi 

X (f- H'fT 1 )-' (3.9) 


where ju(-) ■= St o for all € Z+,„, and P„h 4 := 

- Pt,r'). The proposed algorithm .4,, is given by 


/*»(*)(:):* A/ 


(*)"!>(*)<•+(p'-'rt?) 


where |r| < 1 js an arbitrary real constant. 


Step 3b: Let rank (/ - H*H* l ) ~ m, Also, let / be the 
(m + 1) x (in -f 1) identity matrix and 


ff -[r o )H* 


T 

0 


Solve a vector c 1 ~ [ru ■■■<*,„] from the equation (/ - 
HH l )c = 0. The proposed algorithm A n is given by 


-*.,(*)(:) := A/ 


Cop +C’| P 

r» t p w + c, lf - i p u, ~ ] 


• + v 


_ni 

fit - 


+ ‘ ■ ■ + <\) 


The above algorithm, as those developed in [6], consists of 
two parts which solve a consistency problem first and a standard 
Carath£odory-Fej6r problem subsequently. The key to the algorithm 
is clearly Slep 1, which determines whether the measured data and 
the assumed a priori information are consistent. When A* > 0, 
the consistency problem admits no solution and the algorithm stops. 
When A* <0, the data and a priori information are consistent and the 
algorithm proceeds to find an interpolating function / £ BH * (1). 
This function is found in either Step 3a or Step 3b and is then 
transformed to an interpolating function h £ BH ■* P (M) via the 
transformation /?(-) := Mf( m ~/p). When A* =0, the interpolating 
function is unique |18] and is found in Step 3b. The algorithm 
produces a real rational transfer function with an order of m. When 
A* < 0, the interpolating function is nonunique [18) and can be 
parameterized by an analytic function in BH^il). Step 3a gives 
one specific choice of such functions by selecting the free parameter 
as a real constant r\ which yields a real rational transler function with 
an order of n. The construction in Step 3a and Step 3b follows Irom 
a modified procedure based on [1], Note that the algorithm may be 
numerically ill-conditioned when I- II* H*‘ is nearly singular. This, 
however, should not pose a serious problem for a well-conditioned 
algorithm may be obtained by resorting to an alternative procedure 
given in [ L]. 


IV. Bounds on Idijsitihcation Error 

In this section we provide several explicit lower and upper bounds 
for the identification error associated with the interpolator algorithm 
proposed above. These bounds arc derived for the global radius 
and diameter of the set P{y)\ therefore, they are applicable to any 
interpolator algorithms. Our first result is a lower bound upon the 
global radius of P(//). As noted at the close of Section 11, this result 
also serves as a lower bound for the corresponding (i identification 
problem. 

Theorem 4 1 Let ||P„i/|| ^ 0. Then 

r'i «»{.!/. pflj-}. (4.1) 

Eurthermore, if e/||P„ v\\^ < A/, then 

> (W) + ('/ll p »« 

H-UAW/*")(r/||/*„«iU) - v ■ 

Proof: To establish Part i), consider the function 

Clearly, h £ BH^ P (A F). Note also that \\UP„h || * < max*^ M 
(|*U|/|| PnU ||cx )f = Hence h £ P(0). The result then follows by 
using (2.4). Suppose now that (r/||P„</||^ ) < M and consider the 
function 

n ., = M (> 7/P) + (^/llPn»IU)(iAU) 

' I + /p ,l )(f/\\P,n\\^ )(1/Af)’ 

Since (+/\\Pn u|U ) < M*h is analytic in D P . Furthermore, we have 
\h( ;)| < M for any : 6 D fJ . The latter assertion follows from the 
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well-known fact that for any a, ft € D the inequality (see, e.g., [10, 
p. 4]) 


n ~ b <- M + l ft l <■ , 

1 -ha “ 1 + \o\ ■ \b\ ~ ' 


(4.3) 


holds. As a result, h E B7ir^ tfJ (M). Note now that h can be 
expanded in Taylor series as 

,,(i) = phu + w ( 1_ (irp^iir) )r~"' 

It follows that ||t'P„//||rx = niaxAez +>T ,(|?a|/||/nw||tx;)f = t. 
Hence again, h E P(0). To complete the proof, we claim that 


it follows that 


<»er( o) 


= max{||/i||i: P„h = V~ l P„v, 

IM < A//p\ k € Z+, ii g JWnoU)} 


= max 


M 


ax 11|M'/ 5 ,. I'll 1 »'i—I'lj < 


k E Z-f,n, r E 


mo}+££ 

n. o ( >=o ’ ) k-„ r 


(Af//.") + (f/||P„u|| nu ) 
l + (l/p")('/||/ , .,«|U)(l/A/)‘ 


= 'ED"'-i + £ 


M 

,»i 


Indeed, according to (4.3), we have 


ii"'^ /ii-* jr 


(M/p" ) + {f/\\P u »1|mJ 

i + (i//i-)MiftMik)(i7Ai)' 


The upper bound, however, is attained at - = 1. The proof for Part 
ii) is now completed by using (2.4). □ 

The significance of the lower bound (4.1) lies in that it provides 
an irreducible level of identification error in terms of the quantity 
c/\\P„ </|| v , which, as in [14], may be interpreted as a noise-to-signal 
ratio. The lower bound (4.2) provides an estimate tighter than (4.1) 
when {f/\\P„ </|U ) < Note that in the noise free case (r = 0), 
this lower hound reduces to r* > Af/p ", which together with a 
result of [24] yields 


P" 


< r* 


2 M 

< </* < — ■ 
P" 


This inequality and (4.2) suggest that the global identification error 
will be inherently large when M is large and p is close to one. 

Wc now present an upper bound for the global diameter of the 
set P( y). As in |14], 16], this bound is also useful for establishing 
the convergence property of Algorithm 3.1 in Furthermore, it 
shows that the algorithm is convergent in ( i, and when combined 
with the error bound, it leads to an uncertain model consistent 
with (] design as well. We shall assume that i/o ^ 0. Hence, we 
may write I V := T(wn, ■ ■ ■, iv„-i ) = t T ~\ Note that the vector 
w := [ii’o ■ • ■ ir n _i ] ; can be calculated explicitly from V ~ x . 

Theorem 4.2: Assume that u o ^ 0. Let / E Z+ be the largest 
integer k such that 


= f +1 — k)\u'k i+ 

k=Q 


Af 

p i '(p-iy 


By a well-known fact [8], however, ||/i|| (X , < ||A||i. Consequently, 
the upper bound (4.5) may be established by using (2.4), To verify 
the convergence property of interpolator algorithms, we show that 
(i* —► 0 as n —► 0 and c —► 0. Indeed, it follows from the above 
proof that 


\k-A) l «-0 f ) k^n 


M 
f> k 


< 2[ r 


M 


+ 


p l *(p- v 


where the last upper bound converges to zero when /* —► oq. Since 
/* = min]/, n}, and since \ w >\ > M//" 1 " 1 * if follows that 

l* —► nc as n —> -x> and f —► 0. Therefore, d m —► 0 as n —> oc and 
f —• 0. This completes our proof. □ 

In the remainder of this section we present an additional upper 
bound for the global diameter d *. This result may be combined with 
Theorem 4.2 to yield an improved estimate of the global identification 
error in the H ^ (but not in (\) case. Note that if »o ^ Q and f/|vo| > 
Al , then d w can always be bounded by d* < 2 M < 2(r/|«#o|)- This 
follows by examining the quantity h* defined by (2.3), and by noting 
the fact that ** < sup /l€iJW ^ ^ (A/) \\h\\ x = M. Hence, it suffices to 
consider the case f/|uo| < M. The following lemma, given in [6], 
is useful in our derivation. 

Lemma 4.1: Let / : [0,1] x [0,1] —♦ R.^ be defined by 


/(■r..v) 


i±a_ 

I + -rg' 


■ti-i ¥■ (4 - 4) 

Denote further that 1* := min{/,n}. Then 

,r < +1 - a-)|,..,| + (4.5) 

In addition, any interpolatory algorithm is convergent. 

Proof: For any h E P(0), it can be expressed as h = 
hk , where hk satisfies the condition UP n h = P u v for some 
v E BC^ie). It follows that h k “ ^ or & € Z+.«. 

Additionally, by Cauchy’s estimates (see, e.g., 119], p. 137), we have 
|/ u | < M/p k for all k E Z+, since h E Therefore, 


Then, f(.i\ y) is monotonically increasing with respect to both .r an4 
.V- 

Theorem 4.3: Assume that i/o ^ 0 and f/|uo| < M. Then 


d* < 2 


(f/|uo|) -f M(~)/p) 
1 + [f/M\uo\)(-)/p) 


(4.6) 


where 


: ="f;miu{ru.l}+ 1 


P"~ 2 (P~ 1) 

and for k E Z+.n-i.a*, can be computed iteratively as 


<u 


, = l K| + ( f /|«o|)£L t » t -.£U K| 

M 2 - (f/l«ol) 2 



m 
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Proof Consider any h € P(0) Then, h e D'H nap (M) and 
It satisfies the condition In — 5Z*=o an-.'.- * € Z+t» for some 
t € Let / be defined "by /(.) = (1/Jl t)h(p„) It 

follows that f e Bh n c(l) and /* = (1 /U)p k ln tor k t Z 4 „ 
Since /* interpolates //*,* € Z+ „ in P (M), f interpolates 

€ Z+ „ in JBWdt(l) Furthermore, since f/|uo| < Jl/, we 
have |/ 0 | = (1/A/)|M = (l/A/)|«o«o| = (l/V)M/|«o| < 
f/(Jl/|wo|) < 1 Therefore, by Schur’s algorithm (see, eg, [9]), 
/ may be expressed as 

f( \ — /q + ) 

71 l + /o~tf(0 


where g G satisfies the condition 


/m i + /o 53^-1 4a- /» 

1 - l/ol 2 


A € Z-, 


By inequality (41) it follows that tor any c G D |/U)| < 
(|/o| + \«(i(.)\)/(l 4- |/o-flf(-)|) This implies that lor any ~ G D,, 


V Conclusion 

We have presented an identification algorithm foi a worst-case 7f ^ 
identification problem The main feature about this algorithm is that it 
is an interpolator algorithm constructed based on time domain data 
A direct consequence of this feature is that the algorithm is optimal to 
within a factor of two This property is not shared by the previously 
available algorithms for H x identification problems Note also that 
the algorithm is interpolator for the corresponding (i identification 
problem as well 

A main disadvantage of this algorithm is that it requires solving 
a convex optimization problem with a size equal to the number of 
data This may be computationally demanding By trading off the 
optimality properties, it is possible to lessen this computational com 
plexity How to achieve a judicious trade-off between the optimality 
and complexity is a difficult issue and appears to be a worthy research 
topic Future problems also exist in the aspects of, e g derivation 
of lower order models, identification tor multivariable systems and 
issues related to input design and sample complexity [7] [29J [ 171 


|M-)|= U\ft-/P)\< M 


j/nj_+_l/H Hzlfl 1 

l + l/ol \-/p\ |?(»/ p)\ 


Rfferencfs 


Therefore, by Lemma 4 1, we have 


** = s np ||fi(-)|k 
fer(o) 

(«-/| l i/</|)|)+ (l/p)Mll),g« 1|</(~) || x- 
1 + (r/|Muo|)(l/p)hup jei | ||*7(-)||> 


where g(z) = r/( /p) € BH-* ,,(1) and SI = {ry e D p 
(1) 0k = Qk/p 1, k e Z+ , i i € D /> (0} As a result, for any 
g € we have 


k 1 / A x A + l + /) 53 L 1 f 

1 -rrjftF- 

_ p\fh k¥ \ + h n £*_, 9k h 

SO - |M* 


It is clear that | c/o | < f»o Suppose thai |</, | £ n for / 6 Z, k 
Then, we have 

, ^'E^<>.i + ('/ki>e,‘ ,k .ie, >,i 

ws '-- 

^ l + WKDEL.o*- |n,| 

= 'U 


Therefore, we conclude that k | < ru for all A G Z 4 , _ j It follows 
as m the proof of Theorem 4 2 that 


-■up||'/(-)lk 

yen 


< '^uunfksl 4} 

A—ft ' * ' 




< - 


Hence again, by Lemma 4 1, we conclude that 

* < If/Ml+ j£h/£) 

S - 1 + {t/\\Uu>\)(-)/f}) 

The proof may now be completed by applying (2 4) □ 

Note that the upper bound in (4 6) also indicates that any intcrpola 
tory algorithm is convergent Indeed, a technique similar to the proof 
of Theorem 4 2 may be employed to show that approaches to zero 
when n -♦ do and f ~+ 0 This upper bound is derived by combining 
the Schur’s iterative algorithm |9] and the technique used in the proof 
of Theorem 4 2, which essentially involves approximating a function 
h G HO) via a first order rational function 
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Pole Assignment for Linear Periodic 
Systems by Memoryless Output Feedback 

Dirk Acyels and Jacques L. Willems 


Abstract —We consider linear periodic discrete-time systems. We are 
interested In the problem of placing the poles of the rnonodromy map by 
means of periodic output feedback of the same or multiple period. It is 
well known that, in general, the poles of time-invariant systems cannot he 
assigned by constant output feedback. This is in contrast with what can 
be obtained in the context of lime-\ariant systems. The main contribution 
of this paper is that periodic output feedback suffices for pole placement 
of periodic systems. 

I. Stabilization and Poll Assignment ior Linear Systems 

The purpose of closed-loop control is to monitor the behavior of 
a dynamical system by means of a well-chosen feedback to achieve 
a prescribed goal. The design of appropriate feedback laws to obtain 
stabiJily or a preassigned location of the poles, as well us the study 
of the limitations imposed by constraints on the possible feedback 
strategies, represent major research areas in control theory. For 

Manuscript received October 13, 1994. This work was supported in pari by 
EC-Sciencc Project SCl-0433-C(A). 

The authors arc with the Faculty of Engineering (Department of Systems 
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linear lime-invariant systems, the closed-loop poles can be assigned 
arbitrarily 1 by state feedback, provided the system is controllable. On 
the other hand, controllability and observability imply the existence 
of a state rcconstructor, such that the poles of the closed-loop system 
can be assigned arbitrarily by dynamic output feedback. 

It is well known, however, that in general pole assignment is 
not possible by means of linear time-invariant memoryless out¬ 
put feedback. In some recent papers [2|, [3] the authors have 
shown that—under some technical conditions—for controllable and 
observable linear time-invariant discrete-time systems, exact pole 
assignment is possible hy means of memoryless linear periodic output 
feedback. It seems natural lo examine the possibilities of periodic 
feedback control when the original system itself is periodic. A first 
result, concerned with low-order systems, has appeared in [4]. In the 
prcsenl paper we extend these results to the general case of discrete- 
time systems. To our knowledge the problem has only received 
limited attention in the literature except for a recent paper lhal 
addresses the same question |7]. Our results are essentially different, 
e.g., there is no restriction on the dimension of the system with respect 
to the number of inputs and outputs. 

II. Memory less Periodic Output Feedback 
Control for Periodic l inear Syst ems 

A. Problem Statement 

Consider a linear periodic discrete-lime system with scalar input 
and scalar output 

•r » f i - A, j\ -I- p , = rvr, (I) 

where £ /?", a, £ H, y, £ 7?. / £ 7. The matrices A,, />,, r, 
have appropriate dimensions. The system is assumed to be periodic 
with period 7 

A / + , =■ A,. hi fi rj + , = r, 

The fundamental question that arises is lo what extent the pole 
assignment problem can be solved by iniroducing time-varying (but 
memory less) ouipul feedback. In particular we assume lhal the time 
dependent feedback gain is periodic with the same period as the 
system. Consider ihe system (I) with periodic feedback 

</,/+, = I 4 , 

with /■ £ 7 and t £ (0.1. ■ • •. T - 1 }. With this periodic feedback 
the closed-loop system is also a periodic linear system of period T, 
This system can be considered as a time-invariant system over a time 
interval equal lo the period T 

-hi t = (A / _i + kj-ihi i*7 — i) 

X (A / -2 -f kf .ihi -jej 2 ) - ■ • (An + k\)boc o ).r, r. 

(2) 

The eigenvalues of the system matrix 

4,i = (A/ - i + kj _ i bi _iO . \) 

X (A / -j +■ A7 -jbi 2t‘j —2 ) ■ ( An 4 - frofttif'n ) 

of (2)—from now on also called the poles of the periodic sys¬ 
tem—determine the dynamics of the periodic system. We recall here 
that the eigenvalues of A,i are invariant under cyclic permutation 

1 Obviously the complex poles must occur in complex conjugate pairs. 
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of its defining factors: this implies that the poles are identical for 
every system matrix that describes the closed-loop system considered 
over a period T. The original problem of pole assignment can then 
be restated as selecting feedback gains kr~u"‘ *h such that the 
eigenvalues of the closed-loop system matrix A,\ are the roots of a 
given polynomial’ 

r*(z) ss Z 1 * + O n -l c" ■ + rt I C 4* 0[). (3) 

8. Reformulation of the Problem 
After introducing the notation 

II = (.4 r -2 4- h' ~ 2 ftr- an - 2 )•■ • (Ao + kaboco ) 

Aeq = A/ 1 - ill. C<>q = rr-in, = bj — 1 

system ( 2 ) is represented as 

£i+l — 4 b^v , , 1 , = r‘,.i|Cr (4) 

with £, = J-.r and v, = A 7 - 1 - 1 . The question is whether the 
eigenvalues of 

-4,1 = A**q "f k / (5) 

can be arbitrarily assigned by the feedback gains. This question is 
treated in the Appendix. The analysis leads to the result stated and 
discussed in the next sections. 

C. Main Result 

The analysis of the appendixes yields the pole assignment property 
for systems with period T equal to n +1. Only this particular case is 
considered in the present section. The general case is referred to the 
discussion in Section II-D. For the sake of the presentation a number 
of assumptions are formulated before the theorem statement is given. 
Assumption 1: The matrices A, for i = 0, * • •. ?», are nonsingular. 
Assumption 2: The it vectors 

<'1 

cuAi 

‘r» -1 A„ .2 A n « | ' • * Ai 

An -1 A,,-? ■ ■ * Ai 

are linearly independent. 

Assumption 3. The n vectors 

f, « A ( , 1 

n 

r, *A\ 

r n-\ A, r - 2 An - '1 ■ ■' A | 

are linearly independent. 

Assumption 4: 

rank[K tl .4,A, , '~ l &„ t| ] = n (6) 

for the values of the gains k, , for / = 1 1, given by (18). 

and for ko equal to k[{ = -l/r () A^ l b 0 . 

Assumption 5: 

rank\ba\ A*.qfvq ■■■ A t . (J h f . () Aq bn ] = n (7) 

for the values of the gaias k ,, for / = 1 , • • ■, n - 1 , given by (18), 
and for kn equal to kfi = — 1 /r 0 A t 7 1 
Assumption 6: The parameters of system (1) satisfy 

r ( j4 n ’fto ^ 0 . 

The denominators of the gains k{\ for i = 1 ,■■■,/*- 1 , defined by 
( 18 ), are nonzero 

1 -f A 7 1 b, ^0 for / = - 1 . 


Theorem 1: Consider the periodic single input single output 
system (1) with period n + 1. Then under Assumptions 1M>), the 
poles of the system can arbitrarily be assigned except at the origin 
by a periodic output feedback strategy with period equal to n + 1. 

0. Discussion 

• A noteworthy aspect of Theorem 1 is that the feedback law has 
the same period n + 1 as the system itself. Hence the system 
structure is not altered by the feedback, in contrast with the result 
for time-invariant systems [3], where the time-invariant system 
is transformed to a periodic system by the feedback. Systems 
with period T not equal to it + 1 can readily be dealt with: 

—If the period T is larger than n + 1, then the result can 
still be used. Indeed if, e.g.. at rT + h r € Z , the input 
is set equal to zero, then 

.J\ / +2 = A 1 An.r ,r 4 - A\h [} u, \. 

This reduces the period by one, considering rJ\ rT + 
2, rT -f X ■ * •. as sampling instants, A 1 A 0 as the new 
plant matrix, and Ajfin as the new input matrix at rT. 
Repeating this procedure (if necessary), one finally obtains 
a system of period // + 1 to which Theorem 2 can be 
applied. This periodic feedback actually n + 1 corresponds 
to a feedback of period T in the original system. 

Another approach would be to consider more variables k, 
in the analysis of the appendixes, such that some of the 
variables can be chosen (almost) arbitrarily. 

—If the period T is less than /* 4- 1, then the system can 
also be seen as a periodic system with period 27. 37, or 
in general kT, with k e Z. In this way a period equal to 
n 4- 1 or larger can easily be obtained, and the problem 
is reduced to either the case considered above, or the case 
considered in Theorem 1 itself. 

• We briefly discuss the assumptions of the theorem. Assumption 
1 on the nonsingularity of the system matrices is necessary 
for the analysis in the Appendix. This assumption, if not true, 
may be realized by an a priori output feedback; this feedback 
may be combined with the periodic output feedback obtained 
from the analysis of ihe Appendix resulting in a periodic 
output feedback that solves the pole assignment problem. If, 
however, Assumption 1 cannot be realized in this way, system 
(2) has a zero eigenvalue for all output feedback coefficients 
and pole assignment is certainly not possible by linear periodic 
output feedback. Assumptions 2-3 correspond to observability 
of the system. Assumptions 4-6 are weak conditions that are a 
consequence of the technical approach. It is also assumed that 
the desired pole pattern does not contain one or more closed- 
loop poles at the origin. This restriction is again a consequence 
of the technique and can most probably be waived. Some of 
the restrictions implied by the assumptions of Theorem I can 
be avoided by a cyclic permutation of the time indexes 0. I, 
2,- ■ -, T - 1. Also it can readily be seen that Assumptions 1-6 
reduce to the assumptions obtained in [3] in case the original 
system is lime-invariant. 

• The result of Theorem 1 can be generalized to MIMO systems; 
see [3] for details. 

• It seems important to emphasize that the result of Theorem 1 
is more than a statement on the genericity of pole assignability 
by periodic memoryless output feedback. The theorem provides 
explicit sufficient conditions on the parameters under which 
the pole assignability problem is solvable. These conditions are 
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clearly generically satisfied since they require that the parameters 
do not satisfy a set of algebraic equations, 

• The problem of pole assignment is basically one of solving a set 
of nonlinear algebraic equations. These equations are explicitly 
derived in Appendix B. We have given a number of verifiable 
conditions under which this problem is solvable. 

• An important feature of the result of Theorem I is that no 
minimum number of inputs or outputs is imposed, distinguishing 
it from the result of Yan and Bitmead |7] who relate the number 
of inputs and outputs to the order of the system- Notice also 
that we are considering exact pole assignment, not almost pole 
assignment. 

E. Illustrative Example 

As an example, consider a periodic system of second order with 
period 3. The system data are 


= n == r > — [ — 1 1 

We want to find output feedback gams such that for the system 
considered over one period the closed-loop poles are — 0.3 ± 0.5 j 
It is directly verified that the assumptions of the main theorem arc 
satisfied. By the techniques explained m the appendixes the gains 
An and A 1 , must be determined from (13) and h, follows from 
Ackermann’s formula. For this specific example (13) is 

‘ 2(1 \ — 1)(A'o — 3)(2A*i An H- JAo ■+ 10) -F (A'i A'n -I- An — A - i -F 1) = (). 

The nominal choice for Ao is here An — 3. Following the perturbation 
analysis we choose An -- 2, and we obtain A | = 1.0013. From 
Ackcrmann’s lormula we obtain A-,. - -3. 1GG5 T his leads to 

A, i — [ l j ■+ kihj< j][A i + A'i b | r \][An -F A^jftor n] 

3.001 o G.09I5 
" [-2.1085 -1.0015 

with the required eigenvalues. 

III. Conclusion 

After a discussion of the pole assignment problem for time- 
invariant discrete-time systems by memoryless output feedback we 
have posed the same problem for periodic systems. The main part 
of the paper is concerned with establishing the following result: it is 
possible in general to assign the poles of a linear periodic discrete- 
time system by linear periodic output feedback of the same or multiple 
period. 

Appendix A 

Van DbR WOUDE'S Lemma 

In Section 11-B it was shown that the problem is reduced to finding 
the feedback gains such that the eigenvalues of (5) arc the roots of 
a given characteristic polynomial (3). A result derived by Van der 
Woude [6J is well suited to analyze this condition. It is formulated 
below applied to the problem formulated in Section 11-B. 

As a general rule we will indicate in the course of the technical 
developments where the assumptions of the main theorem come in. 
This is done by explicitly making reference to them. 

Lemma 1: If the pair A, <r h, q is controllable (Assumption 4), 
there exists an output feedback 


for system (4) such that the polynomial o(given by (3)* is the 
characteristic polynomial of (5), if and only if 

(\{A^,)kt r(c,',) Cim[fct., A iU h ,,, (9) 

The problem is hence to ensure, if possible, that (9) can be satisfied 
by a suitable choice of real A* / - j, • • ■ * An. Notice that both sides of (9) 
depend on Ary. j, ■ ■ *. An through A e >,, and r,.,,. Once A* /. vs ■ ■ ■«A‘<> 
have been computed, it must be checked that the pair A vtv h^ is 
controllable. The control, given by (8), must then be determined 
such that 

n( .) “ rh‘t[:7 - (A,.,, - A■/ _i K %l c vli )]. 

The existence of the control follows from Lemma 1. One approach 
for determining Jn-i is to use Ackermann's formula [I], [51. The 
original problem is then reduced to ensure that the feedback constants 
can be chosen such that (9) holds and such that the controllability 
matrix has full rank 

rank[/> (l , (10) 

The following appendixes are dedicated to an in depth analysis of 
relations (9) and (10). The main line of the analysis is as follows. 
First (9) is reformulated as an algebraic equation in A*/ ■ ■ ■. A’o. 
This equation is explicitly solved in Appendix B for a special choice 
of A*o which is denoted by Aft; the obtained solutions are denoted 
as A '/ 2 , ■ ■ ■, A-','. These solutions are not acceptable, however, since 
the choice of Ao violates one of the conditions imposed. Therefore a 
perturbation analysis is perlormed in Appendix C to show that for 
close to A 1 ft the equation can also he solved and the solutions can be 
accepted. The analysis requires some weak restrictions on the system 
data, which are introduced as the discussion proceeds. 

Appendix B 

Tut Determinaiion of the Nominal Feedback Gains 

In the analysis of the present and the next appendix, the period T is 
set equal to n + 1. This is in accordance with the results obtained for 
the time-invariant case |3|. In the sequel we assume that all matrices 
A, are nonsingular (Assumption 1). 

'I’o solve (9), the equation is transformed into a more tractable 
expression under the additional assumption that the matrices A, + 
A, b,c t , lor i 0. •■•./! — 1, are nonsingular. Then 

kw(r,,) = kcr(r„n) = II 'kcr(r„) (11) 

and (9) is equivalent with 

o(A (l ,)Il -1 kcr (r„) C iin[fi„ A u Ub n {A„ \\) n ^b t , ]. 

(12) 

Of course it must he checked a posteriori that the nonsingularity 
conditions arc satisfied for the obtained solutions. 

Assume that the controllability condition (10) is satisfied and let 
r + be a row vector (dependent on A,) perpendicular to the n — 1 
independent vectors 

Lei rj , * ^, ■ - -, be a basis of column vectors spanning the null 
space of r„ (i.e., perpendicular to r „). Then (12) is equivalent with 
the set of n — 1 equations in the n variables Ao, A*i, ■, k H -1 

r + n(,-l,. 1| )irV — 0 

for i — 1, - • ■. n - 1. Explicitly these equations are 

i’ ^ (Aft,, 1 A„ + ■ ■ * n j A n )c+ + no(Ao + ) 1 

(A a „ i -F A’u _ i b f , | ff, _ j) (' t = (). 


(8) 


(13) 
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The Woodbury formula 15] yields 

(.4+ M<r 1 

= (J + fer-A -1 *) -1 [(1 -I- -AvTOvT 1 ]. 

Then (13) is. equivalent with 
(1 4 A*oruj4i)T , &o)i’ + 

x (.4",, *A„ 4- n„_i A,", ^A,, 4- • ■ ■ 4- A„ )r* 

4* oor"^[(l 4 Ao'oA 0 l bu)A (J * — AoA 0 1 buto A n l ] 

X (A i 4' k] h \n ) 1 • ■ ■ (.4,,_ j 4-A‘„, ift„_ ir„ . i) 1 r l + = 0 

(14) 

for i — 1. •' % u — 1. Recall that the desired gains A\ must be such that 
the matrices .4, -4 A*,b,r t , with r = 0, - ■ ■, u — 1, are nonsingular. Noie 
that (14) consists of n - 1 equations with n variables A*,: hence in 
general one of the gains may be chosen and the others computed from 
the equations. We assume that ro A„" 1 b ( , does not vanish (Assumption 
6). Although we have required that the matrices 4, 4- A,b,c, arc 
nonsingular, we first solve (14) under the assumption that Ao is equal 
to -*l/<o4 ( 7 , f>u* denoted as A’d. Appendix C is devoted to solving 
(14) for Ao different from that value. 

The gains A\ satisfying (14) for the particular choice of Ao are 
called the nominal solutions; they are denoted by A’!'. It is assumed 
that oo is nonzero. This implies that the desired closed-loop system 
has no poles at the origin. Furthermore we assume that r + A 0 1 bn ^ (L 
This condition holds due to Assumptions 4 and 5. Then (14) yields 


Appendix C 

Perturbation Analysis 

In Appendix B we obtained a set of expressions (18) determining 
the constants A !' (i = 1 ,»■■,«) satisfying relation (12) for a choice 
of Ao equal to A-,". This value cannot be accepted as was explained in 
Appendix B. Therefore, relation (12) is reconsidered for Ao =. A^’-ff, 
with f small. It is shown below that for this choice of Ao there exist 
solutions A’ f (? — 1, ■ • •, n - 1) of (12) or equivalently (14). 

The application of the Woodbury formula to all matrices (.4, 4 
A \b,c ,)" 1 in (14) yields 

(1 4- A’ornAo 1 bn )(1 4- A-j <o .4 | 1 b \) 

• * ■ (1 4 A*,, _ i r„ - j A„!, b„ _ i) 
x (A'^ 1 .4,i 4- n „ -1.4;; 1 A „ 4- * ■ ■ + o 1 -4,1 )rj 
4- on/ [(1 4- A*oro-4 0 1 b fl M () 1 — Ao 4 0 1 bo^o-4 0 *] ■ * ■ 
x [(1 4 A 1 ,, _ 1 r„ iA„L\b n i)A„_!| 

~ Ao_ 1 4 i ; , J ft l( - 1 r H -,A ( 7i l ]r ( f = 0 (19) 

for / — 1, • * ■, n - 1. The vectors < ^ constitute a basis of ker( ^ „). 
We take the following particular choice. Consider the set of vectors 

ff, = r. .4 1 4, ’ • ■ 4„_, 

for / = I. ■■■,/». These vectors are independent because of the 
observability assumption introduced in Appendix B. Notice that 


for / = 1. — 1, and 


f‘oA n 1 (.4] 4 A-|bi< 1 ) 1 (.4,, ! 4-A'„_|ft (l -ir„_i) 

-0. (15) 

This is equivalent with 

r o -47 1 4 < 0 ( A„- 1 4- Ao -jft„~i* 0 - 1 )* ■ ■ (A| 4 k\h\r { ) (16) 

where the s*ymbol 4 stands for proportionality. 

Although the above set of equations for the gains A j. ■ • . Ao _ 1 is 
nonlinear, it is shown below that it can brought in a triangular form 
and explicitly solved. Assume that the vectors 

<0 < n • ill - I .4/1 -2 • ' ■ A I 


are linearly independent (Assumption 2). This is actually an observ¬ 
ability condition, requiring that the state can be reconstructed from u 
successive output measurements. Then r„ 4 0 1 can be expressed as 

0 A () 1 = ci 11 j 4 nj i*2 4* * ■ • + u,, i' T ,. (17) 

The constant a„ is assumed to be nonzero (this is Assumption 3); 
it determines the proportionality factor in (lb). The solutions A*, lor 
i = 1, • ■ ■. n - 1 can readily be calculated from (16); the solution A\ 
is expressed as a function of A7+,, ■ •, A-;; < 

n t 

ftj,r n (A, t -1 + A*J|_ , b f ,_ j o„ _])■•■ (. 4 ^ 1 4 A'I ' 4 , b,-f 1 r,+i )b , 

where the denominators are assumed to be nonzero. More explicitly 
for i — 1 1 




The vectors c* arc chosen such that < * is orthogonal to all n for 
/ / 1. Each vector is unambiguously defined up to a scalar 
In an obvious notation each equalion in (19) is wntten as 

(An. • ■ ■. Ao 1 ) 4 no //1 (An. ■ ■ ■. A,, 1 ) = 0 (20) 


lot / - 1. ■ ■ ■. n - J. Let A " denote the vector of the nominal gains 
An. ■ ■. Ao_i. Below it is shown that the Jacobian matrix 

4 no//,) 

Ok, uu 

with /. j = 1, — 1, evaluated at A" is nonsingulai. This 

establishes by the implicit function theorem that there exists a 
solution to (19) also for Ao Ao 4 r with f sufficiently small. The 
corresponding solutions are smoothly dependent on < and equal to 
A7 lor f equal to zero. Assume (Assumption 6) 

A-JV, .4* ] h, -fc — 1 for / = 1. • • ■, n — 1. (22) 


Then the solutions A\ also satisfy 1 4 A‘,r, ,4 ’ ^ 0 for small *. 

We now prove the nonsingularity of the Jacobian matrix. First 
notice that evaluated at k*\ we have 


i){A\ 4 ri 0 //,) 

6kT 



(23) 


for /. j = 1, * ■ •, 11 — 1, since each <I\ contains the factor 1 4 
A’ocoAq which vanishes at A". For Ao — A*d the lunclion ij, can 
be written as follows 

~ f 

O) Ad 1 bf) 


with 

» i-oA7 1 [(l 4 A-jrj A\ ’b|).47 1 - A-, A7' b, r, l ] 
x [(l 4 Ao . \ f'„~ i A n L l h t) _ [)A„i, 

A'/. - 1 A„ _ | bn - 1 (‘n — 1 A 1t « I ] 1 


<•„ .4., - I * ■ ■ + • ■ ■ + (1,4-1 


(18) 
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From (15) ii follows that /, vanishes at k°. Therefore 


9g, _ i' + i4p 'ftp df, 
Okj rtl ’ ro-Vho 9k 


(24) 


Notice that with the particular choice of the vectors cf the function 
/i has the following fonn 


/j = (l + A’2^2-4. 2 ] l>2) ■ ■ * (l 4- ) 

x roA 0 1 [/tj 1 (1 4* kiC] A, l b \) 

- fr.,1; ViV] W... 

With the particular choice of rf and invoking (17) one obtains 
foAu A] ,,, A JJ _ J r/ = uic\A\ 1 A 2 j ■ ■ ■ 


Hence 

f\ =■ (l -1- fcof-2 A 2 j ^ 2 ) ■ • ■ (l + k v -ic„-] A^ljb,, ..j) 

x [a 1 (l + A*i n A, 1 b \) — hi ct) A n J A] 1 fti] 
x r, A, 1 ■ ■ ■ A ” 1 ■ 


With 


One immediately verifies that 


It ;"'" 1 - 0 


for j > 2. In general it may be verified that 


Of, 

dkj 


tn = o 


for j > i. 

As for the second diagonal element of the Jacobian matrix one 
obtains 


= -(1 + KnATXm + 

■ ■ • (l 4- - 1 ) 

x r-aA.J 1 ■ • ■ A T 7i,r‘J(fl*fN 4- n^Ai 
+ • ■ • + ti 71 f "h A,i_i A T |-2 ■ ■ ■ A3 )b ’2 ■ 


co 4 (J 1 = n 1 f'i 4- Ai 4- ■ ■ ■ 4- Oji c ,1 A„ -1 ■ ■ • A] (25) 
one obtains 

f\ = (1 4- A’jroA-j ’/>■.))■■■ (l 4- A’„ ic„ 1 A„ J j /#„— 1 ) 
x n A| 1 ■ ■ ■ A (l i, i‘l [a 1 - A*i (a2<‘2 4- ujc.i A^ 

4 ■ ■ A Am 1 An —2 ■ ’ ■ A 2 )b\], 

From the explicit expressions (18) of k] J we conclude that all elements 
(except for the diagonal element) of the first row of the Jacobian 
matrix (21) vanish when they are evaluated at k° 

^-(F ) = n 

Ok, 

for j > 1. 

Consider the first diagonal element: one obtains 

= -( 14 - kljr-j A2 1 h ) • • ■ (1 4 - k][ -1 r*»i * 1 A„ 1 , b„ _ 1) 

X r 1 A^ J ■ ■ A_!_ j r / (r 2 4- a a n 1 A 2 
4 ■ ■ ■ 4* tl n ('11 A„- 1 An -2 ' ' A '2 )b\ . 


This expression is different from zero since (22) holds and the 
nominal A\ J exist (i.e., the denominators are not equal to zero). 

Similar conclusions are obtained for the other diagonal elements 
of the Jacobian matrix. Therefore the Jacobian matrix, evaluated at 
A", is nonsingular if 

• (Assumption 6) the nominal gains A*!', for / = 1, - 1, 

exist and satisfy (22), 

• c + Ao % ^ 0, cciAo^o / 0 (Assumption 6) and a 0 ^ 0. 
Note that i ,+ Aq J bn ^ 0 is equivalent to (Assumption 5) 

rauk[h. q Ai l{ K tl • ■ • A"~' 2 K H A^b 0 ] = u (26) 
if the controllability condition (10) is satisfied at k°. 
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This expression is different from zero since (22) holds and the 
nominal A-'/ exist (i.e., the denominators are not equal to zero). 

Consider now the second row of the Jacobian matrix (21): the 
expression of fi is rewritten, using the definition of rj" 

/j = (14- A'-ir.i 4) -■■(14- A’n - j c„_j A ri _* ] h n _ j) 

x CdA 0 1 [(1 4- A'i r\ A] 1 b\ )A t 1 — A’i A ( l b\ t“i A, *] 
x [(1 + A ■araA^&ajAi J - hA^ ^ AJ 1 ] 
x A-, 1 - ■ • A M A a A,,.!.| rj . 

Invoking (17) and the definition of rj, one obtains 

fi = (l 4* A 1 1 r-jA ;l 1 bn) ■ • * (l 4- A-,i-ir„-i A m !i) 

X [(l 4- A’i r*i A, 4- A > 2 ^ 2 A 2 l &2)u‘2 

— (14-A-lCjA] 1 b )) A’2roA 0 1 Aj 1 A 2 J ^2 
4- A* i k‘2 Co Aq 1 Ai 1 bic\A\ 1 A-J 1 
X A>2 A ;j • ■ • A (JI _tA ?1 _ ict . 
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Fault Detection and Isolation for Unstable Linear Systems 
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Abstract —fki the design of a residual generator for an unstable plant, 
(he unstable modes are made unobservable from the residual vector. We 
point out that this cannot be achieved in practice due to the discrepancy 
between the estimated model and the “true” plant model. Hence, we 
deduce that It Is not possible to design a residual generator for an 
unstable plant, unless It Is stabilized by an adequate controller. As a by¬ 
product, we provide expressions which can be used to quantify the effect 
of modeling uncertainties on the performance of a residual generator The 
factorization approach to control theory is used for the developments. 

I Introduction 

There arc many approaches to perform the synthesis of a residual 
generator for automatic fault detection on linear time invariant 
systems the geometric approach [1]-|2| the factorization approach 
|3]—151, the parity space approach [6H8] and the approach based on 
eigensmicture assignment |9J [10] The stability of the plant never 
appears as a necessary condition for the design oi a residual generator 
in these works We show here that this is due to the assumption that 
the estimated plant model exactly coincides with the “true'* plant 
model By introducing a mismatch between both models we prove 
that it is not possible to design a residual generator for unstable plants 
when they are not stabilized by an adequate controller Beside its 
theoretical interest, this remark might be useful when one considers 
unstable processes running during a finite time period 01 processes 
containing an integrating action 

This note is organized as follows In the second part the so called 
extension of the fundamental problem ol residual generation (EFPRG) 
[1] is stated In Section III its solution using the factorization ap 
proach is reviewed, and the effect of the modeling error is exhibited 
In the fourth section, the EFPRG is solved for an unstable plant 
Controlled by a stabilizing controller, and the influence of modeling 
uncertainties is analyzed 

II The Extension of the Fundamental 
PROBLFM OF RESIDUAl GFNFRATION 

For the sake of simplicity and to clearly show the influence of 
the initial state ol the system we first state the problem in the time 
domain 

We consider the class of systems described by 

i(f)- 4i(f) + i3u(fH Fftt) (1) 

y(t)^-Ci(t) + Du(t) + rf(t) (2) 

where MO 6 U n is the state vector, u{t) 6 J? f is the actuator 
command vector, //(/) 6 /?' is the measurement vector and f(t) 6 
/?' is a vector made of arbitrary functions of time referred to as the 
failure modes 

Our aim is to design a linear time invariant (LTI) system with 
inputs u(t) and y{t), and with q outputs, namely the q components 
of a vector r(t ), called the residual vector so that the map from /(f) 
to r(t) fulfills specific requirements More precisely a nonzero ?th 
component of f{t) / ( (f), must induce a nonzero ah component of 
r(f), i it), and it cannot influence the other components of r{t) The 
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most general form for such a LTI system is 

r, (O = 4, I, (f) + D u(t) + r y(t) 
r(/) = Ci (/) + D,u(t) + F,y(1) 
Combining OH4L we obtain 


■'(f) ■ 


"4 O' 

' •«) ' 

l 

D 

,1,(0. 


F,C A, 


T 

D -{-ED 


E 

r f 


fit) 


mo = [r r t ] 

+ F J7(0 


MO 

* (O 


+ (F D + D )i/(0 


O) 

(4) 


0) 

( 6 ) 


The objectives can now be stated as follows 
01) In the absence of failure / (O decays asymptotically to zero 
/ = I 2 q 

02) In the presence of the /th failure mode (/ (/) ^ 0) r ,(t) 
decays asymptotically to zero / = 1 2 /-I / + 1 q 

and i (0 depends on f (f) in a sense made precise below in 
R2) This should be verihcd for / - 1 2 q 


This yields the following requirements on the system described by 
(5) and (6) 

Rl) All the observable modes of the pan 
4 O' 

1 ( I 


(if 


C C 


asymptotically 


stable and the map (rom u(t) to / (O is equal to zero 
R2) The map from /(f) to MM corresponds lo a diagonal transler 
matrix with nonzero diagonal elements 


Rtmaik 1 


—Although the failure signals /(f) / - 1 q aie arbitrary wc 
do not consider the situation where the system initial condition 
and the signal f (M # 0 arc such that /(f) 0 due to 

a transmission zero The probability of the occurrence of this 
phenomenon is zero for all practical purpose 
The observability condition in Rl) guarantees that the effecl 
on /(f) of a nonzcio initial state of the process oi the filtu 
asymptotically vanishes 

—The FFPRG is staled in a moie geneial fiamewoik in 111 Indeed 
a failure can be modeled by seveial nonzero components in 
/(f) In most practical situations however a single nonzero 
entry suffices to model a failure Hence wc only consider ihis 
situation here 

—The diagonal fault response (i e hypothesis R2)) is a paiOcular 
case Yet any other kind of response can be derived from it by 
an additional transformation [8] 

It is now straightforward to lianslatc the problem in the frequency 
domain Taking into account the transient due to a nonzero initial 
state i° the Laplace transform of (I) and (2) yields 

,/(s) = G( s)i/( s) + 6 / (s)/(*) + Go( s) i° (7) 

where (r(s) = f (s I- 4) ‘if-t-D G/(s) = C(s/ — L) 1 1 4■ / 
and 6n(s) = C (s/ - 4) 1 

Now as only the observable pari of the filter is useful wc assume 
without loss of generality that (C 4 ) is observable Hence lo fulfil 
Rl), 4 must be a Hurwitz matrix Thus we can take the Laplace 
transform of (T)-(4) and neglect the transient due to the initial state 
of the filter, as it decays asymptotically to zero This yields 


M s) = + A(v)«(v) (8) 
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where K (s) = B,+D,., L(«) = C,.(*/-4r) _ 1 E, + 

F r and £(*), A'(*) € /?/?:». Here, and in the sequel, /?//« denotes 
the set of stable proper rational matrices, 

Substituting y(s) by (7) in ( 8 ) yields 

r(») ± (Hs)G(s)+K(s))n{«) + L(s)G,(.«)f(s) 

+ I(^' )G{) (s ).r°. (9) 

From (9), we deduce that Rl) and R2) arc equivalent to the following. 
Rl’) L{x)G{fi) 4- K(m) = 0 L(a)./v'(a) and L(s)Gu{$) belong 
to fl.IT*>. 

R2’) Hs)Gf(n) is a diagonal transfer matrix with nonzero diag¬ 
onal entries. 


Ill. Effect of Unstable Modes in the Pl ant 

We first review the class of all the solutions to EFPRG. For the 
sake of simplicity, we assume that the number of measurements, p t is 
the same as the number of failures, q. The more general case where 
p > q can be handled as in 15]. 

The following lemma is stated slightly differently in [5J. Indeed, 
here, to simplify the equations, we have used the fact that there exist 


matrices N (.« 

*), A 7 / (a), and A'o(a) such that 



G(s) = M(sf'X(s) 

(10a) 


€/{.•<) = N,(s) 

(10b) 


G 0 (h) = M(s)-'.X (l (s) 

(10c) 


are left coprime factorizations of G{s), Gf(s ), and (7n( a) over 
7? // x - 

Lemma [5]: Assume that Gj {s) is invertible, and let (10a)—(10b) 
be left coprime factorizations of G[s ) and Gj {s) over RH. X . Then 
the class of all stable proper filters which solve the extension of the 
fundamental problem of residual generation (i.e., which fulfill Rl) 
and R2)), for the system (7), is parameterized as follows 

f(.s-) = P\ r (s)(M(s)f/{s) - Ms)u(s)) 

where 


Ps f («) = Q(»)S.v f (s)Xj(s)- (12) 

with Q(s), an arbitrary q x q diagonal RH x matrix with nonzero 
diagonal entries 

Sx r (*) - fling P37 —= diag *,•(*) ( 13 ) 

where j = 1. are the unstable poles of the /th row of 

1 , o, is an arbitrary positive real number and p, is detennined 

so that 


Ion s,(s)[N f (sV 


(14) 


is a nonzero row vector. Here [.Y],. denotes the /th row of A'. 

To show what goes wrong with an unstable plant, we now introduce 
a discrepancy between the “true” plant model (7), and the estimated 

plant model, denoted by hats (.), which is used to design the 

residual generator. 

In this framework, (9) can be written 

r{s) = (L(s)G(s) + K(s))u{s) + IAs)Gj{s)f(s) 

+ i(*)G , n(«).r 0 (15) 


where 

= T); A'(.h) = (16) 

Here /*#(*) is obtained by substituting A'/(») by .V/(«) in 
(12H14). 

Now, using (10c), one can rewrite the map from ,r" to r as 

L(h)G 0 (s) = P» t l*maW(*r l -Vo(.v). (17) 


From (17), one concludes that, due to the modeling error, wjjeh 
the plant is unstable, the effect of the initial conditions will not 
vanish, On the contrary a nonzero .r° will yield an “explosion" in 
the residual. From (15MI7) it is clear that whatever I(s) may be, 
it is impossible to avoid instability when M[s) ^ M{$). Notice that 
it is also impossible to assure that the effect of the Initial condition 
on the residual vanishes for a plant with a pole at the origin. Hie 
residual will be affected by a bias whose magnitude depends on the 
initial state, in this case. 

We now investigate the situation where a controller stabilizes the 
unstable plant. 

TV. The EFPRG for an Unstable 
Process Stabilized in Closed Loop 

Lei us consider a stabilizing controller 

tf(*) = C(*)(ir(*)- T,(.s)). (18) 

For the plant (7), and let N r {s)M c (s )~ 1 be a right coprime factor¬ 
ization of C(s) over RH^, In (18), n s) is the Laplace transform 
of the reference vector. 

We can now state the following theorem. 

Theorem: Consider the stable closed-loop system (7), (18), and 
assume that Gf(s) is invertible. Let G(s) — M(s)~ [ N(s\ G /(a) = 
M (a ) ~ 1 Nj ( s ) , and £?«(*) = M{ *)“ 1 A o( a) be left coprime factor¬ 
izations of the transfer matrices of the estimated plant model. Then 
the class of filters parametrized by 

r{*) = L(s)y(s) + k{»)u(») (19) 

where L{s ) and I\(s) are given in (16) fulfills 

liin fir[s) = r(20) 

in the absence of failure, for a step-like reference signal ir(.$) = 
ti'i) € R p . Here T is a constant real matrix. 

Moreover, in the presence of step-like failure /(a) ■= /o/«, 
fo 6 R' 1 . and reference //'(a) wo/h 

liin »r ( s) — Tuw + 0 /o ( 21 ) 

where (-) is a constant real matrix. Besides, when G and Gf, 
respectively, lend to G and 67 , T tends to zero, and 0 tends to 
a diagonal matrix. 

Proof: In the sequel, the Laplace variable a is omitted to shorten 
the expressions. To find an expression of / (.s) in terms of ir(*), /(*), 
and .r°, we first write down the equations of the closed-loop systems 
(7) and (18), where we substitute the different transfer functions by 
their factorization. This yields 

y = MrDjXjf + At e D«'XN,-M7 l w + M-Dj N 0 x° (22) 

where D, \ = MM, 4 - A r A r ,. As the controller assures the internal 
stability of the closed-loop system by hypothesis, D< i is a unimodular 
matrix 1II). Substituting y(s) and i/(.s) by (22) and (18), respectively, 
in (19) of the residual generator* we obtain 

r = (1.1/, - I\N r )D~ t ',X(f + ( LAI, - KN v )D«Xt>*° 

+ ( LM r D~i 1 NN.M7 1 +KN.D- l 'M)w. (23) 

As t{is) and f\{a) are asymptotically stable, so are the transfer 
matrices between /(*), ,r° t /r(«), and r(s). This is obvious, except 
for the map from w(s) to r(.s). Notice, however, that 


(24) 
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where we introduce a left coprime and a right coprime factorization, 
respectively, for the transfer matrices <?, (*) and G(s) as follows [11] 

N,-M ~ 1 = TC'W *. and V K N e + =1 

M~'N = WJT~' and N~N + MU = 1 

2?, 1 = F,.F + F [ 7\r , 

(24) dearly shows that the transfer matrix from w to r also belongs 
to B Hex. . For a step-like reference and a failure of the same nature, 
we deduce 

nut *r(») = [(LM - KN, )D~i‘Nj] g ~ofo 

+ [LM,D; l i NN,M7' + KN,D ~, 1 

(25) 

where indicates that the transfer matrix A r («) is evaluated for 

. ir'as 0. This proves (20) and (21), by identifying T and (-) with the 
corresponding bracketed expression in (25). 

Finally, we consider each term of (25) when G —► G and 
—► Of . We obtain successively 

liiii {LMrD^ NNrM;^ + KNcD'^ M) 
o—o 

= lini QS$ Gj'tM+D^ NNM ~ 1 - GN'-D^ 1 M) 

(26) 

= lini QSfi,&J l lG - G)C(I + GCr x = 0. (27) 


Hence F —> 0 as 6 —► G. Equation (26) is easily deduced from (12) 
and (16), while (27) is obtained by straightforward computations. 
On the other hand, using (16), we obtain 

: lira (tM r - KNOD^Nf 

G—O 

= lira l\ M[I + GC){I + GC)~ l Gr (28) 

■ o — o * f 

f -u f 

= lini P$ MG/ + P K r M(G - G)C(I + GC)~ ] Gj 

O—O * ' f 



+ P* f M(G - G)C(I + GCy 'G r ) (30) 

where (30) is deduced from (29) by using (12). The first term in (30) 
is obviously a diagonal matrix, which concludes the proof. □ 

Note that from (19) and (16), (20) and (21) could intuitively be 
expected. Indeed, as the plant is stabilized, the residual generator 
needs to be BIBO stable for (20) and (21) to be fulfilled. Since it is 
designed using the plant model exclusively, model inaccuracies do 
not affect its stability. 

From (27) and (30), we can expect that, when the discrepancy 
between the estimated and the “true'* plant model is sufficiently small, 
the matrices F and 0 will respectively be sufficiently close to zero 
and to a diagonal matrix to guarantee a satisfactory operation of the 
residual generator. 

V. Conclusion 

We have analyzed the problem of residual generation for an 
unstable plant, and we have proved that it cannot be solved for such 
a plant (as soon as there is a slight modeling uncertainty), unless the 


plant is stabilized by an adequate controller. This is due to the fact 
that the unstable modes of the plant cannot be made unobservable 
from the residual when the model is not exactly known. As a by¬ 
product, we have obtained expressions such as (27) and (30), which 
could be used to quantify the effect of modeling uncertainties on the 
performance of a residual generator. 

For the sake of simplicity, the results are stated an proved for 
the extension of the fundamental problem of residual generation, 
and when the number of failures, </, is equal to the number of 
measurements, p. In the presence of modeling uncertainties, however, 
the asymptotic stability of the plant model is also required to be able 
to solve the fundamental problem of residual generation defined in 
[8]. As this is the basic problem for the design of a residual generator 
for any coding set, one deduces that the above stability requirement 
is also necessary when q > p, Yet, in this case one cannot isolate 
simultaneous failures [8]. 
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Zeros of Discretized Continuous Systems Expressed 
in the Euler Operator—An Asymptotic Analysis 

Addisu Tesfaye and Masayoshi Tomizuka 

Abstract —The structure of the zeros of SISO continuous-rime systems, 
which are discretized via a zero-order hold and expressed in the Euler 
operator, is studied. In particular, It will be shown that when state-space 
descriptions of linear SISO continuous-time systems with relative degree 
>2 are discretized, then the zero dynamics of the resulting discrete system 
is singularly perturbed and Nhows a separation of time scale. The part 
of the zero dynamics associated with the fast time scale is shown to 
correspond to the zeros introduced by the sampling process (sampling 
zeros). An asymptotic formula for this part of the zero dynamics is 
given, and implications of the result to control design based on pole zero 
cancellation is discussed. 

1. Introduction 

In the design of linear control sysiems, the existence of unstable 
zeros makes it difficult to construct some control procedures, such 
as inverse systems, model matching systems, and model reference 
adaptive controllers [6], Furthermore, it restricts the control ability 
of robust controllers [18]. When we use conventional digital control 
(i.e., \ operatoi) with zero-order hold (ZOH) input, zeros appear 
in the discrete time model even though the continuous-tune system 
is minimal phase (1). These zeros are commonly referred to in the 
literature as sampling zeros because they arise from the sampling 
process. Whether these zeros are stable (minimum phase) or unstable 
(nonminimum phase) depends mainly on the sampling interval T. 
Generally at last sampling rates, the sampling zeros appear outside or 
on the unit circle while at slower sampling speeds they are stable. This 
poses a paradox for control engineers since generally a sufficiently 
small sampling interval T is desired from the control point of view, 
while the sampling zeros tend to become unstable as 7 becomes 
smaller. 

Many researchers have sought to address this problem by different 
techniques. Hagiwara and Araki |3| have proposed a mult irate 
sampling scheme which requires n limes sampling between a sample 
period where n is the order of the plant such that the resulting 
discrete-tune system has no finite zeros. A similar double-delay 
scheme proposed by Mila ct al [I0| where control input or output 
measurements are taken twice in one sampling itilerval also constructs 
a discrete-lime model with no finite zeros. Such controllers inevitably 
require intcrsample control or measurement which makes the design 
of the control systems comparatively difficult or requires additional 
hardware. Kabamba [5] has proposed the idea of using generalized 
sampled data hold functions (GSHF) where the input matrix ot the 
discrete-lime model can be chosen freely and hence can avoid the 
unstable sampling zero problem. While this idea has potential advan¬ 
tages, it seems difficult to generate continuous-time hold functions by 
a digital computer, although they can be approximated by fast input 
samplers. Other researchers have sought instead to cope with the 
unstable sampling zeros by compensating for their effects. Notable 
among such works is the zero phase error tracking control (ZPBTC) 
proposed by Tomizuka 116J. For constructing inverse systems such as 
in feedforward applications or model reference control, the essential 
idea behind ZPETC is direct cancellation of all stable zeros and 
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compensation for the effects of unstable zeros by phase cancellation 
over all frequencies while making the overall gain approximately 
equal to one over a desired frequency range. 

Recently, one simple and more direct way of overcoming the 
unstable sampling zero problem has been developed by Middleton 
and Goodwin |2]» |9], They propose to use an operator called the 
delta operator (b = ^ where q is the shift operator in the time 
domain). In the frequency domain an equivalent operator called the 
Euler operator can be similarly defined as f = ^ where ; is the 
conventional : operator for discrete-time models written in difference 
equation form. One major advantage of using these operators i$ 
that the dichotomy between results obtained using continuous and 
discrete-time control laws are resolved especially in regards to the 
limiting properties as T —► 0 |9). Furthermore, the zeros arising from 
sampling are easily distinguished if these operators are utilized. In (2] 
these authors recommend that the sampling zeros which are unstable 
for the T selected be ignored if the sampling interval T is sufficiently 
small. Although ignoring the zeros introduces an error in the discrete¬ 
time model, experimental tests on electromechanical sysiems [21 
and clectrohydraulic sysiems [4] indicate that the resulting adaptive 
control law give excellent performances. Tesfaye and Tomizuka 
have also applied this procedure successfully in the model reference 
tracking control of a single axis NSK direct drive robot arm [ 14], 115]. 

In this paper we seek to address the structure of the zeros of 
discretized linear continuous sysiems modeled by b(f) operators. In 
particular it will he shown that, under sampling of a continuous linear 
singlc-input/single-output (SISO) time system of relative degree >2, 
the resulting ^ model system can be regarded as a regular perturbation 
ol the underlying continuous-time system if the sampling interval is 
considered as a parameter but that the zero dynamics is singularly 
perturbed i.e., exhibits two-time scales. The conclusion is that the 
finite sampling zeros become infinite as T 0 while the rest of 
the zeros tend as a set to the zeros of the underlying continuous- 
time system. An example depicting the behavior of the zeros for a 
hypothetical continuous-time system is included to substantiate our 
results. 


11 . SISO Continuous-Time System 
We investigate linear SISO systems of the form 
.r = 4, ,r + b, a 

«-■ * (l) 

if - r.r. .r 6 ft . i/, f/ 6 ft. 

Assume (I) is minimal (both controllable and observable) and has 
relative degree " > 2 i.e., 

eb, = cA t h, “ = cA? 2 h ( = 0 

r.4?-'fc. *0. (2) 

Remark The case where *> > 2 is considered because continuous- 
time systems give rise to discrete-time models having nonminimum 
phase zeros only when the relative degree is greater than one [ 1 ]. 
To characterize the zero dynamics of (1) we transform it into normal 
form by means of the following change of coordinates 

[k. : I/.]' = [r'.lcAl'.-.i-r'l' : (3) 

where t,. € ft '. //, £ ft " - * 1 and H must be selected so that L, is 
nonsingular. For simplicity we will choose H such that Ifb, = 0. 
The system description (I) can then be rewritten in terms of the new 
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coordinates as 

Ci ^ Ca - Ci “ Cf.*« *''' C(-) — i) 52 Co 

C -) s= Jff<c + S’tV* + ^4? - 1 h, u (4) 


•it = AC + 

where R l J € JP.S* € * M “\A € and Or, 6 

u cC + S„, /t = r.4?r, Q„i,, + /*<,(,. = J7A,.r. 
Equation (4) expresses (1) in normal form, and it is well known that 
Q tf is a companion matrix whOvse n - -) eigenvalues are precisely 
the zeros of system (1). 

111. Sampled Data System in a Form 


The sampled ZOH representation of system (I) can be written in 
A (A = ^ where q is the shift operator) form as 


b.r{k) = A*.vtt)+hu(k) 
y = r.r(A') 


(5) 


where A* = .4/ 4- T\t,, 6/, = A, + Tb\ 

yi —2 


J = s 4 ' + k A ‘*" + 


■(i 




A! 

r* 


4f + ■ 


(* + 1 )! 


At + 



Notice that if we regard the sampling time, 7\ as a perturbation 
parameter, .4/,,6/s can be considered as regular perturbations of 4, 
and b ,, respectively. Further based on the assumption of minimality 
of (I) we conclude that (5) remains minimal for T small. Consider 
a transformation of (5) to normal coordinates 


where 


AT l b,. 


2 ) 

•■[A, + 7vii ) 2 (i>. + n,) = r At [i + n;)\ 

= r~ 2 n 2l +T-'- , n i 2 + T 1 {-} 

where 


Oj| - 




h-D! 


(■> - 2 )! 


<‘(-■1. + r At)'(h, + Tbt) 

--. -i; (/+ n; 

= V 'n,t +r- < ‘ + ,, a 1J + r^{-} 


where 


0 + U (i + D n i, 

rk, i = -——— , , 4, t ), . n =r -- ( 4/ fi, 

(--/ + D! (-> — /)! 

- 1 ) 

4, + T 4, )■*-' (6. + Tbt ) -- . 1/ 1 ffc, + if A, ft, j 


(- - 1 ) 
<( 


-f oir - 1 


Also 


[Ci-))' = 


[r f .(r(A, +n l )) l .-.( 1 (.l +r+,>~ V ■ fl'j'-r . 

/ 

( 6 ) 

Again notice that matrix L ol Ihe above equation is a regular 
perturbation of matrix L t of (3) and that det( L ) / 0 for T sufficiently 
small. Lei 


where 


i.c., 


4 ; 




a, = 4 , 4 ;. h,=h;h, 


r K - 2 


1 , , t . 1 

v A ' + u A ' + 

l A .1^ , ... + rL _ V' 

+ 3: ' + + A*! 


+ ■■■ 
+ ■■ 


. 4 ; - h\ 

A, +TAy = A, (7 + TftI) 

ft, +rft, = (J + n;)6,. 


In the subsequent analysis wc will repeatedly use the properties given 
by (2) of the underlying continuous system (1). Considering the 
transformation of 6/, we have for each row ot Lb* 


• 0 ) 


r(fc, + Tbt) — r 1_, (io + r J {•( 


where 

on = -rf-4? 

• I) 

r(A, + TA, )(b, + Tbi ) = cA, (1 + TblYh, 

ar- J rt „ + r^a^ + r 1 !-} 


H(h, +76,) 


fH\. +(HT 2 ) 


The expression 7 “ (■} denotes some lunction which we drop because 
it will not affect the asymptotic solutions to be developed. Even 
after dropping the expression 1~\\ wc note, from the above, that 
the first “) — 2 rows ol Lb * have coefficients which aie at least ot 
oidei (){T'). At this stage, in view ol facilitating an asymptotic 
analysis which: 1) captures the overall behavior without sacrificing 
the structure of Lb and 2) avoids a cumbersome analysis involving 
a multiplicity of time scales, we replace the first - 2 rows ol T//* 
by bounding functions of the form 7n(,, T <\\, ■ ■ •, An' _ , where the 
o'’s are independent of 7 For example, a particular choice for the 
n'*s IS nj) = Oo.n'i = on -fn|j.n'_, = n ( ^ m + dj. 
The above simplification will not affect the final conclusions and 
is justified because we seek asymptotic solutions Taking this into 
consideration, a simplified expression which captures the behavior of 
Lb* (with respect to 7) can be expressed as 

r 7 "I* i 


Jo 


i 


Tn\. t 


(7) 


7 +r>4/“'6, 

<.l?-'( 6 , + ^A,h.) 

LHA, 

Note that if T = 0, the above matrix equals L, h ,, the transformed 
continuous-time input matrix (i.c., applying (2) and (3)). It is obvious 
from this last expression that tf we transform system (3) into normal 
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form by means of the transformation given by (6) the feedback law 

(/ = ■+ ).r (8) 

7a' 

makes 


g(*) = M*) = 0 


and that the zero dynamics subspace is 
1/ = {.!■: </ = r.r = 0} 

= {( 0 .C<;): V = [Cl Ca ■■■ C, i] 

€ ft" 1 ’>/ € ft" - '}. 

It should be noted that ( has been decomposed as (/ = [ij, (/] 
i.e., in the last expression 0 = r(J ( + 7.4i = '(.4, + 

7.4 1 ) j j\ ■■■,(,■>- i = r(/4, +7.4| The most important thing 

to observe from (7) is that all the entries of the matrix are multiplied 
by the sampling time 7 except lor the one next to the last row block. 
On the subspace o, the dynamics, under the feedback law (8), is 
given by 



\U) = 


- 0 


((A) 


process. The latter zeros are commonly referred to as sampling zeros. 
To establish the structure of the sampling zeros we note from (9) that 
the 7 dependent term corresponding to \\-i (A 1 ) (i.e„ last row of 
^C(A )) is of the order of 1 /T and thus is not a regular perturbation 
but rather a singular perturbation. This singular perturbation is due 
to the high-gain form of the feedback control (8) (division by 7), 
High-gain systems arising from small regular perturbations have been 
investigated by many researchers in particular in [I2J and [17] in 
continuous-time systems and by Rao and Naidu in discrete-time 
systems [11]. More recenlly Sasiry and Hauser [ 131 have investigated 
the appearance of such singular perturbations arising from regular 
penurbations in both linear and nonlinear systems. The following 
conclusion is analogous to Theorem 2.1 of [13]. 

Theorem / Consider a continuous-time system of the form (1) 
and its ZOH equivalent representation given by (5) which is expressed 
by means of the h operator. Then if system (J) has relative degree 

system (5) has // - 1 zeros which according to their asymptotic 
behavior as 7 —*■ 0 belong to two groups: 

• 1) The ^ - 1 sampling zeros tend to ~x asymptotically as 

/-1 < \tH 

\T ' 

• 2) The remaining n — * zeros tend to zeros of the continuous-time 
system given by (I). 

Pi oof- The prool is straightforward if we can cast (9) and (10) 
into a standard singular perturbation form This can be accomplished 
by rescaling (, in the following manner. Denote 


f n 

i l (/. +- -lL » ) 

-“ 

f01 


r i 


1 

i 


0 

L.s J 


'/(A’) 


fiifi A ) = 


[( 

«i,\ 

lt , _ j ji i ■ 


LV 

x / 



U A ) + A'). 


(9) 


( 10 ) 


In the above r 2 . /■». ■ ,)) , E ft. From the above 

expression, foi a sampling lime 7 ^ 0, wc observe that the ZOH 
equivalent representations of continuous-time systems expressed in 
the Euler (O operator will have relative degree one regardless of the 
relative degree of the underlying continuous-tune sysiem. This means 
that such representations always lead to n — 1 zeros. These zeros are 
composed of: 1) n - * zeros which are due to the continuous-time 
system, and 2) - 1 zeros which are a consequence of the sampling 


-i = Ci • .-j = 7 ^ 1 o« -i = 7'> 1 Ci 

.,=77^". ... =7" U,_,. 


This expresses a transformation (rescaling) of (9) and (10) which can 
be wntlcn in matrix form as 


where 



M 


M } 


U, Mi 

Mi Mi 

p o 
o TtH 


0 


Mi— 0 ( - ii<( ( ,- ; |, M\ — 0 (M ) x t -i n 


Mi — 7( lf —»)><(„ —»)■ 


( 11 ) 


r^Ai,(fr) = -7 -t!t4-i(0 + :s(*) 
T^c,<A-) = -.,(*•) + ()(T-^) 


,(*•) +OIT^T) 

T •> 1 h-.-,- | (A-) = -^-- ;i (A-) + r ■' 1-1 r-, i-,. i (A)+ 

n (] 

-r(— + ra)i.(fr) + T^rMAk) + • ■ • + r^Tr,_,,(*) + 


n(/ “) 
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Fail Subsystem Slow Subsystem 

Pig, I. Singularly perturbed (iwo-time scale) zero dynamics. 

For simplicity let us express (9) and (10) as 
^C(fr) = AiCU') + *4 j //(A-) 

M*) = A.C(*) + Air/(*). (12) 

Combining (II) and (12) we cun write (9) and (10) in terms of the 
new state variables as 

1 [MiAtMr 1 A/,.4,1 r ■.(*)! 


Mi AA/f 1 = 


■T ^ 


-rn 


+ r,J T ">• 1 r, T' >r, ••• r, J 

. r.4?-‘(h, + 4.4, M 

° " r«j, ‘ 

Equivalently we have the equation found at the bottom of the previous 
page. Note thai M\ Aj = T^ 1 s and (13) can be concisely written as 
T^bzik)] [Vr, 0 lf;(fr)l 

bulk) J [r gJ[»/(A)J 

+r ,p, 1+rr ,j (l4) 


l h L 

- - ~ 7- 

, () 

T o 


i 

0 0 


From the above, (14) is observed to be in the standard two-time 
scale form of |7]. Its right-hand side is regularly perturbed by the 
term 2 , while the matrix of the unperturbed part is block 

lower triangular. This unperturbed matrix is nonsingular as required 
for a standard two-time scale form. It follows that the eigenvalues 
of (9) are asymptotically 


<r^A(ir,))uA<g,,) 


Variation wrt sampling lime 



-6 -5 -4 -3 -2 -1 0 

della model 


Variation wit sampling time 



ShiH operator (2) model 


Fig. 2. Pole/zero variation with respect to sampling lime 


erf^xdv,)) 




(15) 


where (15) is the asymptotic expression of the *; — 1 sampling zeros 
which tend to x as T -+ 0. This completes the proof ot part 1) of the 
theorem. To prove part 2) observe that from (14) the remaining // — *} 
zeros tend as a set to the eigenvalues of (j,, which are the zeros of 
the underlying system. Following the standard two-lime scale form 
of [7] we can depict the behavior of the zero dynamics of system (5) 
as composed of a fast and a slow subsystem as follows (the external 
inputs characterize the effect of initial conditions). 

Remark: Theorem 1 states that — 1 sampling zeros become 
infinite as T —» 0. Moreover one can deduce that the sampling zeros 
move in specific directions as determined by (15). The consequence of 
Theorem 1 is that sampling zeros of discrete-time models expressed in 
the operator are located far off in the right- or left-half planes for 
sufficiently small sampling times and hence are slightly nonminimum 
phase. This supports the recommendation made by Goodwin et al. |2] 
to discard the unstable sampling zeros if T is sufficiently small. 
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The system has relative degree two with zeros located at (-0.5 ± 
O.SOGjf while the poles are at (P.dt2j,-1}. Fig. 1 portrays the 
influence of the sampling time, T, on the location of the poles and 
zeros for the delta and shift operator (:) models. We notice for the 
delta model that the sampling zero departs to -a. as T —► 0 while 
the rest of the poles and zeros converge to the continuous-time poles 
and zeros, respectively. This behavior supports Theorem 1 above 
which predicts that the sampling zero goes to ^ since in this case 
a[, = «o = ±cAy'b,. For the shift operator model we notice that 
the zero introduced by sampling remains fixed around -1 while the 
rest of the poles and zeros converge to one on the real axis irrespective 
of the locations of the poles and zeros of the underlying continuous¬ 
time system. To give an indication of the behavior of the sampling 
zero in the delta model we calculate for T =0.1, 0.07, 0.04, 0.01, 
and 0.001 secs, the location of the zero at -20. -28.6. -50, -200, and 
-2000, respectively. The numerical data validates Theorem 1 since 
the sampling zero goes to 

V. Conclusion 

The effect of the sampling time on the zero dynamics of continuous 
linear SISO systems expressed in the b operator has been investigated. 
It is shown that as the sampling time tends to zero that the - 1 zeros 
introduced by sampling (sampling zeros) migrate to negative infinity 
while the remaining ;/ - - zeros converge uniformly to that of the 
underlying continuous-time system. From this we can conclude that 
under fast sampling, i.e., as T —♦ 0, one can neglect the sampling 
zeros (which are easily distinguished). This conclusion is important 
for control systems constructed based on system inversion and/or 
model reference control systems especially when the dynamics of the 
plant are imperfectly known. 
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Further Theoretical Results on Direct Strain 
Feedback Control of Flexible Robot Arms 

Zhcng-Hua Luo and Baozhu Guo 


Abstract —This paper is concerned with stability analyses for some 
nonstandard second-order partial differential equations arising from 
direct strain feedback control of flexible robot arms. Exponential stability 
issues are addressed for three types or differential equations, one of 
which is in general abstract evolution equation form and the other two 
are in partial differential equation form. The obtained results are of 
especially theoretical interest because they reveal the essence of direct 
strain feedback control and demonstrate Its power In control of flexible 
arms. 

I. Introduction 

We consider the following initial-boundary problem 
r ir{t. jt) 4- u " /# (/,.r) 4- k.ni "(Ml) = 0 
<r(/.0) = ir'(f.O) = 0 

u"(fj) = u m [tj) = 0 ( 1 ) 

ur((). .v ) = «!(,(,r), u , (0,.r) = it , (,r). ■/'6 (0.0- 

Equation (1) represents a closed-loop dynamic equation of a single¬ 
link flexible arm with the so-called direct strain feedback (DSFB) 
control 19], 110]. The symbol «•(/,.r) denotes the arm’s bending 
vibration deflection at time t and position ,r; Tr and tv nn represent 
time and spacial derivatives of u\ respectively. The symbols ( and 
A > 0 arc, respectively, the length of the amt and strain feedback 
gain, u'o(-r) and ir i(.r) are respectively initial position and initial 
velocity of flexible arms. Originally, the dynamic model of a single¬ 
link flexible arm rotating in the horizontal plane by a control motor 
is given by the following partial differential equation 

w(t..v) + ir""(t,.v) = -.r0(f) 

together with the same initial and boundary conditions as in (1), 
where 6(t) represents motor rotating acceleration [16]. Note that here 
a nonlinear term ,rH(t) 2 w(t, ,r) which should appear on the right-hand 
side of the above equation, is neglected to obtain a linear model, as 
in most literature Til and 1161. The simple yet efficient DSFB control 
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is to control the motion of the motor in such a way that 0(f) is 
proportional to This objective is easily achieved by direct 

feedback of the bending moment /<>"(/. 0) which can be measured by 
cementing strain gauge foils at the root end of the arm. The interested 
reader is referred to f9] and [10] for details. 

We are also interested in an equation 

f w(t,.r) + -f* k.ru n (t, 0) = .r(A'jc(/) + k 2 r(f)) 

I e(t) + k,r(t) + h <•(/) = JmV'lf.O) 

= tr'(f.O) = n = u , '"(tj) = o K ’ 

[w(O.j-) = u'o(r), «'((). .r) = 

which arises from simultaneous vibration/motion control of flexible 
arms. More precisely, if we not only control vibration of the arm, 
but also control the motor position 0(f), then we can command the 
motor to move so chat 0(f) = -kic(t) - k 2 t (t) + Aw/*"(/,()), where 
p(f) (=s 0(f)-0,/) denotes difference between current motor position 
0(f) and a set-point position 0,/, and k\ > (LA*.* > () are servo 
feedback gain constants. 

Let B = L 2 (0,0 be the usual square integrable function space, 
with inner product (■. ■) and the induced norm || • ||. For later use, let 
us first introduce an operator A on ff as 

{ D(A)={w\w£B.tc" f ' £H, 

ir(0) = ti’ r (0) =■ u ,#/ (f) = — 0} 

Air — w nn . Vm 1 £ D(A) 

which is known to be positive definite with a compact inverse. Also 
let us define an operator II by 

fl w = .rir"(0). .r £ ((W),Vir £ D(U). 


It is obvious that operator II is neither self-adjoint nor positive 
definite. It is shown, however, that n is A-symmetric and 4-positive 
semi-definite [9|, 1101- So that II can be decomposed as II = (}A 
when restricted on D(A ), with Q being a bounded, positive semi- 
definite operator. By making use of these, properties of the above 
mentioned operators, (1) can be written as an abstract equation on 
fiT as follows 

fir(f) + frgAir(f) + -4ut(/) = 0 

1 ir(()) = if'o, ii* (0) = w\. 


We say that (1H3) are nonstandard since they possess a special 
damping term j‘ir"(f.0), or equivalently QAii{t), which does not 
appear elsewhere in the existing literature, although some special 
cases where Q = /, A~ !//2 , or A -1 have appeared in [3], [5], and 


[7]. 

Let B j = D[A l/2 ) x H and H 2 = D(A) x D(A ]/2 ). Then H , 
and ff -2 are Hilbert spaces when equipped with the following inner 
products and the corresponding induced norms 


([*»]■&])„ r ( *‘ 


V 


iii 

h-j 


h. 

‘ 

_ 


V 

]‘i _ 


Hi 


Ai„) + (A' /2 li i .A l/ ' i h i ). 


Now let i(f) = 


A [/2 w(t) 


l 


Then (3) can be rewritten as 


r i(f) = Az(t) 


> n v _ \A l '**'o 

; ,0|a [A 1 

on H i, where operator .4 is defined by 


D(A) 


hi 6 D(A' ,a ),A ,f *hi 


+ kQA'^hi 6 £>(4 ,/2 )j 


7 o 1 

0 I 

V 

o 

.4 1 / 2 kQA' /2 

fca. 


Vh 6 D(A). 


Si^h defined D(A) is dense in B \, and A is a closed operator. Thus 
the adjoint operator A* of A can be uniquely defined. Moreover, 
it can be shown that, for any h £ D(A) and h* £ D(A*), there 
hold (AhJi)rii < 0 and (A* h *, h*)n l < 0. Consequently, there 
exists a strongly continuous semigroup of contractions S(t ) generated 
by A [4], 114], and for initial conditions it'o and w\ satisfying 
iro,M’i £ D(A) and A<r 0 + kQAu'i £ D(A l/2 ) % there exists a 
unique solution to (3) which can be expressed as 


M) 
h (0 I 


= T[t) 


f/'l) 

J 


(4) 


where 


T(t) = 


A- J / J 

0 


0 

A-'A' 


S(t) 


A 1 /' 

0 


0 

A 1/2 


(3) 


is a strongly continuous semigroup on H> (this is easily verified). 
Also, it can be shown that the solution is asymptotically stable, if the 
operator Q is assumed to be positive definite. 

It is the purpose of this paper to present some newly obtained 
stability results for (1)—<3>. Specifically, in Section 11, we want to 
show exponential stability of the solution of (3), under the positivity 
assumption of Q. This assumption can always be satisfied in flexible 
arm control if we consider the effect of the Kelvin- Voigt damping in 
ami’s dynamics. The motivation of this section is to see whether 
the exponential stability result can be extended to the (3) which 
represents a wide class of systems, not just the DSFB controlled 
systems. For the specific (1), what is theoretically more interesting is 
whether we can show exponential stability without introducing any 
natural damping (internal damping in the material). This question 
is completely answered in Section III where wc show exponential 
stability of (I). The significance of the obtained results is that they 
reveal the essence of direct feedback control and demonstrate its 
power in control of flexible amis. Finally, we provide a sufficient 
condition on feedback gain k> to guarantee the stability of (2), since 
this is a hybrid system in the sense that its state space consists of an 
infinite dimensional part and a finite dimensional part, and it is very 
hard to find a Lyapunov function for the entire system. 


II. Exponential Stability of (3) 


In this section, we study the exponential stability of (3) under the 
assumption that the operator Q is positive definite. This assumption 
is always satisfied in flexible arm control, if we lake into account 
the effect of the Kelvin-Voigt viscous natural damping in arm's 
dynamics. An important reason that we want to do analyses here 
is that (3) may he considered as an abstract equation of a wide 
class of systems, not just flexible ami systems as described in 
[10], and there does not exist exponential results for such general 
abstract equations, although some results have been obtained for some 
specific boundary controlled partial differential equations [2], (12|. 
The energy multiplier method proposed by Chen [2] will be adopted. 

Let 


m=l 


,r(t) 




( 6 ) 


he an energy function for (3). Then its time derivative along solutions 
of (3) is given by 


E(t) - - k(QAib(t ), Ati'(t)) £ 0 (7) 


which means that the energy will be dissipating. Choose a positive 
constant 0 < r < 1 and define 


V(f) = 2(1 - s)tE{t)+ Jir(f)). 


( 8 ) 
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Since 

(«'(0, Aw{i)) < J(|M- , ^|| a ||.4 1/ -ii-(/)|| 2 + ||A,..(/)||') 
always holds, there exists a constant C such that 

(2(1 - ?)i - C)E(t) < V(t) < (2(1 - -)/ + C)E(t). (9) 

Obviously V(t) is a positive function for t > T\ := C r /2(1 - <■). 

Considering now the time derivative of V(f) along solutions of 
(3), we obtain 

V(/) = 2(l-;)E(t) + 2(l -c)tE{t) 

+ (ii'(t)<Au'(t)) + (ti'(f). Aii'(t)) 

= (1 - .r)||4 l/2 «'(<)|| 2 + (1 - = )||.4«-<f)|| a 
-2M1 - :)t(QAw(1).Au'(i)} 

- k(QAw(i).Aw(t)) — (Air[t). Air(t)) 

+ (ik{t).Aii(t)} 

= (2--)M ,/2 <r(f)|| J --||.4,r(/)|| / 

- 2A(1 - At((JA<rU).Aw[l)) 

- k(QAir{t).Aw{t)). 

Denote now (Q) = niax„.e/f((^n. »>) and = 

min„ f n ( Qu\ ii ). Then for an arbitrary constant n ± 0, it holds 

-(QAir(t).Air(t)) 

< A r „„(g)||.4H(0||||.lM'(/)|| 

< r,A, 11 ,, s (g)^-||.l l ;.(f)|| 2 +-i||.l«r(/)||^. 

Consequently 

11 

V(fK |(2- : + '^X,»*AQ) 


- 2A( 1 - -)t A 

min (U)|ll U'(t )|| 

- - 2 " A.,(( 1 ))^||.l.r(/)||- i . 

Since a is arbitrary, it can be chosen to be large enough so that 
- A ril , x (Q) > 1). Therefore, if we put 

7 = iir 

2 2A( 1 - 

then 

f(f> ^ 0. \/l > 7j. 


Since i?(f) is energy dissipating for t > 0 and V(f) is also energy 
dissipating for / > 7^, ii is easily verified that for t > T := 
iiiaxfTi.Ti}, E(t) can be estimated as 

V(T) „ [2(J - ')T + C']/T(0) 

£,#> - 2(1- -)f-r - 2(1-.-H-C " 

from which we obtain 




[2(i - .-jr + n^W 

[2(1 - -)t — (’]* 


It < TC. 


Noting that E{t) = £ 7V) 


and T(t) is a strongly con- 


Hj. 


tinuous semigroup on Hi. the above relation implies the exponential 
stability of solutions of (3), by the well-known equivalent property 
of V'(p > D-stability and exponential stability for a strongly 
continuous semigroup system 114, Theorem 4.1]. Thus we have 
proved the following theorem. 

Theorem /: The solution of (3) is exponentially stable, provided 
that Q is a bounded, self-adjoint, and positive definite operator. 


Remark I: It is seen that V(/> is a Lyapunov function for t > T. 

Remark 2: It should be noted that if a solution is exponentially' 
stable for t T, then it is exponentially stable for t > 0. To see 
this, suppose there exists a T > 0, independent of the system initial 
conditions, such that 


«■(/) 

«<V) 


< M( 


-Mi- t ) 


Then, obviously 


ir(t) 

w(t) 


< M<' *' 


.1 > r. M > l.A > 0. 


,t > 0 


where M - Mi 1 

Remark 3 It has been shown that under positivity assumption of 
Qn system (3) generates an analytic semigroup, and moreover all 
the eigenvalues of (3) lie either on the negative real axis or within 
a circle with center at (- ^ 0) and radius j K ^ Iq ) ' on * 

complex plane [6]. Hence the exponential stability is proved, from 
a different viewpoint. 


HI. Exponentiai Stability of (l) 

In this section, we investigate the exponential stability of (1) which 
docs not include any natural damping. We solve this problem, in this 
section, by showing that (1) can be equivalently transformed into a 
boundary control system well studied in the existing literature [2|. To 
this end, let us introduce a new variable y(t,r) = u n (t,.r). If the 
initial conditions associated with (1) are sufficiently smooth such that 
the solution admits continuous spacial derivatives up to sixth order, 
then taking spacial derivative ol both sides of (1) twice yields 

y(t:r) + y""(t,r) = 0. (10) 

Also, from the boundary conditions of (I), it is seen that tr(f.O) = 
u V.U) — 0. Therefore, we obtain the following boundary conditions 
on y(f.r) 

r t,(tJ) = - 0 

„UO= m n, (tj) = 0 

</"(Ml) = ir""(1A)) = - if(M)) - 0 

= ir # ""(L0) = -ir'(U)) - *ir"(U)) - -ky(t, 0). 

(ID 

At this stage, letting r = ( - r and noting that 

dy __ 0y Or _ Off 0 2 y _ 0 2 y 
Or Or Or Or' Or J Or- 
<>'u _0ji_ _ o'_£ 

;).!■' Or' ‘ 9j 1 dr' 

we get the following direct velocity feedback boundary control system 
\ij(1.r) + y""(t.r) = 0, r E (0J) 

//(/.()) = (). .i/(M)) = 0 
I y"(t. () — [) 

; </' #/ (f, 0 = ky(t.() 

which is well studied by Chen 12] and MorgUl [12], and the solution 
is known to be exponentially stable. Since the solution of (I) 
can be expressed as ir(f,.r> = JJ f ( * y(t< r)<lr<1x, n (f.r) is also 
exponentially stable. The transformation from tr to y indicates an 
important relation between direct strain feedback control and direct 
boundary velocity feedback control, which makes us possible to study 
sufficient conditions for stability of (2), as will be described below. 


IV. A SiJFFicTHN'i Condition for Stability of (2) 

This section is devoted to studying the stability of the hybrid (2). 
Since it consists of two subsystems highly coupled to each other, one 
possible method to prove stability of the entire system is to find an 
appropriate Lyapunov function. Unfortunately, it seems very hard to 
do this, and until now we have not found an appropriate one. So, we 
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G(») 

ff(t,r)-HT(*,r) = 0 

»((,0) = »'((, 0) = D ,(<) 

A 1,0 = 0 

*i(0 «a(0 0 = “i(0+ «>(') 


i(<) + Lje(l) + fcje(t) * Lt(l) 
«i(0 = -*l«(0 - *a«(<) 


Fig 1 Block diagram al the transformed closed loop system ol (2) 


again transform (2) into its corresponding boundary conti ol form and 
present a sufficient condition for the dosed loop stability 
Again let y(t i) = u "(L i) By the same way as in the previous 
section, we can transform (2) into the following form 
'vtf ’) + v""(i 0 = 0 

\u(t 0) = vV.O) ~ 0 

u"{tt)= 0 (13) 

i*'"(/,o* + n 

if(t) + h l f[t) + htU) = h(t 0 

Now, put 

Z(t) = ¥ (f, f) U| (0 -=■ -A,c(/) - A f (0 m,(/) = A~(M 


We obtain a block diagram as shown in Fig 1 It is interesting to 
observe that the strain feedback plays a role ol an inner loop controllei 
(see the dotted block in Fig I) while the motion control part acts 
as an outer loop controller Let the transfer function from - to m t be 
denoted by A(s) Then 


A (A i * + A 2 ) 


4- A j s 4 A 


(14) 


which is stable and its H norm ||A (s)||^ is easily calculated as 


I|/Vb)||v 


_ _ A/, ___ 

(Yv/SJ+’i*?*!+- ihj - 2Ar^-*) l/ - t 


(13) 


Now, took at the dotted block in Fig I whose transfer function will 
be denoted by G(s) (note that we do not need the expression of 
G{m)) From the discussion in Section III we know that G(v) is 
exponettally stable Hence its norm exists 
For direct boundary velocity feedback system 

rv(* o +v""u 0 = 0 

V(f 0) = !/'(/ ()) = () 

\v"V 0-0 (16) 

I ?/'"(* 0 = A y{t () 4 «i(/) 
v(M) 


there are no existing results on how to calculate the norm lor 
the tramfei function G(») from mj lo , Here we give an estimation 
of the upper bound of ||G|| v Taking the inner product of both sides 
of the first equation in (16) with </(f, 0 yields 

■ l Jt E(1) + k[i/(t I)} 2 = (17) 

where 


E{1) » | J ([V(f 1 )] 2 + W(1 > )] J )'G 

represents the total energy stored in (16) Integrating (17) with respect 


to f, from 0 to x we obtain 

MMOlla+ («■(*> -( 0>2 + £?(x>)-£?(0) = 0 (18) 

where (im(/> ~(0); = J 0 X mi(M (f)dG || = (0||i is the induced 
norm, and 

m ^ \ j‘ ([w(o n} 2 + [v''(o oHrfi 

is the total initial energy Note that in general F( x) is a function of 
«i however, £(x ) > 0 and for any \\u\ (f)||j < 1 the upper bound 
of F(x) can be estimated as 

F(x ) = £(()) - *||-M)||j-(«iM> (M>; 

< £*(()) - A||-(/)||^ + ||-(t)lU 

- /(0,+ Ta 

since the function /(|| (f)|| ) - -A|| (t)\\^ -f ||-(/)|U attains its 
maximum value fa at || (t )||„ = fa Thus E( x) is bounded So that 
when mi ranges in the unit ball the largest value || ( t)\\ satisfying 
(18) is always smaller than the laigest value ||~( 0||2 satisfying the 
following inequality 

Ml Mill? + (»></) (/)) - f (0) s 0 (19) 

In other words ||6(*)|| x is small than the largest value || (f)|| 
satisfying (19) which will be calculated below 
Let i/ t =■ -jpjp It is easy to show that for any mi satisfying 
||mj ||^ < 1 there holds {mi )„ > (m* ) since {mi ) > 

“||m 1 1| || || ^ “|| || =- (m, ) So that for any mi m the unit 

ball the largest value that || (/)||^ can achieve is given by tht positive 
root ol the following quadiatic algebraic equation in || || 

a|| (/)> —r<(»—/1| (on “|| (oil /(i))-i> 


which is evaluated as 


1 + v/i +ui( 0) 

ik 


From this we see that 


IK-<olU - «ip .f (/)l1 

,/i ll«i 


sup 


< > +.v2+_L“l"i (20) 

Since in evaluating transfer function of a linear system the initial 
conditions die always set to zero we obtain 

l|Ms)|U i (21) 

by setting E(0) = 0 in (20) From (1*5) and (21) we see that 


IIMs)IM|G(s)|u 

<- -- 1 _____ ( 22 ) 

“ (2/AJ + 2A,Aj-M 1 '-2lO-2A 1 * )'/’ 

If the value ol the right-hand side of above inequality is less than 
one then the closed loop stability is implied by the well known 
small gain theorem Unfortunately, this is not always the case To 
guarantee closed-loop stability however, it is unnecessary to have 
||A (*)|| v,||G(*)|| x < 1 which is quite conservative Instead, it is 
sufficient to have 


|A(;a)||G(^)|<l (23) 

for every u. € (-<x oc) In general, the low frequency gain of G( y *) 
is very small (near zero) since the output of system (16) is taken to 
be velocity of </(/ () On the other hand, the low frequency gain of 
A (j +) K approximately one Thus, if wc choose y/T^ m such a way 
that the resonance frequency of I\(j+) is much smaller than that of 
G{j +), then (23) holds for a large range of strain feedback gam A 
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It is interesting to note the upper bound ||A'(a)IU ||G(s)||>: of 
l/v 0V)||G'(j*’)| is independent of A?, which is particularly useful for 
feedback design. 

Remark 4: For the closed-loop system shown in Fig. 1, if \\G\\ 

OC ^ I * then the closed-loop system is internally exponentially 
stable [8, Lemma 2.3], since G and A' are internally exponentially 
stable [15]. Under the present condition |Ct(i^)||A'(,y^)| < 1, 
whether input-ouput stability implies internal stability remains a 
problem. 

V. Conclusion 

Some newly obtained stability results have been presented in this 
paper. Specifically, 

1) For a general abstract differential equation on a Hilbert space, 
exponential stability was shown under an additional condition 
(this condition is equivalent to the existence of a Voigt damping 
term in dynamic model in the case of control of flexible arms); 

2) For a DSFB controlled closed-loop equation in partial differen¬ 
tial equation form, the exponential stability is verified, which 
clearly indicates the power of direct strain feedback control 
of flexible arms and reveals an important relationship between 
the DSFB control and the energy dissipating direct boundary 
velocity feedback control; 

3) For a simultaneous vibration/motion controlled hybrid equa¬ 
tion, a sufficient condition for the entire system to be stable is 
given, which can be used as a guideline in choosing feedback 
gains in (2). 
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11 00 Optimal and Suboptimal Controllers 
for Infinite Dimensional SISO Plants 

Onur Toker and Hitay Ozbay 


Abstract —This paper presents a simple formula for optimal and 
suboptimal controllers Tor unstable SISO distributed plants, with rational 
weighting functions. The controller is expressed in terms of i) inner and 
outer parts of the plant, if) a finite dimensional spectral factor obtained 
from the weighting functions, and ill) a rational function satisfying 
certain interpolation conditions. Under certain generidty assumptions, 
this rational function is of dimension less than or equal to i»i -f- / —-1. 
(u j -f / in the suboptimal case)* where l is the number of unstable poles 
of the plant and u \ is the order of the sensitivity weighting function. There 
arc 2( n i + /) (2( n i 4 /+1,) in the suboptimal case) linear equations, which 
determine this rational function. These linear equations can be written 
directly from the structure of the controller. 

Notation 

The set of real numbers. 

The set of complex numbers. 

{*€ C:1M*)>0}. 

Open unit disc, {z € C : |:r| < 1 }. 

Unit circle, {< € C : |(| = 1}. 

Banach space of essentially bounded 
functions on JR . I. 

£ x functions which admit bounded 
analytical extensions to C+,D. 

Real rational functions in H x . 

Hilbert space of square integrate 
functions on j R, T. 

C 2 functions which admit analytical 
extensions to C+, D, 

The orthogonal projection onto a 
subspace M of C 2 . 

The orthogonal complement of Ft 2 in C 2 . 

The unit ball of H 
The reflection (or flip) operator on 

£ a (T), F/U) = 

The shift operator on >f 2 (D), S/( 
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r,. The Hankel operator with symbol 

v „ I\. a v. 

F|/ as 0 F(t>) = 0 whenever /( h) - 0. 

I. Introduction 

In this paper the mixed sensitivity minimization (two-bloek) prob¬ 
lem is considered for a class of infinite dimensional SrSO, LTI plants 
with finitely many unstable poles, with rational weighting functions. 
Tile purpose of this paper is to present a formula for optimal 
and suboptimal controllers. In the one block problem (sensitivity 
minimization) the W x optimal performance and controller can be 
computed by finding the largest singular value, and corresponding 
Singular vector, of an infinite rank Hankel (or “skew Toeplitz") 
operator. Earlier studies on the one-block problem, for stable infinite 
dimensional plants, have shown that the computation of singular 
values and vectors of this infinite rank operator reduces to finding 
a solution to a set of finitely many linear equations, [12], [5], In the 
two block problem, the operator we are dealing with is a “Hankel 
plus Toeplitz” type, with possibly infinite rank Hankel part. This 
problem can be reduced to a set of finitely many linear equations; 
see [7] for the finite dimensional case and [131 for infinite dimensional 
stable plants case. More recently, these results have been extended 
to the case where the infinite dimensional plant has finitely many 
unstable modes; see [81 and [2J for detailed reviews and further 
references. This paper extends these results to the suboptimal case 
and gives a simple formula for the optimal controller as well as 
all suboptimal controllers. Previously it was known (see, e.g., [10], 
[8] and references therein) how to compute the optimal controller 
“numerically,” by using Youla parameterization and computing the 
singular values and vectors of the associated infinite rank operator. 
We follow a similar technique to solve a more general problem. 
Moreover, we obtain a simple closed form expression for the H x 
controllers in terms of inner outer factors of the plant, a spectral factor 
of the weights, and a rational function to be determined. The sei of 
linear equations, which determines the optimal performance and this 
rational function, can be written directly from the controller structure. 

The rest of this paper is organized as follows. Two block 
optimal and suboptimal control problems arc defined in Section 11. 
Main results are presented in Section III. The proofs can be found in 
Section IV, and concluding remarks are made in Section V. 


II. H x Optimal and Suboptimal Mixed Sensitivity 


In this paper the plant and the controller are represented by their 
transfer functions P(s) and C'(.s), respectively. A system, whose 
transfer function is G(«), will be said to be Ntable if G E . 
Throughout the paper we consider “real” functions only, i.e., /?(*) = 
*(*)■ 

This paper deals with the mixed sensitivity reduction/minimization 
problem for a given plant P(n) and two rational weighting functions 
\\\(«) and 

Assumptions on the Problem Data: We assume that the plant ad¬ 
mits a coprime factorization of the form P(s) = in„(tt)N Q (s)/m l t(s) 
where — nl=j ni, E C+, are distinct, and 

N»„ E is inner (i.e., stable all-pass function, possibly infinite 
dimensional) and Aa € is outer (i.e., stable minimum phase, 
possibly infinite dimensional). Moreover, the weights are assumed 
to be rational with H'i(*) being nonconstant and H i, (H^V 0 ), 
( W '2 No ) “ 1 E H x ’; see [81 for a detailed discussion on these 
assumptions, and relations to other types of two block H* control 
problems. 

The optimal performance is defined by 


")u ;= inf 


H 


WxS' 

WiT 


where S = (l + PC)~ l and T = PC(\ + PC)~ l are the sensitivity 
and complementary sensitivity functions. The condition “C stabilizes 
P” means that closed-loop transfer functions S, CS, and PS belong 
to H**. The optimal controller, denoted by O,,,,, is the one 
which stabilizes the plant P, and yields 

H r ,d + pc,„,r 1 1 

The suboptimal control problem is to parameterize the set 


C P 


|c: C stabilizes P, 


H 'iT 



(I) 


for a given suboptimal performance level p > -)o. 


III. Optimal and Suboptimal Controllers 

The main results of the paper are stated in this section. Their proofs 
are given in Section IV. 

Let in . ■ ■ ■, i) T , A E Ci, mi > 1. be the poles of W] (- s); if //, 
has multiplicity (, then it is assumed to be repealed (, times in this 
list. The zeros of 

EM := - 1 j (2) 

are denoted by L Ml , and they arc assumed to be distinct 

for the given p. Then, if ,'s can be enumerated in such a way that 
, * ■ ■»i y are in C+, and = - f i,. Now define 


F,,(s) := 


- ~ 

*» + '/A 


(3) 


where G,, E is minimum phase and determined from the spectral 
factorization 


G»G,(-*) 

Hj (-.s)U'j (.s) ^ H".i(- .h)Hj(s) 

(4) 

Genencity Assumptions For the given p > > wc assume that 
it ,'s arc distinct and none of them coincide with any of the n/s. 
Furthermore, are such that m n (,i ,) ^ 0, for all / — 1, ■ ■ ■, 2/n. 
We also assume that > Timm where is the smallest p such 
that the right-hand side of (4) is greater or equal to zero for all -s = j*c. 

Now, to obtain a parameterization of C /} , pick an arbitrary real 
number a > 0. Then, we have the following formula. 

Theorem I: Consider the suboptimal H' control problem de¬ 
fined in Section 11, and suppose that the above gencricity assumptions 
hold. Then, C «ulinpl E C„ if and only if 


CViibuiii (.s) — En)m if( i s) t- 


iVo(.s) 'F,,(s)Li (s) 


1 + ni u (s)F t ,{ti)Li (s) 


(5) 


where 


Lr ( s ) = 


Lj(s) + L\(-*)t T {b) 

Li(s) + LA‘»)P(sV 


for some C E B 


and L\(s)<L'Afi) are polynomials of degree < in -f / satisfying 
interpolation conditions 


0 = Li ( ih) + in„ (fh )F P ( ,i k ) l, (ih ) k = 1, • ■ ■, n , (6) 

0 = Li(ru) + )P p (an )L‘A*\k ) fr — (7) 

0 = LA~>h ) + m„iih )F p (>k)h(-fh) k = 1. ■ • *. >M W 

0 = Iiff-ru) -f Wn(<H )Fp(<U (9) 

0 = l2(-n) + (Pp(n) 4* l)F fi (ti)Wn((i)L\(-a) (10) 

1 (ll) 
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In the above system of equations, a > 0 is arbitrary For different val 
ues of a one can obtain different parametenzations of the suboptimal 
controllers 

Remark 7 The gcnericity assumption > *, r E ’ is actually not 
needed for Theorem 1, but it will be necessary for Theorem 2 which 
deals with the optimum case Furthermore in the optimum case the 
genericity assumptions are assumed to hold for f> = ^ 

Remark 2 If IIi(s) = w(s)/r/(s) with n{s) <l(•*) £ R[a] 
(/i(s) — 1 Hi 11 j 1 £ then ^ ,(h) = n(*f)n(—s) 

andZLfs) = <I(h)(1(-s) are relatively prime If A ( h) - ^D ,(s) 
has a multiple root then at that point ^' - frD 1 - 0 and hence 
D , V — \ D' — 0 Bui D V — \ I)' is noi identically equal 

lo zero, and hence has only finitely many zeros so there aie at most 
finitely many i> values for which the generiuty assumption on i s is 
nol satisfied T he fact that D V - \ D, is not identically equal to 
zero can be shown as follows Let \ 1 be polynomials such that 

\ \ + ) t D I and assume lhat D \ \ D' = 0 Then 

we get \ t D ^ \ \ 1) = 0 D' \ \ + D I ,D = D 

and hence D divides D' which is absurd This contradiction proves 
that D \ ' — \ D, is not identically equal to zero and hence shows 
that there are at most finitely many i> value for which Ihe gcncriutv 
assumption on 1 s is nol satisfied 

Unonm2 Suppose that ( ,, is unique and j is the laigest 

singular value of the associated Hankel plus Tocplitz operator 
defined in the next section Moitovcr suppose that the above 
generiuty assumptions hold lor i Then 


r 


s) = / 


(s)m /(S) 


Ms) 
1 + m 


(s)/| 

(s)T 0 (slf(s) 


( 12 ) 


wheie f - I )L\ and I (s) l i(s) arc nonzero polynomials with 
degiees less than oi equal to (in + / - 1) satistymg (6> (9) with 

n = 

To hnd ( we (trsl find an upper hound eg by laking any stabi 
li/ing controller and evaluating its performance I hen we replace f 
with a new paiametti in (6)-(9) Note that (6)-(9) can be written 
as V — 0 where v > is a 2 (in -I- /) column vtclor lomted by the 
coefficients of I i(s) and / (s) and M is a square matrix which 
depends on the parameter Therelore n can be found by plotting 
the smallest singular values of \J as vanes between an upper and 
lowci bound The largest value of loi which the plot shows a zero is 
•)o see the below example In this sense the algorithm of finding 
is similar to standard iteration (| I| |6| [71) But here the size of the 
matrix whose singularity is to be determined is 2( n i -f /) x 2( n i + / ) 
even though the undeilymg system is infinite dimensional 

FxampU Let /’(-) = < ' 7(s-l) II- 2 (s + 1 )/(l(K+l) 
and 11.(s) =• (I2(* + 11) In this example in (s) — < 1 
m /(s) = (s-ni)/(s + m) and \ »(s) = l/(s 4- m ) where 
oi — l Note that we need to find F and/(s)=L (s)/7rfs) 
in the controller expression given by (12) Since ti\ = 1 and I — l 
polynomials 1 i(s) and 1 ,( •*) are both first order Therefore there 
are four unknown coefficients 'P = [7 m L 20 L n L 1] 7 where 
L](s)= /11 s -h / 10 and Ms) = Mft + Mi When h = 0 2 it 
can be shown that 0 2 < -0 < 1 3 see 1 4] and [9J 

In this case the largest value of -) which makes \t n singular is 
lo = 0 6819 it is obtained from Fig 1 where the smallest singular 
value of U-, is plotted Once 70 is obtained from this plot, a nonzero 
which satisfies M = 0 can be obtained easily The entries 
of $0 give r(*) = Ij(s)/7 i(s) which determine the optimal 
controllet via (12) For the above example /(*) and *>« can be 
computed as 

r s + U 2129 3 5 CC(^r -0 1 ) 

<> — 0 2129’ 10 («+12C5)(v + 0 788) 


75T 



Tor the suboptimal peitoimancc level (> — 0 7 we get /,(*) s 
—^ 777 ^ ) and if we choose a - 2 then 


T f (*) = L 


141 


.l(s) 


1+f (s)/D(-s) 
1 + D(s)l (s) 


paramelert/es all suboptimal controllers via (^) wheie 


i(‘ 

D( • 


% (s+ 1 W))(s + 0 2HG) 
(S + I 94)(s- 0 208) 
(s- 1 999)(s- 0 2146) 
(s + 1 9T)(x-0‘208) 


We see that foi this example rr, ,, (1/^) behaves numerically well, 
1 e it is easy to detect 0 from big 1 Also note that in the optimal 
case 7 (s) is all pass (but possibly unstable) and when ft is close to 
) D(s) is close 10 being inner and 7 „i, t i(s) and // H are 
close lo / (s) of the optimal case 


IV Pr(xti s ob THh Main Ri sul is 


Consider Oil Ji x contiol problems defined in Section II and 
define (r = \ w £ H' and Ct / = nn £ then 

P - (1 /(»/ is a coprime factorization Let \ \ £ be such 
that \r, + T Cm — 1 All stabilizing controllers are in the lorm 
(scl hid 




wheie (J is a free parametei in Following the notation of [9], 
dchne (> lo be such that H |U +U ,11 * - OO* and G“ l £ H v 
Let b\ be the hmte Blaschke product of minimal order such that 
llo MI !TI * G R^f x Then let H„ 6 - RM be such 

Ihdi \ =- e 7 r and vet (J, = ti V 0 Q + \ 

G, = II |H G 1 m — liii/i in 1 m 1 = b\m n It was shown 
that 1 9] the optimal control problem reduces to the computation of 


0 


inf 

Q\< 


I’ll ii - /»iHn - tnQi 

Ct, 


(14) 


and determination of C,? 1 and the suboptimal control problem 
reduces to 


|Gi € H' 


[Ho — m 1H0 - wQ 1 

G 


(IS) 


The optimal performance 0 is the norm of the so-called “Hankel 
plus Toeplitz’ operator [T /(TI| u 0 n(j) ^ 2 | wheie denotes 

the Toeplitz operatoi with symbol G [81 Note that II u H 0 m 
are stable and rational, and m j is inner and possibly infinite dimen¬ 
sional Now let / be an H** function such that f f* = * 2 
and 1 £ Note that by genericity assumptions 70 > "mm* 
and it is easy to show that -v mi „ = Define = Ho/A 
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ft, - Wo//v and #2 = Then 70 is the > maximum 

7 > 7 mm value such that one is a singular value of the Hankel 
operator F = F,,.^^^,^), (see, e,g., [ 8 ] and references therein), 
furthermore 

m*(n, 0 -roitf, 0 )-Qaupt = — (16) 

4'0 

where ao,|/o are nonzero Ji' 2 functions satisfying 

F.ro = f/u < F*yJ = *rj. (17) 


Note that (23) implies that myS e H J and (24) implies that 
m*(-ro - 777 ) G Hi. i-e.„ mVo - “£7 G Since 


Q\ + (u, - mill,)—- 
,s + 

— —(1/^ - Hllfi 


, . J * bm(a)\ 

(«*> - Mill/,) I w .r 0 --— 

V « +" / 

•H"''“- 7 + 7 ; 


Claim: When 0 > -)o we have 

m*(u, - nii u,) - = 


yS 4- £q4T 

j‘o + zoyoU 


. res (18) 


where J'o.j/o are H' 2 functions satisfying 


Jto=rjrn, J’o = , + r*yj. (19) 

a + <1 

Proof. Let V, (*) =■ — ni](*)ff,(*)) and *',(:) = 

r, (a) f for an arbitrary ti > U. Note that 5, = {/,Q 2 e : 
||1^ — < 1}. So, to find all such Q/s one can find all 

tfa G W*(D) such that ||o, - y 2 ||* < 1 and set («M<) = ^(—J). 
The set of all such <//s is given by [I] 

is - <12 = —J/ ■ f € 6(D) 

where y>,</ € W 2 (D) iuch that (I-Fr,.I7F)y> = 1, and q = Sr*F p 
(we use v = r, whenever the notation is clear from the context). 
Set r(z) = pU), yU) - ~ _1 f/U), then we have 

y — r*F*r, and . 1 * — 1 4- FT, y. 

By defining .r 0 («) * we obtain 

an equivalent set of equations in terms of ,r 0 . yo G H 2 

yS ~ r.ro, J‘o = + r^yo- (20) 

5 + r/ 

Finally, by setting C(s) = -f(;—), we obtain (18). □ 

In summary, we consider the following system of equations, to 
find C> P < and C p 

J’o.yo G H 2 , yj = r,r 0 , j-o = + PyJ. (21) 

s -j- r/ 

The suboptimal control problem is equivalent to (21) for = 1, i.e„ 
one has to find the unique nonzero j*q, y 0 satisfying (21), and then 
(18) can be used to obtain all Qi »,ii»., r .i. The optimal control pioblcm 
amounts to solving (21) for = 0, i.e., one has to find nonzero ,i*o. y 0 
satisfying (21) and then can be obtained from (18) by setting 
V * 0, 


A. Oh the Solution of .r 0 and y f) 

Let w, = 6 ,/Av m,i = A*/A where ft,, A^, .4 € R[#»], (/>,.&,) = 
1 . Define n = deg(fr, ) and / = dog(A). It is clear that ( 21 ) is 
equivalent to (22M24) 

•*‘ 0 , Jt/o-• 0\i G H 2 , (22) 

tto = w*(w, - mill, )xo + (23) 

.i‘o - ^ A + <p). (24) 

a x (I 


+ P W J HI ,/ 1 / , .I’o “ F W 2 m 1 U , ;- 

s + a 

we have 

/ , ^m(«)\ Pi i 

V, H”' ■ ro "7T7j = l“ 

U * ■- +■> P l 4 

P^WIrfl/vTO = III,/M, 4*0-jj 7 ~ 

where pi i,pi 2 G R[i»] with dog(pi 1 ) < /» - L degfpi 2 ) < / - 1. 
Hence we obtain 

Pj , 

o, - — + m f/ ii,4ci - f/-, —— 

A*, 4* s + n 

where yij ^ R[s] and dog (pi) < u + / — 1 Similarly, since 
r,; 2 = -P H 2 w{[ 1 * - w*u*)y f * 

= mi/!Jyn - P w . m f /«-Vo 

we have 

o * * 1 } 2 \ 

mu.//„ - -p- 

P H j m , 1 1 /’ 7o = m,/ii- i/n ^ 

for somep^ i.p_> 2 € R[.s] with dog(pj 1 ) < #/ — 1. d«*g(pj j) < /-l. 
Thus 

Pi . 

02 = —- + m ( /u,i/ 0 

for some p 2 G R[a] and dog(p 2 ) < 11 + / - 1. Now define 
PiM ~ (s -f- «)vm(^) - M-,(.s).4*(‘0//i(r0 and Pi (s) — (s -l- 
n)p 2 (s) + <^A’*(s).4(.s). Then, we have 

"■)*'i -r 'finy i 

*»o =- 7 ——-— —. (25) 

(.S 4- ir)7T-4 

A •rp.+miM'i 

*° = ■~+7)ir4~* (26) 

Pi ^ ~ A , 1 /, (A -) Pj + m 1 f;, Pi) 

Pi =- 7—;—;—777 -< (27) 

(.S 4 - fl)7T 4* A’, 

P^-A-;r,;(A‘;r,4mT^P2) 

- - U ZTT7TP- (28) 

where Pj and P 2 satisfy 

clog(Pi) < n 4- /. deg(P>) < n 4- / (29) 

A',Pi + mi^P,U = 0 (30) 

Pin — k*il*(k*Pi 4- Pi )| A* — I) (31) 

P 2 tt - Av/;(A*;P, + n.;6,P,)h = 0 (32) 

A^P, +m;t,Pj|, as -. (l =0 (33) 

Pa(-«) = -Mi, (34) 

Conversely, if (29>-(34) hold, we can define . 1 * 0 , yo-Oi. 4>‘i by 
(25H28). In this case (22M24) hold and hence (21) holds. 

If ft = 0 , (33) and (34) impily that Pi(-n) = Pjl-a) = 0. In this 
case (30M32) hold for P, replaced by PJ , / » 1,2 and (29) replaced 
by dog (Pi )< w + l-1, Furthermore, (33H34) are redundant and 
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(25H28) hold if P, replaced by P[ > * = 1,2 and the (* 4- a ) factor 
is cleared from the denominators. Since (16) depends only on j/o/j o, 
it is enough to solve ( 21 ) for f> - 0 and to set V = 0 in (18) to 
solve the optimal problem. 


B On the Structure of the Controller 

Let W\ = Bi/Ku Wa = B2/K2 with (Bi,K\) = ( B ;. 
AY) — l. Define Ac to be a stable polynomial such that 

Ac Ac = P,P;A* 2 A“.; + (35) 

Then, it is easy to see that G = Similarly define to be 

a stable polynomial such that 

b,b; = ~i‘ l An a a - Di D 2 n;n:. cw>) 

Then, we can define . So G-, defined in Section 11 is equal to 

= 2 ^ L ^ - O* 1 the other hand, since hi is the Blaschke product of 
minimal order such that TT 0 = h\G AV X W* = h x ^r-j^ € H v , 

^ *■ U ^ * l\ * 

we obtain fri = -4*. Furthermore, note that Mo = ■ / 4w-, ™ 
A, == B t B*,K;. k, = A', B, . Since *(.*) = b,h* -It, k; 
— (B\B\ --r A’|A7) .4r,-.4?,s and »»m = w„ ^• (30) implies that 
Pt = A a L> , P* = (37) 


In this case (30) reduces to 


l\ 4- w„F^ Lj\ f =0 (38) 

because i/-, — and hence (</-, - F-, )|k\. = 0 . 

Similarly, (31)—(32) reduce to 

L\ 4- in a F-, L > | a - — 0. (39) 

la + miF^A.h =0 (40) 


because AYu ) ~ ) 1 and hence T/-, (o 

Finally, (33)—(34) reduce to 


Li (-f») + ( E, la) A 1 IF, (n)m 1 ( (r/)/. t (-o) =r () (41) 

£,(-«) = 1- (42) 


In fact (34) reduces to L\ ( —a) — -h hut replacing this 

equation by (42) does not effect the ratios y/ ( */.ro and 
hence it does not effect the controller formulas. 

Note that, by [10] 

C - m,iG~ 1 A’q ' —r~ where (-) - 11 U 4- mdj 2 . (43) 

1 - m „G ~ l C) 

On the other hand, by (18) 


e = /, | 


‘ l "i 


nhi 


d + 

•i» + ~faVa f’ 


= mXr'irrvr, - 


*0 + 7+^UnP 


(44) 


By substituting (25), (26), (37), (44) into (43), and simplifying the 
resulting expression, we obtain 


G.„h 0 ,,i(*) = E^(s)nu(») 


A r o ] {x)E~ ! {h)Li {s) 
i + ncA^Fd^Lui*) 


(45) 


where 7 = p. and 

Lu(s) 


L'j(s) + L\(“s)U(s) 
L\(*) 4- /a‘ 2 (-*)r(.s)' 


and LuL'j are the unique nonzero polynomial? of degree at most 
»4 / satisfying (38M42). The controller structure is the same for 
the optimal case, except the free parameter V drops 


C opt 


Iff) = E lv (s)m, l (#) 


Nq 1 (b)F, 0 (h)L(*) 

1 4- /Mt 1 (,s)F >0 («)X(,si) 


(47) 


where L(s) = L-A*)/L\{s) and L \,A 2 are nonzero ploynomials of 
degree at most u 4 - / - 1 satisfying (38M40). 

V. Concluding Remarks 

In this paper, a simple expression is obtained for the optimal 
and suboptimal controllers for a class of unstable distributed 
plants, with rational weighting functions. We have shown that H 0 ® 
controllers are in the form (45). (47), where m t /. ui „. No are inner 
and outer parts of the plant P = in n No/ui ( r, is given in terms of 
the sensitivity weighting function W \: F- is computed from a spectral 
factorization which depends on 7. W\ and My. and polynomials Ly 
and L 2 are computed from a finite set of linear equations. These 
linear equations can be written directly as interpolation conditions on 
L\ and I 2 . that are expressed in terms of m n and F^ evaluated at 
the zeros of E 1 , and at the poles and zeros of m f i. The number of 
equations is 2 ( 7 #j +/4- 1 ) (in the optimal case it is 2 (m 4 - 0 ). which 
is equal to the number of unknown coefficients of L\ and Li. 
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Least Squares Type Algorithms for Identification 
In the Presence of Modeling Uncertainty 

Er-Wei Bai and Krishan M. Nagpal 


Abstract —The celebrated least squares and LMS (least-mean-squares) 
are system identification approaches that are easily imptementable, need 
minimal a priori assumptions, and have very nice Identification properties 
when the uncertainty In measurements is only due to noises and not due 
to unmodeled behavior of the system. When there is uncertainty present 
due to unmodeled part of the system as well, however, the performance 
of these algorithms can be poor. Here we propose a “modified” weighted 
least squares algorithm that is geared toward identification in the presence 
of both unmodeled dynamics and measurement disturbances. The algo¬ 
rithm uses very little a priori information and is easily implementable in 
a recursive fashion. Through an example we demonstrate the improved 
performance of the proposed approach. Motivated by a certain worst- 
case property of the LMS algorithm, an estimation algorithm Is 
also proposed for the same objective of Identification in the presence of 
modeling uncertainty. 


1. Introduction 

The least squares and the LMS (Icast-mean-squares) algorithms are 
perhaps the most widely used identification algorithms [4], [71, [91, 
(111, Their advantages lie in their simple recursive implementation 
and minimal use of a priori information. These algorithms perform 
very well in applications where the modeled behavior of the system 
closely approximates the behavior of the system to be identified. 
The performance of both least squares and LMS algorithms may 
become poor, however, if the assumed model structure is not rich 
enough to capture the dynamics of the system. Here we propose 
two algorithms that are motivated by the worst-case properties of the 
least squares and the normalized LMS approaches. While retaining 
most of the nice properties of the standard least squares (such as 
recursive implementation), these methods are also suitable for system 
identification in the presence of modeling inaccuracies. 

Recently there has been a strong emphasis in the control com¬ 
munity on “identification for control design.” The objective of 
system identification there is to obtain a good nominal plant together 
with some error bounds that are suitable for robust control design 
methodologies [2], [5], 16], [8], [10], [12], [13]. The approaches 
that have been proposed so far are, from a practical point of view, 
quite complicated and often result in nominal models of a very large 
order. There are other approaches to combine identification with 
control design [14], 1161- In the present approach we do not try to 
obtain nominal models with “hard” error bounds but instead sacrifice 
such rigid and precise objectives for optimality in a certain worst- 
case sense and case of implementation. They do not rely much on 
the a priori information and especially the “modified” least squares 
algorithm is implementable in a recursive manner. For clarity of 
explanations, we will consider single input/single output systems. The 
approach is immediately generalizable to multi-input/multi-output 
systems. 

The paper is organized as follows: in the next section we review 
some worst case properties of the standard least squares and the 
normalized LMS algorithms. Section 111 contains the precise problem 
formulation. Section IV the main results, and the Section V the proofs. 

Manuscript received April 12, 1994. 

The authors are with the Department of Electrical and Computer Engineer 
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II. Review of the Least Squares and 
LMS Algorithms 

Suppose we observe an output sequence y t of a system with an 
ARMA structure 

y t = p', 0 + £,, / == 0.1, - ■ ■. A T . (2.1) 


Here y, is the output at the ah time instant, is the measurement 
noise at the ith instant, p, € i?" is a known regressor vector 
composed of past inputs and outputs, and 0 € 7?" is the unknown 
vector to be identified that describes the system behavior. We will 
also use the following notation 

y := [i/o*I/i,-- , <I/\]\ * [pn. Pi, •■-,p\] / 

£:»[£n.£i.-■•■£*]'. (2.2) 


Let A be an identification algorithm. For the identifier An 00) will 
be used to denote the estimate of 0 obtained using measurements up 
to the stage /, j.e„ 0(0 = ^4(//u- //i < ■ ■ ■, y ,). 

The next lemma describes some well-known properties of the least 
squares algorithm that are pertinent for our later analysis [1], [9], 
Lemma 2.1 (Least Squares Properties )• For given data Y and f J\ 
the leasi squares estimate of 0 given by 0(A ) := (*'4>) -1 *'V is 
optimal for the following two cost functions 

a) 0(A T ) = argint{||£||‘: Y = *0 + 0 


b) 


0(A) = a rg 

o 


inf sup 

.40 j c 


||*0 - * 0 || J 

lkll J 


The first part of the above lemma states that the least squares 
estimate of 0 is that value of 0 which explains the data with minimum 
amount (in 0 norm sense) of noise. In the hlerature, this approach is 
sometimes referred to as total least squares |3). Since in this case the 
uncertainty is only due to noises, the least squares estimate would be 
the answer to the question “what is the 0 that explains the data with 
minimum amount of uncertainty?” In the next section we will ask 
this very question but also include the contribution from unmodeled 
behavior of the system in describing amount (norm) of uncertainty. To 
interpret the second part of the above lemma, consider the problem of 
estimating p'0 for / € [0, A ] using the measurements y {) to </\. The 
cost function to be minimized is the worst case amplification in the 
/j norm from noises ]L } to the estimation errors p'(0 - 0). This is a 
discrete-time H v smoothing (noncausal estimation) problem since 0 
is obtained using the entire measurement sequence. The second part 
of the above lemma states that if 0( V) is the least squares estimate 
of 0, then </,0(A') is the best worst case (in sense) smoother 
estimate of p'0. Thus the least squares estimate can be viewed as an 
optimal H ^ smoother estimate. 

Similar results can be said about the normalized LMS algorithm. 
For a given identifier A , we will define the filtered error at stage / 
as ( t , ■= p',(0 - 00)). 

Lemma 2 2 (Normalized LMS Property). For any given p > 0, 
consider the following normalized LMS algorithm 

(Hi + 1) := »(i)+ ■ - -(i/, + t - </,+,#,).»((» = ffu. 

1 + /I0, nt-'. + i 

Then the above estimator achieves the following optimal worst-case 
cost 

... .ii' j 

c 1 Hill 2 + - 0u|| 1 ' 


0018~9286/95$04.00 © 1995 IEEE 
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Fig, I. 

In the above, Bo represents the a prion estimate of B. Thus the 
normalized LMS algorithm minimizes the worst-case lj gain from 
the “uncertainty” (in terms of a priori knowledge of B and the 
disturbances) to the filtered error. The normalized LMS algorithm, 
however, is not the only algorithm that minimizes the above cost 
function |7). The cost criterion in the above lemma and part b) of 
Lemma 2.1 are similar m the sense that in both cases one wants 
identifier for B to give a good estimate for v'fl. In part b) of Lemma 
2.1, however, the estimate of B used in predicting o[B is based on 
the entire sequence of measurements available while in Lemma 2.2 
the estimate of B used in predicting <1>[B is the one based on only the 
measurements up to time instant /. 

III. The Probl em Formulation 

In this section we will formulate two approaches for identification 
in presence of both the measurement noises as well as unmodeled 
dynamics. It is worth reemphasizing that the objectives of the present 
approach are i) to make as few “hard” assumptions as possible (such 
as a bound on the noise, or an H * bound on the unmodelcd behavior), 
and ii) to come up with an algorithm that is easily implcmentablc. 

We will assume an additive uncertainty model (see Fig. 1) for 
describing the input-output behavior of the system 

,</ =* Gu -(- Au -I- (3.3) 

Here Cj is the nominal model that we wish to identify, £ is 
the measurement noise, and A is an unknown stable linear time 
invariant system that captures the unmodeled dynamics ol the system. 
Assuming an ARMA structure for the modeled part G tor the 
structure, one can write the above equation as 

\h - rt> + (Aii),+£ ( . ?=(). L--. S (3.4) 

where o ( , the regressor vector is composed of known inputs and 
outputs and B is the unknown parameter we wish to identify. Here 
we are also assuming that only a finite number of observations (.V) 
are available for identification. Let the (unknown) impulse response 
of the uncertainty A be <V ■ ■ ■ . We can then rewrite (3.4) more 
compactly as 

y = + (> + £ ( 3 . 5 ) 

where V, <F, and £ are defined in (2.2), and 

r<\) i o 

II \ 1/0 

A := r = (3.6) 


Motivated by part a) of Lemma 2.1, we now propose our first 
approach for identification of B. 

Problem J—Modified Weighted Least Squares Problem (MWLSP): 
Given are two positive definite weighting matrices H* = H7, and 
\V[ = H;. Find B so as to 

minimize + ||£||u ^ = b'Wsh + £ # H\C) 

subject to V = $B 4* l h + £. 

Before discussing the role of IF* and \\\ in the above problem, 
we will briefly discuss the underlying motivation behind the above 


problem formulation. The motivation behind the above problem is 
qualitatively very similar to the property a) of the standard least 
squares stated in the Lemma 2.1. Let 

J0W.Wc)-.= ||?v. + ||*||?v, • 

The quantity -7(ir*,H<) can be interpreted as the size (or norm) 
of uncertainty (due to both unmodeled behavior and the measure¬ 
ment noises). Thus in the modified weighted least squares problem 
(MWLSP) we want to “find B that explains the data with minimum 
amount of uncertainty.” In the standard least squares one asks this 
very question but there all the uncertainty is attributed to noises only. 

The weighting matrices IF* and U r t are chosen by designers 
to measure the individual contributions of the unmodeled behavior 
and the disturbances. This can be used to incorporate some prior 
knowledge. We discuss some cases here. 

Case l) Consider the following choice of weights 

UWA/, and \\\ = (1 - A)/. 0 < A < 1 

where 7 is the (JV + 1 )-dimensional identity matrix. The 
quantity A is chosen to reflect the relative contribution of 
unmodeled behavior and the disturbances. If A —► 0 then 
one expects that most of the uncertainty is due to <*> or 
the unmodeled behavior while A —» I corresponds to the 
belief that most of the uncertainty is due to noises and not 
due to the unmodelcd behavior. Indeed, as can be easily 
observed, the above method reduces to the standard least 
squares when IF.s = A I —► 7 and = (1 — A)/ —► 0. 

Case 2) The weightings H ’s can also be used to shape the accuracy 
requirements of identification with respect frequency. For 
example, consider a situation where it is desired to have 
a more accurate description of the system behavior at low 
frequencies. In this case, we would want the unmodeled 
dynamics (system A in Fig. 1), which is “consistent” with 
the identified model in explaining the data, to be small 
at low frequencies. Let {h ,) be the impulse response of 
a low pass filter. Consider now the following choice for 

h; 

ho U u (I 0 1 

h i ho i ho 

UV = 

L/i’ ho J L hs lis -I ho J 

With the above choice of IF. the norm of the unmodeled 
uncertainty = b'Wth) is given more weighting 

in the low frequency region. 

Case 3) The unknown system is assumed to be stable. Then we 
would probably also desire the nominal model obtained 
from identification to be stable, and thus so would be 
the unmodeled dynamics A. Consequently, the impulse 
response {A*} of A converges to zero as k -+ oc. 
This a priori knowledge that h —* 0 as k —♦ x can 
be incorporated in the identification process by suitable 
selection of H* and H<. We next describe one such 
choice for H* and \\\. The weightings H7, and \\\ 
should reflect the fact that the “tail” contribution of {<*>* } 
for large k is much smaller than that of the disturbance 
sequence {£^}. This can be easily incorporated into the 
weighting matrices by choosing them, for example, as 
follows 

Wh = diag(n, a 2 . • ■ *, n and IC t = rJ 
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where c is some positive real number, / the identity 
matrix of appropriate size and a > 1. With these 
choices of weightings, a unit ball of uncertainty defined 
as B =5 {(M): IMIwr* + ll£llw € < 1} would have h 
comparatively smaller than (k for large k. 

We shall observe in the next section that the optimal 0 for the 
Problem l is also optimal in a worst-case sense, a property very 
similar to property b) of Lemma 2.1. To be more precise, the optimal 
0 for the MWLSP also satisfies 


optimal 0 for MWLSP = arg inf sup 


||j>g - »0|| a 


II vi* 


+\m t 


(3.7) 


Property (3.7) has an obvious worst-case interpretation. Its usefulness 
Can be seen from another angle as well. From a good model one 
would expect to be able to predict the uncorrupted system output 
accurately. From (3,4) it follows that for a good identification scheme, 
d>[0 would be close to at, - £, for all i. Thus one meaningful criterion 
for system identification would be to find an identification algorithm 
that minimizes the worst-case gain from ‘'uncertainty” to the error in 
predicting the actual system output or to find 0 = .4(1') that achieves 


inf sup 


||($0 + ( r *)-$0|| 


AUi --- 

'--**>«* (IWvu + llcilU* 


(3.8) 


It turns out that the solution to the above problem is not easily 
implementable. As we show next, however, the MWLSP algorithm 
minimizes an upper bound for J. To see this note that 


<•« {IWI'f» t + lltlli» € }* ~V* {M&. + IKIIf,,}* 
+ «r,„ .^a/r'rrv*- 1 ) 


where M* is the square root of IV*, i.e., Jlf*A/* = IV* and <r„ mK 
denotes the largest singular value. The second term in the above 
inequality is independent of the algorithm, while the first one is 
precisely the quantity that is minimized by MWLSP [see (3.7)], Thus 
the MWLSP has an additional property of minimizing an upper bound 
for J , the worst case gain from “uncertainty” to the error in estimating 
the uncorrupted system output (output when there is no measurement 
noise), 

Next we present the second problem we will consider. The motiva¬ 
tion for the next problem comes from Lemma 2.2, a framework that 
follows from an immediate generalization of the worst-case property 
of the normalized LMS algorithm. Recall that for any given identifier 
*4, the filtered error at stage / is defined as t = 0',(0 - 0(0). 

Problem 2 — H ^ Filtering Identification Problem (HFIP): Given 
are two positive definite matrices IV* and \V$ and real numbers 
p > 0. Find the identification algorithm A so that 0 = A{ 1') achieves 


im, + 11^ + ^-^ 

where the supremum over and 0 is taken over all £,M that 
satisfy Y = $0 + Uh + £. 

Again the motivation is similar to before—we want to minimize 
the worst case amplification from “uncertainty” to the prediction error 
of the modeled part of the system. The previous discussions about 
the roles of IV* and \\\ also apply for this case. 


IV. Main Results 

In this section, we present the solution to MWLSP and HFIP 
problems. 



Theorem 4.1 (Solution of the MWLSP Problem). Consider the 
identification problem for a system whose input-output behavior 
is described by (3.5). For given input-output data Y , * and the 
weighting matrices IV* and IV t , if * is full column rank, then 
1) 

Lu-i. = (#'g$)" 1 $'gr (4.9) 

with Q= +nvr'L rl ) (4.10) 


solves the MWLSP problem. Moreover, the solution is unique 
and also admits a recursively implementable algorithm as 
shown in Lemma 4.2. 

2) 0r„u/» also achieves the following worst-case cost 


inf sup 
o -^( v ) e * 


||4*0 - *0|| J 

*iifi, + ii€ii?t 


The solution 0 tnw u to the MWLSP is basically a weighted least 
squares estimate of 0 with the weight Q. Unlike the standard weighted 
least squares problems, however, here the weighting matrix Q also 
depends on the input. Being a weighted least squares algorithm, it 
possesses all the nice properties of the least squares, such as recursive 
implementation as is discussed in the Lemma 4.2. 

Before presenting the next results, we present an example to 
demonstrate the performance of the MWLSP algorithm. The system 
to be identified is the following second-order system 


= — 1-i — 0. i —2 + I .ou,~ i + O.Gif # 2 + 6- 


For simulation, £, was assumed ro be an independent uniformly 
distributed random variable in [0, 0.6], input u, = mh(/ )+2 ros(2.3* 
/) and N = 50. The nominal model is assumed to be of the first 
order. Simulations were done for two choices of weights: in one 
case the weights were chosen as IV* =■ 0.97, IV t = 0.17 (from 
the discussion of the Case 1) in the previous section, this choice 
corresponds to an a priori assumption that measurement noises are 
more important/dominant than unmodeled dynamics) and in another 
case chosen as 14 * = 0.1/, IV$ = 0.9/ (similarly corresponds to an 
a priori belief that unmodeled dynamics are more important than 
noises). Fig. 2 shows the Nyquist plots of the true but unknown 
system (solid), the least squares estimate (dashdot) and the MWLSP 
estimates proposed in the paper with 14* = 0.07, \\\ = 0.1/(**) 
and IV* = 0.17, \l\ = 0.9/(++), respectively. As one might have 
expected, for both these choices of weights, the proposed algorithm 
outperforms the least squares algorithm. 

The calculation of 0 , i1w Ih in Theorem 4.1 can be made recursive. 
Before summarizing the recursive procedure, we describe some 
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notation used in the recursion. As defined previously, H7 and 
are (.V + 1) x (S 4-1) positive definite matrices where A denotes 
the number of data points. Let ir*(A) and II* (A) denote the upper 
left A x A submatrix of II * and ll'^ respectively, i.e., 


Wi = 


lit (A) 


, tr A = 


117(A) 


where * indicates the remaining portions of the respective matrices. 
Because of the symmetry of the weighting matrices H*(A) and 
11\(A), their inverses can be written as 




A(k- 1] 

t 

<<k 


ilk 

<n k 


\V~Uk):=\r; l (A n/; 1 (A] 


with .U"'(A) := 


ii(A-l) 0 

Al A* a 


being the Cholesky factor of 


II7" (A), where <ik a and M a are scalars, and m and bk are column 
vectors. The data is given by 



‘//II ' 


Vo' 

V(0“ 

.Vi 

, *(*):=- 

. 1 
( >>\ 


•f/k ■ 


i 

U>A J 


'u 0 

0 

0 ■ 


m 

Uu 0 



.•'k 

U A 1 ■■■ 

II n . 


\C{k- 1) 0 

<‘k i'k A 


with < a a being a scalar and < a a column vector. Define 


Z[k):*-l'[ A)W, 1 (A ) =- 


r(A- i )/](a - i) n 
/*( A - 1)4- »iiAa a 


Z(A- 


0 

•A A 


nr, '(Hrj + nJuirrNor'a-))"' 

= (.4U') + Z(fr)Z'(fr)) 1 = l.'(k)Uk\ 


wilh 


T.'(k):= 


rr'(k-i) o 


L(k - 1) 


no -- 


Lil'Ud-n 

The next lemma summarizes the recursive formula for obtaining 
0 IMtt \ h for the MWLSP stated in the Theorem 4.1. 

Lemma 4.2 (Recursive Algorithm for MWLSP): A recursive algo¬ 
rithm for obtaining 

6 ... ( k) - ( <t>' (k )(H k )*( k) r 1 *' (k )Q{ k )V( k) 

is given as follows. 

Given initial data //n- ^n. calculate 

1(0) = (.4(0) + Z(0)Z(0)ii(())B(0)) ,/J .l'(0) = L(« )./„. 

4(0) = U 0)0,',. 


Step 1 ) At Ath step, for any given new data i/;. i/a , and <?[, 
calculate 

Z(A) = r(A)Z?(A). /a = L(A - 1)K + Z(A - 1)-a) 


/* A - ^A.A + -l k + : A 


/7x - ~— (i/a —/j[r(A-l)), 

U A 

»1 = -L(oi 4«r-i)). 

»A.A 


Step 2 ) 


fimuiAk) = 0 rrM1 /*(A - 1) 


/'(A - Do* 


] -K^ P(k - 1)0A 


—(w -oi-1)) 


m>=^ m-i 


P(k - ] F'(k - 1) 
1 + o) P(k — l )Ok 


Step .1) Set 


1 




4(0 = 


4(*-l) 

Ok 


L(k- 1) 

1(0 = 

U’kUk-1) 

Step 4) Set A + 1 = k and go to Step 1. 

Note that h a is away from zero since II ((A) > 0 for all A. 

We next present the solution to the // x filtering identification prob¬ 
lem. The following result can he obtained by immediate application 
of the standard results lor the discrete-time /7 X filtering problem 
(see, lor example, [7|, |15|). Recall that for any given identifier A , 
the filtered error at stage / is defined as (f , “ <’/,(0 - 9(i )). 

theorem 4J (Solution to the It PIP): Given are a positive definite 
matrix IT* and a real number p > 0. There exists an identification 
algorithm A so that 0 -= .4(T) achieves 

J ‘~" tv ilt,. + W + £ll*-i 


i + O, -—/I/, >0. * = 1. • ■ •. ,Y (4.11) 


where 


ft 

o ir; 


h, ■= [o! u, 1/, — l ■ - • (/(l 0 

L = [o! I) ■ 0] 


and P(i) satisfy the following recursion 

P~'U 4 1) = P- [ U) + h\h, 


(4.12) 


Moreover, if (4.11) is satisfied, the following identifier achieves 

./ < V* 

0(/ + 1 '#(/) P(i + 1 )A:,l 

m > + i h) i + hniru + iw^ 



with [0(0). MU)] — [0 O .0]. 

From an implementation point of view, the above algorithm is not 
as nice as the MWLSP algorithm for two reasons—i) the order of the 
estimator is equal to the size of the unknown vector 0 plus the number 
of data points (while in recursive implementation of MWLSP, the 
order of estimator is just equal to the size of the unknown vector 0) 
and ii) to check whether (4.11) holds, one has to iterate with respect 
to The recursive computation of P(t) in (4.12) can be simplified, 
however, using the Matrix Inversion Lemma so that in obtaining 
P ’ 1 (J H- ]) from P-'IH one only needs to invert a 2x2 matrix, 
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V. Proofs 

fn this section, we will give a brief outline of the proofs for the 
Theorem 4.1 and the Lemma 4.2. 

In the proofs, we will use an observation that we present in the 
next lemma. Define a set 


of the unknown parameter and there were no uncertainties/noises 
present. From Lemma 5.1, thus for any A G C (the class of '‘correct 
algorithms”), the estimate 9 =* ,4(V) = 0*. Since from Lemma 5.1, 
our search is restricted to the class of “correct algorithms,” we obtain 
using (5.14M5.J6) that 


1’ = {A: A(Y) = 0 if/> = 0 and £ = ()}. (5.13) 

The set C is referred to as the set of correct algorithms which repre¬ 
sents all the identification algorithms that give the exact estimate 9 if 
both b and £ are absent. The following lemma is quite straightforward. 

Lemma 5.1: Let A be any identification algorithm .A(Y') = 9. 
Then, Mtp s> is bounded only if A € t\ 

Proof of Theorem 4.1: 

Part 1): From (3.5), £ can be written in terms of 0 and b as 
£ = V-M-TA. 

Thus 

Ji := IHI J w*+ll€ll?v e 

= fi'Wth + (Y - M - rb)'\i\(Y lb). 

Taking derivative of J\ with respect to 0 and <\ respectively, and 
setting them to zero yield 

0 = 2 \V*b - 2U'\\\(Y - $8 - l r b). 

Solving these two equation for 0, we have 

,. = (#'<?*)- | *'gv 

with 

q = n\ - + ?"ir £ n' 1 c\\\ 

*(H7‘ +nvr'U')-'. (5.14) 

The uniqueness follows from convexity. This completes the proof. 
Part 2) Let 

* = r'ilo' M’ (5.15) 


||<M(r>- M|| 


\Lt> 


(r -WWD'tDdW -H) _ e'e 

(»' -OYWD'diuDd'lUf -«) ~ C'QmC 

where ( = D\ 1 '([O' — H). Since Di\’l is nonsingular and 0* is 
arbitrary, C is arbitrary. Thus 


t. * - u d 


Md > _ 1 

ft nun (11 ) 


or that (5.17) holds. We now show (5.18). Note that 

= 0 + (4>'w+ £). 


and <I>(<l>'Q4*) '4>' = r, 


Or, 1 0 

0 0 


r:. 


Hence 


ll^nmk - ‘MU' = \\J.\rd Md 

in + Kir,,, Md 

where S - L ^ "j I’lQWM-'. ,1/; 1 ). Thus SS 1 = 

( ) - 1 (j 

r ^ ^ r( and from above it follows that 

||M mvvN -M|| J , 4 1 

sl11 ’ imi ~' "I ~iinrr~ S".m„v(Qn )- -—;■ 

This completes the proof. □ 

Proof of Lemma 42. Note that 

f) = (*'<*)0(*)*(*)) 

where Q(k) is symmetric and positive definite. Let Q[k) — 
L'{k)L(k), we have 


be the singular value decomposition of where U\ and \\ arc 
unitary matrices and D\ is a diagonal positive definite matrix. Let 
Q be defined as 


Q=*UiQV[ = 


Q \i 

0l2 


Qu 

(hi 


r;. 


<5 I ft) 


Also, Ah and Ah are defined as A A Ah = H * and M[ H 
Now, the proof is divided into two parts. In the first part, we will 
show that if 0 = A(Y) and A € C , then 


«mp||#0 - $0|| 

«* 


A/^l >_ l 


(5.17) 


o mui-(fr) = mill \\L(k)(Y(k) - fc(A-)0)|| 

= arj; nun ||V(A’) — <[>(A*)0|| 

where Y(k) = L(k)Y(k) and $(k) = L(k)$[k). From above, 
we see that 0„ m ih (k) can be obtained recursively (using the standard 
recursive least squares algorithm) if Y{k) and <b(k) can be calculated 
so. It can be easily verified that the algorithm shown in Lemma 4.2 
recursively estimates V(A ) and ${k) and then generates the estimate 
by implementing the recursive least squares approach on the new 
data. □ 


In the second part, wc will show that for the MWLSP estimate 0, nw ih 


References 


Mlp||W MI *|* “ $0|| 
€ * 


Ahb 

Mc£ 


0 linn (Ql 1 ) 


(5.18) 


We now show (5.17). Let 0 be the unknown vector to be identified 
and let 9 m be any vector of the same dimension. Let b and £ be as 
follows 


v 


117 1 

0 

'V 

£. 

2= 

0 

V J 

i 


Q$((f -0), 


For the above b and £, from (3.5) and (5.14), + ( = 

$9*. The resulting data Y and $ are as if 9* were the true value 
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Square-Rout Brvson-Frazier Smoothing Algorithms 

PooCiyeon Park and Thomas Kailath 


Abstract — Some new square-root algorithms for Bryson-Fra/ier 
smoothing formulas arc suggested: square-rool algorithms and a fasl 
square-root (or so-called Chandrasekhar type) algorithm. The new 
square-rool algorithms use square-root arrays composed of smoothed 
estimates and their error covariances. These algorithms provide many 
advantages over the conventional algorithms with respect to systolic 
array and parallel implementations as well as numerical stability 
and conditioning. For the case of constant-parameter systems, a fasl 
square-root algorithm is suggested, which requires less compulation 
than others. 


1. Introduction 

Square-root (or factorized, as they are sometimes called) algorithms 
for state-space estimation have been found to have several advan¬ 
tages over the conventional equation-based algorithms in numerical 
stability, conditioning, and amenability to parallel and systolic imple¬ 
mentation. While such algorithms for prediction and filtering have by 
now been studied quite extensively (see, e.g., [ 1 ]-18]), (he picture is 
not quite as complete for smoothed estimates. Square-root smoothing 
algorithms in the literature have various advantages and disadvantages 
(see, c.g., [4J, [9]-| 15)); all of them require certain inversion on back- 
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substitution steps, and none of them are particularly well suited for 
parallel implementation. 

In-another direction, there has been considerable effort in speeding 
up Kalman filtering [7|, [16J. While the conventional equation-based 
or square-root algorithms require C( n 2 ) flops pcf iteration, fait 
square-root (or Chandrasekhar type) algorithms for time-invariant 
systems [ 17]-[20] require only 0(n{pH-f/)) flops, where p and q are 
the number of outputs and the displacement rank of Riccati solutions 
(i.e., raukf/ 1 , * i|< - |), respectively. For smoothing, 

however, there seems to be no fast square-root algorithms in the 
literature. 

In our paper, we shall present some new square-root smoothing 
algorithms which arc more suitable for parallel implementation than 
any algorithm in the literature and also a fast square-root smoothing 
algorithm for constant-parameter systems. As a target, we shall 
consider the basic Rryson-Frazier smoothing formulas [21]. 

Section II reviews Kalman filtering and some recast versions [22), 
and Section III describes the Bryson-Frazier smoothing formulas, 
presents several square-rool versions, and briefly compares them on 
the basis of constraints, memory size, and computational speed. 

For convenience we first introduce some notational conventions. 

Square-Root Factors: Given a positive definite matrix A , -4 > 0, 
a square-root factor will be defined as any matrix, say A 1 /*, such 
that ,4 = (A‘/ J )( l‘^)\ where ihe denotes matrix transpose. 
Such square-root factors are clearly not unique. In most applications, 
the triangular form is preferred. For convenience we shall also write 
(A'^r - 4*/-\(.4 , /V l = A~ y '\(A '/V = A~*'\ 

Square-Root Forms: Given a matrix .4, assume that we apply a 
unitary operator (-) to the I so as to get some special form of a matrix 
D such as AH = B , where AA* = BB\ then we shall call the A 
a pre-array and the B a post-array. 

Matrix Inversion Lemma (A+ BC 'D)“ 1 = A' 1 - .4 “ 1 B(C~^ i -f 
DA 1 B) 7;.i \ 

State-Space Model: ,r M i = F,.r, -f (V, u,. tj, = H,.v, 4- v,.i > U, 
where F r £ II, £ G, £ and id, t'»} 

are independent zero-mean white Gaussian random variables with 
the covariances {1I„. Q,. /?,}. The matrices { F,, 6',, //,, Q,< /?,, II,,} 
arc assumed to be known. For simplicity, wc shall define j‘,|, = 
the linear least-squares estimate of .i\ given | llo- ■ • ■. P*\t ^ 

F[.r< - ,r, |, |[r, - -it| j 1 ^, the error covariance of the estimate ,\\\ Jt 
For compactness we shall write, unless necessary for emphasis or 
comparison. ,.P,\ t ^\ ^ P,. 

II. Squarf-Rmi Kalman Filtering 

Given the standard stale-space model described above, we have 
the following recursions: for measurement updates 

i,i, =+ k, r,|, = r, - k, ,u, ,k] , 

and for time updates 

=f,p.u =F,r,|,r 

where K, , = 1 ?,,!/?, + «, = y, - H. 

.i f , and the P, arc propagated via the Riccati difference equation: 

Pi 11 - r,p,r; + g,q,g:- F I A / .,tf,.,A;,F,\F 0 - n 0 . 

When n 0 is very large, it is often preferred to propagate F ~ 1 , 
which can be performed when ITo > 0, R, > 0, and the F, are 
invertible. The resulting so-called “Information Filter” formulas are 
as follows: for measurement updates 

= pr { *' + H:R; [ y l . i > 7 / = r; x + u;r; 1 h. 
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for time updates 

i&U;+» = ('/. ; + Frp^Ff i Gi(iG:r i (Frr^ ») 
p~ + \ = Frp^F- i \i -Gti.GtFrP^F- 1 ) 


wherp Qi — Qi-Q, G* P~+\ G, Q, . For convenience, we shall define 
Kyj 4 F,P,H"R~] t Sf.4 = K P 'iR'J?, A /, 4 K f .,R x J 2 . and 
\:Mt 4 FrP^F-'GiQ'/-. . 

Square-root algorithms for Kalman filtering can be nicely organized 
by combining what we call the covariance and information-form 
Updates 122]. Let us first consider the basic so-called square-root 
covariance filter (SRCF) algorithm for measurement updates. Form 
the pre-array A and block triangularize it by applying any unitary 
rotation 0 so that AB has the form 


R 


1/2 


A = 


H,py i i 

pi/ 2 


-nr®; 




Kpr ri i 


AS = 


X 

Y 


0 

Z 


We may regard this as saying that the rotation 0 sets up a 
conformal (i.e., a norm- and angle-preserving) mapping between the 
(block) rows of A and the rows of the post-array. An inner-product 
of the first row and a cross-product between the first and second rows 
yield the relations R t -f H,PjH* = XX* and H.P, — XY*> from 
which we can identify X = R { J* and Y — Kf From the second 
block rows we obtain P t = Kj.iK },, 4 - ZZ*, so that we can identify 
Z = P'\,' X ■ Forming a cross-product between the first row and the 
third row, we obtain the relation that y, - #»■' « = A'n, and verify 
that o — R~y 2 e,. Similarly, forming a cross-product between the 
second row and the third row gives us i\ = Vn -f Z3. Therefore, 
we can select Z as 

By inverting and transposing the hrst two blocks of the A , we can 
form another basic so-called square-root information filter (SR1F) 
algorithm for measurement updates and combine the SRCF and 
the SRIF into a nice array form shown in the measurement-update 
equation of the following algorithm. 

As far as time updates are concerned, we can form the following 
pre-array 

F,p \{ 2 GiQ ; /B 

>1= 0 Q\ n 

«W / * «» 

and, after identifying the post-array by performing inner- or cross- 
products of the pre-array and comparing them, we can obtain the 
result shown in the time-update equation below. 

Algorithm U A — Square-Root Covarianc e and Information Filtering 
(Two-Array Form); Assume that /?, > 0, IIo > 0, and the F, are 
invwtible. Given P () ~* /2 = n“* /2 , P 0 ,/a = £|J /2 , and Fp h x a = 0 , 
we can write the following. 

Measurement Updates; 


- • - 1. ■ 

oA 

R t py i ■ 

pi/ 2 


■ yy 

A/.; 

!M 

's. * 

w V 

-h;r~' ,x 

0 

1 i 

e = 

R~*/' 2 

o'. 

^R-'H.Plf 

p-»/2 

i .|« 

-y:R~ ,/2 

upry 



pW 2 


where 0 is any unitary rotation that zeros out the (1, 2) or (4, 1) 
en^ 


Time Updates: 


p p 1 / 2 

0 

GiQ'/ 2 ' 

cry 


r p j / 2 
QiG,p- + T 

0 

Q'/ 2 

V*P7* /a 

1 f 1 1 

r i 1 1|, 

0 

Q7' /2 

e = 

p~*/ 2 

1 i+\ 

0 

-K\, 

Q7* /2 

f* p-*/ 2 

0 


j'.* p-*/ 2 

L >4-4+1 * ,4-1 

—J ,j 4> j At, i . 


where 0 is any unitary rotation that zeros out the ( 1 , 2 ) or (4, 1 ) 
entry of the pre-array. 

Estimate Construction: The filtered estimates .r,+ i can be con¬ 
structed from the post-array of the time-update equation in Algorithm 
1U as .r,+i = (P}±i)(P~+/ 2 j-i+\ ) or from the post-array of 
the measurement-update equation in Algorithm II. 1 as ,r,+i = 

The following corollary shows the square-root covariance and 
information filtering (SRCIF) algorithm in a single-array form, which 
is useful when the input and output noises are correlated; see [ 22 ]. 
We shall use this single-array form in Section MI to implement the 
Bryson-Frazicr smoothing formula in square-root form. 

Corollary II. I — Square-Root Covariance and Information Filtering 
(Single-Array Form): Assume that R, > 0, ITo > 0, and the F, are 
invertible. Given P { f^ 2 = P^ 2 = n,‘/\ and P ( f^ 2 .r () = 0 

R\ n H.r! 1 2 0 ■ 

0 F, py- 

-F-'ii:n;' ri f;"p~’ /2 » 

g* f~‘h;r ;’ 12 -G’Fr'p ,"' 2 Q~' n 

. z?:’ 7 - ~p~~ r —. 


- w i/» 

0 

0 

A,'. 

f »-f 1 

0 

0 

P"“ */ 2 

J i+l 

-A',,., 

0 

0 

Q7' /J 

-e*R:* /2 

p~*/ 2 
•1 /-(■ 1 .T , 4 1 

-H+lRlK,. 


where 0 is any unitary rotation that either lower-triangularizes the 
first two (block) rows or upper-triangul arizes the third and fourth 
(block) rows of the pre-array. The filtered estimates ,i>+\ can be 
constructed via i’;+i = (/V+ZM^+'/^h-i ) or .iv+i - F,(j\)4- 
K v .AR~y\). m 

III. Bryson-Frazier Formulas and 
Their Square-Root Algorithms 

The fixed-interval smoothing problem is to find {.r,|.v)/-o:Y given 
output data }>=()■. <v- In 1963, Bryson and Frazier [21] suggested 

two smoothing formulas that use the {.?,} or the {i,|,} of Kalman 
filtering, and certain quantities defined by the recursions 

A i | ;V Fp. i A»-H | A *+■ R * R7.1 A A 4 . | | A' =0, (l) 

A , | a . = Pp , i A; .(-1 1 ; v Pp , 1 4- R * R r ,! Hi « A A r 4 - j | v = 0 (2) 

where F #(l , = F, - A ’ v jHi. (See the Appendix for the brief 
derivation.) 

Algorithm III. I—BF Smoothing Formulas: 

Case 1 ) (based on .r,) v = t'i -4- Pi^niv, 

P t | iV = Pi ~~ Pi A 1 1 1\ Pi ■ 

Case 2) (based on i,|y) 4- P t \ t Ff Y t +i\N, 

Pi\N = Pi |t - P t \,F* 
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Square-Root BF Smoothing Algorithms 

The coefficients F*\ t and H* R~~] of (1) can be computed simul¬ 
taneously when the Pi are computed via the conventional equation- 
based algorithm. Similarly, we can get some terms related to the 
coefficients when we use a square-root algorithm for propagating the 
Pi. Consider the following array, formed from the array of Corollary 
H.l extended with one more row 


■ R 

1/2 

H,ry 2 

0 

i 



0 

F,py 2 

G.Q'J 2 






0 


0 

1 

0 



K' ri 

i\: p;* /2 

0 




■ K r : 


0 

o ■ 


R i> .i 

i 

,1/2 

i+\ 

0 


X , 


r, 

(*) 



,/ ' r-H 

1 »+i 

(*). 


Cross-products between the first and the third rows and between the 
second and the third ones yield the equation X, = P, 1 /2 (P,H* 

v, = p; /J (F- k,,., nr p ,:;/ 2 = r ; /2 f;, pry/ 2 . 

Rewriting (1) using the {A,} and {V,} gives the formulas 

(p; / 2 a,| A .) = o;)(f,*/; j a, +I !.v) + (A',)(r?:! / V) 

which recursively propagates ( P '‘ 2 \ } instead of (A,|\ }. With 
the \ P~ 111 r , ] recursively found from the above array, we can use 
the Bryson-Frazier formulas to obtain 

F, | .v = (P; A '){(Fr' / ^,) + (^ /i! A,|..v)}. (3) 


It is impossible, however, to hnd the quantities {A,] ; v \ in (2) by 
directly using the {A’,} and {T,}. Therefore, we shall separate the 
{y^A.iv} into 1 P *^\‘/J ( and {A~I/"A,| N ( and fonn another 
array using the {AY(, {V, | and \P* /2 \](*} 

-v, (v,)(F; + / i J .v, , | L ; l , v )- 

«■:/?,-* /2 ) (A'+avA;;,^) J 

where 0 is any unitary operator that zeros out the (1, 2) entry of the 
pre-array. Inner- or cross-products of the array entries yield 

inr = -Y.A7 + (V, )(p* + / ;^\^ |(V ) (a;/;, v r,!/?)(>; i\ 

I> = U- 1 {(A\)(/?,:; /2 r,) + (V,)(A- | 1 / 1 ^A, +1 |.v)}. 


0 = 


Hence, the new array recursively provides \W ~ P'J 1 A 
[p = A^y j A t |,v }. so that the { P,\ ,v} can then be found as 


'V = (r! r2 )V - (^a:^)(a;^^ 1/2 )}(/V /j ). 


and 


(4) 


These results are summarized in the following proposition. 

Proposition III. I — Square-Root BF Smoothing (Case I): Assume 
that R, > 0. Perform Step I first, and then Step 2 and Step 3, together. 

Step J: For (P { 0 and (P 0 1/2 ) = propagate 

[P ™ 1 } and {Py 2 } using a forward recursion 


n 

y 2 h.p'g 

0 1 



0 FP. ,/2 

G,qy 2 





0 



0 / 

0 


[-v:nT* /a i?P“ /a 

0 . 




■ *y; j 

0 


0 


A',,., 

p 1 / 2 
1 «+i 


0 


p* /2 H*n;*, /u 

P*/-v* p- +/ 2 

1 1 r p,t I i+i 

(*) 


-gb;'/ 2 

:* p-*/'* 

•t V’f'J ■* i + 1 

(*)"': 


where 0 is any unitary operator that lower-triangularises the first and 
second rows of the pre-array and “(*)” indicates the redundant entiles. 
During this forward recursion, generate and save the following vari- 
able*: for Step 2, 

and for S>ep 3. (P* /2 }, (P, ' /2 .i-,}. 

Step 2: For (A^A**,,*) = 0 and (P^m.v-Vv+m.v) = «• 
propagate {Ajj- • A,,*} and {P'/ 2 \]( 2 . ] using a backward recursion 

\(r; f7 »: *r; /a ) (P^PPPV/ 2 ) (K ' 2 .\]^ s ) • 

(**::/*) (^m.vA-;/, 8 a : 

■ o ] 


0 


(A;,:VA 


- */'2\ 


(*)j 


where 0 is any unitary operator that zeros out the (1, 2) entry of 
the pre-array. 

Step 3: Calculate smoothed estimates and their error covariances 
in parallel with Step 2 via (3) and (4). • 

If we are interested only in smoothed estimates but not in their 
error covariances, then we can speed up the algorithm and reduce 
the amount of storage. 

Proposition 111.2 — Square-Root BF Smoothing (Estimates Only): 

Step I: For (P { ~ ]/i .r n ) — U and (P 0 1y/2 ) = Ilf 1 /", propagate 
{Pf ,/jJ .?,} and [P~ {/2 ] using the same array as in Step 1 
in Proposition 111.1. Save the following: {P^ 2 }, {P“ 1 ^,i , ,|, 

{P? /2 F; k ,P~+/' 2 \< {c, = 

Step 2: For !|.\-A.\>i|.v ) = 0, propagate {P* /2 A,| A -} using 

the following backward equation 


(P, A 


|v 


-vp: /2 F t :.,pr + 7 2 )ip:i 2 a, +1|A >+ 


Step 3: Calculate smoothed estimates as 

= (r! /2 ){(P~ ,/2 h) + tr*'* A,| v)}. 


Remark I: The main advantages of the square-root BF algorithm 
arc that there is neither matrix inversion nor back substitution, and 
all gains for forward and backward recursions can be computed 
via a forward pass only. Therefore, whereas the forward pass pro¬ 
cedure requires a large computation, the backward pass procedure 
requires relatively small computation. As disadvantages, the error 
covariance P, j,v might not be pOvSitive semi-definite due to round-off 
errors and, during the procedures, we should save { P*^ 2 H* 
{F‘ /2 F,:.,P,;*/ 2 }, {/?, ., [py 2 ). {Pf which requires 

memory of the order of (.Y + l)»p, (.V + l)n 2 , (A T + 1 )p, 
(N + l)n(n + l)/2, {N + 1)m, respectively. The major costs come 
from {Py 2 Fp.iPiy/ 2 ) and |P, I/2 }. 

If we are interested in j-v but not in P, \m and the F, are 
nonsingular, we can find another square-root fonn that compensates 
for the latter drawback. Let us think about a method for propagating 
j.r, | and {PA,|\ }. The first quantities can be composed from the 
array in Step 1 of Proposition Ill. 1 via (P' / ~)(P“ i/ ‘ i j' 1 ). 

For the second quantities, let us consider the following variation of 
(1): (P|A,|.\) F,,,(P, + iA, + i|.\ ) + iA'j.i)(H~y i e. i ). where F,.i 

denotes 

p.p;,p.V, = P .- 1 (/ - g,q.g:p,i\ ) = r~'{i - GQ\ l2 Kl,). 

Since the two equations in Algorithm II.I give { I\ \, t }, {n~y 2 t \}, 
(A’/,,,}, and {Q, 1 ^ 2 }, we can find {P,A,|.v} via 

(PA,|\) = (p-'){/ - G.(g: /2 )(A' ( :,)}(p, + i a 1+1 |,v) ; 

+ (/m, )(«;., l/2 r,). (5) 
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Proposition IHJ — Square-Root BF Smoothing (Case 2): Assume 
that Hi > 0 and the F, are nonsingular. 

Step!: For (P^ X ^H) - 0 and (P^ 2 ) — propagate 
[P~ 1 ^ 2 >i ‘,} and^P, 1 ^} using measurement- and time-update equa¬ 
tions. 

Measurement Updates: 


0 

pill 


' A’,. 

‘T 1 

-hir;’ 1 ' 

p-*/* 

0 = 

0 

p ~*/ 2 


*:rr /2 \ 



n-*/^ 

J 


where 0 is any unitary operator that zeros out the (2, 1) entry of the 
pie-array. Save {(A'/-*)(J?“ f l/a «,)}. 

Time Updates' 


r fp x/2 

G,g, 1/J ' 


pi/; 

' 14 1 

0 

0 

g, 1/a 


g.c;p,;V J 


p~* p-*/* 

0 

0= 

p~*fl 

1 /4-1 

-AT, 


0 


y,* p-*/'! 

L >lf >+i 1 t+i 

-f 

l' 


where 0 is any unitary operator that zeros out the (1.2) entry of the 
pre-array. Save the following: [(Py 2 )(Pf^ 2 S\)} t {(Q'J 2 )(Kl ,)}. 

Step 2: For (P \+\|\A v+h/v ) = 0, propagate (P,A,|\) via (5). 

Step 3: Calculate smoothed estimates via .i ,|\ =■ {(P, 1 ^) 

(i^r 1/a >. >}+(z 1 . a,| A >. ■ 

Remark 2 * An interesting feature of this algorithm is that the SRIF 
block in the measurement-update equation in Algorithm 11.1 is chosen 
for measurement updates, whereas the SRCF block in the ume-update 
equation in Algorithm 11.1 is chosen tor time updates. 

Remark 3 When the P, are time-variant, there is no difference 
of memory size between the two propositions if we save the /),, by 
writing over the F ,. Otherwise, the memory size is quite different. The 
latter case requires memory of the order of ( N +1) n , (A +1) n , (.V 4- 
1 )up corresponding to {( P] ,l )(P ~ 1 /2 u\)}, {( I\ f , )(H~ ‘ /2 t ,)} and 
,)t. respectively. 

Fast Square-Root BF Smoothing Algorithms 

Now, we shall derive a fast square-root (or so-called 
Chandrasekhar-type) form for the BF formulas. If the system matrices 
are constant, we can apply a fast square-root algorithm for forwards 
Kalman filtering, which will produce the normalized Kalman gain 
X t >, and the square-root innovation covanance H 1 / 2 . Recall that 
we generated some quantities from a forward pass using the array in 
Proposition III.3: {(K, .)(Z?, - , 1/2 *’.)} and {(Q’ /J )(A' ( T,)}, which 
are needed to form F,., = F 1 (/ - GC)'/ ' h ",) for the backward 
pass. Unfortunately, the term , cannot be found using the fast 
square-root covariance algorithm for forwards Kalman filtering, 
because this term contains P,~jl l , which is impossible lo obtain 
through the displacement of the P,. Therefore, we must employ 
one more fast square-root algorithm related lo the displacement 
of the Pf +\. 

Fast Square-Root Covariance Filtering. We first recall that the 
fast square-root form uses matrices {L,} or {.V,} defined as 
L,JlL * 4 Pi+i - P, or AV7/.A7 = P,+i,h-i - P,j,. Consider 
the following array 



where 0 is any ./-unitary operator that zeros out the (1, 2) entry of 
die pre-array. J-inner or J -Cross products of the array entries yield 
A-Y* a R, , +J , YX* = K„ 1+ , r:» + *. ZJi.Z" = P,+ 2 - P, +1 = 


L,+i . Therefore, we can identify X — R x /^,, Y — A’ P .+i, 

Z = P,+i. 

If we want to find the Kf, from this array, we should use N, 
rather than L ( , which obey the interesting formula A,+i0 = FAT, 
whefte 0 is some Ji -unitary operator. For a proof, note that, since 
Pi+i = FP { ^F**-FGQG* and thus L,+\Jf s L*+\ = P t +%~ P»+i = 
P(P ffJ | 1+1 - P,|,)F* “ FN t J] \*F* , therefore, the above array 
form can be rewritten by 



HFN,~\ 
FAV i 


0 = P 


\n [jl 

K\ t+i 



where / = / i F and 0 is any J-unilary operator that zeros out the 
(1,2) entry of the pre-array. Hence, unless the matrix F is singular, 
wc can generate the {(K f , 1 } from the fast square-root 

covariance algorithm. Now, to construct the F*,, we arc going to 
provide a fast square-root information algorithm. 

Fast Squat e-Root Information Filtering The objective of the al¬ 
gorithm is to find F* , = F~'(J — CQ\ /2 Kl, ), and indeed, 
{(Q l J 2 )(1 \’u . )}• Consider the following array 


Q;' /2 g*f-*m, 
AT, f~*m, 





where 0 is any ./-unitary operator that zeios out the (1,2) entry ol 
the pre-array, and * 4 P~ + \ - P~' Forming ./-inner or 

J-cross products of the ariay entnes yields 

xx* - y + ci *f *(P,+ 1 ,+ //*/? 'H)F q,^. 

V A* = F-‘{r, t \+H*R-' H)F 'G = K h ,, 

zJmz: = i>r+\ - rr+\ = w.uJu v;, 

Therefore, we can identify A" = Q,i ’('‘. 1 = hi, ,+ i. Z — 

Here we note that the minimal lank of the Ji is as same as that 
of the J\i. lo show this, consider initial conditions U f) and An. 
Then nole that since Pi — P () = LuJi L* {) - F \_ i ,Ji A * | F*, 
Pf'-K' =-Pn ] Lu(J,-FL; ) Pf i U) 1 F-n / n * (- MuJ\l MS )■ 
Based on these results, we can present a fast square-root smoothing 
algorithm. 

Proposition III 4 

Step 1 (Fast Square-Root Covanance Filter mg) 

' R'Jf h rx, ,i r/?, l/ 4i (| 

A f , F ’N i -1 0 — A f ,+1 i 

K'J 1 o J [n;:ii <*) 

where ./ = / Ji and 0 is any J-uniiary operator which zeros out 
the (1, 2) entry of the pre-array and 

LoJ] L() =P| — P b, Lo — FA r _i. 

R'Jn =(/? + f<, n = r u H r R;; /2 . 

For j-(, = 0, compute {//,} and {.r,} via //, = A"/,/?, - Hi ,) 

and ,r f+1 = F(.r, -|- //,), respectively, and save them. 

Step 2 (Fast Square-Root Information Filtering) 

'g; * /i cf- u,] o 

Kh , F * M, 0 = K f , i ^ i M, 11 

.q'J 1 o J Lw.+I (*) 

where ./ = 7 i Jm and 0 is any J-unitary operator that zeros out 
the (1, 2) entry of the pre-array and 

M 0 J\tMo =— Fo l Lo(Ji +L;,P u 'L n )-'LaP^', 

QZ' 1 * =(Q - QGPT'G'Q /u o = IT'GQQZ ’ /1 - 

During the procedure, save {A‘£,) (. For ( + 1 A v +, |,\ ) = 
0, propagate 

(P 1 A,|v) = F- | {/-G{(g t ,/ “)(A- t * 1 )n(P 1+l A 1+1 |„) + /.„ 
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TABLE l 
Constraints 


SR algorithms 

Error Cov. 

Fi 

System 

Prop. IIL1 

yes 



Prop. III.2 

no 



Prop. III.3 

no 

invertible 


Prop. III.4 

no 

invertible 

constant 


TABLE !I 

Comparisons 


SR algorithms 

Storage 

Speed 

Stability 

Prop. III.l 

0(i(M+iK) 

slowest 

Good 

Prop. III.2 

0(§(AM-l)n 2 ) 

slow 

Good 

Prop. III.3 

0{(N+l)np) 

slow 

Good 

Prop. III.4 

0({N+l)np) 

fastest 

(?) 


Step i (Smoothed Estimates}: Compute the smoothed estimates as 
= a+ (P,A i | A r). ■ 

Remark 4: This algorithm requires memory of order of (A T + 1 )»», 
(.V ■+■ 1 )n, ( V + 1)h.j» for the p,[, {/*,.}, and {(Q, l/J )]. 

Remarks (Comparisons): If error covariances need to be com¬ 
puted, only Proposition III. 1 is available. If the matrices F t are not 
nonsingular. Propositions 111. 1 and II1.2 arc possible, where using the 
latter is faster than using the former. If the F, are nonsingular and 
error covariances do not need to be computed. Propositions III.I, 
111.2, and 111.3 can be used, where Proposition 111.3 gives better 
performance than the other propositions because of less storage and 
computation. If, in addition, the system matrices are constant, all the 
propositions can be used. In this case, we recommend Proposition 
III.3 and Proposition III.4, depending on a trade-off between speed 
and stability—see Tables I and II. 

IV. Concluding Remarks 

The new square-root algorithms use arrays including tillered or 
smoothed estimates and their covariances and avoid inversion or 
back substitution, which are major advantages over previously known 
algorithms with respect to systolic array and parallel implementations 
as well as numerical stability and conditioning. Moreover, the square- 
root array form supplies flexibility in implementing the rotation in 
various ways. 

For constant-parameter systems with invertible F, a fast square- 
root (or Chandrasekhar-type) smoothing algorithm is suggested, 
which has advantages of small storage and fast computation over the 
others. Bqt the numerical properties of this algorithm need further 
investigation because it uses ,7-unitary rotations rather than unitary 
rotations. 

Appendix 

Brief Derivation of BF Formulas 

The fixed-interval smoothing problem is to find {.i\|A r ) f -n : .v 
given output data {;<//}/=o n. In this case, since the smoothed 
estimate can be rewritten with the innovations e 3 so that .i\|a = 
E(s r Cj )E~ ] (f je*)e n where “E(a‘) M denotes the expectation 
of «\ we have the following two expressions of .r^x 
• -1 x 

£ E(.r, r ;)E-'(r^ ) ej + 22 EU-.'^E-' (f»> 

J= 0 ./ = ' 

I s 

or '£2E('L'y j )E'~ l (ejej)Cj + E(Wj)E~'(ejrj)ej 

J -0 >=»+t 


m 

where E{* t r*)E~ l (v,e] )e, can be denoted as i\ arid 

EWo as After performing complicated 1 

calculation we can verify that, for / < j 

Eery) = = p i]t F7*;uj + dh; 

where 4*( /. i) — F lt J —i Ep % j —2 ’ * 1 FV } ” F* 

K V i ~ FMj.,. Therefore, defining X t \x as , 

gives the two recursive equations for the smoothed estimates jit 
Algorithm III. I. 

Since e, is orthogonal to r 7 for i ^ . the two recursive equations 
for the smoothed estimates in Algorithm Ill. 1 provide the relation 

£(•<', .\js>v*\ t \) = E{S m ,P i \,F*E(\ 1 +\\xX* +x \x)FiPi\j- 
= E{.v,K) + r,E{ a,|*a;„v)/v 

Using the fact that P, \\ = E(:r,.r*) - Ef.rq/v.rJ| lV ), FVj, 
E{.r,.r* )-£ , (.i i / |. r r-| 1 ), and P, = E(.r,.iv-)*), we can obtain 
the two recursive equations for the covariance matrices in Algorithm 
mi 
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Robust Hoo Control of Uncertain 
Nonlinear System via State Feedback 

Tielong Shen and Katsutoshi Tamura 


Abstract —This technical note presents a solution of robust control 
problem for an affine nonlinear system with gain bounded uncertainty. 
It Is shovto that the L 2 -Induced norm of a closed-loop system with the 
uncertainty is less than one if an extended nonlinear system with no 
imcCrtalnty Is dissipative with respect to a supply rate. In consequence of 
this mult, a state feedback law such that the closed-loop system has ft x 
robust performance Is derived based on the Hamilton-Jacobi inequality. 
It Is also shown that the existing results for linear systems are special 
cases of the presented results. 

I. Introduction 

A popular method to solve the standard control problem is 
the state-space approach developed in [1] which is based on the 
relation between the ff^ norm of linear system and the Riccati 
equation or inequality. Recently, attention was focused 10 robust ft Xl 
performance problem and the state-space approach has been extended 
to solve this problem for linear systems with parameter perturbation 
f6]-[9], In this literature, it was shown that a controller ensuring 
robust stability of the closed system with an Hr^ norm restriction can 
be obtained by solutions of the Riccati inequalities, The background 
of these results is that the robust stability of the system with parameter 
perturbation can be guaranteed by a positive definite solution of the 
Riccati equation (see 110|—f 12]). The state feedback problem ([6], 
[81) and the measurement feedback ([71, [9]) have been solved using 
the state-space approach. 

On the other hand, the state-space approach was extended to 
nonlinear ftno control problems [2| and [31 where the terminology 
of nonlinear ft,* control means that the 1 2 induced norm from 
input to output is less than one. This extension was possible based 
on the relation between the £2 induced norm of nonlinear system 
and the Hamilton-Jacobi equation or inequality. The relation is a 
generalization of the one between ft v norm and Riccati equation 
in linear systems. The classical notion of dissipativeness in [4J 
and [5] plays an important role to obtain this result. A nonlinear 
state feedback [2| and measurement feedback [3) approaches were 
developed, respectively, along this research line. 

In this note, the line of the research is kept to develop robust ft* 
suboptimal control of uncertain nonlinear systems. We will extend the 
approach given in [6] to solve the robust ft* performance problem 
for nonlinear systems with gain bounded uncertainty. In other words, 
a state feedback controller such that the £2 induced norm of the 
closed-loop system with the uncertainty is less than one can be 
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The authors are with the Department of Mechanical Engineering, Sophia 
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obtained by a smooth solution of Hamilton-Jacobi inequality. The 
approach developed in this note can be extended to output feedback 
case. 

The following notations are adopted here. For vector ; G 1?", ||i|| a 
and ||;||t denote z 1 z and {j'J z 7 (r)z(r)dr}i, respectively. For a 
differentiable function V : R u -♦ ft. —- (.r) is the row-vector of 
partial derivatives. £ 2 (0. T) denotes a set of vector-valued function 
z{t) satisfying ||c (/)||7 < tc. 

II. Preliminaries 

Consider a nonlinear system with a gain hounded perturbation 
given by 

= /(-»’) + A fU) + (/| (.!■)<*+ r/2(.l’)u (1) 

- = h 1 (,r) + k\2(’i')u (2) 

where .r is a state vector defined on a neighborhood .V of the origin in 
ft", u € ft" 1 denotes the control input, d e ft' the disturbance input. 
: £ ft* the penalty variable, /, ry 1 , <n . hi and A* y> are known smooth 
mappings, and A/ is a smooth uncertain mapping. It is assumed that 
/(()) - 0 and /? 1 (()) = 0 without loss of generality. 

We make the following assumption throughout this note. 

Assumption I: The uncertain mapping A f(.v) is described by 
A f{.r) = f (.r)<*(./’) with known smooth mapping < : ft" —* ft"*'' 1 
and unknown smooth mapping A ; /? fl —<* ft' n . A/ belongs to a 
bounded set defined by 

Si: = { A/(.r) | A(0) = 0, )|| J < ||« (.<')|| J . V .r £ .V} (3) 

where w : ft" —► ft''- is a given weighting mapping. 

Assumption 2‘ k 12 U)] = [0 I}. 

Assumption 2 is a nonlinear version of the standard assumption 
in |1| for simplicity. 

For the system (lM2), consider a nonlinear stale-feedback law 
11 = o(0) = 0. (4) 

Definition 1: Let - > 0 be given. The closed-loop system of (1) 
and (2) with (4) is said lo have locally robust disturbance attenuation 
performance in U C -V if for any A / € the free system with d = 0 
is locally asymptotically stable with domain of attraction containing 
M, and \\z\\ j < -||J||/ for every T > 0 and every d € P, where 
the set P is defined by 

P: - {d | d e £ 2 ((),£) s.l. j’(f) with .r(U) =■ 0 

remains in 14, v f < T}. (5) 

The nonlinear robust ft x control problem is now defined as 
follows. Given the plant (l)-(2), find a state feedback controller 
(4) such that the closed-loop system has local robust disturbance 
attenuation performance. Without loss of generality, let ■) = 1. 

Wc collect some preliminary results in the next Lemma (see [2] 
and 14] for proof) which are useful to prove our main result. 

Consider a smooth nonlinear system given by 

.;■ = /(.r ) + <;(.'■ W (6) 

- = M-r) (7) 

where the state .r is defined on .V. d € ft' and : £ R" denote the 
input and the output, respectively. /, y, and h are smooth mappings 
satisfying /(()) - 0 and h( 0) = 0. 


0018~9286/95$04.00 © 1995 IEEE 
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Lemma l: Assume that the system (6M7) is zero-slate observable 
[2J. The free system of ( 6 ) with d = 0 is asymptotically stable and 
\\~\\i < HIt for every T > 0, if system (6H7) is dissipative with 
respect to the supply rate [5] 

\ {|KI| a -||-l| a l (8) 


or equivalently, there exists a nonnegative definite smooth solution 
Y(r) > 0 satisfying the Hamilton-Jacobi inequality 


av 

Or 





h 1 


U)h(r)< 0 , 


V( 0 ) = 0 . 


(9) 


III. Main Result 

The main result is to give a solution of the robust H x control 
problem. We start with finding a sufficient condition for the nonlinear 
system having robust H x performance. 

Consider a nonlinear system with uncertainly described by 

x = /(.r) 4- A/(.r) + <i(r)d (10) 

: = /i(.r) ( 11 ) 


Rearranging some terms in (17), we have 

f-<*){/<*)+ A/U)} 

+ (.(•)-——(•') + ^h 1 (r)h(r] 

2 Or Or 2 

, ,, ,a'r, v i „ 

+ 23^(7) {||U ’ (J ' )|I ' -I|A(J ' )|I " I - H - ° 8) 

Since ||^(.r)||” < ||ir(.r)|| J for every .r 6 V and every A/(.r) € il, 
we obtain 


■<•){/(.»•)+ A/(.r)} 
c).r 

+1 (•<')//(■<•)'/ 7 ( j - > —< -»■) 

+ ) < 0. A/6S2. (19) 


where the uncertain nonlinear mapping A./V) satisfies Assumption 
1 . 

For system (!())--(11), define an extended nonlinear system by 

■r = /(■*’) + [//(.r) A(.r)<(.r)]»/' (12) 

h(A) 


where <!' (- R' f, L m J /T +r/ , and A(.r) > () is a given smooth scale 
function in ./* £ -V. 

TheoremF Suppose that the system (10) (II) is zero-state 
observable for every A/(.r) 6 $2. If there exists a smooth scale 
function A(.r) > I) such that the system ( 12 )-*( 13) with respect to the 
supply rale ‘ {IKl| 2 -|| : II"} ,s dissipative, then the nonlinear system 
( 10 )-(l I) has locally robust disturbance attenuation performance in 
14. 

Proof: Suppose that system (12)—(13) with respect to supply 
rale 7{||^ , || 2 — ||- / || J ( is dissipative, then there exists a nonnegative 
definite smooth function Y(x) > 0 (V( 0 ) = 0 ) such that 

VU)< f + (14, 

,/() - 

So that 

f-(.r){/(r) + [„(.r)Ar(.,>K} < - ||.'|L}. (13) 

From Theorem 2 in |2], there exists a \'(r) > 0(T’(()) = 0 ) such that 

1 TT"(■*■){//( '’)<J 1 (■'*) + A^(,r)f (r)e l (.r)} 

Or 2 Or 

— (•<)+ ij//(.<•)/,(.,•)+ p~»’'(.r).r(j)J < 0. 

(16) 


Adding and subtracting ~~(.r)A/(.i') in (16) 

dY. 


Or 


(,r){/(.r) +A/(.r)| 

+ ^).</().y f (■'*) —77—— (-r) + \h' (.»■)/!(.r) 
2 a.r Dr 2 

- ~-(.r)«'(.i-)A(j-) 

Dr 

, **(■«•)or, , , , ,, ,0'V, , 

H-o-K-( J ')'’(■'■)'• (-*■> —S—(■'•) 

2 o.r Or 


+ 


w l {r)w(r ) < 0 . 


2 A' 2 (.i“) 


(17) 


Therefore, by Lemma I the free system (10M11) with d = 0 
is asymptotically stable and ||v||y -c ||i/||/. v t/ 6 P for every 

A f[x) € H 

In consequence of Theorem 1, we have the following corollary. 
Corollary!. Suppose that the system (!())—(11) is zero-state 
observable for every A/ € il. The system has the locally robust 
disturbance attenuation performance in 14 if there exists a smooth 
scale function A (r) > U such that the Hamilton-Jacobi inequality 

i ) + 5 - 7 -( r )(;;(.(')</' (.r) 

Ox 2 itr 

+ y' 

Or 

+ r, y ' 1 (•')('(•' ) + v |, ) u' / (.r)»'(.r)| < 0 ( 20 ) 

has smooth solution V(.r) > 0(V(0) = 0). 

Remark I: Il is interesting to compare the result in Corollary 1 
with linear case given in | 6 ) and [ 8 |. Consider a linear system with 
parameter perturbation 

x = (A + EE(/)).r +77,/ (21) 

* = Cx (22) 

where unknown time-varying matrix ^(f) satisfies E* (/)£(/) < 
F l F?1 for a given F. Since a smooth solution Y(r) of the 
Hamilton-Jacobi inequality can be given by V(j ) - P.r, where 
F is the nonnegative definite solution of the Riccati inequality 

A 1 P + PA + P[DD’ + A 2 EE 1 )I J + C T C + ~F 1 F < 0 

A* 

(23) 

for a scale A > 0, Corollary 1 implies that the linear system (21H22) 
has the robust 11^ performance if the Riccati inequality has the 
nonnegative definite solution, which has been shown in 16]. 

Remark 2: Note that in nonlinear case the scaling parameter A be 
chosen as a function of .r, which may reduce conservativeness of 
the result. 

We now consider the robust H rx control problem mentioned in 
Section II. Using the condition in Theorem I, the problem can be 
solved by finding such a state feedback controller that makes the 
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closed-loop system dissipative with respect to the supply rate It can 
be done by solving an Hamilton-Jacobi inequality 
Theorem 2 Assume that the system * = /(a ) -f A/( i) -f 
gi{x)d. - ?= hi(jr) is zero-state observable for every A/(a) € i) 
N there ousts a scale smooth function A( i) > 0 such that the 
Hamilton-Jacobi inequality 


0 \ / W/ \ I 1 ^ / V 

■gy('-)/(r)+ 2 ~(j) 


■j x (» ) + A 2 (4 b(l)f' (i) - <Jl(l )f/i ( ' 

+ ^ 11 )M *■) + II ' (»)« (j )| < 0 (24) 


has a smooth solution I'f i) > 0(1 (0) = 0), then a solution of the 
robust control problem is given by 

a(r) = -f/j (i)^p-(r) (25) 

Oi 

Proof Note that the closed loop system of the plant (l)-(2) 
with the state feedback controller u = o( i) is given by 


To show the robust asymptotic stability of the lree system 

• = /( 0 + Qi( i )n( i) + A/( i) (11) 

we consider the solution T (i) > 0 as a Lyapunov function From 
(29) and Assumption 2, we have \ < () v A/(i) € 52 and 1=0 
implies h\ (i) = 0 and n( i) ~ 0 Therefore, the robust asymptotic 
stability follows from the Assumption that pair {/ + A/, /i} is zero 
stale observable tor every A,/ 

Rematk ? As shown in Remark 1 in linear case the solution 
Mi) > 0 of (24) will be given by the solution of the Riccati 
inequality (see the result in f 6 D 


IV C()N( L USJON 

In this note, a sufficient condition for that a nonlinear system 
with uncertainly has robust H^ performance is given based on 
the classical dissipativeness theory Using this condition such a 
state feedback control law is derived based on (he Harmlton-Jacobi 
inequality that the closed loop system has locally robusl distuibancc 
attenuation performance It is also shown that the well known icsulls 
for linear system are special cases of the presented results 


i = /(i) -1- A/( r ) + f/J i )d( i) + r/i( i )fl (26) 

- = /h(0 + *i2(* WO (27) 

Using the Hamilton-Jacobi inequality (24) and the assumptions m 
Section IL we obtain 
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Oi 


(i){/(») + Af(i)+ fjAi) nit)} 


+ ^ {^i (' H- ( 1 )l\ 0 + *i (• hi i) 1 

< -4|U( ' )r 7 (* )^M~( • > - tM(« 

21 | <)t \( i) 




Dll'-11*1011 


+ t 


A 1 1 


(28) 


Use of (25) for n( i) and assumption on the uncertain mapping i) 
gives 

i){f(r) + A/( O + fj 2 { t)o( i)} 

+ £^“(')</i( r)tjl (i)-—-(i) 

+ 2 {^/ (O 4- n 1 (? )A [ 2 ( 1 ) J{hi (/) + A u( i )n( r)} 

<0, v A/(a)€i> (29) 

By Theorem 2 m f2| 

^-( r){/( r) + A/( 1 ) + 1 )n( /)} + * )*h(i)d 

< |(llrf|| J “ll-ll'K v a/o) e u no) 
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Optimal Nonparametric Identification from 
Arbitrary Corrupt Finite Time Series 

Jie Chen, Carl N. Nett, and Michael K. H. Fan 


Abstract-*- In this paper we formulate and solve a worst-case sys¬ 
tem identification problem for single-input, single-output, linear, shift- 
invariant, distributed parameter plants. The available a priori informa¬ 
tion in this problem consists of time-dependent upper and lower bounds 
on the plant impulse response and the additive output noise. The available 
a posteriori information consists of a corrupt finite output time series 
obtained in response to a known, nonzero, but otherwise arbitrary, input 
signal. We present a novel identification method for this problem. This 
method maps the available a priori and a posteriori information into 
an “uncertain model” of the plant, which is comprised of a nominal 
plant model, a bounded additive output noise, and a bounded additive 
model uncertainty. The upper bound on the model uncertainty is explicit 
and expressed In terms of both the /] and Hr^ system norms. The 
identification method and the nominal model possess certain well-defined 
optimality properties and are computationally simple, requiring only the 
solution of a single linear programming problem. 


I. Introduction 

The primary goal of this paper is lo develop a system idenlification 
method for identifying, from available a priori and a posteriori 
informal ion, an “uncertain plant model” which can be used for robust 
control design. Typically, an uncertain model consists of a nominal 
design model, a noise specification, and a model error specification. 
Though the specific nature of the required noise and model error 
specifications is dictated by the specific robust control merhod to 
be applied, nearly all existing robust control design methods require 
that these specifications be stated as explicit, worst-casc/detenninistic 
bounds on the levels of noise and model error [7], | 8 |. For this 
reason a worst-case/deterministic identification approach is adopted 
in this paper. System idenlification methods that take a worsi- 
case/detemiinistic, robust control oriented approach have recently 
received considerable interest; an extensive list of publications on 
these problems may be found in, c.g., [12] and [36] and the references 
therein. 

The specific identification problem considered in this paper per¬ 
tains to single-inpul, single-output, linear, shift-invariant, distributed 
parameter plants. The available a priori information in this problem 
consists of time-dependent upper and lower bounds on the plant 
impulse response and the additive output noise. The available a 
posteriori information consists of a corrupt finite output time series 
obtained in response to a known, nonzero, but otherwise arbitrary 
applied input. Our objective is to develop from the given information 
an uncertain model which has an additive model error structure and 
whose modeling uncertainty is measured in the (\ and system 
norms. Our motivation for these choices of model error structures 
and system norms stems from the fact that they are consistent with 
the currently popular fi and H nL design methods [7], [ 8 |. Recently, 
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there has been a balk of research work devoted to problems of this » 
kind, under the name of identification in ti and . We refer the 
reader to some of the closely related problems and results in, e.g.« 
[12|, [15], [10], [27], [4], [211, [20], [33], [51, [16], [1J, and [29] for 
a glimpse of the work in this area; discussions on these results are 
beyond (he scope of this note. 

Our main contribution in this paper consists of an identification 
algorithm and spveral model error bounds. The identification algo¬ 
rithm maps the given a priori and a posteriori information to an 
identified nominal model. The error bounds are explicit and are 
expressed in terms of both the f j and H n norms. Our algorithm 
possesses certain well-defined optimality properties and is generally 
referred to as an interpolator algorithm in the information-based 
complexity (IBC) theory [311, [32]. It is a standard result in IBC 
that algorithms of the interpolator class are worst-case strongly 
optimal to within a factor of two, in the sense that the worst- 
case error achievable in applying the algorithm to approximate the 
underlying plant is at most a factor of two of the smallest possible 
such error. Moreover, our algorithm also has the advantage that it 
can be constructed in an extremely simple manner; it requires only 
that a single linear programming problem be solved. Technically, our 
development shares some elements with the set membership method 
for parametric identification problems (sec, c.g., [23]-|25] and the 
references therein). 

The notation used throughout this paper is as follows. Let Z 

denote the set of integers, Z+ := {A- € Z ; A* > ()}, and 

Z+,„ := (A' G Z f : 0 < A* < n - 1}. Let R denote the set of real 
numbers, and R" denote the space of n dimensional real vectors. 
Let S„ (aj>) := {.r 6 R M : tn < J’a < h.k 6 Z f Define the 
normed spaces (, {/: Z+ — R | ||/||i := |/(fr)| < *-), 

h ■■= if- Z+ - R | H/ll? := Er-oimf < x). and 
U := {/: Z., — R | ||/|U := |/(*-)| < tc}. Let 

D := {: € C:|:| < 1} and denote H^ := {/:D - C | 

f analytic in D and ||/||//-v := s,i]>. gD |/( :)| < * }. Additionally, 
define the operator T„:f * R" such that [T n f){k) = f(k) for 
A 1 £ 

Finally, a preliminary version of this paper was presented previ¬ 
ously in [3]. 

II. Problem Formulation and Preliminary Results 

We consider the class of causal, single-input, single-output, linear, 
shift-invariant, stable distributed parameter systems. This class of 
systems can be identified with a normed linear space X C (1 which 
contains one-sided sequences and is endowed with a system induced 
norm || ||. A system h 6 X is represented by its impulse response 
{h(k)\klQ* or alternatively, by the corresponding transform /?(c), 
where 1 

( 2 . 1 ) 

The system to be identified is assumed to belong to X. Additionally, 
its impulse response {/?(A , )) a x ~ 0 is assumed to satisfy < h(k) < 
A/+,VA* e Z+, where € h and {A/^ }?L 0 € f i are two 

prespecified sequences which represent the available plant a priori 
information. We may write h 6 (A/ - , A/ + ) C X. 

The experimental procedure considered consists of applying an 
arbitrary nonzero input it 6 to the system h 6 X to generate an 

'This transform corresponds to the standard :-transform evaluated at ]/:. 
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■' output /* * u. An additively corrupted version of b* u is observed over 
r ftmte duration ’L Let the corrupting noise be denoted by v £ f*>. 
experimental procedure can be precisely written as 

Er t {h,v) := r„((A * m) 4 / ). 

LT&e a priori information on the output noise v 6 ^consists of 
o-iiifb;.fff^qpcw?i&ed.ftequonces € C*, and {f* }£r.« € f*. The 

jcpise sequence {r(A*)}J t a . 0 is assumed to satisfy r k < v(k) < fj, 
fijfe € 25+ dr, alternatively, v £ S Ar .(r"’*f + ) C f*. The experiment 
Curator ): X x f,« R“ operates on the ordered pair (/j, v) 
Imd yields an output data record y £ R n ,y[k) := (iE,,(A. v)) k for 
id! Let V be the lower triangular Toeplitz matrix formed 

by the input « 

u( 0 ) 0 0 

n(1) m(0) 0 

F:= 

u(n — 1) ii (I? —2) n (0) 

Then, the output data // € R" can be conveniently expressed as 
y == l r T,, fi 4 T„ where h £ (A/", A/ 4 ), and f € S. X .V*, f 4 ). 

The set of all possible such data is given by 

Y ;= [y £ R" : y = UT„h 4 for some 

h 6 S»(Ar.Af + ).r £ S»(f*.f + )|. 

For each .y € Y, we define the set of indistinguishable systems as 
,P(V){/* € S»(A/‘.A/ 4 ): » = + T„v for some 

«’ € S-v (t v 

Thi$ set contains all plant models which are “unfalsified" and whose 
members cannot be rigorously distinguished by the available a priori 
and a posteriori information. Let P „(?/): = {T„ h:h £ P ( y ) (. Then, 
P„(y) consists of the truncated series in P (y). One may readily 
observe that P tf (y) is a convex polylope in R”, as it is defined via 
a finite set of linear inequalities. 

Given the above a priori and experimental information, our primary 
goal in this paper is to obtain an uncertain model through the 
development of an identification algorithm and the quantification of 
a corresponding identification error. An identification algorithm is 
a mapping’ A r ,: R" —* X which operates on the data record and 
generates a model A»(y) £ X. The identification error associated 
with an algorithm A„ is measured by the norm || • || and is used us 
4 bound on the modeling uncertainty. Several notions concerning 
identification algorithms and errors are adapted below from the 
existing IBC and identification literature (cf. [31], [23], [24], 112 ]): 

• local identification error: ■ )|) sn I^eP( y )ll^ “ 

• global identification error: c(A „;|| ||) := sup y€Y c{ A n ; y\ |H|), 

• optimal local identification error: p*{y; || • ||) 

> :. : 4„;(y)€Xj c(A n ; 01 (J * || )i 

• optimal global identification error: f’(IHI) := inf< /i„ : a*< V )€XI 

. 4^.;IMI)« 

An algorithm Al is said to be locally optimal at y if <i Al iy; || • ||) = 
(P*(y; || * ||). If it is locally optimal at each y £ Y, then it is said to 
be strongly optimal. 

2 hi a broader sense, an identification algorithm acts on the input-output 
data pair (ri, j/), as the input signal plays an important role in identification. 
It is not our aim, however, to discuss the role of input signals in this note. 
Hence, to simplify the notation, we omit the dependence of all the relevant 
notions (e.g.. P(si),.4, # (y). etc.) on the input and the a priori informal!' % 


In the IBC framework, the optimal algorithm and error can be 
alternatively characterized as the center and radius of the set P (y). 
For a given data record tj € Y, let the radius and diameter (31], [13] 
oifPty) be denoted by r(P(y); || * ||) and d{P(y); || ■ ||). respectively. 
Then, the following result is known [31]. 

Fact 2.1: For any data record y< e*(y\ || • || ) - r(P(;</); || • || ) and 
d(P{y); || ■ ID/2 < e*(y;\\ • ||) < r/(P(j/): |] * ||), Furthermore, for 
any p £ P(y), r{P(y); || • ||) < sup /i6P(m) ||/* -p|| < t/(P(y): || ■ ||). 

As in [31], we shall call r(P(i/);|| • ||) and d(P{y); || • ||) the 
local radius and diameter of information. Accordingly, the global 
radius and diameter of information are designated to the quantities 
SU P</€Y »’(P(2/);|H|) and stip veY ^(P(lf): INI)-The center of Ply) 
is the element p £ X such that r(P(y): || ■ ||) = sup,, eT » (ly> \\h — p||. 
If for some A u *p — A„[y) for all y £ Y, then A,, is called the 
central algorithm [31], [23]. Clearly, an algorithm is strongly optimal 
if and only if it is central. In general, it is difficult to find the center 
of P (y). If, however, P(y ) is a symmetric set, then its center can be 
easily found. The set P(iy) is said to be symmetric if there exists 
a p € X such that for any h £ P{y).p 4 h £ P (y) implies 
]> — h 6 P (y). The element p possessing this property is called 
the point of symmetry in P (y). It was shown in [31] that a point 
of symmetry is a center, and hence it defines a central algorithm. 
We shall say that such a central algorithm is symmetrically central. 
It is useful to point out that a symmetrically central algorithm is 
central for all norms, as a point of symmetry is of a purely geometric 
nature. 

In this paper, we are interested in a class of nearly optimal algo¬ 
rithms termed interpolatory algorithms [31]. An algorithm A „: R" —* 
X is said to be interpolatory if A u (y) £ P(//). An appealing property 
of interpolatory algorithms is that they yield an identification error 
that is tight to within a factor of two of the optimal error for 
all possible data records [31]. This is clear from Fact 2.1 and is 
summarized in the following. 

Fact 2.2: Let A v be an algorithm such that A„[y) E P(//). Then, 

<■*(?/: INI) < '(-4„: j/: || • ||) < 2r*(;y; || ■ ||). 

In the sense of Fact 2.2. we say that an interpolatory algorithm 
is strongly optimal to within a factor of two. It is clear that this 
property holds regardless of the norms. Note also from Fact 2.1 that 
the identification error for any interpolatory algorithm is bounded 
from above by r/(P(/y); || • ||) and from below by d(P{y): || ■ ||)/ 2 . 
This fact makes it possible to estimate the error by computing the 
diameter of information. The following characterization [311 may be 
used to compute the global diameter sup yeY r/(P,,(</); || ■ ||). 

Fact 23: Let M k ~ — A/^ and = — fjf for all k £ Z+, lt . 
Consider the nomied space (R". || ■ ||). Then 

sup d(Pv (y); | =2 sup | ( 2 . 2 ) 

l/€ Y ;,fP n(U) 

The key condition implicit in Fact 2.3 is that the hyperrectangle 
S„(Af“,A/’ f ) x S„(e'.iF + ) C R 2 " is a balanced convex set [31]. 
An extension of this result will be given in the next section. 

III. Main Results 

In this section, we shall first formulate and solve a problem 
concerning the consistency of the a priori and a posteriori infor¬ 
mation. This problem bears some similarity to the model validation 
problems studied in [30] and '[28]. Significant differences also exist, 
however, between the two problems. Based on the solution of this 
problem, we construct an interpolatory algorithm and derive bounds 
for the corresponding identification error. Finally, we provide a 
brief discussion on central algorithms and optimal identification 
errors. 
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A. Consistency of Information 

The problem of consistency between the a priori information and 
data record is concerned with the following question: Given the model 
set defined by S ,AI + ), the noise set S oc (f'\f f ), and given 
also a data record y 6 R n , will there exist an /; 6 S-v (M~ , Af 4 1 so 
that y = r„((/j * i/) 4 t )? This problem can be equivalently posed 
as to determining whether the set P(//) is empty. 

Definition 3 .7: A data record y € R" is said to be consistent 
with the plant and noise a priori information ifP (If)*®. 

It is important to note that the data and a priori information should 
be consistent for the identification problem to be meaningful, for if 
P iy) is empty, the identification error will become unbounded. 

It can be readily recognized that P(jt/) ^ 0 if and only if 
P,, I iy) # 0. This observation leads to the following simple necessary 
and sufficient condition that can be used to check the consistency 
between the data and a priori information. 

Lemma 3.1: A data record y € R " is consistent with the a priori 
information if and only if the linear inequalities 

M k <h(k)<Mf, A- = 0,1.- 1. (.1.1) 

k 

f k < - ^2 "(*• 

i D 

k = - 1 (:U ) 


admit a solution. 

Consequently, the consistency problem amounts to solving a finite 
set of linear inequalities, which in turn may be solved efficiently 
using the linear programming method (see, e.g., [261). It is easy to 
derive necessary or sufficient conditions by weakening (3.1M3.2). It 
is also possible to extend this result to solve a companion problem 
which is to compute a minimal noise envelope such that the data and 
a priori information are consistent. The reader is referred to [31 for 
a detailed discussion on these issues. 


B. An Interpolators Algorithm and Error Bounds 

The interpolator algorithm to be proposed below is motivated 
from the IBC theory [31) and requires only solving the data con¬ 
sistency problem. More specifically, it follows from Fact 2.2 and 
Lemma 3.1 that any solution of (3.1) and (3.2) serves as an identified 
model that yields a local identification error to within a factor of two 
of the optimal local error. Hence, the following result is clear. 

Theorem 3.1: Let [h(k)\ J'_,| be a solution to (3.IM3.2). Con¬ 
struct -4„ so that 

I h(k) i) < k < n - i 

(3.3) 

¥l±)LL k > 7 /. 

Then, for any data record y C Y,< ■ ||) < <(A n \y; || ■ ||) < 

*(y \II ■ ||). 

As a result of this construction, the algorithm is clearly 
intcrpolalory, and it can be readily computed by solving a single linear 
program. As pointed out in the preceding section, this algorithm is 
strongly optimal to within a factor of two for all norms. 

Having constructed the above interpolatory algorithm, in the re¬ 
mainder of this section we shall derive explicit bounds for the local 
and global identification errors. Denote 


Lemma 3.2: Let r [r 0 n ■ ■ ■ r ?1 — iJ 7 £ R” be the center of 
Pn(g), and let fiMk be defined by (3.4). Then, the algorithm AJ 
such that 


[A; (//)], 


a 

i 


0 < A- < n - l 
k > n 


is central. Furthermore 

f*(»:||-||i)= \ + r( Pn(if),|HI , >. (3.5) 

k—tl 

Proof: By definition, we have 


'•(P(jk).II ■ III) = iuf sup y'|/i(A-)-/)(A’)| 

hex vi-Piu>fTo 


= inf 

h c R Tl 


n- l 


sup 

I'tPni!/) 


E 


\h(k)-p(k)i 


+ iuf sup \h{k) - p(k)\. 

^/R%rP ( ,) /P „(*) " 


It is easy to sec that P(.i/)/P„( y) is a symmetric set. and its point of 
symmetry is h* , where h*{k) = (1/2)(A/* + Mj~) for all k > u. 
Hence, it follows that 


inf sup V \h(k) - 

/.CX/R- Kl*lv)/Pu(v)^ 


Pik) 1 5 i «-«.-)■ 

k — ti 


The proof is now completed by using the definition of radius and 
Fact 2.1. □ 

Lemma 3.3 ■ Let A„[y) be given by (3.3). Then 


1 _ v ^ 

t ( A„:y ; || ■ ||i) = - hM k + max || T n h - T\, A r ,(i/)||i■ 

A — n 

(3.6) 

Proof: Similar lo that of Lemma 3.2. □ 

It follows from Lemmas 3.2 and 3.3 that the problem of finding 
the central algorithm reduces to that of finding the center of P„ (y), 
and the problem of finding an interpolatory algorithm simply reduces 
to that of finding an element of P „ (//). The latter observation 
is precisely the underlying idea for the algorithm (3.3), whose 
construction for k G Z + „ amounts lo finding an element of P«,(g), 
and lor k > n the center of P(y)/P 7l (y) is selected. It is important 
lo note from these results that the condition {Mf* } 6 fi is 

necessary so that the identification error is finite. This is clear because 
the series bMk and have lhe same convergence 

behavior (see, c.g., |I7|). This condition is always satisfied because 
{M k }r=o € (\ and {A^}^ 6 f,. 

In what follows we shall develop bounds for the diameter of 
P„(//). According to Lemma 3.3, these quantities can then be used 
to calculate bounds for the identification error. With no loss of 
generality, we assume that u(0) ^ 0. It follows readily that V is 
invertible and that T -1 is also a lower triangular Toeplitz matrix. 
We shall write 


V 1 


«ll 0 

(I) (1 0 


°1 

0 


hMk M? — Mk . flfk := ef — . (3.4) 


Li»-1 fi„-2 ■■■ <h)J 


We shall first present the following preliminary lemmas. 


Theorem 3.2: Suppose that A„[y) is given by (3.3). Let h € X 
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t» defined as h(k) (Mf + A/,”)/2, and r € (•* be defined 
as v(k) :=* (f£ + f*)/2, for all A: € Z+. Furthermore, denote 
y a - VT t Jt - r,,i» 

/i»,(lr) := tiiin |^( a A-v0O) + |<u-,|hf,), AA/* |, (3.7) 

(/r) := max |. (3.8) 

Then 

e(A„-.w || • ||//») < f(-4„; (/; || • ||i) 

X I* — 1 

< \ y, nh + - m*». ^ 3 ' 9) 

* k—H k-ti 


as in Theorem 3.2, i.e., y := y - UT»h ~ T n v. Define further 

Y := {y E R" : 5 = UT„ h 4 - T n r for some 

A € S^MA//2.*A//2), P € (-^/2,^/2)}. 

Correspondingly, define the sets 

P(j/) := {A € (—AA//2. bM/2) : i/ = UTJi + T„v 

for some P € (—Ar/2, ^r/2)} 

and P„(jy) := (A, ft: ft E P(j/)}. It follows that y E Y if and only 
if i/ 6 Y and that T„ft E P„(/f) if and only if T n h E Pi,(</). As 
a result, wc have 

sup f/(P„(if),|| ■ ||) = Slip tl { Pv ( ij )< II ■ ID- 

v€Y (/ £Y 


I Proof We show that the bound holds for (( 4 n : y\ || ■ || i), us the 
l Inequality between t-( A, r :,y: ||-|| ) and e {A u \y\ ||-||i) follows from 

[ the well-known fact || ■ ||>/^ < || - ||i [ 6 ]. Toward this end, it suffices 
to prove that </(P,,(*/), ||-||i) < ~ M*)) orP T . (i/) C 

S„(A# + hji u + ft). Note that for any ft € P„(</), we have < 
h(k) < A/* + , and ft(ft) = E^ o 'U-ji/O) + £* =0 J r ( 7 ), where 

‘ k € Z+,„ and v E S x U ). The latter equality is equivalent to 
h(k) - ft(ft) = £j ^ 0 fl^,J5/(7) + «fr-;(i f O) “ f’O))- Hence, 
it follows that -AA/* < h{k) - h(k) < ftAA. and that 

k k 

- ]T |ua-,|<S < h(fr) - ft(fr) 

j-0 o 

! A A 

{ <5Z r '*-jJ'(j) + 5Zl°H-;l /,f J- 

j J-U j -0 

j This implies that h - h £ S rl (7i/, ft„), and hence that h E S„(ft/ + 

) hih lt +h) andPnt.v) C S„(ft/ + ft, ft u + ft). As a result, r/(P„(i/). ||* 

j 111 ) < d(S v (hi + ft.ft,, + ft),|| - III). However 

\ rf(s+ ft. + m.|| ■ Hi) = rf(s .(*,.ft.), ii • ||,) 

I 

? = £(/i„U)-fti(A)). 

i A-0 

’ This completes the proof. □ 

I; Jn addition to the above bound for the local error, we shall also 
i derive below bounds for the global identification error and establish a 
? convergence condition of the proposed interpolatory algorithm. Here 
* an algorithm -4„ is said to be convergent [ 12 ) if it satisfies the 
| property 

j 

5 Bin ,(.4„;|M|) = 0 

■ ft < - 0 

J 

> where b* := max* ftr*. Note that the condition {ftA A} T- o € A is 
necessary for any algorithm to be convergent. We shall first need the 
following extension of Fact 2.3. 

Lemma 3.4: Let ft and P be defined as in Theorem 3.2. Then 

Hup//(P„(g),|| ■ ||) 2 sup \\T n h - r M A||. ( 3 . 10 ) 

WCY Uh+lnfi) 

Proof ■ For any h E (A/“, A/ + ) and v E we 

define h := h — h and v t> - P. Then, h E (AJ“, A/ + ) if and 
r‘ only if h E and v E S x (f~\f + ) if and only 

if P E Soe(-^f*/2 ,bf/2)* Similarly, for any y E Y, let y be defined 


Since S r , (— bM/2. AA//2) x S„ (—Ar/2, Af/2) is a balanced convex 
set, wc may invoke Fact 2 3. This gives 

sup r/(P„ (i/), || • ||) = 2 sup HA.MI 
V^V / „ hf P T| (0) 

= 2 sup ||7 n h — 1 „ //|| 

i„i>ep t if /„/(+/„.) 


which establishes (3.10) □ 

Theorem 3.3 Let 4„ (,//) be given by (1 }), and let h Ua and 
be defined by (3.4). Then 


J-f^ftATi + i .uax^*/, <'(-4„ : |M|i) 


^9E m, ‘ + Z*‘ ai,) 


where 


" a := mm 




Furthermore, 4 ?l will be convergent in f i if {MA 1 a" o mono- 
tonically decreasing sequence 

Proof From Lemma 3.3, we have 

1 52 ^3A + ^ suj) f/(P n (g). || • ||i) 

2 rr: 2 

< c(/4„;|| • ||i) < ^ Y' + , >uiw/(P I ,(v).|| • ||i). 

2 « eY 

It follows from Lemma 3.4 that 


sup d(P rt l 

(»).«■ 

ll>) 




vfY 






= 2 ina\ | 

[r _l 


I’ll 1 : r r ” 

'r.d- 

- r) 


e s, 

■(-£ 

fiM \ 

• 2 J’ 

ir e s 

!*(»" 

= 2 max < 

[lIK" 

1 Tr. I'H 1 : 

« 




•">} 


,=D 


. bMk c e f f ' f 

k~0 L-0 J 
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m 


This gives the upper bound in (3.11). The lower bound is established 
by observing that 


V > j-n 

B-t)} 


< 


i’ e So 


- \ o/^_J miU {E t«A-, 

To establish the convergence result, define / lo be the largest integer 
k E Z+ such that bMi > l f L I- Then, for any k > /, 

^A/a < hr : = niinj/, //}. Then, it follows that 

/* —> x as bf — ♦ 0 and /i —* x , Consider now weakening the upper 
bound in (3.11). We have 

'( 4,.: || ■ III ) < ~ /S ^ A + 5Z Illill l <S<r Kl- } 

k-n A-0 t )-U ) 

x 

< +/'M/,-• 


Since {M/* is a monolonically decreasing sequence, it follows 
from 117, Theorem 3.3.1, p. 61) that —* 0 as /* —* x. 

Also. Ylk-t* —♦ II when /* —> x, because {/a IT-o € f|. 
Therefore, the last upper bound goes to zero as Af —» 0 and w —► x.. 
This completes the proof. I/ 1 

Additionally, the upper bound in (3.11) clearly bounds the global 
identification error • -||» v ). Hence the convergence condition 

in Theorem 3.3 is also sufficient for the interpolator algorithm lo 
be convergent in // v . A similar lower bound will show thal the 
condition {bMk)k o £ ^i is necessary for the identification error to 
he finite in H * . To see this, we first give a condition under which 
the H ■*. and (i norms of a transfer function coincide. The proof for 
this result is simple and is thus omitted. 

Lemma 3.5 Suppose thal h E X satisfies i ) h{k) > O.VA E Z + , 
ii) h{k) < O.VA' E Z 4 , or iii) h[k)h(k+ 1) < U.VA- E Z+. Then, 
l|M-)||//. = ||/«||i. 

The following result gives a lower bound for the identification 
error in the H ^ case. 

Corollary 3.1 * Let A„(/y) be given by (3.3), and let M/* and 
be defined by (3.4). Then 


< (A„; || ■ II//*,) > \ max j]T ^A^. j inax^ ->a |- (3.12) 


However 


^(P(f/)/P»(f/)-|| ■ = Mip ||/i(i) -p(^)||// oc 

h i*€P(»)/P„(v) 


> 



From Lemma 3.5. we have II 

Therefore, r(P(i/)/P n (//), || . || /fne ) = 0/2) Mi*. This 
proves the fact that e(.4 fl ;|| • ||//^) > (1/2)^A/*. 
Additionally, note that </(P(i/),|| ■ ||//^) > r/(P(j/),|| ■ ||,) > 
(/(P„(j/), || ■ ||a). It follows analogously as in the proof of Theorem 
3.3 that 


sup d(P TI (//),|| • ||a) 

vGY 



This establishes that r(A„:|| • > (1 /2 )-^a for all k E Z+,„. 

The proof is now completed. □ 


C. The Central Algorithm and Optimal Error 

We now turn to the central algorithm and optimal identification 
error. Our purpose in this section is to demonstrate, by considering 
the case of (i norm, that the problem of solving the central algorithm 
and optimal error may, in general, be computationally infeasible, so 
is the problem of computing exactly the identification error and the 
diameter of information. This should give certain justification to our 
pursuit of an interpolator algorithm and error bounds. 

From Lemma 3.2, it is clear thal finding the central algorithm in 
the (\ case requires computing the center of P„(</) and that the 
optimal identification error corresponds to the radius of the same set. 
Since P„ (.</) is a convex polytope, it can be expressed in terms of its 
vertices [35]. Let the vertices of P„Lv) be denoted by the collection 
{// 1 . * ■ •. h 1tl j, where h, E P,, (//) for 1 < / < m. Then 

r„(!l) = A, h,:h, e P,.(//) . A, > °-E" V = 1 }• < 3 - 13 > 

The following result gives a characterization of /■(P„(,y), || • ||i) 
in terms of the vertices of P,,(//), The proof is simple and thus is 
omitted. 

Proposition 3.J. Let P„(i/) he described by (3.13). Then 


Proof: Since P(i/)/P, ,(.</) is a symmetric set, it follows from 
|32, p. Ill that 

<‘(P(.v)/P!,(.</)■ II ■ Hu,*) = £<f(P(j/)/P„(</).|| ■ ||/<«,) 

< i</(P(.v)/P„(j/).|| • ||i) 

k — n 


r(P M (?/),|| ■ ||i) = minimize h (3.14) 

subject to ■"* 

It is not difficult to see that the minimization problem in (3.14) 
amounts to a linear program, which, however, requires explicit use of 
the vertices of P„ ( y ). The problem of finding the vertices of a convex 
polytope has been well studied; see [22], [25], and [34] and the 
references therein. An important result in 122] slates that the number 
of vertices of a polytope increases exponentially with its dimension. 
According to this result, the task of generating all the vertices of 
Pti ( y) and hence the problem of computing the central algorithm and 
optimal error via the linear program (3.14) becomes computationally 
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intractable, Alternatively, the complexity in computing the tedius 
r(P„ ( y ), || * ||i) may also be observed by noting that its computation 
involves maximizing a convex function over a convex polytope. A 
recent result [2] in combinatorial theory establishes rigorously that the 
problem of this class isJ computationally intractable, in the sense that 
its computational complexity grows exponentially with the dimension 
of the polytope. It is clear that the problems of computing exactly 
the identification error and the diameter of information both belong 
to this class, and the exact computation of these quantities is also 
^'Computationally intractable. 


TV. Identification with Impulse and Step Inputs 

The general framework developed in the preceding sections will 
now be examined in the more specific context where the following 
two important signals will be taken as experimental inputs. The first 
input is the unit impulse signal, denoted by 1 / / 


«/(*): = 


A=0 

k > r 


The second input is the unit step signal, denoted by ik 


1 / s ( k ) := 1 , k £ Z-f . 

The identification problem will first be considered with the general 
a priori information, and then that corresponding to the class of 
exponentially stable systems. 


A. Identification with General a priori Information 

In using the unit impulse signal as an experimental input, the f ( 
identification problem for a class of exponentially stable systems has 
been studied in detail in [15], Both the central algorithm and global 
optimal error were given for the f i case in these references. Our 
development in this section constitutes a generalization to 115j. and 
more importantly, we show that the central algorithm and optimal 
error are actually the same in both the (i and H ^ cases when the 
impulse signal is used. 

Consider first the set P{y) corresponding to ui. Notice trivially 
that r is an identity matrix. As a result, the set P„(i/) becomes a 
hyperrectangle in R n 

P„(jV) = {T n lr. Ilia x{/y(A) - rj, M k | < h(k) 

< min|//(A ) - }. k e Zf „ (. 


Clearly, the set P M (/y), and therefore P(</), are symmetric. For any 
k € Z+,i, lei h u {k) = min{,v( k) - f k . A/^}, and h t (k) = 
tuax{iKA-) - ft i A/*“}. Then, the point of symmetry of P(t/) is 
at e € P(y). where r{k) := (J/ 2 )(/*„( k) + ln(k)) for k € Z f 
and r(A*) := (AJ+ + Mf )/2 for A > n. This gives a symmetrically 
central algorithm. 

Theorem 4.1: Let it = w /. Also, let <4* (,y) be given by 


[X(v))fr 


( 


i (h„(k)+h,m 

2 


k < n — i 


k > ?i. 


(4.1) 


Then, .4* is symmetrically central and hence strongly optimal for 
any norm. 

Note that the strong optimality of this algorithm was only estab¬ 
lished in the (% case in [IS], Our derivation shows that it is strongly 
optimal for any norm. Additionally, our following result gives explicit 


expressions of the optimal identification errors in both the (\ and 
Hex cases. 

Theorem 4.2: Let v = uj. Then: 

i) for any data record jy 

INI//,) = <*(#: ll-lli) 

= \ + \- , " ( * -)> - (4 ' 2) 
^ K - n k .-0 

ii) Furthermore 

'*(11-Ik. > = <*<ll-ll.) 

1 rx. 1 r» - 1 

= \ y, i Y. ^ i- 

k - 1 ) 

Proof: We first establish (4.2M4.3) tor the fi case. In this 
regard, the proof for (4.2) follows from the fact that .4*0/) is the point 
of symmetry of P(i/). It can be readily verified that (4.2) gives the 
radius of P( y). Indeed, the radius r(P(iy), || ■ || i) = max, (t p, v) \\h - 
.4„(i/)||i is achieved at /»*. where h*(k) := h„(k) for k fc Z-+ „ 
and h*(k ) := for k > n. Since P(yy) is a symmetric set, we 
have r(P(/t), || ■ ||i) = (1/2 )r/(P(/y). || ■ ||,) for any // (see. e.g., [32, 
p. 11|). The proof for (4.3) follows from this observation and using 
Lemma 3.4. To show that i* ( y: || ■ ||// „ ) = f 4 (/y: || ■ || i), one simply 
notes that h*{k) - [.4„(i /)]a > 0 lor all k £ Z+. The result then 
follows by applying Lemma T5. □ 


B. Identification of Exponentially Stable Systems 

We now consider a class of exponentially stable systems. The 
plant a priori information corresponding to this class of systems 
consists of two constants t) < M < -x and > 1, so that 
-Mf — Alf = U/// for all h e Zj [I5|. Following |I5], we 
shall also assume a uniform noise bound < > l) for all measurement 
instants: — (f “ = r for all A- £ Z + . As a result, we have 

MV* = 2 U/y/, and = 2 f. A direct application of Theorem 4.2 
then yields the following simplified expression of the optimal global 
erroi in the impulse input case. The result for (i case is precisely 
the one given in [15J. 

Corollary 4.1 Let it = n /. Denote n * — (1 / In p ) ln( 3 f/(), and 
iif — min {/i. [iC *]}. Then 

.*<IHI^) = '*(IHI.> = »" + — ry • (4 ’ 4 > 

In the step input case, we give an explicit expression of the 
global diameter of information for both the f i and H x identification 
problems, hence providing an estimate tight to within a factor of 
two of the global optimal error. We summarize this result in the 
following theorem. 

Theorem 4.3 . Let a = ns. Let also 


. = r« M<f 

^ (jnitiJ//, otherwise 

where n* := (l/lw p)ln( A//2f). Then 


(4.5) 


sup d{P{y)i || ■ || ) = sup d(P[y ), || • ||i) 
v€Y M ev 

M 

—ii 

V''-"' + 7^t7T 


// s < 1 
11 s >1. 

(4.6) 
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Proof: Again, we first prove the result for the f j case. In this 
regard, we notice that for u = u s 


F " 1 = 


r 1 0 

-1 1 


... o 

... o 

-1 lJ 


Therefore, h E P(0) if and only if \h(k)\ < M/p k for all k E Z+, 
and h(D) = r(0), h{k) = v(k) — r(k — 1) for k E Z+,„, where 
\t'{k) | < f. It follows from Fact 2.3 that 


- sup r/(P(i/).|| ■ ||,) 

1 Y 


{e 


v{k) - r(k - 1 )|: |i«(*)| < 

M 


!*’(*■) - i'(k - 1 )| < 


*l\ + Y v 

< unn{ f , .1/) + Y min { 2 ,. ^ ; 

A — ] K I J A — m 7 


.u 

,A ' 


If us = 0 (i.e., A/ < r) or /is = 1 (A/ > r, 2r < M/ft), then 
we have 


1 sup ll(P(jl). || • ||, I 

2 vFY 


< II sf- 



= 7/ sf + 


fl" S 


A/_ 

Mp - 1 ) 


It is easy to see that the equality can actually he attained by the 
sequence {r(A , )}I - 0 1 


f r(0) A*=r() 

\r(A-- 1 ) + (-!)* M/ f i k 1 < k < n - 1 


(4.7, 


where /»(()) = Af for //s = 0, and r(0) = < for us = 1. Hence 
(4.6) holds for the case #; s = 0 and //s = 1. Consider now the case 
us > 1. It follows that 


\ sup i7(P(,(/). || ■ ||i) < f + 2(//s 
- Vi y 


~ 1 )€ + 


V A/ 

utr, 


= ( 2n s ~ 1 )f + 


- 1 )’ 


Again, equality can be attained by the sequence [v{k)}’/_Q t con¬ 
structed as 


Kns 

\/’(fr - 1 ) + (-1)4/// ns < k < n - 1 ' 


(4.8) 


Hence (4.6) also holds for the case us > 1. This completes the proof 
for the (i case. To show that the global diameters are identical in both 
the G and H cases, we note that the sequences constructed in (4.7) 
and (4.8) lead to h{k)h(k + 1 ) < 0, where h(k) = v(k) - v{k - 1 ) 
for all k E Z-\ „. The result then follows from Lemma 3.5. □ 


V. Conclusion 

In this paper we have formulated and solved a worst- 
case/deterministic system identification problem. Our main effort 
has been concentrated on developing an intcrpolatory algorithm 
and bounds for the identification error. An important reason in our 
pursuit of the interpolatory algorithm and error bounds has been 
that the optimal algorithm and error is generally very difficult to 
compute. In contrast, the interpolatory algorithm, albeit suboptimal, 
can be computed easily by solving a linear program. 


The proposed algorithm is “tuned” [12] to the plant and noise 
a priori information. It is possible, as in [15], to construct an 
algorithm tuned to only the plant a priori information. Note that 
in the present formulation the past input has been assumed to be 
zero. Nevertheless, this assumption can be relaxed, and the present 
results can be extended to the more general setting, as evidenced 
by an extension in [ 11 ]. which appears to be directly built on the 
present approach and techniques. Future extensions building on this 
work may also be pursued by considering mixed parametric and 
nonparametric identification problems [19], [18], ]9). Finally, we note 
the recent work in this area on input design and sample complexity 
in e.g., [51, [29]. [16], and [1]. 
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Ahtstraett^-Eor multi-input multi-output nonlinear systems whose free 
dynamics are Lyapunov stable, we show how the problem of global 
Stabilization via dynamic output feedback can be solved by using the 
technique of input saturation. The power of this technique is also 
Illustrated by solving the problem of global stabilization via bounded 
state feedback for affine nonlinear systems with stable unforced dynamics. 
Analogous results are established for discrete-time nonlinear systems. 
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I. Introduction 

Consider multi-input multi-output (MIMQ) nonlinear systems de¬ 
scribed by differential equations of the form 


x = Ax + 

i^i 

(1.1) 

'C 

11 

r* 

V; 

(1.2) 


where x 6 R" is the state, v, € R, 1 < i < m , are the control 
inputs and y 6 R H is the system output, A E R" * T ' and C E R" Xu 
are constant matrices, and y, : R" —► R '\1 < i < in, are smooth 
functions. In this paper, we assume that the nonlinear systems under 
consideration, which include bilinear systems as a particular case, 
satisfy the following hypothesis: 

HI) There exists a positive definite matrix P such that 
A r P + PA < 0. 

In other words, the unforced dynamic system of ( 1 . 1 ) is assumed 
to be Lyapunov stable. 

The purpose of this paper is to study the following two problems of 
global stabilization for a nonlinear system (I. I)-(1.2) with Hypothesis 
HI). 

Problem I: When is there a smooth state feedback or a bounded 
state feedback control law which renders the equilibrium .r = 0 of 
system ( 1 . 1 ) globally asymptotically stable? 

In the case where the state x is not available for measurement 
but only the output y is measurable, the following problem becomes 
more realistic and rather important. 

Problem 2: When does there exist a dynamic compensator 

C = »(<$.!/)• 9(0.0) = 0 

(1.3) 

„ = 0 (£L 0 ( 0 ) ~ 0 

such that the closed-loop system (LI H I 3) is globally asymptotically 
stable at the equilibrium (.r ,0 = ( 0 , 0 )? 

In the case of single-input single-output (SISO) bilinear systems 
(i.e., //(.r) = 13 .r + A"), these two problems have been investigated 
by Gauthier-Kupka in |7], which improves the nonlinear separation 
principle initially developed by Vidyasagar (17] in the bilinear 
case. More specifically, Gauthier and Kupka [7] develop sufficient 
conditions for global stabilization of SISO bilinear systems via 
smooth state feedback by using the Jurdjevic-Quinn technique [ 8 ]. 
Based on the Jurdjevic-Quinn type feedback control law, they 
then present a "‘small stabilizing control law.” This, in turn, leads 
to a separation principle for a class of SISO bilinear systems 
whose free dynamics are dissipative. The result of [7j is nice, 
and the proof is clever but somewhat unsatisfactory because it is 
only applicable to a SISO system. In addition, it requires that for 
W = {.r: .v r (A‘ P + PA).r = 0} D {*: x r P(D* + N) = 
0 } 

dim span (a#, atf Aj (Bx + .V). - ■ ■ ,<ul Aj .(Bx + A'). ■ • ■) 

= n Vx 6 H* - {0} (1.4) 

i.e., the ad-condition is satisfied on W -{()}, 


.00 © 1995 IEEE 
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Not© that requirement (1.4) is too strong even for a linear system. 
More explicitly, it is easy to see that for the linear system f 1 2] 


: x = Ax H- bu = 


’0 

0 

0 ‘ 


V 

0 

-1 

2 

* + 

0 

0 

0 

-1 


0 


dim si>an{ Aj\ b, Ah, A 2 h } 


f 

0 


Y 


o' 


'o' 

) U = {.r 6 R n : l k At V:{r) = 0 = x T PA k x,' 

= dim span < 

—x> + 2x :i 


0 


0 

. 

0 

>< 2 - 1 <*<>•} 

l 



0 


0 


0 

J S = {.r€ li": P-r) = 0 


With the vector fields A.r. g } ( x ), • • -, g nt [x) we associate the distri¬ 
bution 

D — .spaii|r?Y/5 ljL .//,(.r): 0 < k < it - 1 . 1 < / < m j. (2.1) 

Let il and S denote the sets associated with V'(.r) £ ~x r Px, P > 0 


( 2 . 2 ) 


Thus, ad-condition (1.4) can never be satisfied. If, however, we use 
Corollary A.2, which is a slight generalization of Theorem 1 given in 
[12], a straightforward calculation shows that for V{x) = .'■f+-»* 2+.! , 3 


and 


Slyj — {./■ G R 1 : X'2 — .i*:j = ()} 

5v = {.r e R 3 : J'l =0}. 


Vr G D. ()<('</’- 1) 


With these notations in mind, we can easily deduce from Appendix 
A sufficient conditions for the solution of global stabilization via 
smooth stale feedback or bounded state feedback for continuous-time 
multi-input nonlinear systems of form (LI). 

Theorem I: Suppose a continuous-time multi-input nonlinear 
system (1.1) satisfies Hypothesis HI). If 12 fl S = {()}, then 


Hence, S2>jD^V = {0}. Thus O. is stabilizable by u ~ - LiA (.r) — 

—2.ri. 

In this paper, we address Problems 1 and 2 for MIMO nonlinear 
control systems (1.1)—(1.2), with the property HI). In particular, 
a solution to these problems is proposed. The approach employed 
in the paper is based on Lyapunov analysis, LaSalle's invariance 
principle, and the center manifold theory |5|. in the spirit of f7). 
Our solution to Problem 2 depends heavily on the technique of 
input saturation, which has recently obtained a strong renewed 
interest in the linear control literature [13], |I5)-|16J. The power 
of this technique is first illustrated by solving the problem of 
global stabilization via bounded stale feedback for affine nonlinear 
control systems. The result incorporates and generalizes a number of 
well-known stabilization theorems [I]. [ 6 |. |8]-|9J. [ 12 J. and [141 
proposed in the nonlinear control literature for global asymptotic 
stabilization of certain classes of affine systems. Since the material 
is somehow independent of the theme of this paper, we put it in 
Appendix A. Appendix B includes a useful boundedness criterion 
for perturbed systems. Section II deals with Problems 1 and 2 for 
MIMO continuous-lime nonlinear systems of the form (I. I HI.2) 
and we show how the ad-condition can be removed. Discrete-time 
counterparts of systems (I.1ML2) are treated in Section III. Section 
IV includes the concluding remarks of this paper. 

The preliminary version of this work was presented at the 33rd 
IEEE Conference on Decision and Control [10]. 

II. Solutions of Global Stabilization Problems 

In this section, we present a solution to Problems 1 and 2 slated 
in the previous section. To describe our main results in this paper, 
we need to recall some notations related to nonlinear systems of the 
form ( 1 . 1 ). 

For smooth vector fields Ax and </,(.r),I < / < m, on R", 
[Ar, (/, ( i )] denotes their Lie bracket and we define inductively 

a Aril, ( JT ) = < 7 , ( x ), ■ • •. <A \ r 1 </, ( j -) 

= [A.r.nd\ T ,iAr)]. k = 0.1.-•• 

for / = 1, ■ ■ ■, rw. If V : R” —► R is a C (r > 1 ) smooth function, the 
Lie derivative of V with respect to the vector field A.r is defined by 
LaxV(x) = ttjtAx. In a similar manner, higher order Lie derivatives 
can be defined inductively as follows 

L° A ,Y{x).= V(r),---,L k A yV(x) 

= L Ax .(lXVU)) !<*<»•-I- 


i) the equilibrium x = 0 of system (1.1) is globally asymptotically 
stabilized by the bounded state feedback control law 

u"{x) = - sgu 1 Pf/ f (x)^ 

X min ^|.r 1 P<j, (,r) |, o ^, 1 < i < m (2.3) 

where n is an arbitrarily positive real number. 

As an immediate consequence of (2.3), 

ii) system (l.l) is globally asymptotically stabilizable via the 
smooth state feedback control law 

a = -[r/i (./•).* ■',// Ill (.r)] y Px * ^g 1 (,r)Px. (2,4) 


Proof: Let V(x) ~ -x 1 Px with P > 0. Conclusion i) follows 
from Theorem A.l proposed in the appendix; ii) is a consequence 
of Corollary A.2. 

In the case where the state x of system (1.1) is not available tor 
measurement, the feedback control schemes proposed in Theorem 
1 cannot be applied directly, thus it is necessary and important to 
develop sufficient conditions for Problem 2 to be solvable. In the 
remainder of this section, we assume that the pair (A,C) of the 
input-output nonlinear system (1.1 >—(2.2) is observable. Then, it can 
be shown that a dynamic compensator of the form (1.3), which 
consists of a “Luenberger-type" nonlinear observer and a bounded 
state feedback control strategy, solves Problem 2 for MIMO nonlinear 
systems ( 1.1 M 1 . 2 ). □ 

Theorem 2: Consider continuous-time MIMO nonlinear systems 
of the form (1.1 HI. 2) satisfying Hypothesis HI). Suppose the pair 
( 4,C) is observable and iln $ = {()}. Assume that the functions 
j/i (■»•).! < i < or, are global Lipschitz V.r G R". Then for any 
sufficiently small a > 0 , a dynamic compensator 


i = M + +K{y- CO 

i- I 

u, = ii “(£), 1 < i < w 


(2.5) 


is such that the closed-loop system formed by ( 1 . 1 ), ( 1 . 2 ), and ( 2 . 5 ) 
is globally asymptotically stable at the equilibrium (;r,£) = ( 0 , 0 ), 
where i/'/(‘) is given by (2.3) and the gain matrix K is designed in 
such a way that A - AC is Hurwitz. 

Proof: Let e denote the error signal 
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f Then, the closed-loop system can be represented as * 

c - u - ao+£ - </,(x))«r(o . (2.6) 

t = At + 2*gimv?W - KCe. (2.7) 

■ 1=1 

h Since .4 - KC is Hurwitz, there exists a unique positive definite 
(i v .;iiiMrtrix R satisfying the Lyapunov equation 

. {A - Kcfn + nu - KC) = -I. 

Thus a straightforward calculation proves that 

■ • , in 

— Re) = -r T e + 2e r J?^(js(«)- ft (jr))*r(0 


< - 2«Hfl|| 52/A 


-<(■' Re) < -V'e <-— 

dt ’~2 ~ 2k„ m AB) 


e T Re. 


MO)). 


V(t) = t T Pt > A mill (P)||£|| 2 


ar 


at 


< 2||P|| \\t\\ 


and 


■■■/*'|M(f)||df< /"°M||A'C||exp(-^) fit 
Jo Jo 


< + 00 . 


(0.4(f)) be the corresponding trajectory,. Obviously, this trajectory 
is characterized by the folio wing differential equation 


e'(o = ■•»€(o + ^fl.(f(o)«: , (i(f)) 


(2.9) 


which has been proved to be globally asymptotically stable at. 5' = 0. 
In other words, the global asymptotic behavior of the closed-loop 
system (2.6H2.7) at (c,5) = (0,0) is completely determined by the 
flow on the invariant manifold governed by system (2.9) [5], Since the 
latter is globally asymptotically stable, so is the closed-loop system 
(2.6M2.7). 

As an immediate consequence, we have the following global 
separation principle for MIMO bilinear systems. 

Corollary I: Consider a continuous-time MIMO bilinear system 


x = 


m v 

Ax + y^( B, ,r + A t t )i/,- I 

c, J‘ 


( 2 . 10 ) 


because |u" (0 | < n and ||<r/< (.r) - pi (Oil < ft||.t - 5|| for o > 0 
and A > 0, i = !,•••, m. Choosing a < —to -, yields 


This, in turn, implies 

.'A„,i„(f?)[|c|| 2 < e 7 Re 

Hence 

lk</)H < M exp(-vlf) for, M > 0 and /I > 0. (2.8) 

Let d(t) = ~IiCe(t) in (2.7). Recall that by Theorem 1 system 
(2,7) with d(t) = 0 is globally asymptotically stable at £ = 0. In 
particular, for V(£) 4 £ 7 P£ and fit) = -4£ + nM)K(t) 

Lj V(t) = f T (t)P( + t T Pf(t) < 0. 

Moreover, it is easy to check that in this case 


With this in mind, we apply Theorem B.l given in Appendix 6 to 
system (2.7) and then conclude that ||5(t)|| < N Vf > O, for some 
JV > 0. 

$o far, we have shown that every trajectory (<?(#),4(f)) of 
the closed-loop system (2.6H2.7) is bounded V/ > 0 and 
y(d0),f(0)) € R T ‘ xH". To show, (e, 4) = (0,0) is a global 
asymptotically stable equilibrium of (2.6), we let (e(f),4(f» be a 
trajectory of system (2.6H2.7) with the initial value (e(0),5(0)). 
Let r° denote its ir-limit set. Clearly, r° is nonempty, compact, 
and invariant because (e(f).,4(t» is bounded Vf > 0. In addition, 
it follows from (2.8) that limr-** e(t) = 0. Therefore, any point 
in r° must be a pair of the farm (0.5(f)). Let (0,5) C r° and 


Suppose Assumption HI) holds. Suppose the pair (.4,0 is observ¬ 
able and i! n S = {0}. Then a dynamic compensator of the form 
(2.5) with 0,(5) = B r £ 4- A\-, 1 < i < m, renders the closed-loop 
system (2.5)-(2.10) globally asymptotically stable. 


III. The Discrete-Time Case 

The goal of this section is to show how discrete counterparts of 
the stabilization results established in Section II can be carried out 
for a class of MIMO discrete-time nonlinear systems of the form 

x (k + 1) = ,4x (+ a ,(.«• (*))«,(*) (3.1) 

1- 1 

!/(*) = C.r( k) (3.2) 

whose free dynamic system .v(k + l) = Ax{k) is Lyapunov stable, 
so that there is a positive definite matrix P satisfying 

H2) .4 7 P.4 - P < 0. 

Unlike the case of linear systems, the generalization from continu¬ 
ous lime to discrete time in the nonlinear context is infamous for its 
bifurcation into two cases—the routine extension and the technically 
impossible with relatively few exceptions. This is primarily caused 
by a nonlinear nature. In [3], we have pointed out that one cannot 
transport the stabilization results via smooth stale feedback for 
continuous-time affine systems (|1). [6], [8]-~[9], [I2|, [14]) to the 
discrete-time case. The reason why this is impossible has been 
explained in [2]-[4] from a passive system point of view. In what 
follows, on the basis of our previous work [3], we construct a bounded 
state feedback control law by the use of a modified saturation design 
technique. This, in turn, leads to a global separation principle for 
discrete-time nonlinear systems (3.1M3.2). 

To begin with, we introduce the sets ild and Sd which correspond 
to the sets defined in (2.2) 

ih = {.r £ R" U'*fU T PA - P).4'.r = 0. 

( = 0 . 1 . 

= {x 6 R" 

1 < i < m..( = 0,1, • • ■) 

In addition, set 


(A ,+ 'jn) r PgAA r r) = 0. 


(3.3) 


»(*) - [<7J (■*•), -' ■ .tfm (•»■)]. 


(3.4) 


We are now in the position to state and prove our main result in 
this section. To keep the exposition simple and highlight the central 
idea behind our design approach, we first study the single-input case. 
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Theorem J: Consider a single-input discrete-time nonlinear sys¬ 
tem (3.1) with the property H2). Suppose il,i H S<i = {0}. Then 
i) the equilibrium .r = 0 of system (3.1) is globally asymptotically 
stabilized by the bounded state feedback control law 

lt r - _ lllill 0fl f {■r)PA.r|,n)sgii(<y / (x)PAjt ) 

1 + iU r (-r)P<lU) 

for any a > (). (3.5) 


ii) As a consequence of i), the equilibrium .r = 0 of (3.1) is 
globally asymptotically stabilized by the smooth stale feedback 
control law 


() 1 (.r)P Lr 
J + : 2 U l {.r)Pfl(-r)' 


(3.6) 


Proof: Consider a proper Lyapunov function l'(.r) — 7 j ,; P.r* 
Then the difference of \ r (.r) along the trajectory of the closed-loop 
system (3.1M3.5) is 


±Y(,r(k)) = Y(.r(kA l))-YU(k)) 

+ u^k),, 1 (Mk))P-\.r(k) 

+ .r( A-))i/„( A-)- (3.7) 


By Assumption H2), it is clear that 
A V(.r) < n \ fj 1 (.< )PA.r 


It,[ ( 1 + ( -I* )/ T> jry( .1 II n - it /, 


(3.8) 


s 

Substituting the feedback control law (3.5) into (3.8) yields 

min (|// # (.r)P Lr|. n)sgn(/y 7 (.»*)P.Lr) 


jd» < -M- - 


1 + V (./' )P//(-r) 

tin(l/y 7 (j-)PAr|,r 
1 -|- 7 i f/ / (.1 )Pr/(.r) 




Note that |.r| = .rsgn.i. With this in mind, we deduce from the 
inequality above that 


A\ (.r)< -nf t 


min (|ry / (.r)P i.r|. n) 

1 + V 


x (|// 7 {>t‘)P M - min (if/ 7 (.»)P-‘l.r|, t\ j j < 0. (3.9) 


This proves that the closed-loop system (3.1)-(3,5) is globally stable 
at ,r = 0. To show global asymptotic stability, wc need to distinguish 
two cases. First, suppose niin(|i/ / (.r)PAj j.n) = n. Then it follows 
from (3.9) that 


ALU) < -ul 


(l + i;/'(.r)Pf/(.r)) 4 


t < 0 


which implies that the origin of the closed-loop system is globally 
asymptotically stable. In the case of miu(| g 1 (j)P .l.ij.n) = 
|f/ / (,r)P.4.r|, by LaSalle’s theorem, all trajectories of the closed- 
loop system approach to the largest invariant set 1 contained in the 
zero locus ol ALU). From (3.7) and (3.9), we see that AV(.r) = 0 
implies for all k = 0.1, ■ * ■ 

i/„(A) = 0. .)■' PA - P>r(A) = 0 

and 

fl 1 {•<(k))PA.v(k) — 0. 


This, in turn, results in Vj*( 0) = .r 

(A k f) 1 (A 1 PA — P)A k jr = 0 and </ (A k ,c)PA k + '.r = 0. 

0 f. 


In view of the assumption it n S = {0}. we conclude that / = {r € 
R": AV(.r) s= 0} — (0). This completes the proof of Theorem 3 i), ' 

Let a = +x, conclusion ii) follows from i) immediately. 

Similar to the continuous-time case, we can establish a global 
separation principle for a S1SO discrete-lime nonlinear system of 
the form (3.1 >-(3.2), by using the bounded state feedback control 
strategy proposed in Theorem 3. 

Theorem 4: Consider a discrete-lime S1SO nonlinear system 
(3.1 M3.2) which satisfies Hypothesis H2). Suppose the pair (.4.C) 
is observable and iht ft S,i “ {0|. Suppose the function </(,/*) is 
global Lipschitz on R". Then for any sufficiently small n > 0, a 
Luenberger-observer-like based output feedback control law 

t.(k + 1) = Ai(k) A //(*(*))«„(*) 4- A 'Uf(k) - ct(k-)) ’ 

,, mind// 7 U(A))/M£(A-)|,o) > (3 10) 

} i + & (tmraum 

Xsf.n(,/'(t(A))P.4((fr)) 

renders the equilibrium (.».£) = (0.0) of the closed-loop system 
(3.1 H3.2M3.IO) globally asymptotically stable, where K is a con¬ 
stant matrix such that A - KC is Hurwitz. 

Proof: Let < {k) =■ i(k) — r(k). Then the closed-loop system 
(3.1 M3.2M3.10) can be expressed as 

<(k+ 1) = (A - hC)( (k) A (uiUk)) - <j(.v(k)))u t Ak)\ 
l(k + 1) = A£[k) A U(t[k))u f Ak \- KC<(k) ]' 

(3.11) 


By assumption, there is a positive definite matrix /? such that 
(A-KC) 1 IHA-KC')-U =-I 
because 4 - KC is Hurwitz. Let ll’(r (A)) — 4. 1 (A-)H*>(A-). Then 
Al!'(r(A)) = »’(<(*•+ 1)) - !!'(< (A-)) 

~ - .'ll' , (A)||‘ i + < 1 (A)(.4 — KC) 1 
x /f(i/U(A))- ;;(.r(A))).i„(A-) 

+ 7 «/.<A-)<//U(A)) - uiJ'U'))) 1 
x 7?(f/U(A*)) - <l{'i'(k)))u,Ak). 

From Lipschitzness of </(•) and |u,,(A*)| < n, we deduce that 

AU'(r(A')) < — F|| ( (A)|| J 

x (1 — 2i,.Y||(.4 - A'f’)7?|| - ii' ! .V 2 ||77||) 

where X is the Lipschitz constant associated with »/(■)• Obviously, 
it is possible to choose n > 0 sufficient small so that for some 
0.0 < H < l 

ATT (1 Ik)) = 1 (A + 1 )R<‘(A + 1) - <■' (A')Re(K-)) 

<-^'(7-)R<‘(A)£0. 

This, in turn, implies 

< 1 (A-)Rc(H < (1 -#)('''(*•- ])Re(A- - 1) 

< •■■ < (l-«()V r (0)Re(0). 

Thus 

||> (A-)|| < Ma k for k = 0,1. • • •, M > 0 and 0 < a < 1. (3.12) 

On the other hand, recall that by Theorem 3 [or inequality (3.9)], 
the following relationship 

IM«<*) + i/U(A , ))«..(*)||/> < ||£(A)||if*. k = 0.1. “ (3.13) 
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to satisfied. Without loss of generality, Let P ^ / in (3.f3). Then, 
we deduce from (3.11) and (3.13) that 

\w +Dii < \\Am+#w))* n m +iia-cv(*)h 

<||e(A-)ll + ^/||A-C’||<--- 

(3.i4) 

From (3.12) and (3.14), we conclude that all trajectories of the closed- 
loop system (3.11) are bounded. Since liuu— *_ i (A ) = 0, the same 
arguments as the ones used in the proof of Theorem 2 show that 
every bounded trajectory of system (3.11) eventually converges to 
the invariant manifold characterized by the difference equation 

C(*+ D* .*«*) +0(C(*))w„(*> 


which is globally asymptotically stable. Thus the proof is complete. 

Finally, we turn our attention to the multi-input case. In this case, 
the input saturation technique developed in Theorem 3 cannot be 
applied directly to the multi-input discrete-time systems, because 
there exists a couple matrix [/ + \g l (.r)P</(.r)]” ] which appeared 
in the smooth state feedback control law. This is in sharp contrast to 
the case of continuous-time affine nonlinear systems (see Theorem 
1). Nevertheless, using a modified state feedback control law, we can 
prove that it is possible to extend Theorems 3 and 4 to discrete-time 
M1MO nonlinear systems of the form (3.1)—(3.2)- 

TheoremS: A multi-input discrete-time nonlinear system (3.1), 
which satisfies Hypothesis H2) and the assumption. HtHS.i = {0 [, is 
always globally asymptotically stabilizable via smooth state feedback, 
In particular, a possible choice is 


u 


i + \a l 


(I* (.s)P.4.i. 


(3.15) 


Moreover, system (3.1) can also be globally asymptotically stabilized 
by the bounded state feedback control law 


«n(.r)— -o / + ( Jt’)Pg(.r) 


ij 1 (jr)FA,v 

TTjj7TrT7M7F- f "* n> 11 < " <1 ' 


(3.16) 


Theorem 6: Under Assumptions H2) and .VOW = {()}, a 
discrete-time M1MO nonlinear system (3.1H3.2) can be globally 
asymptotically stabilized by the dynamic compensator 

*<* + 1) = AUk) + gU(h))u(k) 4- Iv(yU') - C{(*)) \ 

«(*) = ««(<(*)) J U 1 


with any sufficient small n > 0, provided that the pair M.O is 
observable and <M-r) is global Lipschitz for 1 < / < m, where 
u,»(*) is given by (3.16) and A' is such that .4 - I\C is Hurwitz. 

The proofs of Theorems 5 and 6 are strongly reminiscent of the 
proofs of Theorems 3 and 4 and therefore are left to the reader as 
an exercise. 

Remark I: As discussed in the last section, we note that discrete¬ 
time nonlinear systems (3.1H3.2) include bilinear systems as a 
particular case, a class of systems which has attracted considerable 
attenuation in the last two decades. Therefore, it is easy to see that 
all the global stabilization results developed in this section can be 
directly applied to discrete-time bilinear systems. Indeed, simply 
replacing g,(.r) by D,.r -f A", in Theorems 3-6 yields desired results 
for bilinear systems. It must be noted that Theorem 6 has provided 
a solution to the open problem raised in [11], namely the problem 
of global asymptotic stabilization via dynamic output feedback for 
MIMO bilinear systems. 


IV, Conclusion 

In this paper, we have presented smooth and bounded state 
feedback control schemes which globally asymptotically stabilize 
a clas$ of continuous-time affine nonlinear systems whose free 
dynamics are Lyapunov stable. Based on an arbitrarily small bounded 
state feedback control law thus developed, we have been able to prove 
that a class of MIMO nonlinear systems which are “controllable” and 
“observable” to globally asymptotically stabilizable by a nonlinear 
Luenberger-observer-like based output feedback control law. The 
crucial point behind the development of a nonlinear enhancement 
of global separation principle to to use the technique of input 
saturation, which has attracted considerable attention in the literature 
recently in [13], [15], and [16], The stabilization results established 
in continuous-time nonlinear systems have also been extended to 
their discrete-time counterparts. It must be pointed out that such 
generalizations arc neither routine nor trivial. This certainly raises 
the value of Section 111. 


Appendix A 

Global Stabilization or- Affinf 
Systems by Bounded Statf Feedback 

Consider an affine nonlinear conirol system 

til 

1': Jr- f{s)+ '£ n,(.< )„, (A.I) 

I— I 

where ./ € R” is the state and i/, 6 R. 1 < / < >n, are control 
inputs, /: R" —► R N , and g,: R" —> R", 1 < / < /»», are smooth 
mappings, with /(()) = 0. Suppose there exists a("(/ > 1) proper 
and positive definite function V: R M —► R such that 

H) L,n.i)<n v.i e R' 1 

i.e.. the unforced dynamics of (A.I) are Lyapunov stable. Then the 
following global stabilization resuh via bounded state feedback can 
be established. 

Theorem At. An affine nonlinear control system (A.I) with prop¬ 
erly H) is always globally asymptotically stabilizable via arbitrarily 
small bounded state leedback provided lhat D 5\ = {0}, where 
and S\ denote the sets 

= {.,■ 6 R” : L)Y{jr)- 0. 1 < A- < ,•} 

S\ =■ {.!■ 6 R" : l)L r \ (.!■) = 0. Vr g D. 0<A<r-l} 

(A.2) 


the distribution D is defined by 

D = ii,: 0 < k < n — 1.1 < , < (A.3) 

In particular, a typical bounded state feedback conirol law is given by 

«! l (j') = -sgn(Z, f/i \ r (^))min(|Ay ( V(,r)|,o), l < t < in (A.4) 

where n is an arbitrarily positive real number. 

Proof • Substituting the bounded stale feedback conirol law 
(A.4) into system (A.I) yields the closed-loop system 

rn 

i = /(.r) - y^jr/i(j-)sg»(L gi r(.r))min(|£^,r(.r)|.n). (A.5) 

Clearly, the equilibrium r = 0 of the closed-loop system is Lyapunov 
stable. As a matter of fact, a routine calculation proves that 

til 

n.r) = LfV(.r) - ^i„ i r(.r)sgii(I„ i r(.r)) 
x min(|L,, V(x)|,,i). 
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Observe that u* sgn(.r) = |,r|. With this in mind, it follows from 
Assumption H) that 


Y(x) = L f VU) - \L q> V(.r)| min( V(.r)|. o) < 0 (A.6) 

i -i 

Thus the claim follows. 

To show that the equilibrium x = 0 of (A.5) is globally asymptot¬ 
ically stable, we let Y{x) = 0. This yields 

— and L< h l’(-r) = 0, 1 < /' < m. (A.7) 

Hence u','(x) = 0 for 1 < i < tn. Let j*(/,.ro) be the trajectory of 

the free dynamics x = f(x) V/ > 0 starting from .r(0) - .r u . Let r° 
denote its ^-limit set. Since L/V(.r) < 0, r° is nonempty, compact 
and invariant. Therefore for any initial .7 € r°, the corresponding 
trajectory x(t..r) of .r = /(.r) stays in r° forever. Note that V(.r(/)) 
is positive definite and nonincreasing along every trajectory of the 
free dynamics. Thus lint/ f > 0. By continuity of 

V, Y{x) = >i for every .7- — lint, x{tj,x o) in / ,(J . In other words, 
Y{x) is constant over r°. In particular, V/ > 0 

L) r (.!■(/. J)) = 0 V.r £ r°. 1 < A- < r. (A.8) 

On Ihc other hand, since I. li l(.r) vanishes V.r t {./■; l'(.r) = ()}, 

so docs L„,Y(r) V.r £ r". Thus l„Y(.<-(t..r)) = 0 V/ > 0.1 < 
/ < ///. This, together with (A.7), implies that for r = [/.;/,] 


L-\’{.r(t..f)) = L i L, h ) ,r)) — L„,LiY(. r(t..v)) 

= 0. 1 < / < i/i. 

By induction, it is straightforward to prove that 

// f L r r(.i((..r))- 0 Vr £ £>*. and 0 < A- < r - J. (A.9) 

According to the arguments above, we conclude from (A.8HA.9) 
and LaSalle's Invariant Principle that all trajectories of the closed- 
loop system (A.6) eventually approach the largest invariant set 1 of 
{.r € R": V = Of, which is contained in S\ fl By assumption, 
ih fl 5s, — {0}. This implies I = (.#■ € R”: V = 0[ — {0}, thus 
completing the proof of global asymptotic stability. 

Let n = + ^. We deduce immediately from Theorem A.l the 
following result which is a slight extension of Theorem 1 proposed 
in [12]. 

Corollary A.2: An affine nonlinear system (A.l) whose free 
dynamics are Lyapunov stable can always be globally asymptotically 
stabili/able by smooth state feedback if = {()}. In particular, 

a smooth state feedback control law is given by 

u = ~(L„Y (.,■))' ^ -\L , yi / ,J'(.r)] ? . (A. 10) 


Appendix B 

A Boundedness Criterion for Perturbed Systems 

Theorem B.l: Consider a perturbed system described by differ¬ 
ential equations of the form 


Jr = /Lr) + </(/). (B.l) 


Suppose there exists a C 1 function V:R" —* R, with T(0) = 0, 
such that 


Al) 


r<.r)> 


0Y 

Ox 


< n 2 |k|| and LfV{x) < 0. 


Assume that the disturbance fl: R —> R" is piecewise continuous and 
satisfies 


A2) 


/ IW)||rff <« 

JQ 


:i < +c* 


where u,,l < / < 3, are positive real constants. Then all the 
trajectories of system (B.l) are bounded. 

Proof; By Assumption Al) 


Hence 


Y = L,Y{r)+~d(t)<^-,l(t). 

Ox ox 


V(x(t)) < H-i’o) + J ^7 d(T)<h 

r II ai 


< V(.r 0 ) + 


Ox 


||^(r)||dr. 


Using Assumption Al) again, we have 

"i|k(f)|| J < Y(xu) 4- Ii 2 f ||.r(r)|| ||d(r)||r/r. 
./ n 


(B.2) 


Thus 


"i||j’(f)|| < fl i \\x{i )||~ + rf i 

<ui+l (.ro) + n *2 f ||.r(r )|| ||i/( r)||d T . 

Ja 

This, in turn, implies 

11-1(011 < “i + '>2 / ||.r(r)|| ||r/(r)||dr 
./ o 

for some o, > 0, i — 1,2, From the Bellman-Gronwell inequality, 
we obtain the estimate 


|| J -(f)||<n,r n - , io ll ' , ' r)||, ' T 


Vf > 0. 


(B.3) 


Because of A2), Theorem B.l follows from (B.4). 

Remark . A similar argument shows that Theorem B.l remains true 
il Al) is replaced by the following assumption 

I OV 


A) V(.r) > <ii ||.i 


IP/ + 1 


Ox 




and 


LfV(x) < 0, for any positive integer </. 
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Optimal Control for Systems with 
Deterministic Production Cycles 

Jian-Qiang Hu and Dong Xiang 


Abstract —In this paper we consider a failure prone production system 
with deterministic production cycles. The objective is to find the optimal 
production rate to minimize long-run average cost We first show that the 
optimal control policy belongs to a type of switching curve policy which 
has a special structural property and can be characterized by a single 
parameter. We then establish a relationship between the surplus process 
of the system under the optional control policy and the workload process 
of a D/G/t queue, based on which the existing results in queueing theory 
which can be applied to obtain the steady-state probability distribution 
of (be surplus process under the optimal control policy. 


1. Introduction 

.' Consider a production system which has a single machine and 
produces a single part-type. The system meets a constant demand 
r rate d and backlog is allowed. The machine has two states: up 
„ and down, which are denoted by one and zero, respectively. The 
; produced parts are represented by a fluid flow. When the machine is 
f} up, it can produce at any rate between zero and a maximum value r. 
* Denote the production surplus at time t by -Y,; a positive value of 
„ X t represents inventory while a negative value represents backlog. 
\ Let Of 6 {0,1} be the state of the machine at time t and u t be the 
controlled production rate of the machine at time t under a control 
policy 7T. X t can then be characterized by the following differential 

1 1 1 1 
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equation 


rLY, 

dt 


v t — d 


(I) 


^where 0 < u t < r> f r. For each control policy 7r, we use ./* to denote 
its long-run average expected cost 


*K = 


lim 

V — ex. 



(C + Xt +C-\r)dt 


( 2 ) 


where -Y, + = max(.Yi.O), A < = max(-A’i.O) and C + , C are 
two nonnegative constants. The goal is to find a control policy n* 
to minimize •/.. 

A few variations of the above problem (e.g., systems with multiple 
machines and/or multiple part types) have been studied by many 
researchers (see [1], [2] and references therein). The key assumption 
used in almost all previous work, with a few exceptions [3]-[91, 
is that the machine state process {n/} is a Markov process, i.e., 
the machine up and down times are exponentially distributed. Under 
such an assumption, the system can be modeled as a system with 
jump Markov disturbances based on Rishel’s formulation (|3]), and 
it can be shown [10J that the optimal control is a hedging point 
policy. The Markov assumption, however, is nol very realistic in 
many applications. So far, only limited work has been done for 
general systems. 

In this paper we consider the one machine and one part-type 
system in which the machine has deterministic up lime and general 
down time. A production system with deterministic up time, which 
is similar to ours, is also studied by Meyer a at [11J. The system 
we consider here can be ideally used to model production systems 
in which production is only interrupted after (approximately) a fixed 
amount ol time since it starts. For example, consider a production 
system in which preventive maintenance is performed afier the system 
is in opeiation for a fixed amount of time. If the system rarely fails 
due lo its scheduled preventive maintenance, then its up time is 
approximately deiermimslic. Another example is an inventory system 
which replenishes its inventory continuously (with no interruption) 
but only receives demands (orders) periodically. In such a system 
each period corresponds to one up time and an order (a random 
variable) received al the end of each period corresponds lo total 
demands accumulated during one down time. It is also worth pointing 
out that the periodic review system with limited capacity studied 
in inventory theory (1121) is very similar lo our system and their 
relationship is investigated m a recent paper by Fu and Hu 114). 
As we shall see, the well-known hedging point policy is no longer 
optimal for our system. Our analysis shows, however, that its optimal 
control policy belongs to a class of so-called switching curve policies 
whose switching curves have a very simple structure and can be 
characterized by a single parameter Furthermore, we show that the 
surplus process of the system operated under a switching curve policy 
is related to the workload process of a D/G/l queue, a result sim^r 
to the one established in [15] for the one machine and one part-type 
system under a hedging point policy (also see 116]—[ 19] for similar 
results on the relationship of fluid models to the workload process 
of queueing systems). This enables us to use the existing results in 
queueing theory to obtain the steady-state distribution function of 
the surplus process for the class of parameterized switching curve 
policies to which the optimal control belongs. 

If {AV. / > 0} has steady-state probability distribution function 
F(.r) under a control policy it, then the average expected cost J* 


! 
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can be computed as the expected cost with respect to -F(.rh i.e M 




(C 


+ J- + + c~ 


)(lF{x). 


(3) 


Therefore, with the steady-state probability we obtained we can 
finally convert the original optimal control problem into an optimisa¬ 
tion problem with a single parameter, which is much simpler and 
easier to solve. 

The rest of this paper is organized as follows. In Section II, we 
first establish a structural property for the optimal control. Then, 
we discuss the switching curve policies and show that the optimal 
switching curve can be characterized by a single parameter. In Section 
III, we show how the surplus process of the system under a switching 
curve policy is related to the workload process of a D/C/ 1 queue. 
This relationship is used in Seciion IV to obtain the steady-state 
probability distribution of the surplus process for the system under 
the class of switching curve policies to which the optimal switching 
curve policy belongs. A simple example with exponential machine 
down times is provided in Section V. 


II. Thl Optimal Control Policy 


In this section, we first establish a structural properly for the 
optimal control based on sample path analysis. Then we discuss a 
class of so-called switching curve policies. Using the structural prop¬ 
erly established, we can show that when restricting ourselves to the 
switching curve policies, the switching curve foi the optimal control 
belongs to a class of switching curves which can he characterized by 
a single parameter. 

Intuitively, the .-.uuctural property of the optimal control we are 
about to establish can be interpreted as follows: to minimize the 
surplus cost, one needs to maintain a nonnegative surplus at the end 
of each machine up period to anticipate future capacity shortages 
brought by machine failures while keeping the surplus level close 
to zero as much as possible. Most of our derivations arc quite 
straightforward. We hope that the accompanying figures greatly aid 
in the understanding of the derivations. 

Suppose 7r* is the optimal control policy. Given the machine stale 
process {a,}, lei 1i/ 4 1 be the optimal control process generated under 
the optimal control policy n *, and [ A7 } be the corresponding surplus 
process determined by (I) with if/ being replaced by nj. 

Lemma I: There exists an optimal control {//* [ that satisfies the 
following condition 


v if A7 < 0-rti ^ 1; 

' \> <i if v; =lln, = 1. 


(4) 


Condition (4) can he explained as follows: When the surplus is 
negative the production rate should be set at the maximum rate r 
so that the surplus can be brought back to zero level as quickly 
as possible; when the surplus is at zero level we should keep il at 
a nonnegative level as long as we can. In other words, we should 
Always prevent the surplus from being negative. Based on Lemma 1, 
vft shall henceforth assume the optimal control we consider satisfies 
(4). 

Proof: If {ii7} does not satisfy (4), we can construct another 
optimal control based on {</, 4 ( which satisfies (4). Suppose n = 1, 
Let us first consider the case A7 U < 0. We denote the first hitting 
time lo 0 from A7 (J by 

to = inf{r.A7 = (it > to}. 


By definition, we have X/ < 0 for t £ [to. to). We construct 



if i € [fn- /,'!] and .Y, < 0; 


Ui - l dot. 

if / £ [fo./Si and A\ = 0; 

(5) 

1 .r. 

otherwise 




Fig. I. Comparison between Xf and X ( during \lo- f o\ for the case 

< n. 


where {A/} is the surplus process associated with {i// J. Under 
the control (<//}, the machine produces at the maximum rate r (if 
possible) until the surplus level reaches zero and then remains at 
zero (if possible) until t[ J. Therefore, we have u f{i = r (in fact, {«/} 
satisfies (4) at least during the time interval [G.ffi]) and 


f > A if t € [G* ); 

\— AT. otherwise 


(see Fig. 1), from which we immediately obtain 


L 


<c+.v,‘ -4-r .w ut 
< / (c ,+ (.\7) + + t ,- (.Y;r).w 

./n 


toi any T > 0. 


( 6 ) 


That is to say {u, [ is also an optimal control process. 
For the case A7 U = 0, wc define 

to = inf f/: A7 > 0. t > to }- 


If = to. wc then have ilXf/dt >‘11, which implies uj > r/. If 
G to, we have A', 4 < 0 for / £ {to-tl\ h in which case we construct 
{<//} based on (5) with t{\ being replaced by fj. The rest of the proof 
remains the same. □ 

Lemma 2: There always exists an optimal control [a* | such that 
it only lakes three values 0. </, and r. and its value increases with 

respect to the age of the machine up lime when the surplus level 

becomes nonnegative. 

Proof: Denote the (deterministic) machine up time by D. We 
consider a lime interval [G,G] during which ihe machine is up, 
where G is the epoch at which the machine up period is initiated 
and 1-j = t] + l) (i.e., G is the time epoch at which the machine up 
period ends). Unless otherwise stated, in what follows we shall focus 
on the interval [G- G]. Since {»/*} satisfies (4), it is not difficult for 
us to verify lhat 

i) If \,* < 0, then A7, > A', 4 , ; 

ii) If X? > 0 for some t £ [G-G]» then A’ 4 (t) > 0 for all 

r £ [LG]- 

Therefore, we have the following three cases: 

1) ATj < 0 and Xf < 0. In this case A * < 0 for all t 6 [G< G]. 

which implies u 4 — r for all t £ [G,G]. 

2) A7j < 11 an d Xf 2 > 0. Let t\ be the first hitting time to zero 

from A' 4 ,. It immediately follows that u* = r for t £ 

since A 4 < 0 for t £ [G.f}). Also it is clear that A7 > 0 
for t £ [ft.G], We now construct another control {n,} bused 
on { u\ | as follows 

r 'I- if X;j(r-d)); 

ui - < r. if i € [ti - - </), 

otherwise. 


We can easily verify lhat the surplus process {.V#} associated 
with the control {a,} satisfies 


X, 



if/€[G^-Y # V(r-rf),G]; 

otherwise 
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queue with service times {t<t „r/: ri = 1,2*...} and inlerarrival times 
nal,2,...}. Since t„,n(r- d) is equal to a constant 
Dir — d), the GI/G/ 1 queue is in fact a D/G/l queue, Therefore, 
we can apply various methods developed for single server queues 
in the literature to obtain the steady-state probability distribution 
functions of Y,i „ and Y„ „ (hence A,, rl and A\,,„) (e.g., see |20]). In 
the following sections, we shall simply assume that the steady-state 
' probability distributions of X,t „ and A’„,„ arc available. 


IV. Distribution Fi jnction 

In this section, we consider the switching curve policy (7). We use 
the level crossing technique to derive the steady-state distribution 
function of the surplus process under (7). We should point out, 
however, that the technique can also be used to obtain the steady-state 
distribution function of the surplus process under a general switching 
curve policy though the derivation becomes a bit more involved. 

Denote the steady-state distribution function of A’„ „ by 
and its density function by f\ u (.r). Define 

Pi(j') - the number of .r downcrossings of the process 
{ X, (during [0./]; 

I //(.*■) = the number of r-uperossings of the process 
(A' f ( during [()./]. 

Notice that the accumulated time that the process { X, ( takes value 
between [,r. .r 4- b.r] during [0. /] is equal to 

//i (■>■)*■!■ 

l -~d + ‘ i/ ’ 

Therefore, the steady-state probability density lunclion of {A/} is 
given by 

. , . r 1 (HtU) , PiiJ)\ f . , n 

f\ i-r) = Inn - —.4-: foi .i < : and .i f 0. 

i-' t \ r - d d J 

(9) 


(The interested reader is referred to [16) for a rigorous derivation ol 
(13).) Note that 


li,„ “i'i = lh„ 

/ - f I — -k 

it then follows from (13) that 


r 

lim 

i - v 



t 

r 

liin 

Ut„ „ 

d(r - d) 


T lt 


PiUJ 

t 


for .r < : and .»■ ^ 0. 


( 10 ) 


(Recall T„ „ + i is the epoch at which the (it + 1 )th machine up tunc 
initiated.) To calculate lini„ Mi u n n (r)/T u #l + i, we consider an 
down and up cycle [T lt T„ „ + i]. First, it is obvious that the process 
{A'/} can have at most one ./-upcrossing during [T„ , ( .T„ „ +J ] (in 
fact during [T„ „, 7^ „ + i]). Define 


r(j) 


{ 


IJ- 

0, 


» - t! ' 


if j* > 0. 
if .v < 0. 


(See Fig. 5.) Then { X t } has one .r-uperossing during [7 , lli „. T,i ,,-j. i] 
if and only if ./■ — (/■ — d)D < A’, IiM < 4- dr(.r). Therefore 


ii 

u, . H (j ) = ^l(.r- (/• — tl)D < ,Y„ ,, < Jt + df(jr)). 


It follows that 

AW* 


lint 


1 


/(,. _ d) T u Tlf i n 
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Xt 



X ^ l(.r - ~ d)D < X u u < .r 4- dc[j')) 

1 

= b[F\ u (.v 4- dr(,r)) - F\ u (.v - (r - d)D)] 

for .r < : and r ^ 0 (11) 

where b j= r/((D 4- R)d( r - d)) and /? is the average machine down 
time. Taking the normalization factor into consideration, we have 

F\ (0) — \ - j / v (12) 

I^et .7 be the run-long average expected cost associated with the 
switching curve policy |7|, then 

7. = I (f 4 .r 4 +C-X )/x(r ),Ij. 

Therefore, the optimal switching curve (i.e., the optimal value of 
:) can be obtained by solving the following optimization problem: 
min 


V. An liXAMRLb 

In this section, wc study a simple example in which the machine 
down time is exponenlially distributed with mean 7? and rate /*( = 
1//?). We will use the results obtained in the previous section to 
calculate j \ (.t ) and J . Since now the down time is exponentially 
distributed, the corresponding D/G/\ queue becomes a D/M/\ with 
the mean interarrival time (/ - d)D and the mean service time 7?r/. 
Hence we have (120]) 

= "K tor < • 

where n is the unique solution to the following equation in the range 

0 < <r < 1 

a _ f -<.-./j/j<i 

Based on |I5| we obtain 

= f “d - -mini ’ ' » )))/il 

__ ( -(!-»),,(- , 

for .r < i and : + 0. (13) 

When - < d(r — d)D/t\ (13) gives 
/\U) = 

f(, ■,l)l))/rl'j j f Q < j. < 

U ( -<'_ r -(l-"l,.<,-.IU>/.A j f r < 0 




7*6 
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^ _ d , d _ -fi( l-rr) /l 

j/(l-rr) yi 2 (1 — <r) J y/ 2 (1 — <r 


/^(l - (r) 1 


-*<( 1 —*») /W/i -/i(l-ir)(r-rf)0//i 


When « > d(i - <J)D/i , (11) gives 

/\U) = 


b(l-t ■'+*<' 

if „ — </(i — d)D/r <i<«, 

*)»•(»< - >/<'< - 0)-^) _ r —(i -*)# ( - +(»-0O)/f^ 

V if 0 < r < — r/(r — <l)D/u 

\ K <i )/^i_ r G ')#•( 0/>//^ 


If i <0 


|i( l-*) / f / M «•)! OH/ I 


/<( 1 - <t) 

x — <r)/i + 1) — i 1,(1 

+ " , '" r) " 

/,•*(] - fr)- 1 

M J? -,(i »)//,,_ ,(i 

hC + d J (i-it) J Df 2 

~ 2r* l,(!-«> + 

+ !£l£lirj!i i (, "'( tt-t 

//■*(] - <r)' ! r i V 


#<(i <*)( t)t 


( 1 ») //|| /(!—«•)( 


( 1 ) - 1 
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Analysis and Control of Nonlinear Infinite Dimensional Systems 

- V. Barbu. (San Diego, CA: Academic Press, 1993). Reviewed by 
H O. Fat tor in i. 

1. Difffrenti^ Equations in Linear Spaces 

Lei il be a domain with boundary V in in -dimensional Euclidean 
space R ? ". An initial or initial-boundary value problem such as 

///(/, ,r) - Ai/(/, .r). //((), .r) — u 0 (.r) (j* € il) (1) 

«(.i\f) = U (-ren (2) 

(.r — (./■ i. .r_>. ••■../■„,), A the Laplacian in the space variables) can 
be studied in the form of an “ordinary differential*’ evolution equation 

n'(0 - Au(t). »/(()) “ i/(i (3) 

in a suitable function space E\ u{i) takes values in E and A = A, the 
boundary conditions included in rhe definition of the domain D{ A) of 
A or of the space E, For instance, we may take E ~ L 2 {il), D(A) 
consisting of all u E L 2 {il) such that Ai/ E L 2 {il) (in Ihe sense of 
distributions) and ii = 0 on I\ Alternately, we may set E = 

(the space of all continuous functions on the closure 12 lhat vanish at 
the boundary T); in this case, the boundary condition (2) is included 
in the definition of the space rather than that of D(A). 

The formal similarity of (3) with a linear, constant coefficient 
system of ordinary differential equations gives insight into the original 
problem (1)—(2): for instance, we may expect to be able to write the 
solution in the form </(f) = S(t) = exp(M)i/n, which is true if 
the exponential is correctly interpreted. This approach to evolution 
problems for partial differential equations was initiated by Hille 
f9] and Yosida [20] under the semigroup theory point of view of 
studying the exponential operator equatioh S{* + t) = S{m)S(I ), 
5(0) = F; their main result, the Hille-Yosida theorem, characterizes 
the infinitesimal generator A = lim/,_o/i -l (S(M - /) of S(t) 
in terms of the resolvent operator (A/ - A)‘ ] . Equivalently, it 
characterizes the operators A (not necessarily bounded or everywhere 
defined) such that the initial value problem (3) is well posed; this 
means solutions exist for a dense set of initial data and depend 
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continuously on these initial data. (The Hille-Yosida theorem was 
given full generality by R. S. Phillips, I. Miyadera, and W. Feller). 
Among the many subsequent highlights of the linear theory, we 
mention the introduction of dissipative operators in Hilbert space f19J 
(corresponding to an important particular case of the Hille-Yosida 
Theorem) and their Banach space extension in [16]. 

Semigroup theory was extended to nonlinear operators by Komura 
in 1111 in Hilbert spaces, soon after by Kato [10] in certain “smooth*’ 
Banach spaces, and finally by Crandall and Liggett [5] in arbitrary 
Banach spaces (see 1171 for more information). A class of equations 
(2) lying between the linear and the fully nonlinear is that of 
semilinear and quasilinear equations; these can be treated by a 
combination of linear semigroup methods combined with results on 
integral equations and fixed point theory. See [8) and 118] for more 
information on these equations and on linear semigroup theory. 

The author of the present book has contributed many important 
results on nonlinear semigroup theory as well as the first textbook in 
this area [1| (the Romanian version appeared in 1974). Part of this 
work can be considered as an update of [ 1]. It includes a short review 
of linear semigroup theory, the key results of nonlinear semigroup 
theory, and numerous applications to partial differential equations. 


II. Infinite Dimensional Opitmai Control Theory 

Finite dimensional control theory deals with systems described by 
ordinary differential equations; infinite dimensional control theory 
studies systems described by partial differential equations (and also by 
functional differential equations). Control theory of partial differentia^ 
equations began to be practiced by engineers in the fifties, anc 
was raised to a higher mathematical level in the Soviet Union by 
Butkovskii and others during the sixties [31. Using a more abstrac 
approach, [15| brought to bear on control theory the extensive theory 
of boundary value problems developed in the previous decade by 
Magenes and the author. At about the same time, [6], [7] a differer. 
approach to evolution control problems was initiated, based on the 
Banach space version (3) of the equations. Since then, mathematical 
optimal control theory of partial differential equations has developed 
along these two lines, sometimes combining both approaches. There 
are problems (such as those described by elliptic equations or elliptic 
variational inequalities) where the evolution approach (3) is nc 
germane; however, many other tools of functional analysis apply 
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For several years the emphasis was on obtaining versions ot Pontrya- 
j gin’s maximum principle for optimal control problems ot all types 
Recently, much work has been done on the dynamic programming 
approach to control of partial differential equations which extends 
Che finite dimensional theory in 121 Potentially the most promising 
approach (it provides closed-loop feedback solutions), it was partly 
heuristic even in hntte dimensions New developments in nonlinear 
analysis such as viscosity solutions ot nonsmooth nonlinear partial 
‘I differential equations have plugged many of the mathematical gaps 
, and put the theory on a turner footing, although much lemains to 
' be done 

The present book comprises both control ot steady stale and 
evolution systems Among the first are optimal control ot elliptic 
equations and variational inequalities, including the Signonm prob 
lem r the obstacle problem and tree boundary problems Treatment ot 
evolution problems includes both results of maximum principle type 
* and the dynamic programming approach 

We mention in passing that the ‘engineering ’ control theory of 
partial differential equations has gone its own way although often 
intersecting with the mathematical theory, it usually approaches 
problems by discretization and application ol finite dimensional 
theory, sometimes leaving aside rigorous proofs On this it must be 
said, however that mathematics lags behind the applications in this 
area as in others There are works such as |4] that straddle both fields 
they adopt in some ways the engineering point of view but provide 
mathematical rigor as well 

III Conclusions 

Infinite dimensional optimal control theory within acceptable math 
ematttal standards^requires a considerable overhead Without count 
Ing measut^lheory, a great deal ol partial differential equations and 
functional analysis must be absorbed before one can face even the 
Simplest problems The same can be said of the abstract theory 
of evolution equations l his book provides a clear example filled 
exploration into the two subjects written by a mam protagonist in 
th<? latest developments The work can be used as both a textbook in 
infinite dimensional control theory or in a modern course in partial 
differential equations presenting much of the recent research into 
such subjects as the dynamic programming approach We believe its 
main audience will consist ot mathematicians but there are in many 
advanced graduate students in engineering that could profit from it 
It is a welcome contribution in a field not very well represenled 
There are other works on the market mostly in such other subjects 


a$ stabilization by feedback [4], f 13], the linear—quadratic problem 
(14], or controllability [12] This book is a welcome and timely 
addition to the literature 
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